ID A0A0T6B2Q4_9SCAR Unreviewed; 473 AA.
AC A0A0T6B2Q4;
DT 17-FEB-2016, integrated into UniProtKB/TrEMBL.
DT 17-FEB-2016, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=Peptidase {ECO:0000313|EMBL:KRT81521.1};
GN ORFNames=AMK59_5081 {ECO:0000313|EMBL:KRT81521.1};
OS Oryctes borbonicus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Scarabaeiformia;
OC Scarabaeidae; Dynastinae; Oryctes.
OX NCBI_TaxID=1629725 {ECO:0000313|EMBL:KRT81521.1, ECO:0000313|Proteomes:UP000051574};
RN [1] {ECO:0000313|EMBL:KRT81521.1, ECO:0000313|Proteomes:UP000051574}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=OB123 {ECO:0000313|EMBL:KRT81521.1};
RC TISSUE=Whole animal {ECO:0000313|EMBL:KRT81521.1};
RA Meyer J.M., Markov G.V., Baskaran P., Herrmann M., Sommer R.J.,
RA Roedelsperger C.;
RT "Draft genome of the scarab beetle Oryctes borbonicus.";
RL Submitted (SEP-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRT81521.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LJIG01016131; KRT81521.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0T6B2Q4; -.
DR OrthoDB; 1085298at2759; -.
DR Proteomes; UP000051574; Unassembled WGS sequence.
DR GO; GO:0004869; F:cysteine-type endopeptidase inhibitor activity; IEA:InterPro.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00042; CY; 1.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.10.450.10; -; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR000010; Cystatin_dom.
DR InterPro; IPR046350; Cystatin_sf.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR018073; Prot_inh_cystat_CS.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411:SF959; CATHEPSIN F; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF00031; Cystatin; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00043; CY; 1.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54403; Cystatin/monellin; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00287; CYSTATIN; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000051574};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT DOMAIN 32..140
FT /note="Cystatin"
FT /evidence="ECO:0000259|SMART:SM00043"
FT DOMAIN 168..225
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 254..471
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 473 AA; 53708 MW; A4C346683595AA95 CRC64;
MVVPTFLIPA LLSRCPMRMN KEQKFRSKRS AVLVGGIVPV STTDPDVADL VEYTLTNLDQ
QSTLDQKYKV TEIVSATKQV VAGSLYNIEA RIVPSDCPKT DSRARSECGI LKEEDPQLCQ
IKIWDRPWLP NGRQTSVTCD QATYKFRSKR SVGRVYYEDD FSVQFNAFRD FKEKYNKIYP
TTSEELRRFK IFRNNLKKIH LLNINERGTA VYGITKFSDL TYAEFRAKHM GLRTDLALEN
EITFPQAEIP DIELPDEFDW REKGAVTEVK NQQSCGSCWA FSVTGNVEGQ YAIKHQKLLE
FSEQELVDCD KYDEGCGGGL MDNAYRAIEE IGGLELESDY PYDAENEKCH YDKNLFKVEL
SGAVNISQNE DDMAKWLVQN GPISIAINAN AMQFYVGGVS HPWKFLCNPK SLDHGVLIVG
YGVHSYPTFK KTLPYWIVKN SWGKSWGEQG YYLVYRGDGT CGLNQTPSSA IVA
//