ID A0A210PXH4_MIZYE Unreviewed; 634 AA.
AC A0A210PXH4;
DT 27-SEP-2017, integrated into UniProtKB/TrEMBL.
DT 27-SEP-2017, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE SubName: Full=Cathepsin F {ECO:0000313|EMBL:QBA97386.1};
DE SubName: Full=Cysteine proteinase {ECO:0000313|EMBL:OWF41198.1};
GN ORFNames=KP79_PYT09842 {ECO:0000313|EMBL:OWF41198.1};
OS Mizuhopecten yessoensis (Japanese scallop) (Patinopecten yessoensis).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Mollusca; Bivalvia;
OC Autobranchia; Pteriomorphia; Pectinida; Pectinoidea; Pectinidae;
OC Mizuhopecten.
OX NCBI_TaxID=6573 {ECO:0000313|EMBL:OWF41198.1, ECO:0000313|Proteomes:UP000242188};
RN [1] {ECO:0000313|Proteomes:UP000242188}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PY_sf001 {ECO:0000313|Proteomes:UP000242188};
RX PubMed=28812685; DOI=10.1038/s41559-017-0120;
RA Wang S., Zhang J., Jiao W., Li J., Xun X., Sun Y., Guo X., Huan P.,
RA Dong B., Zhang L., Hu X., Sun X., Wang J., Zhao C., Wang Y., Wang D.,
RA Huang X., Wang R., Lv J., Li Y., Zhang Z., Liu B., Lu W., Hui Y., Liang J.,
RA Zhou Z., Hou R., Li X., Liu Y., Li H., Ning X., Lin Y., Zhao L., Xing Q.,
RA Dou J., Li Y., Mao J., Guo H., Dou H., Li T., Mu C., Jiang W., Fu Q.,
RA Fu X., Miao Y., Liu J., Yu Q., Li R., Liao H., Li X., Kong Y., Jiang Z.,
RA Chourrout D., Li R., Bao Z.;
RT "Scallop genome provides insights into evolution of bilaterian karyotype
RT and development.";
RL Nat. Ecol. Evol. 1:120-120(2017).
RN [2] {ECO:0000313|EMBL:OWF41198.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PY_sf001 {ECO:0000313|EMBL:OWF41198.1};
RC TISSUE=Whole organism {ECO:0000313|EMBL:OWF41198.1};
RA Wang S., Zhang J., Jiao W., Li J., Xun X., Sun Y., Guo X., Huan P.,
RA Dong B., Zhang L., Hu X., Sun X., Wang J., Zhao C., Wang Y., Wang D.,
RA Huang X., Wang R., Lv J., Li Y., Zhang Z., Liu B., Lu W., Hui Y., Liang J.,
RA Zhou Z., Hou R., Li X., Liu Y., Li H., Ning X., Lin Y., Zhao L., Xing Q.,
RA Dou J., Li Y., Mao J., Guo H., Dou H., Li T., Mu C., Jiang W., Fu Q.,
RA Fu X., Miao Y., Liu J., Yu Q., Li R., Liao H., Li X., Kong Y., Jiang Z.,
RA Chourrout D., Li R., Bao Z.;
RT "Scallop genome provides insights into evolution of bilaterian karyotype
RT and development.";
RL Nat. Ecol. Evol. 1:0-0(2017).
RN [3] {ECO:0000313|EMBL:QBA97386.1}
RP NUCLEOTIDE SEQUENCE.
RA Guo H., Wang Y.;
RL Submitted (MAY-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NEDP02005416; OWF41198.1; -; Genomic_DNA.
DR EMBL; MH318608; QBA97386.1; -; mRNA.
DR AlphaFoldDB; A0A210PXH4; -.
DR SMR; A0A210PXH4; -.
DR STRING; 6573.A0A210PXH4; -.
DR EnsemblMetazoa; XM_021516986.1; XP_021372661.1; LOC110462801.
DR OrthoDB; 1085298at2759; -.
DR BRENDA; 3.4.22.41; 4566.
DR Proteomes; UP000242188; Unassembled WGS sequence.
DR GO; GO:0004869; F:cysteine-type endopeptidase inhibitor activity; IEA:InterPro.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00042; CY; 3.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.10.450.10; -; 3.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR000010; Cystatin_dom.
DR InterPro; IPR046350; Cystatin_sf.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR018073; Prot_inh_cystat_CS.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411:SF959; CATHEPSIN F; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF00031; Cystatin; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR Pfam; PF16845; SQAPI; 2.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00043; CY; 3.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54403; Cystatin/monellin; 3.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00287; CYSTATIN; 2.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 2: Evidence at transcript level;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000242188};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..634
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5036313223"
FT DOMAIN 18..112
FT /note="Cystatin"
FT /evidence="ECO:0000259|SMART:SM00043"
FT DOMAIN 116..200
FT /note="Cystatin"
FT /evidence="ECO:0000259|SMART:SM00043"
FT DOMAIN 204..313
FT /note="Cystatin"
FT /evidence="ECO:0000259|SMART:SM00043"
FT DOMAIN 336..395
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 421..632
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 634 AA; 70179 MW; AAA1738ED6CEB9CA CRC64;
MNVLIVLSCV VACCYGSGVP GGYSKADISD PEIIGYANFF VSQYNLRTKA QGGNAAEYTN
VQLVSANQQV VSGMNYDLHL RMSNGAVTQM CDVVIYVQPW TNTKRVDTQV CHPVTKRAGG
YEEHSVDAEV MMMANFALNK MNVLLKQSNS LVRVVSASTQ VVAGTNYKLR LELDQNKFCD
VVVFNQPWTQ TTQLTSNSCD STKRQLGAPH AISSSDIHVQ EVFTQAIELA NGMSNNMYRL
SAVSIQHVTK QVVAGMKYEF DMMMGESSCR NNGESKGLTA VNCPVKSGGM STTFHVVGIW
QSWTTPEYHV TVTPVVDNGV VDKPKETQQL STMEKFLSFK RKHSKVYSSM AEEKKRYRIF
ESNMKLAAKI QQTERPGSTA RYGATKFADL TEKEFRQHVG YKWDLKGNVG MQKAEIPRGT
TPEAFDWRDH NAVTEVKNQG SCGSCWAFST TGNIEGQWAI KQGKLVSLSE QELVDCDKVD
EGCNGGLPSQ AYKEIIRLGG LETEKEYKYE GEDEKCLFNR SDVRVTINSS VSISSDEKEM
AAWLAKNGPI SIGINAFAMQ FYMGGISHPW SFFCSPKELD HGVLIVGYGV QDGTPYWAVK
NSWGPDWGEK GYYLVYRGGG VCGLNTMCTS SVID
//