ID A0A0K9NNM0_ZOSMR Unreviewed; 383 AA.
AC A0A0K9NNM0;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=Cysteine proteinase cathepsin F {ECO:0000313|EMBL:KMZ58361.1};
GN ORFNames=ZOSMA_77G00200 {ECO:0000313|EMBL:KMZ58361.1};
OS Zostera marina (Eelgrass).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Zosteraceae; Zostera.
OX NCBI_TaxID=29655 {ECO:0000313|EMBL:KMZ58361.1, ECO:0000313|Proteomes:UP000036987};
RN [1] {ECO:0000313|Proteomes:UP000036987}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Finnish {ECO:0000313|Proteomes:UP000036987};
RX PubMed=26814964; DOI=10.1038/nature16548;
RA Olsen J.L., Rouze P., Verhelst B., Lin Y.-C., Bayer T., Collen J.,
RA Dattolo E., De Paoli E., Dittami S., Maumus F., Michel G., Kersting A.,
RA Lauritano C., Lohaus R., Toepel M., Tonon T., Vanneste K., Amirebrahimi M.,
RA Brakel J., Bostroem C., Chovatia M., Grimwood J., Jenkins J.W.,
RA Jueterbock A., Mraz A., Stam W.T., Tice H., Bornberg-Bauer E., Green P.J.,
RA Pearson G.A., Procaccini G., Duarte C.M., Schmutz J., Reusch T.B.H.,
RA Van de Peer Y.;
RT "The genome of the seagrass Zostera marina reveals angiosperm adaptation to
RT the sea.";
RL Nature 530:331-335(2016).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KMZ58361.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LFYR01001945; KMZ58361.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0K9NNM0; -.
DR STRING; 29655.A0A0K9NNM0; -.
DR OMA; CQFERSK; -.
DR OrthoDB; 1085298at2759; -.
DR Proteomes; UP000036987; Unassembled WGS sequence.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IBA:GO_Central.
DR GO; GO:0051603; P:proteolysis involved in protein catabolic process; IBA:GO_Central.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411:SF947; CATHEPSIN O; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000036987};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..32
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 33..383
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018544270"
FT DOMAIN 56..112
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 144..369
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 383 AA; 42866 MW; 2B75A3F90A664D8C CRC64;
MDRYLGRSRC SFTFLAIAVL FTLLSPAKEV LSDPLIQQVI SDLYSDEPIL DAEEHFETFK
RTFGKKYKSD EEHTSRFEIF KGNMRKARIH QKLDPSAIHG VTQFSDLTYE EFEKGYLGVD
NGMEWNLFRN ASGETEAPIL PTEVPDDFDW RKHGAVTEVE DQGSCGSCWS FSAVGALEGA
NYLATGKLIS LSKQQLLDCD HECDPSDPDS CDSGCQGGLM RNAFEYLFKA GGIQSEKSYP
YRGEDGHKCQ FERSKIAASV KSFSVISSDP NQIAANLVKN GPLAVGINAI YMQTYRGGVS
CPYLCCKNVD HGVLLVGYGK AAYSQIRMKK KPYWIIKNSW GDQWGEDGYY KICRERNICG
VDSMVSMVSV EDIKSTTYTS HSH
//