GenomeNet

Database: UniProt
Entry: F2TXF2_SALR5
LinkDB: F2TXF2_SALR5
Original site: F2TXF2_SALR5 
ID   F2TXF2_SALR5            Unreviewed;       381 AA.
AC   F2TXF2;
DT   31-MAY-2011, integrated into UniProtKB/TrEMBL.
DT   31-MAY-2011, sequence version 1.
DT   27-MAR-2024, entry version 44.
DE   SubName: Full=Cathepsin {ECO:0000313|EMBL:EGD76061.1};
GN   ORFNames=PTSG_00770 {ECO:0000313|EMBL:EGD76061.1};
OS   Salpingoeca rosetta (strain ATCC 50818 / BSB-021).
OC   Eukaryota; Choanoflagellata; Craspedida; Salpingoecidae; Salpingoeca.
OX   NCBI_TaxID=946362 {ECO:0000313|Proteomes:UP000007799};
RN   [1] {ECO:0000313|Proteomes:UP000007799}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 50818 {ECO:0000313|Proteomes:UP000007799};
RA   Russ C., Cuomo C., Burger G., Gray M.W., Holland P.W.H., King N.,
RA   Lang F.B.F., Roger A.J., Ruiz-Trillo I., Young S.K., Zeng Q., Gargeya S.,
RA   Alvarado L., Berlin A., Chapman S.B., Chen Z., Freedman E., Gellesch M.,
RA   Goldberg J., Griggs A., Gujja S., Heilman E., Heiman D., Howarth C.,
RA   Mehta T., Neiman D., Pearson M., Roberts A., Saif S., Shea T., Shenoy N.,
RA   Sisk P., Stolte C., Sykes S., White J., Yandava C., Haas B., Nusbaum C.,
RA   Birren B.;
RT   "Annotation of Salpingoeca rosetta.";
RL   Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; GL832956; EGD76061.1; -; Genomic_DNA.
DR   RefSeq; XP_004998236.1; XM_004998179.1.
DR   AlphaFoldDB; F2TXF2; -.
DR   STRING; 946362.F2TXF2; -.
DR   EnsemblProtists; EGD76061; EGD76061; PTSG_00770.
DR   GeneID; 16078832; -.
DR   KEGG; sre:PTSG_00770; -.
DR   eggNOG; KOG1543; Eukaryota.
DR   InParanoid; F2TXF2; -.
DR   OMA; VYYDEEC; -.
DR   OrthoDB; 5472948at2759; -.
DR   Proteomes; UP000007799; Unassembled WGS sequence.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd02248; Peptidase_C1A; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR039417; Peptidase_C1A_papain-like.
DR   InterPro; IPR013201; Prot_inhib_I29.
DR   PANTHER; PTHR12411:SF971; CATHEPSIN L-RELATED; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF08246; Inhibitor_I29; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00848; Inhibitor_I29; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007799};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..17
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           18..381
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018744791"
FT   DOMAIN          20..80
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          106..305
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   381 AA;  40814 MW;  6FBF79DB9395FFDC CRC64;
     MMLKIAVLLA MAVAANAMSF DDFKTTFEKQ YESPEEEARR FAIFADNLAF IARHNAEAAR
     GLHTHTVGVN QFADLTNEEY RQLYLRPYPT ELLGRERQEV WLDGPNAGSV DWRQKGAVTP
     IKNQGQCGSC WSFSTTGSVE GAHAIATGNL VSLSEQQLVD CSGSFGNQGC NGGLMDNAFK
     YIISNGGLDT EQDYPYTARD GVCDKSKESK HAVSISGYKD VPQNNEDQLA AAVEKGPVSV
     AIEADQQSFQ MYSSGVFSGP CGTNLDHGVL VVGYTSDYWI VKNSWGASWV TRGGCHSGEQ
     AVRIEGISGS FCSPSCSSTS PCPTNVPPGT TAQPECVLET QGSSQPTNCA LICNPEENDG
     GCPQNASCKP IQGVGICTYD S
//
DBGET integrated database retrieval system