GenomeNet

Database: UniProt
Entry: A0A1S4CGT5_TOBAC
LinkDB: A0A1S4CGT5_TOBAC
Original site: A0A1S4CGT5_TOBAC 
ID   A0A1S4CGT5_TOBAC        Unreviewed;       376 AA.
AC   A0A1S4CGT5;
DT   12-APR-2017, integrated into UniProtKB/TrEMBL.
DT   12-APR-2017, sequence version 1.
DT   27-MAR-2024, entry version 29.
DE   SubName: Full=Cysteine proteinase COT44-like {ECO:0000313|RefSeq:XP_016500184.1};
GN   Name=LOC107818652 {ECO:0000313|RefSeq:XP_016500184.1};
OS   Nicotiana tabacum (Common tobacco).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae;
OC   Nicotiana.
OX   NCBI_TaxID=4097 {ECO:0000313|Proteomes:UP000084051, ECO:0000313|RefSeq:XP_016500184.1};
RN   [1] {ECO:0000313|Proteomes:UP000084051}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. TN90 {ECO:0000313|Proteomes:UP000084051};
RX   PubMed=24807620; DOI=10.1038/ncomms4833;
RA   Sierro N., Battey J.N., Ouadi S., Bakaher N., Bovet L., Willig A.,
RA   Goepfert S., Peitsch M.C., Ivanov N.V.;
RT   "The tobacco genome sequence and its comparison with those of tomato and
RT   potato.";
RL   Nat. Commun. 5:3833-3833(2014).
RN   [2] {ECO:0000313|RefSeq:XP_016500184.1}
RP   IDENTIFICATION.
RG   RefSeq;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_016500184.1; XM_016644698.1.
DR   AlphaFoldDB; A0A1S4CGT5; -.
DR   SMR; A0A1S4CGT5; -.
DR   STRING; 4097.A0A1S4CGT5; -.
DR   PaxDb; 4097-A0A1S4CGT5; -.
DR   GeneID; 107818652; -.
DR   KEGG; nta:107818652; -.
DR   OMA; LMFIDEH; -.
DR   OrthoDB; 5472443at2759; -.
DR   Proteomes; UP000084051; Unplaced.
DR   GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR   GO; GO:0004197; F:cysteine-type endopeptidase activity; IBA:GO_Central.
DR   GO; GO:0051603; P:proteolysis involved in protein catabolic process; IBA:GO_Central.
DR   CDD; cd02248; Peptidase_C1A; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025661; Pept_asp_AS.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR039417; Peptidase_C1A_papain-like.
DR   InterPro; IPR013201; Prot_inhib_I29.
DR   PANTHER; PTHR12411:SF951; CATHEPSIN-RELATED; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF08246; Inhibitor_I29; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00848; Inhibitor_I29; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000084051};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..20
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           21..376
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5016955460"
FT   DOMAIN          52..109
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          142..357
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   376 AA;  42451 MW;  C06EA16FE8F7D66E CRC64;
     MAKTIITTLL FALFSSLSYA IDMSIIDYKN NQHVGNMKKW TWQNDEEVKG IYELWLAEHG
     KAYNALGEKE KRFEIFKDNL KFIEEHNNSG NRTYKVGLNQ FADLTNEEYR TMYLGTRSDA
     RRRFVKSKNP SHRYASRPNE LMPHSVDWRK RGAVAPIKNQ GSCGSCWAFS TVAAVEGINQ
     IVTGEMITLS EQELVDCDRV QNSGCNGGLM DYAFEFIISN GGIDTENHYP YCGVEGRCDP
     VRKNYKVVSI DGYEDVPRNE RALQKAVAHQ PVCVAIEASG RAFQHYSSGV FTGECGEQVD
     HGVVVVGYDS EDGVDYWIVR NSWGTKWGEN GYVKMERNVK KSHLGKCGIM TEASYPTKDS
     GINKRTTSNE EKISSI
//
DBGET integrated database retrieval system