GenomeNet

Database: UniProt
Entry: A0A5D2S931_GOSMU
LinkDB: A0A5D2S931_GOSMU
Original site: A0A5D2S931_GOSMU 
ID   A0A5D2S931_GOSMU        Unreviewed;       355 AA.
AC   A0A5D2S931;
DT   13-NOV-2019, integrated into UniProtKB/TrEMBL.
DT   13-NOV-2019, sequence version 1.
DT   13-SEP-2023, entry version 11.
DE   RecName: Full=Peptidase C1A papain C-terminal domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=E1A91_D13G230200v1 {ECO:0000313|EMBL:TYI48204.1};
OS   Gossypium mustelinum (Cotton).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX   NCBI_TaxID=34275 {ECO:0000313|EMBL:TYI48204.1, ECO:0000313|Proteomes:UP000323597};
RN   [1] {ECO:0000313|EMBL:TYI48204.1, ECO:0000313|Proteomes:UP000323597}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=1408120.09 {ECO:0000313|EMBL:TYI48204.1};
RA   Chen Z.J., Sreedasyam A., Ando A., Song Q., De L., Hulse-Kemp A., Ding M.,
RA   Ye W., Kirkbride R., Jenkins J., Plott C., Lovell J., Lin Y.-M., Vaughn R.,
RA   Liu B., Li W., Simpson S., Scheffler B., Saski C., Grover C., Hu G.,
RA   Conover J., Carlson J., Shu S., Boston L., Williams M., Peterson D.,
RA   Mcgee K., Jones D., Wendel J., Stelly D., Grimwood J., Schmutz J.;
RT   "WGS assembly of Gossypium mustelinum.";
RL   Submitted (JUL-2019) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM017661; TYI48204.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A5D2S931; -.
DR   Proteomes; UP000323597; Chromosome d13.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd02248; Peptidase_C1A; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025661; Pept_asp_AS.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR039417; Peptidase_C1A_papain-like.
DR   InterPro; IPR013201; Prot_inhib_I29.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   PANTHER; PTHR12411:SF642; PRO-CATHEPSIN H; 1.
DR   Pfam; PF08246; Inhibitor_I29; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00848; Inhibitor_I29; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000323597};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT   SIGNAL          1..20
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           21..355
FT                   /note="Peptidase C1A papain C-terminal domain-containing
FT                   protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5023090616"
FT   DOMAIN          56..112
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          138..353
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   355 AA;  39090 MW;  DB73539335BF47CE CRC64;
     MARLTLVSSI ILMLCCVAAA STFEDSNPIR MVSDGLRGYE SSVLRVIGHT RHAISFARFA
     YKHGRKYETV EEMKLRFQIF KENLDLIRST NKKGLSYTLA VNRFADWSWD EFQKHRLGAA
     QNCSATTKGN HQLTDVVLPE SKDWREADIV SPVKEQGSCG SCWTFSTTGA LEAAYHQAFG
     KGISLSEQQL VDCAGAFDNF GCHGGLPSQA FEYIKYNGGL DTEEAYPYTA KDGECKFSPE
     NVGVQVIDSV NITLGAEDEL KHAVALVRPV SVAFQVITSF RFYKTGVFTS DKCGTTSQDV
     NHAVLAVGYG VENGVPYWLI KNSWGAQWGD NGYFKMEMGK NMCGVATCAS YPVVA
//
DBGET integrated database retrieval system