ID A0A1U8LPY5_GOSHI Unreviewed; 352 AA.
AC A0A1U8LPY5;
DT 10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT 10-MAY-2017, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Xylem cysteine proteinase 1-like {ECO:0000313|RefSeq:XP_016716651.1};
GN Name=LOC107929669 {ECO:0000313|RefSeq:XP_016716651.1};
OS Gossypium hirsutum (Upland cotton) (Gossypium mexicanum).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX NCBI_TaxID=3635 {ECO:0000313|Proteomes:UP000189702, ECO:0000313|RefSeq:XP_016716651.1};
RN [1] {ECO:0000313|Proteomes:UP000189702}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TM-1 {ECO:0000313|Proteomes:UP000189702};
RX PubMed=25893780; DOI=10.1038/nbt.3208;
RA Li F., Fan G., Lu C., Xiao G., Zou C., Kohel R.J., Ma Z., Shang H., Ma X.,
RA Wu J., Liang X., Huang G., Percy R.G., Liu K., Yang W., Chen W., Du X.,
RA Shi C., Yuan Y., Ye W., Liu X., Zhang X., Liu W., Wei H., Wei S., Huang G.,
RA Zhang X., Zhu S., Zhang H., Sun F., Wang X., Liang J., Wang J., He Q.,
RA Huang L., Wang J., Cui J., Song G., Wang K., Xu X., Yu J.Z., Zhu Y., Yu S.;
RT "Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1)
RT provides insights into genome evolution.";
RL Nat. Biotechnol. 33:524-530(2015).
RN [2] {ECO:0000313|RefSeq:XP_016716651.1}
RP IDENTIFICATION.
RC TISSUE=Leaf {ECO:0000313|RefSeq:XP_016716651.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016716651.1; XM_016861162.1.
DR AlphaFoldDB; A0A1U8LPY5; -.
DR STRING; 3635.A0A1U8LPY5; -.
DR PaxDb; 3635-A0A1U8LPY5; -.
DR GeneID; 107929669; -.
DR KEGG; ghi:107929669; -.
DR OrthoDB; 5472443at2759; -.
DR Proteomes; UP000189702; Chromosome 4.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IBA:GO_Central.
DR GO; GO:0051603; P:proteolysis involved in protein catabolic process; IBA:GO_Central.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF357; OS01G0971400 PROTEIN; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000189702};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..352
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018685089"
FT DOMAIN 48..104
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 134..349
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 352 AA; 39495 MW; CBBC82D0D6C992F7 CRC64;
MVVSFLSKLS ILTFTASVLV VSALAHDFSI VGYSPEDLSS RDKLIELFES WVSKHAKFYE
SFEEKLLRFE VFKDNLKHID KRNKEISSYW LGLNEFADLT HEEFKNKYLG LKPEVFKKNR
SPPEEFTFRD DVDLPKSVDW RQKGAVTPVK NQRSCGSCWA FSAVAAVEGI NKIVTGNLTS
LSEQELIDCD TSFNNGCNGG LMDYAFEFIV ANGGLHKEED YPYLMEQGTC EEKKEEMDVV
TISGYKDVPE NDEKSLLKAL AHQPLSVAIE ASGRDFQFYS GGVFNGPCGT DLDHGVAAVG
YGTWKGSDYI IVKNSWGPKW GEKGYIRMKR NTGKPEGLCG INKIASYPTK NK
//