ID A0A0D2TCZ4_GOSRA Unreviewed; 272 AA.
AC A0A0D2TCZ4;
DT 29-APR-2015, integrated into UniProtKB/TrEMBL.
DT 29-APR-2015, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE RecName: Full=Peptidase C1A papain C-terminal domain-containing protein {ECO:0000259|SMART:SM00645};
GN ORFNames=B456_009G036000 {ECO:0000313|EMBL:KJB54489.1};
OS Gossypium raimondii (New World cotton).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX NCBI_TaxID=29730 {ECO:0000313|EMBL:KJB54489.1, ECO:0000313|Proteomes:UP000032304};
RN [1] {ECO:0000313|EMBL:KJB54489.1, ECO:0000313|Proteomes:UP000032304}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23257886; DOI=10.1038/nature11798;
RA Paterson A.H., Wendel J.F., Gundlach H., Guo H., Jenkins J., Jin D.,
RA Llewellyn D., Showmaker K.C., Shu S., Udall J., Yoo M.J., Byers R.,
RA Chen W., Doron-Faigenboim A., Duke M.V., Gong L., Grimwood J., Grover C.,
RA Grupp K., Hu G., Lee T.H., Li J., Lin L., Liu T., Marler B.S., Page J.T.,
RA Roberts A.W., Romanel E., Sanders W.S., Szadkowski E., Tan X., Tang H.,
RA Xu C., Wang J., Wang Z., Zhang D., Zhang L., Ashrafi H., Bedon F.,
RA Bowers J.E., Brubaker C.L., Chee P.W., Das S., Gingle A.R., Haigler C.H.,
RA Harker D., Hoffmann L.V., Hovav R., Jones D.C., Lemke C., Mansoor S.,
RA ur Rahman M., Rainville L.N., Rambani A., Reddy U.K., Rong J.K.,
RA Saranga Y., Scheffler B.E., Scheffler J.A., Stelly D.M., Triplett B.A.,
RA Van Deynze A., Vaslin M.F., Waghmare V.N., Walford S.A., Wright R.J.,
RA Zaki E.A., Zhang T., Dennis E.S., Mayer K.F., Peterson D.G., Rokhsar D.S.,
RA Wang X., Schmutz J.;
RT "Repeated polyploidization of Gossypium genomes and the evolution of
RT spinnable cotton fibres.";
RL Nature 492:423-427(2012).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001748; KJB54489.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0D2TCZ4; -.
DR MEROPS; C01.163; -.
DR EnsemblPlants; KJB54489; KJB54489; B456_009G036000.
DR Gramene; KJB54489; KJB54489; B456_009G036000.
DR OMA; NCNNTSK; -.
DR Proteomes; UP000032304; Chromosome 9.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF642; PRO-CATHEPSIN H; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000032304};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT DOMAIN 55..270
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 272 AA; 29418 MW; 1F4FFEA0838BBC7B CRC64;
MDLIRSANKK GLPYSLSVNQ FADLTWDEFR KHRIGAAQNC SATRKGNHQL TDVVLPESKD
WRESGIVSPV KNQGSCGSCW AFSTTGALEA AYHQAFGKGI SLSEQQLVDC AGAFNNFGCN
GGLPSQAFEY IKYNGGLDTE EAYPYTAKDG QCTFSSENVG VQVIDAVNIT LGSEDELKHA
VAMVRPVSVA FEVVPSFNFY KSGVYTSDKC GNTSSDVNHA VLAVGYGIEN GVPYWLIKNS
WGAEWGDKGY FKMEMGKNMC GVATCASYPV VA
//