ID A0A061EVR0_THECC Unreviewed; 404 AA.
AC A0A061EVR0;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE SubName: Full=Gag protease polyprotein {ECO:0000313|EMBL:EOY08512.1};
GN ORFNames=TCM_023016 {ECO:0000313|EMBL:EOY08512.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY08512.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY08512.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001883; EOY08512.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061EVR0; -.
DR EnsemblPlants; EOY08512; EOY08512; TCM_023016.
DR Gramene; EOY08512; EOY08512; TCM_023016.
DR eggNOG; KOG0017; Eukaryota.
DR HOGENOM; CLU_026677_1_0_1; -.
DR InParanoid; A0A061EVR0; -.
DR OMA; RTESTHK; -.
DR Proteomes; UP000026915; Chromosome 5.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008233; F:peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR001878; Znf_CCHC.
DR PANTHER; PTHR34482; DNA DAMAGE-INDUCIBLE PROTEIN 1-LIKE; 1.
DR PANTHER; PTHR34482:SF36; DUF4283 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000313|EMBL:EOY08512.1};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Protease {ECO:0000313|EMBL:EOY08512.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 371..387
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT REGION 1..65
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 289..346
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 47..63
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 293..321
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 404 AA; 44797 MW; AB91A26C5001B470 CRC64;
MPPRRGRPPL TRSVGRGRGR SQRHQPDTVE EESAASTIRA TPAAEQADSP PHPPSPQPPT
GIPSMPTEAA QALAAFFAAI VGQAQTGQVP PVVPPTTPLV PPPIQDVSIS KKLKEARQLG
CVSFTGELDA TVAKDWINQV SETLSDMGLD DDMKLMVATR LLEKRARTWW NSVKSRSATP
QTWSDFLREF DGQYFTYFHQ KEKKREFLSL KQGNLTVEEY ETRFNELMLY VPDLVKSEQD
QASYFEEGLR NEIRERMTVI GREPHKEVVQ MALRAEKLAT ENRRIRTKFA KRRNLGMSSS
QPVKRGKDSA TSGSTTSISV TSPRPPFPPS QQRPSRFSRS AMTGSGKSLG GFDRCRNCGN
YHSGLCRGPT RCFQCGQTGH IRSNCPQLGR ATVAASSPPT RTDI
//