GenomeNet

Database: UniProt
Entry: A0A061EVR0_THECC
LinkDB: A0A061EVR0_THECC
Original site: A0A061EVR0_THECC 
ID   A0A061EVR0_THECC        Unreviewed;       404 AA.
AC   A0A061EVR0;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   27-MAR-2024, entry version 36.
DE   SubName: Full=Gag protease polyprotein {ECO:0000313|EMBL:EOY08512.1};
GN   ORFNames=TCM_023016 {ECO:0000313|EMBL:EOY08512.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY08512.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOY08512.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001883; EOY08512.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A061EVR0; -.
DR   EnsemblPlants; EOY08512; EOY08512; TCM_023016.
DR   Gramene; EOY08512; EOY08512; TCM_023016.
DR   eggNOG; KOG0017; Eukaryota.
DR   HOGENOM; CLU_026677_1_0_1; -.
DR   InParanoid; A0A061EVR0; -.
DR   OMA; RTESTHK; -.
DR   Proteomes; UP000026915; Chromosome 5.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0008233; F:peptidase activity; IEA:UniProtKB-KW.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR   InterPro; IPR005162; Retrotrans_gag_dom.
DR   InterPro; IPR001878; Znf_CCHC.
DR   PANTHER; PTHR34482; DNA DAMAGE-INDUCIBLE PROTEIN 1-LIKE; 1.
DR   PANTHER; PTHR34482:SF36; DUF4283 DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF03732; Retrotrans_gag; 1.
DR   Pfam; PF00098; zf-CCHC; 1.
DR   SMART; SM00343; ZnF_C2HC; 1.
DR   PROSITE; PS50158; ZF_CCHC; 1.
PE   4: Predicted;
KW   Hydrolase {ECO:0000313|EMBL:EOY08512.1};
KW   Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW   Protease {ECO:0000313|EMBL:EOY08512.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW   Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW   Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT   DOMAIN          371..387
FT                   /note="CCHC-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50158"
FT   REGION          1..65
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          289..346
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        47..63
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        293..321
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   404 AA;  44797 MW;  AB91A26C5001B470 CRC64;
     MPPRRGRPPL TRSVGRGRGR SQRHQPDTVE EESAASTIRA TPAAEQADSP PHPPSPQPPT
     GIPSMPTEAA QALAAFFAAI VGQAQTGQVP PVVPPTTPLV PPPIQDVSIS KKLKEARQLG
     CVSFTGELDA TVAKDWINQV SETLSDMGLD DDMKLMVATR LLEKRARTWW NSVKSRSATP
     QTWSDFLREF DGQYFTYFHQ KEKKREFLSL KQGNLTVEEY ETRFNELMLY VPDLVKSEQD
     QASYFEEGLR NEIRERMTVI GREPHKEVVQ MALRAEKLAT ENRRIRTKFA KRRNLGMSSS
     QPVKRGKDSA TSGSTTSISV TSPRPPFPPS QQRPSRFSRS AMTGSGKSLG GFDRCRNCGN
     YHSGLCRGPT RCFQCGQTGH IRSNCPQLGR ATVAASSPPT RTDI
//
DBGET integrated database retrieval system