ID A0A061DMJ7_THECC Unreviewed; 546 AA.
AC A0A061DMJ7;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 28-JUN-2023, entry version 34.
DE SubName: Full=DNA/RNA polymerases superfamily protein {ECO:0000313|EMBL:EOX94044.1};
GN ORFNames=TCM_003127 {ECO:0000313|EMBL:EOX94044.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOX94044.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOX94044.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001879; EOX94044.1; -; Genomic_DNA.
DR EnsemblPlants; EOX94044; EOX94044; TCM_003127.
DR Gramene; EOX94044; EOX94044; TCM_003127.
DR eggNOG; KOG0017; Eukaryota.
DR HOGENOM; CLU_037075_0_0_1; -.
DR InParanoid; A0A061DMJ7; -.
DR Proteomes; UP000026915; Chromosome 1.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR35046:SF10; CCHC-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR35046; ZINC KNUCKLE (CCHC-TYPE) FAMILY PROTEIN; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 260..276
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT REGION 1..32
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 222..252
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..15
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 238..252
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 546 AA; 62817 MW; 3033E6F7C4A12AC8 CRC64;
MENPEGDHNP LEIHDLEDDD EFENENPFHE DGPXXXSLEN YFEWKPMAEN RKVLFVKLKL
KGTALQWWKR VEEQRARQGK LKISTWEHMK SKLRKQFLPA DYTMELYEKF HCLKQNNMTV
EEYTSEFNNL SIRVGLAESN EQITSRYLAG LNHSIRDEMG VVRLYNIEDA RQYALSAEKR
VLRYGARKPL YGTHWQNNSE ARRGYPTSQQ NYQGAATINK TNKGATNVEK NDKGKSIMPY
GGQNSSGSST NKGGSNSHIR CFTCGEKGHI SFACPQRRVN LAELGEELEP VYDEYEEEVE
EIDVYPAQGE SLVVRRVMTT TVNEEAEDWK RRSIFRTRVV CEGKVCDLVI DGGSMENIIS
KEAVNKLKLP TNKHPYPYKI GWLKKGHEVP VTTQCLVKFT MGNNLDDEAL CDVVPMDVGH
ILVGRPWLYD HDMVHKTKPN TYSFYKNNKR YTLYPLREET KKSANNKISK ITGYLSAENF
EAEGSEMGIT YALVTKHLKS DQMSKSPQYP TEIQQLLKEF GELFNEDLPK SLPPLRSIQH
AIDLVP
//