ID A0A061GBD0_THECC Unreviewed; 441 AA.
AC A0A061GBD0;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE SubName: Full=P-loop containing nucleoside triphosphate hydrolases superfamily protein isoform 1 {ECO:0000313|EMBL:EOY24364.1};
GN ORFNames=TCM_015986 {ECO:0000313|EMBL:EOY24364.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY24364.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY24364.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001881; EOY24364.1; -; Genomic_DNA.
DR EMBL; CM001881; EOY24365.1; -; Genomic_DNA.
DR EMBL; CM001881; EOY24366.1; -; Genomic_DNA.
DR EMBL; CM001881; EOY24368.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061GBD0; -.
DR EnsemblPlants; EOY24364; EOY24364; TCM_015986.
DR EnsemblPlants; EOY24365; EOY24365; TCM_015986.
DR EnsemblPlants; EOY24366; EOY24366; TCM_015986.
DR EnsemblPlants; EOY24368; EOY24368; TCM_015986.
DR Gramene; EOY24364; EOY24364; TCM_015986.
DR Gramene; EOY24365; EOY24365; TCM_015986.
DR Gramene; EOY24366; EOY24366; TCM_015986.
DR Gramene; EOY24368; EOY24368; TCM_015986.
DR Proteomes; UP000026915; Chromosome 3.
DR GO; GO:0016787; F:hydrolase activity; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 3.30.60.220; -; 1.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR InterPro; IPR007529; Znf_HIT.
DR PANTHER; PTHR48365; CCHC-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR48365:SF1; CCHC-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF04438; zf-HIT; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000313|EMBL:EOY24364.1};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 247..261
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT REGION 55..85
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 119..155
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 119..134
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 135..152
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 441 AA; 50147 MW; 9D0EBC65606E90BF CRC64;
MGTRSNFYKN PSLSYKKDLS LSSALQNLKA YNIATGDAPP SVELEAYPPV DDKIACKKRS
RERKPFSMPD RRREIEENDG PMSHQDYILK RRREVSSSHG YEELSVDILQ ASSSSVNLVD
YGSDGNASSE CKESQDPPDS GHVNEVDQVK SRSEQRFSLP GEPICVVCGR YGEYICDKTD
DDICSMECKS DLLQSLQITE KSLSNQNSLL SSSEPTSISL LPELAEDTWD YNNHRWSKKS
SSLCTYKCWK CQRPGHLAED CLVTTTEQVT MRQSKLTSIS RDLLELYRRC HQIGKNLSSA
SCNACRSSIA LATCLDCSTV LCDNAGHLNE HIQTHPSHQQ YYSHKLKRLV KCCKSTCKVT
NFRDLLVCHY CFDKAFDKFY DMYTATWKGA GLSIIWGSIC CDDHFTWHRM NCLNADVEDR
AYIMSRDTER ETHVQLSDFI F
//