ID A0A061F029_THECC Unreviewed; 571 AA.
AC A0A061F029;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE SubName: Full=Zinc ion binding,DNA binding, putative isoform 2 {ECO:0000313|EMBL:EOY10042.1};
GN ORFNames=TCM_025432 {ECO:0000313|EMBL:EOY10042.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY10042.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY10042.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001883; EOY10042.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061F029; -.
DR EnsemblPlants; EOY10042; EOY10042; TCM_025432.
DR Gramene; EOY10042; EOY10042; TCM_025432.
DR HOGENOM; CLU_019956_1_0_1; -.
DR Proteomes; UP000026915; Chromosome 5.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR CDD; cd15568; PHD5_NSD; 1.
DR CDD; cd10567; SWIB-MDM2_like; 1.
DR Gene3D; 3.90.70.200; Plus-3 domain; 1.
DR Gene3D; 1.10.245.10; SWIB/MDM2 domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR045894; At5g08430-like.
DR InterPro; IPR004343; Plus-3_dom.
DR InterPro; IPR036128; Plus3-like_sf.
DR InterPro; IPR036885; SWIB_MDM2_dom_sf.
DR InterPro; IPR003121; SWIB_MDM2_domain.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR46851; OS01G0884500 PROTEIN; 1.
DR PANTHER; PTHR46851:SF5; ZINC ION BINDING _ DNA BINDING PROTEIN; 1.
DR Pfam; PF03126; Plus-3; 1.
DR Pfam; PF02201; SWIB; 1.
DR SMART; SM00249; PHD; 1.
DR SMART; SM00719; Plus3; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR SUPFAM; SSF159042; Plus3-like; 1.
DR SUPFAM; SSF47592; SWIB/MDM2 domain; 1.
DR PROSITE; PS51360; PLUS3; 1.
DR PROSITE; PS51925; SWIB_MDM2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 214..295
FT /note="DM2"
FT /evidence="ECO:0000259|PROSITE:PS51925"
FT DOMAIN 352..480
FT /note="Plus3"
FT /evidence="ECO:0000259|PROSITE:PS51360"
FT REGION 537..571
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 545..561
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 571 AA; 64686 MW; B2206A2E05EC7091 CRC64;
MSKNKGKKKV KAEKEDEAED WCFVCKDGGK LLLCDFKGCG KAYHPVCVGK KNSVLKSEGR
WTCCRHSCSV CGGPPRFYCL CCPDAVCRLC ARSAEFVSVK LKKGLCKTCI EVTLLAENNA
EFNSQGVKMD FEDPDTEEFM FKGYLEIIME QEDLTFDDLH RAALKKENYD SSSDSDKIED
EDAVVTISDG DSDTDFAVID NSLGKRKKSE VRDYVGWGSK PLINFLKSVG IDATEKLSKF
QVDIIISKYI LEKNLFREEG KKKTVLCDEK LYSLFQKKQV HKNKIYDLLE AHFVDTLGQS
NSDENENDSG SCSGNEDEHI IAVCKKQRTL STDKVPLEEK VDYAVQKNCY ASIVAENIKL
VYLRRSLVEE LLMQSDNFED KVVGSFVRVK RMHGNCSIRT SFQLLQVTGI KKTSNAKVDR
GILLEVSCMP VDICIDMLND GDISEEECED LRQRMKDGLL RKPTVVELEQ KAKSLHEDIT
KNWIRRQLVS LQNKIDFAHE KGRRYMLERF LDEREMLKKS SEQQRLLLKL PRVIAEEIEP
DPTARDSSEN NCSNGSGKLQ CPAVNQADGG A
//