ID A0A061E0T5_THECC Unreviewed; 450 AA.
AC A0A061E0T5;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE SubName: Full=HVA22 A, putative isoform 2 {ECO:0000313|EMBL:EOX98634.1};
GN ORFNames=TCM_007349 {ECO:0000313|EMBL:EOX98634.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOX98634.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOX98634.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001880; EOX98634.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061E0T5; -.
DR EnsemblPlants; EOX98634; EOX98634; TCM_007349.
DR Gramene; EOX98634; EOX98634; TCM_007349.
DR HOGENOM; CLU_034800_0_0_1; -.
DR Proteomes; UP000026915; Chromosome 2.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 2.
DR InterPro; IPR003604; Matrin/U1-like-C_Znf_C2H2.
DR InterPro; IPR004345; TB2_DP1_HVA22.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR12300:SF43; HVA22-LIKE PROTEIN; 1.
DR PANTHER; PTHR12300; HVA22-LIKE PROTEINS; 1.
DR Pfam; PF03134; TB2_DP1_HVA22; 1.
DR Pfam; PF12874; zf-met; 2.
DR SMART; SM00355; ZnF_C2H2; 2.
DR SMART; SM00451; ZnF_U1; 2.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT DOMAIN 240..274
FT /note="U1-type"
FT /evidence="ECO:0000259|SMART:SM00451"
FT DOMAIN 243..267
FT /note="C2H2-type"
FT /evidence="ECO:0000259|SMART:SM00355"
FT DOMAIN 407..441
FT /note="U1-type"
FT /evidence="ECO:0000259|SMART:SM00451"
FT DOMAIN 410..434
FT /note="C2H2-type"
FT /evidence="ECO:0000259|SMART:SM00355"
FT REGION 276..404
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 304..330
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 337..362
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 378..404
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 450 AA; 50336 MW; FB55BA91025BFD1B CRC64;
MGCLAFVKFA LIRLDALAWP LFALGYPLRA SIQAIEADSS SDSKKLVTYW VIFSLISLFE
HAFMGLLQWL PFWPYMKLTL VGWLMIPRFD GALYVYDNFV HPCLYVDMQT IINWFRKQQE
FFLNDNFLAE ADKYVKANGP EALEKLIPTE SRDREPSMLQ KEIKPVRVTQ EKETAAVNLS
KDREPSTLQK EIKPVRVAEI PETEPSAAQT QVKMLAVARP EIKEATGWDL PELPSDKQVQ
KEWTCAMCQV TTSSEKNLNM HLQGRRHRAA CEGLMKAKNQ PSKGKVAPAS AVKDSKKEPE
KRASSSSTQA SPKMQQPSNG QVSAASVGKN SDLLKNEPEK CATSNGTPTS SKAVNPKTGI
SNGSKPDLPK EEPKNSLPKN KAGNQQKSRE KVQGQQQSGK KHAKVNNPQF RCTICNISCG
RSEDLNCHLW GRKHLARIQE LNRLGQSELA
//