GenomeNet

Database: UniProt
Entry: A0A061E0T5_THECC
LinkDB: A0A061E0T5_THECC
Original site: A0A061E0T5_THECC 
ID   A0A061E0T5_THECC        Unreviewed;       450 AA.
AC   A0A061E0T5;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   27-MAR-2024, entry version 31.
DE   SubName: Full=HVA22 A, putative isoform 2 {ECO:0000313|EMBL:EOX98634.1};
GN   ORFNames=TCM_007349 {ECO:0000313|EMBL:EOX98634.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOX98634.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOX98634.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001880; EOX98634.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A061E0T5; -.
DR   EnsemblPlants; EOX98634; EOX98634; TCM_007349.
DR   Gramene; EOX98634; EOX98634; TCM_007349.
DR   HOGENOM; CLU_034800_0_0_1; -.
DR   Proteomes; UP000026915; Chromosome 2.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   Gene3D; 3.30.160.60; Classic Zinc Finger; 2.
DR   InterPro; IPR003604; Matrin/U1-like-C_Znf_C2H2.
DR   InterPro; IPR004345; TB2_DP1_HVA22.
DR   InterPro; IPR036236; Znf_C2H2_sf.
DR   InterPro; IPR013087; Znf_C2H2_type.
DR   PANTHER; PTHR12300:SF43; HVA22-LIKE PROTEIN; 1.
DR   PANTHER; PTHR12300; HVA22-LIKE PROTEINS; 1.
DR   Pfam; PF03134; TB2_DP1_HVA22; 1.
DR   Pfam; PF12874; zf-met; 2.
DR   SMART; SM00355; ZnF_C2H2; 2.
DR   SMART; SM00451; ZnF_U1; 2.
DR   SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 2.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT   DOMAIN          240..274
FT                   /note="U1-type"
FT                   /evidence="ECO:0000259|SMART:SM00451"
FT   DOMAIN          243..267
FT                   /note="C2H2-type"
FT                   /evidence="ECO:0000259|SMART:SM00355"
FT   DOMAIN          407..441
FT                   /note="U1-type"
FT                   /evidence="ECO:0000259|SMART:SM00451"
FT   DOMAIN          410..434
FT                   /note="C2H2-type"
FT                   /evidence="ECO:0000259|SMART:SM00355"
FT   REGION          276..404
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        304..330
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        337..362
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        378..404
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   450 AA;  50336 MW;  FB55BA91025BFD1B CRC64;
     MGCLAFVKFA LIRLDALAWP LFALGYPLRA SIQAIEADSS SDSKKLVTYW VIFSLISLFE
     HAFMGLLQWL PFWPYMKLTL VGWLMIPRFD GALYVYDNFV HPCLYVDMQT IINWFRKQQE
     FFLNDNFLAE ADKYVKANGP EALEKLIPTE SRDREPSMLQ KEIKPVRVTQ EKETAAVNLS
     KDREPSTLQK EIKPVRVAEI PETEPSAAQT QVKMLAVARP EIKEATGWDL PELPSDKQVQ
     KEWTCAMCQV TTSSEKNLNM HLQGRRHRAA CEGLMKAKNQ PSKGKVAPAS AVKDSKKEPE
     KRASSSSTQA SPKMQQPSNG QVSAASVGKN SDLLKNEPEK CATSNGTPTS SKAVNPKTGI
     SNGSKPDLPK EEPKNSLPKN KAGNQQKSRE KVQGQQQSGK KHAKVNNPQF RCTICNISCG
     RSEDLNCHLW GRKHLARIQE LNRLGQSELA
//
DBGET integrated database retrieval system