ID A0A061GXU4_THECC Unreviewed; 712 AA.
AC A0A061GXU4;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 22-FEB-2023, entry version 34.
DE SubName: Full=C2H2 zinc-finger protein SERRATE isoform 2 {ECO:0000313|EMBL:EOY34286.1};
GN ORFNames=TCM_042011 {ECO:0000313|EMBL:EOY34286.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY34286.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY34286.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the ARS2 family.
CC {ECO:0000256|ARBA:ARBA00005407}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001887; EOY34286.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061GXU4; -.
DR EnsemblPlants; EOY34286; EOY34286; TCM_042011.
DR Gramene; EOY34286; EOY34286; TCM_042011.
DR HOGENOM; CLU_021946_0_0_1; -.
DR Proteomes; UP000026915; Chromosome 9.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR039727; SE/Ars2.
DR InterPro; IPR007042; SERRATE/Ars2_C.
DR InterPro; IPR021933; SERRATE/Ars2_N.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR13165; ARSENITE-RESISTANCE PROTEIN 2; 1.
DR PANTHER; PTHR13165:SF0; SERRATE RNA EFFECTOR MOLECULE HOMOLOG; 1.
DR Pfam; PF04959; ARS2; 1.
DR Pfam; PF12066; SERRATE_Ars2_N; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
PE 3: Inferred from homology;
KW Metal-binding {ECO:0000313|EMBL:EOY34286.1};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Zinc {ECO:0000313|EMBL:EOY34286.1};
KW Zinc-finger {ECO:0000313|EMBL:EOY34286.1}.
FT DOMAIN 479..502
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS00028"
FT REGION 1..119
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 258..314
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 522..604
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 654..688
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 26..63
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 77..102
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 258..276
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 279..296
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 541..579
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 580..594
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EOY34286.1"
SQ SEQUENCE 712 AA; 80437 MW; AEDF1681724B3284 CRC64;
NNNNNNNNRQ PPSSDDPNSS PPPLPPPRRR DRDSRERRDR EYYDRNRSPP PPPPRERDYK
RRSSVSPPPP PLNYRDRRHS PPPRRSPPYK RSRREDGGYE GRRGSPRGGF GPGDRRFGYD
YGGGYDREMM GRPGYPEERP HGRYFGRTSD WDSSRGYGDA ANSGSTQREG LMSYKQFIQE
LEDDILPAEA ERRYQEYKSE YISTQKRAFF DAHKDEEWLR DKYHPTNLVT VIERRNELAR
KVAKDFLLDL QSGTLELSPG VNALSSNKSG QISDPNSEDE ADIGGKRRRH GRGPAKETDL
SAAPKAHPVS SEPRRIQIDI EQAQGLVRKL DSEKGIEENI LSGSDNDKIN RDKSHGGLTG
PVIIVRGLAS VKGLEGVELL DTLITYLWRV HGLDYYGMIE TSEAKGLRHV RAEGKNSDVT
NNGSEWEKKL DSRWQERLRG QDPLVLMTAK DKIDAAAVEA LDPYVRKIRD EKYGWKYGCG
AKGCTKLFHA AEFVHKHLKL KHPELVMELT SKVREELYFQ NYMNDPDAPG GTPVMQQSVP
KDKPQRRKIL ENRLKDERGP RRERDNRANG SDRYDRSENP QSSDFTSNND GPDGGNRDDT
MFDAFGGQGM RVAAPFSSDI APPPVLMPVP GAGPLGPFVP APPELAMQVF RERGGPPPFE
GNSRGGRPGP NLSGPAPFLL PPGFRQDPRR LRSYQDLDAP EDEVTVIDYR SL
//