ID A0A061GB66_THECC Unreviewed; 964 AA.
AC A0A061GB66;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 43.
DE SubName: Full=Golgin candidate 5 isoform 1 {ECO:0000313|EMBL:EOY26816.1};
GN ORFNames=TCM_028772 {ECO:0000313|EMBL:EOY26816.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY26816.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY26816.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001884; EOY26816.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061GB66; -.
DR STRING; 3641.A0A061GB66; -.
DR EnsemblPlants; EOY26816; EOY26816; TCM_028772.
DR Gramene; EOY26816; EOY26816; TCM_028772.
DR eggNOG; KOG4673; Eukaryota.
DR InParanoid; A0A061GB66; -.
DR OMA; IEAVCRM; -.
DR Proteomes; UP000026915; Chromosome 6.
DR GO; GO:0005794; C:Golgi apparatus; IEA:UniProtKB-KW.
DR InterPro; IPR022092; TMF_DNA-bd.
DR InterPro; IPR022091; TMF_TATA-bd.
DR PANTHER; PTHR47347; GOLGIN CANDIDATE 5; 1.
DR PANTHER; PTHR47347:SF2; GOLGIN CANDIDATE 5; 1.
DR Pfam; PF12329; TMF_DNA_bd; 1.
DR Pfam; PF12325; TMF_TATA_bd; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW Golgi apparatus {ECO:0000256|ARBA:ARBA00023034};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT DOMAIN 850..951
FT /note="TATA element modulatory factor 1 TATA binding"
FT /evidence="ECO:0000259|Pfam:PF12325"
FT REGION 72..266
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 295..337
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 343..428
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 453..501
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 543..616
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 646..775
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 924..951
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 90..135
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 151..186
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 218..243
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 302..323
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 964 AA; 108282 MW; A41B6237B96F3AC8 CRC64;
MAWFSGKVSL GGFPDLAGAV NKLQESVKNI EKNFDTALGF EEKSESSSNE GSGLWSSDRK
ALFDPVMALM GHKSEETAVE SSGKLESSQA PPEVEEKEEA ETDRSLHSPD QTTAEEDKSA
VQVEKDDEHS EVVESSDNVF PDPGKTEPES EPVSVQPSES TFQNVESSDS PDNEQQKESS
GLVPSESADS KEAKLEAAEI DQVEDAMAVP AESSNVVDMH ESTDEQKPQT EDALEKGSPV
KSEESRDSQA SAGGGPDELE FLRSHSITVE ETKSAHEFLL PSVVPSDEAQ GMVSESVFFE
NDANTKRVEV DQRTNDSETD AKEEQCLSSA TTMSDSADSM HELEKVKMEM KMMESALQGA
ARQAQAKADE IAKLMNENEQ LKVVIEDLKR KSNEAEIESL REEYHQRVAT LERKVYALTK
ERDTLRREQN KKSDAAALLK EKDEIINQVM AEGEELSKKQ AAQEAQIRKL RAQIRELEEE
KKGLTTKLQV EENKVESIKK DKTATEKLLQ ETIEKHQAEL AGQKEFYTNA LNAAKEAEAL
AEARANSEAR TELESRLREA EEREAMLVQT LEELRQTLSR KEQQAVFRED MLRRDVEDLQ
KRYQASERRC EELITQVPES TRPLLRQIEA MQETTSRRAE AWAAVERSLN SRLQEAEAKA
AAAEERERSV NERLSQTLSR INVLEAQISC LRAEQTQLSK SIEKERQRAA ENRQEYLAAK
EEADTQEGRA NQLEEEIREL RRKHKQELHD ALVHRELLQQ EVEREKAARL DLERTARVHS
VAVSEQASIS RHNSALENGS LSRKLSTASS MGSMEESYFL QASLDSSDGF AEKRNIGEAT
LSPLYMKSMT PSAFESALRQ KEGELASYMS RLTSMESIRD SLAEELVKMT EQCEKLKAEA
ATLPGIRAEL EALRRRHSAA LELMGERDEE LEELRADIVD LKEMYREQVN LLVNKIQIMS
SSNG
//