ID A0A061FU76_THECC Unreviewed; 1583 AA.
AC A0A061FU76;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 22-FEB-2023, entry version 36.
DE SubName: Full=BAH domain,TFIIS helical bundle-like domain isoform 5 {ECO:0000313|EMBL:EOY20638.1};
GN ORFNames=TCM_012003 {ECO:0000313|EMBL:EOY20638.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY20638.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY20638.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00649}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001881; EOY20638.1; -; Genomic_DNA.
DR EnsemblPlants; EOY20638; EOY20638; TCM_012003.
DR Gramene; EOY20638; EOY20638; TCM_012003.
DR HOGENOM; CLU_001647_0_0_1; -.
DR Proteomes; UP000026915; Chromosome 3.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR CDD; cd00183; TFIIS_I; 1.
DR Gene3D; 2.30.30.490; -; 1.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR InterPro; IPR001025; BAH_dom.
DR InterPro; IPR043151; BAH_sf.
DR InterPro; IPR003617; TFIIS/CRSP70_N_sub.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR InterPro; IPR017923; TFIIS_N.
DR PANTHER; PTHR46548; BAH AND TFIIS DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR46548:SF1; BAH AND TFIIS DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR Pfam; PF01426; BAH; 1.
DR Pfam; PF08711; Med26; 1.
DR SMART; SM00439; BAH; 1.
DR SMART; SM00509; TFS2N; 1.
DR SUPFAM; SSF47676; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR PROSITE; PS51038; BAH; 1.
DR PROSITE; PS51319; TFIIS_N; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00649}; Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT DOMAIN 4..119
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT DOMAIN 294..371
FT /note="TFIIS N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51319"
FT REGION 144..221
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 372..406
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 418..655
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 682..709
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 757..834
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 871..917
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 929..987
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1000..1096
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1181..1221
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1234..1254
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1499..1583
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 154..182
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 183..221
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 378..406
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 418..499
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 511..567
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 595..621
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 622..640
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 641..655
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 692..709
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 759..783
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 786..807
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 813..834
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 881..905
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 941..955
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 956..977
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1000..1040
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1052..1068
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1197..1211
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1235..1254
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1536..1553
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1583 AA; 167950 MW; C00E963F608550C3 CRC64;
MDGRKISVGD CALFKPPQDS PPFIGIIRCL IAGKENKLRL GVNWLYRPAE VKLGKGILLE
AAPNEIFYSF HKDEIPAASL LHPCKVAFLP KDVELPSGIC SFVCRRVYDI TNKCLWWLTD
QDYINERQEE VDQLLDKTRL EMHATVQPGG RSPKPMNGPT STSQIKPGSD SVQNSASSFP
SQGKGKKRER GDQGSEPVKR ERTSKMDDGD SGHGRPEINL KSEIAKITEK GGLEDSEGVE
KLVQLMVPER NEKKIDLVSR SMLAGVIAAT DKFDCLSRFV QLRGLPVFDE WLQEVHKGKI
GDGSGSKDDR SVDDFLLTLL RALDKLPVNL TALQMCNIGK SVNHLRSHKN LEIQKKARGL
VDTWKKRVEA EMDAKSGSNQ AVPWSARPRI SEVSHSGSKH SGSSEVAVKS SVTQFSASKT
GSVKLAQGET PTKSASASPG SMKAATSPVS ASTNLKDGQA RNATAVGTSD PQTTARDEKS
SSSSQSHNNS QSCSSDHAKT GGVSGKEEAR SSAAGSGTVT KISGSSSRHR KSINGFPGSS
GVQRETGSSK NSSLHRNPAS EKISQSGLTC EKAVDAPMAE GNSHKFIVKI PNRGRSPAQS
VSGGSLEDLS VMNSRASSPV LSEKHEQSDR NTKEKSETYR ANVTTDVNTE SWQSNDFKDV
LTGSDEGDGS PAAVPDEEHC RIGEDARKTT EVTKTASSSS GNELKSGKLQ EASFSSINAL
IDSCVKYSEA NACMPVGDDA GMNLLASVAA GEISKSDVAS PIDSPQRNTP VVEHSSTGND
TRLKPSAGDD VVRDRHQSVE GADDEHLKQG TVAGNSWAKN ADCKTGSSQE KSGGELNEHL
ISSSMGLPQT ADQCLENGKL KEIVAAALVN LPSGSTVEKT TDVGDSKEHL EKKAGGVDDD
SSLDTKQKGS TSLVNEDKVV DPGVKVEKEA VDGSSSVPSM EVDVEDKKNV TEGLDRSLQT
HENSAAVTGN STKGADKEAS PPGSAKDIVL EKVGEVKLEK DVETDARSHV AHTEKQKPEW
ETVTARKGEQ VEENLECSEV HEPRGGPSPC RASSTVMETE QPTRSRGSKL TVAEADEAEE
RTSTTSDAPA TGGADADAKV EFDLNEGFNA DEAKFGEPNN LTAPGCSPPV QLISPLPFPV
SSVSSSLPAS ITVAAAAKGP FVPPDDLLRT KGVLGWKGSA ATSAFRPAEP RKSLDMPLGT
SNASMPDATT CKQSRPPLDI DLNVPDERVL EDLASRSSAQ GTDSAPDLTN NRDLTCGLMG
SAPIRSSGGL DLDLNRVDEP IDLGNHSTGS SRRLDVPMQP LKSSSGGILN GEASVRRDFD
LNNGPAVDEV SAEPSLFSQH NRSSNVPSQP PVSSLRINNT EMANFSSWFP TGNTYSAVTI
PSILPDRGEQ PFPIVATGGP PRVLGPPTAA TPFNPDVYRG PVLSSSPAVP FPSAPFQYPV
FPFGTTFPLP STSFSGGSTT YVDSSPSGRL CFPPVSQLLG PAGAVPSHYA RPYVVSLPDG
SNNSGAESGR KWGRQGLDLN AGPGGPDIEG RDETSPLASR QLSVASSQAL AEEQARMYQV
PGGILKRKEP EGGWDGYKQS SWQ
//