ID K7KVT9_SOYBN Unreviewed; 1613 AA.
AC K7KVT9;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 67.
DE RecName: Full=BAH domain-containing protein {ECO:0008006|Google:ProtNLM};
GN Name=100788512 {ECO:0000313|EnsemblPlants:KRH54431};
GN ORFNames=GLYMA_06G184600 {ECO:0000313|EMBL:KRH54431.1};
OS Glycine max (Soybean) (Glycine hispida).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3847 {ECO:0000313|EMBL:KRH54431.1};
RN [1] {ECO:0000313|EMBL:KRH54431.1, ECO:0000313|EnsemblPlants:KRH54431}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:KRH54431};
RC TISSUE=Callus {ECO:0000313|EMBL:KRH54431.1};
RX PubMed=20075913; DOI=10.1038/nature08670;
RA Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W.,
RA Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., May G.D.,
RA Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., Sandhu D.,
RA Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., Goodstein D.,
RA Barry K., Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N.,
RA Joshi T., Libault M., Sethuraman A., Zhang X.-C., Shinozaki K.,
RA Nguyen H.T., Wing R.A., Cregan P., Specht J., Grimwood J., Rokhsar D.,
RA Stacey G., Shoemaker R.C., Jackson S.A.;
RT "Genome sequence of the palaeopolyploid soybean.";
RL Nature 463:178-183(2010).
RN [2] {ECO:0000313|EnsemblPlants:KRH54431}
RP IDENTIFICATION.
RC STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:KRH54431};
RG EnsemblPlants;
RL Submitted (FEB-2018) to UniProtKB.
RN [3] {ECO:0000313|EMBL:KRH54431.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Callus {ECO:0000313|EMBL:KRH54431.1};
RA Schmutz J., Cannon S., Schlueter J., Ma J., Mitros T., Nelson W., Hyten D.,
RA Song Q., Thelen J., Cheng J., Xu D., Hellsten U., May G., Yu Y.,
RA Sakurai T., Umezawa T., Bhattacharyya M., Sandhu D., Valliyodan B.,
RA Lindquist E., Peto M., Grant D., Shu S., Goodstein D., Barry K.,
RA Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N., Joshi T.,
RA Libault M., Sethuraman A., Zhang X., Shinozaki K., Nguyen H., Wing R.,
RA Cregan P., Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.,
RA Jackson S.;
RT "WGS assembly of Glycine max.";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00649}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000839; KRH54431.1; -; Genomic_DNA.
DR RefSeq; XP_006581932.1; XM_006581869.2.
DR AlphaFoldDB; K7KVT9; -.
DR SMR; K7KVT9; -.
DR STRING; 3847.K7KVT9; -.
DR PaxDb; 3847-GLYMA06G19640-2; -.
DR EnsemblPlants; KRH54431; KRH54431; GLYMA_06G184600.
DR GeneID; 100788512; -.
DR Gramene; KRH54431; KRH54431; GLYMA_06G184600.
DR KEGG; gmx:100788512; -.
DR eggNOG; KOG1886; Eukaryota.
DR HOGENOM; CLU_001647_0_0_1; -.
DR InParanoid; K7KVT9; -.
DR OMA; MPDRGDQ; -.
DR OrthoDB; 1219039at2759; -.
DR Proteomes; UP000008827; Chromosome 6.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR CDD; cd00183; TFIIS_I; 1.
DR Gene3D; 2.30.30.490; -; 1.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR InterPro; IPR001025; BAH_dom.
DR InterPro; IPR043151; BAH_sf.
DR InterPro; IPR003617; TFIIS/CRSP70_N_sub.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR InterPro; IPR017923; TFIIS_N.
DR PANTHER; PTHR46548; BAH AND TFIIS DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR46548:SF1; BAH AND TFIIS DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR Pfam; PF01426; BAH; 1.
DR Pfam; PF08711; Med26; 1.
DR SMART; SM00439; BAH; 1.
DR SMART; SM00509; TFS2N; 1.
DR SUPFAM; SSF47676; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR PROSITE; PS51038; BAH; 1.
DR PROSITE; PS51319; TFIIS_N; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00649}; Reference proteome {ECO:0000313|Proteomes:UP000008827}.
FT DOMAIN 49..164
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT DOMAIN 340..417
FT /note="TFIIS N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51319"
FT REGION 1..42
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 191..260
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 422..454
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 516..624
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 645..684
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 837..873
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 912..936
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1009..1054
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1077..1110
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1283..1303
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1531..1580
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1592..1613
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..17
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 25..42
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 196..226
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 227..260
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 528..552
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 560..617
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 645..669
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 845..859
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1017..1052
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1283..1297
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1613 AA; 171973 MW; D418BDF91C58C645 CRC64;
MHGCGGEKGK GTRHMWKAPV RGDSSLNADV SSSSSSSSST VKSFCKDGRK ISVGECALFK
PSEDRPPFIG IIHCLTFGKE KKLKLGVSWL YRSIEVKLNK GVPLEAAPNE IFYTFHKDET
DAESLLHPCK VAFLRKGAEL PSGFSSFVCR RVYDIANKCL WWLNDQDYIN DCQEEVDQLL
YRTCVRMHAT VQPGGRSPKP MSSPTSTSQL KSVSDSVQNN TSSFPSHIKG RKRERADQGS
EPVKRERSIK TEDGDSGHFR HDNILKTEIA KITEKGGLVD NEGVEKLVQL MVPDRNEKKI
DLASRSLLAA VIAATEKLDC LSQFVQLRGL PVFDEWLQEV HKGKIGDGVG SRDGDKSVEE
FLLVLLRALD KLPVNLQALQ TCNIGKSVNH LRTHKNTEIQ RKARGLVDTW KKRVEAEMNI
KDAKSGSGPT VHWPAKSRSS DVGHGGNRHS GASSDIAMKS SVTQLSASKT ASVKIVQGEN
TIRSASTSTF PGPAKSVLSP ASVTANLKDG QPCIAAVSGG SDLPMVNARD EKSSSSSQSH
NNSQSCSSDH AKTGGHSGKE DARSSTAMSV NKISGGSSRH RKSINGFPGS TPSGGQRETG
SSRNSSLHKN LTSEKISQPG LMDKALDGTS LEGVTCKLIV KIPSQGRSPA QSASAGSFDD
PTIMNSRASS PVLPEKHDQF DHCSKEKSDL YRANIGSDIN TESWQSNDFK DVLTGSDEAD
GSPAAVTDEE RCRIVNDCKK TFEVPKAASS SSGNENKAGN LQDASYSSIN ALIEGVKYSE
ADDVGMNLLA SVAAGEILKS ELLTPTGSPE RNTAAVEQSC TGNDMVKSSE ENLVRDECHS
NNGLDGEHKN QGSVTDDLGA NDESDSDFRA SGEKAARELN KSVNACSMDL QQVSEIILES
KGKLNEKSVS TALRGLSESS VQEARDGDRS KQLQEVGRGV NGGEIVDVKV SSVAEVEAEA
TEKLSHIAVK VDVQSDNCTA EGSSGGGRTA AVLVPSDLAR GKDENVLHSS AYSVDKVPED
LTERESEKAD DVDAENLPSQ SKKERNECES DTLTMPENRG LCSIVTGIAA EHVEENLETK
EVHDQPAREE LPKDSPSVRS QEMDKHLDSK GSKLTAMEAE EAEECTSTTA DASSVSAAAV
SDADAKVEFD LNEGLNADDE KCGEFNSSAP AGRLVSPVPF PASSMSCGIP APVTGAAAAK
GRFVPPEDLL RSKGEIGWKG SAATSAFRPA ELRKVMEMPF GALTSSIPDA PAGKQSRAPL
DIDLNVADER ILDDISSQPC ARHTDSVSLT TDGHDPVSSK MASPVRCSGG LGLDLNQVDE
ASDVGNCLSS NHKIDVPIMK VKSSLGGPPN REVNVHRDFD LNNGPSVDEV TTESSLFSQH
ARSSVPSQPP VSGLRVSTAE PVNFSWLPSS GNTYSAVTIS SIMPDRGDQP FSIVAPNGPQ
RLLTPAAGGN PFGPDVYKGP VLSSPFEYPV FPFNSSFPLP SASFSAGSTT YVYPTSGNRL
CFPVVNSQLM GPAGAVSSHY PRPYVVGLTE GSNSGSAETS RKWARQGLDL NAGPGGSDME
GRDDNSPLPS RQLSVASSQA LAEEQARIQL AGSVCKRKEP DGGWDGYNQS SWQ
//