ID D7LRI5_ARALL Unreviewed; 1613 AA.
AC D7LRI5;
DT 10-AUG-2010, integrated into UniProtKB/TrEMBL.
DT 10-AUG-2010, sequence version 1.
DT 13-SEP-2023, entry version 57.
DE RecName: Full=BAH domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=ARALYDRAFT_485166 {ECO:0000313|EMBL:EFH53853.1};
OS Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=81972 {ECO:0000313|Proteomes:UP000008694};
RN [1] {ECO:0000313|Proteomes:UP000008694}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. MN47 {ECO:0000313|Proteomes:UP000008694};
RX PubMed=21478890; DOI=10.1038/ng.807;
RA Hu T.T., Pattyn P., Bakker E.G., Cao J., Cheng J.-F., Clark R.M.,
RA Fahlgren N., Fawcett J.A., Grimwood J., Gundlach H., Haberer G.,
RA Hollister J.D., Ossowski S., Ottilar R.P., Salamov A.A., Schneeberger K.,
RA Spannagl M., Wang X., Yang L., Nasrallah M.E., Bergelson J.,
RA Carrington J.C., Gaut B.S., Schmutz J., Mayer K.F.X., Van de Peer Y.,
RA Grigoriev I.V., Nordborg M., Weigel D., Guo Y.-L.;
RT "The Arabidopsis lyrata genome sequence and the basis of rapid genome size
RT change.";
RL Nat. Genet. 43:476-481(2011).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00649}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL348717; EFH53853.1; -; Genomic_DNA.
DR RefSeq; XP_002877594.1; XM_002877548.1.
DR STRING; 81972.D7LRI5; -.
DR EnsemblPlants; fgenesh2_kg.5__1158__AT3G48060.1; fgenesh2_kg.5__1158__AT3G48060.1; fgenesh2_kg.5__1158__AT3G48060.1.
DR Gramene; fgenesh2_kg.5__1158__AT3G48060.1; fgenesh2_kg.5__1158__AT3G48060.1; fgenesh2_kg.5__1158__AT3G48060.1.
DR eggNOG; KOG1886; Eukaryota.
DR HOGENOM; CLU_001647_0_0_1; -.
DR Proteomes; UP000008694; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR CDD; cd00183; TFIIS_I; 1.
DR Gene3D; 2.30.30.490; -; 1.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR InterPro; IPR001025; BAH_dom.
DR InterPro; IPR043151; BAH_sf.
DR InterPro; IPR003617; TFIIS/CRSP70_N_sub.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR InterPro; IPR017923; TFIIS_N.
DR PANTHER; PTHR46548; BAH AND TFIIS DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR46548:SF1; BAH AND TFIIS DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR Pfam; PF01426; BAH; 1.
DR Pfam; PF08711; Med26; 1.
DR SMART; SM00439; BAH; 1.
DR SMART; SM00509; TFS2N; 1.
DR SUPFAM; SSF47676; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR PROSITE; PS51038; BAH; 1.
DR PROSITE; PS51319; TFIIS_N; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00649}; Reference proteome {ECO:0000313|Proteomes:UP000008694}.
FT DOMAIN 49..164
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT DOMAIN 326..412
FT /note="TFIIS N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51319"
FT REGION 188..261
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 412..651
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 718..746
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 786..874
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1076..1123
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1340..1360
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1591..1613
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 188..222
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 223..248
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 439..496
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 510..533
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 540..615
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 726..746
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 791..817
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 827..855
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 856..874
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1082..1109
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1613 AA; 171984 MW; C9E2247776BE6609 CRC64;
MHGRVCERRH KSRRRHMLIS SSRVIATVEG GGSSCLSLSS STSFSKDGRK ISVGDCALFK
PPQDCPPFIG IIRLIIAEKE GKLKLGVNWL YRPTELKLGK GTLLEAEPNE LFYSFHEDNI
PAASLLHPCK VAFLPRGVEL PSGICSFVCR RVYDVTNERL WWLTDQDYID DRQLEVDKLL
CKTRSEMHTT LQQGGRSPKS MNSPTTSQAK DGIQNNNSFL SQGKGRKRER MDHGSESVKR
ERSSRVDDSG SGPLRTESGL KSEISKFTEK GGLVDSEGVE KLLQLMLPER NEKKIDLIGR
AILAGVVAAT DKFDCLSRFV QLRGLPVFDE WLQEVHKGKV GDGGSPKDSD RLVDDFLLVL
LRALDKLPVN LNALQTCNIG KSVNHLRSHK NSEIGKKARS LVDTWKKRVE AEMDAKSGSN
QGVSWPGRLS HGGRHSGGSA EANKTSSSHL HASKSVSVKQ QVENNLKCVA TSPGSTRSAP
SPGSGGTISK DGQQRNAGAG GVSEVLAAVK DEKSSSSSQS HNNSQSCSSE HAKTGNLCGK
EDARSSTAGS TLKKCSGGSS RHRKSNNVFQ GSSSSASPRE AGFSRSFSSQ RNVPSEKISQ
SSLTSEKTLE VPLTESSGNK LIVKLPNRGR SPAQSVSGGS LEDPAPVNSR VSSPVHAVKQ
ELCDNNVREK NHSYRANVSS VLNAESWQSN ELKDILTGSQ EAAGSPLVVA GDERGGALKD
SDKAAGNVKG TSSLGNDFKS GERHGGTLSS MNALIESCVR YSETNASLAG SDDVGINLLA
SVAADEMSKS PVASPSVSQP PNSLMNENST VGNNTKLIAS DGLPHEQHQA ARTTVSNEQG
EQHVSSSGTQ LESEIKNESK TGDRDKSSNS ETEDLQRLVD KRLENNDNSD GAVASPVLPT
KAIKEKILDD SDSGEVKDIK ADVKSEADCT SDSTKRVASS MLTECRDVSQ KVDSVAVEHT
PLDRVDDKKE EKPPTALSSE LVKKVEEDVP VSSGISRGMD AVSIDRPITE MVNNMAVNHI
DQKDIKKIKQ DCDAFVGAIK DASAGLDSSV TKGKVEPVEG NLENIKVKER CLGLKATPGV
SPKDAEDLKR PNGPKTSDAD GDEAEECTSA ARDASSVSAA ASAGSEMDAR VEFDLNEGFD
GDDAKHGDSN NFSGSVFLTP TPLQPVNTLP FPVAPVSSGI PASITVAAAA KGPFVPPEDL
LRNKGAVGWR GSAATSAFRP AEPRKAQDVL LSINNTSTSD ASTSAGKQTR TFLDFDLNVP
DERVLEDLAS QRTGIATNCT SGITNSFDQV RSGVMGSALD HSSGGLDLDL NKVDDSTDMN
NYNMSSSHRL DSSFQHVKLP STGGRRDFDL NDGPAGDDAA VEPSMVLNQH SRSGLPSQPS
LSGIRVNGEN MASFSTWFPA ANAYSAVSIP PIMPERGDQP FPMIANRGPQ RMLGPTTGVS
SFAPEGYRGP VLSSSPAMPF QSTTFQYPVF PFGNSFPITS ANFSGASTTH MDSSSSGRAC
FPGVNSQILG PGVPVPSNYP RPYIVGLPNG GSNGGVLDNS AKWFRSGLDL NSGPGGHETE
GRDESTLVAR QLSSSASLPL KEDQARMYQM SGGVLKRKEP EGGWDGYRQS SWQ
//