ID D7LVX0_ARALL Unreviewed; 439 AA.
AC D7LVX0;
DT 10-AUG-2010, integrated into UniProtKB/TrEMBL.
DT 10-AUG-2010, sequence version 1.
DT 27-MAR-2024, entry version 71.
DE SubName: Full=Basic helix-loop-helix family protein {ECO:0000313|EMBL:EFH54433.1};
GN ORFNames=ARALYDRAFT_486233 {ECO:0000313|EMBL:EFH54433.1};
OS Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=81972 {ECO:0000313|Proteomes:UP000008694};
RN [1] {ECO:0000313|Proteomes:UP000008694}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. MN47 {ECO:0000313|Proteomes:UP000008694};
RX PubMed=21478890; DOI=10.1038/ng.807;
RA Hu T.T., Pattyn P., Bakker E.G., Cao J., Cheng J.-F., Clark R.M.,
RA Fahlgren N., Fawcett J.A., Grimwood J., Gundlach H., Haberer G.,
RA Hollister J.D., Ossowski S., Ottilar R.P., Salamov A.A., Schneeberger K.,
RA Spannagl M., Wang X., Yang L., Nasrallah M.E., Bergelson J.,
RA Carrington J.C., Gaut B.S., Schmutz J., Mayer K.F.X., Van de Peer Y.,
RA Grigoriev I.V., Nordborg M., Weigel D., Guo Y.-L.;
RT "The Arabidopsis lyrata genome sequence and the basis of rapid genome size
RT change.";
RL Nat. Genet. 43:476-481(2011).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL348717; EFH54433.1; -; Genomic_DNA.
DR RefSeq; XP_002878174.1; XM_002878128.1.
DR AlphaFoldDB; D7LVX0; -.
DR STRING; 81972.D7LVX0; -.
DR EnsemblPlants; fgenesh2_kg.5__2225__AT3G57800.1; fgenesh2_kg.5__2225__AT3G57800.1; fgenesh2_kg.5__2225__AT3G57800.1.
DR Gramene; fgenesh2_kg.5__2225__AT3G57800.1; fgenesh2_kg.5__2225__AT3G57800.1; fgenesh2_kg.5__2225__AT3G57800.1.
DR eggNOG; ENOG502QS36; Eukaryota.
DR HOGENOM; CLU_053042_0_0_1; -.
DR Proteomes; UP000008694; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:EnsemblPlants.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR024097; bHLH_ZIP_TF.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR PANTHER; PTHR12565; STEROL REGULATORY ELEMENT-BINDING PROTEIN; 1.
DR PANTHER; PTHR12565:SF409; TRANSCRIPTION FACTOR BHLH60; 1.
DR Pfam; PF00010; HLH; 1.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR PROSITE; PS50888; BHLH; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000008694};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 210..320
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT REGION 115..199
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 376..411
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 115..141
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 148..172
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 173..199
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 439 AA; 47885 MW; 133A7784C62C667C CRC64;
MDLTGGFGAR SGGIGPCREP IGLESLHLGD EFRQLVTTLP PENAGGSFTA LLELPPTQAV
ELLHFTDSSS SQVAAVTGIG GENAPPLHSF GGTLAFPSNS VLMERAARFS VIATEQQNGN
VSGETPTSSV PSNSSANLDR VKTEPAETDS SQRLISDSAI ENQIPCPSQN NRNGKRKDFE
KKVKSSTKKN KSSEENEKLP YVHVRARRGQ ATDSHSLAER ARREKINARM KLLQELVPGC
DKGTDFGGKI KIKVCFGVHL LMISGKKAVN FLWKVSCEDL IDCSFNPLGF RLTRHSLAAS
FTIQGTALVL DEIINHVQSL QRQVEMLSMR LAAVNPRIDF NLDTILASEN GSLMDGSFNG
TPMQLAWPHQ AIETEQSFHH RQLPPPPTQQ WPFDGLNQPV WGREEDQADG NDNSNLMAVS
ENVMVASANL HPNQVKMEL
//