ID A0A445HMU5_GLYSO Unreviewed; 860 AA.
AC A0A445HMU5;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 22-FEB-2023, entry version 11.
DE SubName: Full=Transcription factor LHW isoform B {ECO:0000313|EMBL:RZB75098.1};
GN ORFNames=D0Y65_033828 {ECO:0000313|EMBL:RZB75098.1};
OS Glycine soja (Wild soybean).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3848 {ECO:0000313|EMBL:RZB75098.1, ECO:0000313|Proteomes:UP000289340};
RN [1] {ECO:0000313|EMBL:RZB75098.1, ECO:0000313|Proteomes:UP000289340}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. W05 {ECO:0000313|Proteomes:UP000289340};
RC TISSUE=Hypocotyl of etiolated seedlings {ECO:0000313|EMBL:RZB75098.1};
RA Xie M., Chung C.Y.L., Li M.-W., Wong F.-L., Chan T.-F., Lam H.-M.;
RT "A high-quality reference genome of wild soybean provides a powerful tool
RT to mine soybean genomes.";
RL Submitted (SEP-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RZB75098.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QZWG01000012; RZB75098.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A445HMU5; -.
DR Proteomes; UP000289340; Chromosome 12.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR CDD; cd18915; bHLH_AtLHW_like; 1.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR043561; LHW-like.
DR InterPro; IPR025610; MYC/MYB_N.
DR PANTHER; PTHR46196; TRANSCRIPTION FACTOR BHLH155-LIKE ISOFORM X1-RELATED; 1.
DR PANTHER; PTHR46196:SF4; TRANSCRIPTION FACTOR LHW; 1.
DR Pfam; PF14215; bHLH-MYC_N; 1.
DR PROSITE; PS50888; BHLH; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000289340};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 647..696
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT REGION 122..152
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 625..661
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 122..138
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 637..661
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 860 AA; 93638 MW; 655324FBFC265324 CRC64;
MLTRIVGRAA FTGNYQWILL NNFTRDAYPP EVYPELHYQF SAGMQTVAVI PVLPHGVVQL
GSFSPIMEDI GFVNDVKNFI LQLGCVPGAL LSEDYSAKVS NEKFAGPVTV DPPVITSNCT
PSVANGSNQL TNSPLASRPV AQPPHPLRGG INNYQGSLLT PQAYNPNQVF DGICQPKAHS
MIKTNVCGQP KKTIVEAEAK VIPTNFDSCL QQHSVYNARS AFNELSSFNQ SNLSDGSLKY
MEQQTSGVGR QSQVIPNVNP SSALNMPRLK IDGGKILEQN QSSSGSSLLG GIPICSGSNL
LRTNMINCSL SNPPKVSTNT SDFSGMYKVG FGLQSNNTTT NAVLCSVPNF TNQSVTNHMN
LEGSGQKSLS IDLKQVWDAF ASTDQRIDDD LLQAALKIPS LHLEEHVPMG DHISGFVQDC
LSKDLTSQHM MKMNVKHAEA DAQLPSGDDL FDVLGVDLKR RLLNGNRNEL LATDSDAITE
HLDKKATHMN LQGVGPNNSY SVNEAISESG IFSGTDTDHL LDAVVLKAQS AAKQNSDEMS
CRTTLTRIST ASIPSPVCKQ VMPDHVAPRG LFDFPKTGVK TASAETSSLR SGCSKDDAGN
CSQTTSIYGS KLSSWVENSS NFKRESSVST GYSKRPDEVC KSNRKRLKPG ENPRPRPKDR
QMIQDRVKEL REIVPNGAKC SIDALLEKTI KHMLFLQSVT KHADKLKQTG ESKIVSKEGG
LLLKDNFEGG ATWAYEVGAQ SMVCPIIVED LNPPRQMLVE MLCEECGFFL EIADLIRGLG
LTILKGVMEA RNDKIWARFA VEANRDVTRM EIFMSLVRLL DQTVKGGASS SNAIDNNMML
YHSFPQATQI PATGRPSSLQ
//