ID A0A445F3I6_GLYSO Unreviewed; 577 AA.
AC A0A445F3I6;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 13.
DE SubName: Full=Transcription factor ABORTED MICROSPORES {ECO:0000313|EMBL:RZB43320.1};
GN ORFNames=D0Y65_053752 {ECO:0000313|EMBL:RZB43320.1};
OS Glycine soja (Wild soybean).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3848 {ECO:0000313|EMBL:RZB43320.1, ECO:0000313|Proteomes:UP000289340};
RN [1] {ECO:0000313|EMBL:RZB43320.1, ECO:0000313|Proteomes:UP000289340}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. W05 {ECO:0000313|Proteomes:UP000289340};
RC TISSUE=Hypocotyl of etiolated seedlings {ECO:0000313|EMBL:RZB43320.1};
RA Xie M., Chung C.Y.L., Li M.-W., Wong F.-L., Chan T.-F., Lam H.-M.;
RT "A high-quality reference genome of wild soybean provides a powerful tool
RT to mine soybean genomes.";
RL Submitted (SEP-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RZB43320.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QZWG01000020; RZB43320.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A445F3I6; -.
DR SMR; A0A445F3I6; -.
DR Proteomes; UP000289340; Chromosome 20.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0031326; P:regulation of cellular biosynthetic process; IEA:UniProt.
DR GO; GO:0080090; P:regulation of primary metabolic process; IEA:UniProt.
DR CDD; cd04873; ACT_UUR-ACR-like; 1.
DR CDD; cd11443; bHLH_AtAMS_like; 1.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR025610; MYC/MYB_N.
DR PANTHER; PTHR31945:SF11; TRANSCRIPTION FACTOR ABORTED MICROSPORES; 1.
DR PANTHER; PTHR31945; TRANSCRIPTION FACTOR SCREAM2-RELATED; 1.
DR Pfam; PF14215; bHLH-MYC_N; 1.
DR Pfam; PF00010; HLH; 1.
DR SMART; SM00353; HLH; 1.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR PROSITE; PS50888; BHLH; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000289340};
KW Signal {ECO:0000256|SAM:SignalP};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..577
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5019275777"
FT DOMAIN 338..387
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT REGION 222..283
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 308..349
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 377..407
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 222..264
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 265..279
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 577 AA; 65588 MW; 8C51A489031FE149 CRC64;
MDISFCFFCF FWSVFSSILL LLLVKGSNHS HFAFVSSKRK FHYHHPGDMN IIMQNLVERL
RPLVGLNGWD YCIYWKLSED QRFLEWLGCC CAGTESNQNA GEEHIFPVSS VASCRDSTYP
HPRTKPCDLL SQLSTSIPID NSGIHAQTLL TNQPNWVNYS NGMDPNILEE TIGTQVLISV
PGGLVELFVT KQVPEDHQLI DYVINQCIEA VNHSMSFHID ENSMSNMQSN PLIGDENEGN
NNSRDTSTLQ NMSSQWTSAV LQTNQEDQEH EHEHDTYQKS LMTTTDSQYV EPLEAKEKQE
EDKDLLKNVV GRSDSMSDCS DQNEEEEDGK YRRRNGKGNQ SKNLVAERKR RKKLNDRLYN
LRSLVPRISK LDRASILGDA IEYVKDLQKQ VKELQDELEE NADTESNCMN CVSELGPNAE
HDKAQTGLHV GTSGNGYVSK QKQEGTTVID KQTQQMEPQV EVALIDGNEY FVKVFCEHRP
DGFVKLMEAL NTIGMDVVHA TVTSHTGLVS NVFKVEKKDS ETVEAEDVRD SLLELMRNRY
RGWTHEMTAT SGNSVESDQH QLHNHNQMGA YPHEFHS
//