ID A0A1A6GFX3_NEOLE Unreviewed; 323 AA.
AC A0A1A6GFX3;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE RecName: Full=Transcription factor MafB {ECO:0000256|ARBA:ARBA00016894};
GN ORFNames=A6R68_06701 {ECO:0000313|EMBL:OBS64759.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS64759.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS64759.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS64759.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS64759.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the bZIP family. Maf subfamily.
CC {ECO:0000256|ARBA:ARBA00008500}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS64759.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01097138; OBS64759.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1A6GFX3; -.
DR STRING; 56216.A0A1A6GFX3; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR CDD; cd14718; bZIP_Maf_large; 1.
DR Gene3D; 1.20.5.170; -; 1.
DR InterPro; IPR004827; bZIP.
DR InterPro; IPR004826; bZIP_Maf.
DR InterPro; IPR046347; bZIP_sf.
DR InterPro; IPR013592; Maf_TF_N.
DR InterPro; IPR008917; TF_DNA-bd_sf.
DR InterPro; IPR024874; Transcription_factor_Maf_fam.
DR PANTHER; PTHR10129; TRANSCRIPTION FACTOR MAF; 1.
DR PANTHER; PTHR10129:SF10; TRANSCRIPTION FACTOR MAFB; 1.
DR Pfam; PF03131; bZIP_Maf; 1.
DR Pfam; PF08383; Maf_N; 1.
DR SMART; SM00338; BRLZ; 1.
DR SUPFAM; SSF47454; A DNA-binding domain in eukaryotic transcription factors; 1.
DR SUPFAM; SSF57959; Leucine zipper domain; 1.
DR PROSITE; PS50217; BZIP; 1.
PE 3: Inferred from homology;
KW Activator {ECO:0000256|ARBA:ARBA00023159};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Isopeptide bond {ECO:0000256|ARBA:ARBA00022499};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW Repressor {ECO:0000256|ARBA:ARBA00022491};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 238..301
FT /note="BZIP"
FT /evidence="ECO:0000259|PROSITE:PS50217"
FT REGION 34..78
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 116..210
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 263..297
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 50..78
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 129..143
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 171..188
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 323 AA; 35748 MW; C8EC67E8BA52F5B6 CRC64;
MAAELSMGPE LPTSPLAMEY VNDFDLLKFD VKKEPLGRAE RPGRPCTRLQ PAGSVSSTPL
STPCSSVPSS PSFSPTEQKT HLEDLYWMAS NYQQMNPEAL NLTPEDAVEA LIGSHPVPQP
LQSFDGFRSA HHHHHHHHPH PHHGYPGAGV AHDELGPHAH PHHHHHHQAS PPPSSAASPA
QQLPTSHPGP GPHSAAAATA AGGNGSVEDR FSDDQLVSMS VRELNRHLRG FTKDEVIRLK
QKRRTLKNRG YAQSCRYKRV QQKHHLENEK TQLIQQVEQL KQEVSRLARE RDAYKVKCEK
LANSGFREAG STSDSPSSPE FFL
//