ID A0A2K5WI01_MACFA Unreviewed; 2862 AA.
AC A0A2K5WI01;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 28-MAR-2018, sequence version 1.
DT 24-JAN-2024, entry version 29.
DE SubName: Full=MAX dimerization protein MGA {ECO:0000313|Ensembl:ENSMFAP00000036769.1};
GN Name=MGA {ECO:0000313|Ensembl:ENSMFAP00000036769.1};
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541 {ECO:0000313|Ensembl:ENSMFAP00000036769.1, ECO:0000313|Proteomes:UP000233100};
RN [1] {ECO:0000313|Ensembl:ENSMFAP00000036769.1, ECO:0000313|Proteomes:UP000233100}
RP NUCLEOTIDE SEQUENCE.
RA Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSMFAP00000036769.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00201}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00201}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_005559313.1; XM_005559256.2.
DR Ensembl; ENSMFAT00000011018.2; ENSMFAP00000036769.1; ENSMFAG00000037653.2.
DR VEuPathDB; HostDB:ENSMFAG00000037653; -.
DR GeneTree; ENSGT00940000156269; -.
DR OrthoDB; 5323209at2759; -.
DR Proteomes; UP000233100; Chromosome 7.
DR Bgee; ENSMFAG00000037653; Expressed in thymus and 13 other cell types or tissues.
DR GO; GO:0071339; C:MLL1 complex; IEA:InterPro.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0045893; P:positive regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd18911; bHLHzip_MGA; 1.
DR CDD; cd20195; T-box_MGA-like; 1.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR Gene3D; 2.60.40.820; Transcription factor, T-box; 1.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR037935; MAX_gene-associated_bHLHzip.
DR InterPro; IPR032060; MGA_dom.
DR InterPro; IPR008967; p53-like_TF_DNA-bd_sf.
DR InterPro; IPR046360; T-box_DNA-bd.
DR InterPro; IPR036960; T-box_sf.
DR InterPro; IPR001699; TF_T-box.
DR InterPro; IPR018186; TF_T-box_CS.
DR PANTHER; PTHR11267:SF32; MAX GENE-ASSOCIATED PROTEIN; 1.
DR PANTHER; PTHR11267; T-BOX PROTEIN-RELATED; 1.
DR Pfam; PF00010; HLH; 1.
DR Pfam; PF16059; MGA_dom; 1.
DR Pfam; PF00907; T-box; 1.
DR PRINTS; PR00937; TBOX.
DR SMART; SM00353; HLH; 1.
DR SMART; SM00425; TBOX; 1.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR SUPFAM; SSF49417; p53-like transcription factors; 1.
DR PROSITE; PS50888; BHLH; 1.
DR PROSITE; PS01264; TBOX_2; 1.
DR PROSITE; PS50252; TBOX_3; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00201};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00201}; Reference proteome {ECO:0000313|Proteomes:UP000233100};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 79..260
FT /note="T-box"
FT /evidence="ECO:0000259|PROSITE:PS50252"
FT DOMAIN 2220..2271
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT REGION 259..322
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 600..653
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 878..911
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 972..993
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1250..1291
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1308..1336
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1384..1433
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1492..1520
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1635..1654
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1700..1722
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1762..1856
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1904..1964
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2057..2110
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2172..2193
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2327..2352
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2372..2393
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2467..2505
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2743..2778
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1115..1146
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 262..276
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 277..294
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 632..646
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 972..991
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1258..1280
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1315..1332
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1496..1518
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1767..1793
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1800..1815
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1816..1835
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1950..1964
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2096..2110
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2374..2393
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2472..2502
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2752..2778
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2862 AA; 315957 MW; 7EE5119914CE0187 CRC64;
MEEKQQIILA NQDGGTVAGA APTFFVILKQ PGNGKTDQGI LVTNQDACAL ASSVSSPVKS
KGKICLPADC TVGGITVTLD NNSMWNEFYH RSTEMILTKQ GRRMFPYCRY WITGLDSNLK
YILVMDISPV DNHRYKWNGR WWEPSGKAEP HVLGRVFIHP ESPSTGHYWM HQPVSFYKLK
LTNNTLDQEG HIILHSMHRY LPRLHLVPAE KAVEVIQLNG PGVHTFTFPQ TEFFAVTAYQ
NIQITQLKID YNPFAKGFRD DGLNSKPQRD GKQKNSSDQE GNNISSSSGH RVRLTEGQGS
EIQPGDLDPL SRGHETSGKG LEKTSLNIKR DFLGFMDTDS ALSEVPQLKQ EISECLIASN
FEDDSHVASP LDQNGSFNVV IKEEPLDDYD YELGECPEGV TVKQEETDEE TDVYSNSDDD
PILEKQLKRH NKVDNPEADH LSSKWLPSSP SGVAKAKMFK LDTGKMPVVY LEPCAVTRST
VKISELPDNM LSTSRKDKSS MLAELEYLPT YIENSNETAF CLGKESENGL RKHSSDLRVV
QKYPLLKDSQ WKYPDISDSI NTERILDSSK GSVGDLLSGK EDLGRKRTTM LKIATPPKVV
NANQNASPNV PGKRGRPRKL KLCKAGRPPK NTGKSLISTK NTPVGPGSTF PDVKPDLEDV
DGVLFVSFES KEALDIHAVD GTTEESSSLQ ASTTNDSGYR ARISQLEKEL IEDLKSLRHK
QVIHPGLQEV GLKLNSVDPT MSIDLKYLGV QLPLAPATSF PFWNLSGTNP ASPDAGFPFV
SRTGKTNDFT KIKGWRGKFH SASASRNEGG NSESSLKNRS AFCSDKLDEY LENEGKLMET
SMGFSSNAPT SPVVYQLPTK STSYVRTLDS VLKKQSTISP STSYSLKPHS VPPASRKAKS
QNRQATFSGR TKSSYKSILP YPVSPKQKYS HMILGDKVTK NSSGIISENQ ANNFVVPTLD
ENVFPKQISL RQAQQQQQQQ QQQQGSRPPG LSKSQVKLMD LEDCALWEGK PRTYITEERA
DVSLTTLLTA QASLKTKPIH TIIRKRAPPC NNDFCRLGCV CSSLALEKRQ PAHCRRPDCM
FGCTCLKRKV VLVKGGSKTK HFQRKAAHPR DPVFYDTLGE EAREEEEGIR EEEEQLKEKK
KRKKLEYTIC ETEPEQPVRH YPLWVKVEGE VDPEPVYIPT PSVIEPMKPL LLPQPEVLSP
TVKGKLLTGI KSARSYIPRP NPVIREEDKD PVYLYFESMM TCARVRVYER KKEDQRQPSS
SSSPSPSFQQ QSLCHSSPEN HSNAKEPDCE QQPLKQLTCD LEDDSDKLQE KSWKSPCNEG
ESSSTSYMHQ RSPGGPTKLI EIISDCNWEE DRNKILSILS QHINSNMPQS LKVGSFIIEL
ASQRKSRGEK NPPVYSSRVK ISMPSCQDQD DMAEKSGSET PDGPLSPGKM EDISPVQTDA
LDSVRERLHG GKGLPFYAGL SPAGKLVAYK RKPSSSTSGL IQVASNAKVA ASRKPRTLLP
STSNSKMASS SGTATNRPGK NLKAFVPAKR PIENAAQIPV ATPQVSPNTV KRAGPRLLLI
PVQQGSPTLR PVSNTQLQGH RMVLQPVRSP SGMNLFRHPN GQIVQLLPLH QLRGSNTQPN
LQPVMFRNPG SVMGIRLPAP SKPSETPPSS TSSSAFSVVN PVIQAVGSSS AVNVITQAPS
LLSSGASFVS QAGTLTLRIS PPEPQSFASK TGSETKITYS SGGQPVGTAS LIPLQSGSFA
LLQLPGQKPV PSSILQHVAS LQMKRESQNP DQKDETNSIK REQETKKVLQ SEGEAVDSEA
NIIKQNSGAA TSEETLNDSL EDRGDHLDEE SLPEEGSATV KPSEHSCITG SHTDQDYKDV
NEEYGARNRN SSKEKVAVLE VRTISEKASN KTVQNLSKVQ HQKLGDVKVE QQKGLDNPEE
NSSEFPVTFK EESKFELSGS KVTEQQSNPQ PEAKEKECGD SLEKDSIRER WRKHLKGPLT
RKCVGSSQEC KKEVDEQLIK ETKTCQENSD VFQQEQGICD LLGKSGITED ARVLKTECDS
WSRISNPSAF SIVPRRAAKS SRGNGHFQGH LLLPGEQIQP KQEKKGGRSS ADFTVLDLEE
DDDDENEKTD DSIDEIVDVV SDYQSEEVDD VEKNNCVEYI EDDEEHVDIE TVEELSEEIN
VAHLKTTVAH TQSFKQPSRT HISADEKAAE RSRKAPPIPL KLKPDYWSDK LQKEAEAFAY
YRRTHTANER RRRGEMRDLF EKLKITLGLL HSSKVSKSLI LTRAFSEIQG LTDQADKLIG
QKNLLTRKRN ILIRKVSSLS GKTEEVVLKK LEYIYAKQQA LEAQKRKKKM GSDEFDMSPR
ISKQQEGSST SSVDLGQMFI NNRRGKPLIL SRKKDQATEN TSPSNTPHTS ANLVMTPQGQ
LLTLKGPLFS GPVVTVSPDL LESDLKPQVA SSAVALPEND DLFMMPRIVN VTSLATEGGL
VDMGGSKYPH EVPDGKPSDH LKDTVRNEDN SLEDKSRISS RGNRDGRVML GPTQVFLANK
DSVYPQIVDV SSMQKAQEFL PKKISGDMRG IQYKWKESES RGERVKSKES SFHKLKMKDL
KDSSIEMELR KVTSAIEEAA LDSSELLTNM EDEDDTDETL TSLLNEIAFL NQQLNDDSVS
LAELPSSMDT EFPGDARRAF ISKVTSGNRA AFQVEHLGTG LKELPDVQGE SDSISPLLLH
LEDDDFSENE KQLAEPASEP DVLKIVIDSE IKDSLLSNKK AIDGGKNTSG LLAEPESVSS
PPTLHMKTGL ENSNSTDTLW RPMPKLAPLG LKVANPSSDA DGQSLKVMPC LAPIAAKVGS
VGHKMNLTGN DPEGRESKVM PTLAPVVAKL GNSGASPSSA GK
//