ID G1P307_MYOLU Unreviewed; 1249 AA.
AC G1P307;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 27-MAR-2024, entry version 67.
DE SubName: Full=Arginine-glutamic acid dipeptide repeats {ECO:0000313|Ensembl:ENSMLUP00000004278.2};
GN Name=RERE {ECO:0000313|Ensembl:ENSMLUP00000004278.2};
OS Myotis lucifugus (Little brown bat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; Vespertilionidae;
OC Myotis.
OX NCBI_TaxID=59463 {ECO:0000313|Ensembl:ENSMLUP00000004278.2, ECO:0000313|Proteomes:UP000001074};
RN [1] {ECO:0000313|Ensembl:ENSMLUP00000004278.2, ECO:0000313|Proteomes:UP000001074}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSMLUP00000004278.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAPE02050533; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAPE02050534; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAPE02050535; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAPE02050536; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; G1P307; -.
DR STRING; 59463.ENSMLUP00000004278; -.
DR Ensembl; ENSMLUT00000004702.2; ENSMLUP00000004278.2; ENSMLUG00000004696.2.
DR eggNOG; KOG2133; Eukaryota.
DR GeneTree; ENSGT00940000153615; -.
DR HOGENOM; CLU_005292_1_0_1; -.
DR InParanoid; G1P307; -.
DR OMA; VMYLRAA; -.
DR TreeFam; TF328554; -.
DR Proteomes; UP000001074; Unassembled WGS sequence.
DR GO; GO:0016604; C:nuclear body; IEA:Ensembl.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd04709; BAH_MTA; 1.
DR CDD; cd11661; SANT_MTA3_like; 1.
DR CDD; cd00202; ZnF_GATA; 1.
DR Gene3D; 2.30.30.490; -; 1.
DR Gene3D; 4.10.1240.50; -; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR002951; Atrophin-like.
DR InterPro; IPR001025; BAH_dom.
DR InterPro; IPR043151; BAH_sf.
DR InterPro; IPR000949; ELM2_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR InterPro; IPR000679; Znf_GATA.
DR PANTHER; PTHR13859:SF12; ARGININE-GLUTAMIC ACID DIPEPTIDE REPEATS PROTEIN; 1.
DR PANTHER; PTHR13859; ATROPHIN-RELATED; 1.
DR Pfam; PF03154; Atrophin-1; 2.
DR Pfam; PF01426; BAH; 1.
DR Pfam; PF01448; ELM2; 1.
DR Pfam; PF00320; GATA; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM00439; BAH; 1.
DR SMART; SM01189; ELM2; 1.
DR SMART; SM00717; SANT; 1.
DR SMART; SM00401; ZnF_GATA; 1.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51038; BAH; 1.
DR PROSITE; PS51156; ELM2; 1.
DR PROSITE; PS51293; SANT; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000001074};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 1..175
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT DOMAIN 176..279
FT /note="ELM2"
FT /evidence="ECO:0000259|PROSITE:PS51156"
FT DOMAIN 283..335
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT REGION 355..378
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 434..675
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 719..752
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 867..929
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 448..463
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 469..517
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 518..561
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 562..578
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 579..598
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 599..630
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 633..664
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 719..748
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 867..899
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1249 AA; 137895 MW; 240C6BAA8E4FA114 CRC64;
DCVYIESRRP NTPYFICSIQ DFKLVHNSQA CCRSPTPALC DPPACSLPVA SQPPQHLSEA
GRGPVGSKRD HLLMNVKWYY RQSEVPDSVY QHLVQDRHNE NDSGRELVIT DPVIKNRELF
ISDYVDTYHA AALRGKCNIS HFSDIFAARE FKARVDSFFY ILGYNPETRR LNSTQGEIRV
GPSHQAKLPD LQPFPSPDGD TVTQHEELVW MPGVNDCDLL MYLRAARSMA AFAGMCDGGS
TEDGCVAASR DDTTLNALNT LHESGYDAGK ALQRLVKKPV PKLIEKCWTE DEVKRFVKGL
RQYGKNFFRI RKELLPNKET GELITFYYYW KKTPEAASSR AHRRHRRQAV FRRIKTRTAS
TPVNTPSRPP SSEFLDLSSA SEDDFDSEAS DQELKGYPCR HCFSTTSKDW HHGGRENILL
CTDCRIHFKK YGELPPIEKP VDPPPFMFKP VKEEDDGLSG KHSMRTRRSR GSVSTLRSGR
KKQPASPDGR TSPNEDIRSS GRNSPSAAST SSNDSKAETV KKLAKKVKEG VSSPLKSTKR
QREKVASDTE EADRTSSKKT KTQEISRPNS PSEGEGESSD SRSVNDEGSS DPKDIDQDNR
STSPSIPSPQ DNESDSDSSA QQQTLQAQPP ALQAPPGAAP APSTAPPGAP QLPTPGPTPS
AVPPQGSPSA STAVQPGSAG FCFPDTGHLL CVRLFVLLSW QLEDPVLLSQ DPLFRDCCSP
PPFPGPPPPE AHRAPPPFPG PPPSPRGSPC TSSLALAGSD WLTSSLGTAE KLKVGRARHW
GVDLPHPRPL AALAGHSRGL GHPAPALVGG PLPCSALSHR YMNPFRTQGK LQAQRRPSRG
VWARSLSQAV AEAARCWGGG TKLTGQCRMA TGQSSPPQDT RQLVSPTPQK ASSSAHEGRL
SDPQLGGPSH MRPSFEPPPT TIAAVPPYIG PDTPALRTLS EYARPHVMSP TNRNHPFYMP
LNPTDPLLAY HMPGLYNVDP TIRERELRER EIREREIRER ELRERMKPGF EVKPPELDPL
HPATNPMEHF ARHSALTIPP TAGPHPFASF HPGLNPLERE RLALAGPQLR PEMSYPDRLA
AERIHAERMA SLTSDPLARL QMFNVTPHHH QHSHIHSHLH LHQQDPLHQG SAGPVHPLVD
PLTAGPHLAR FPYPPGTLPN PLLGQPPHEH EMLRHPVFGT PYPRDLPGAI PPPMSAAHQL
QAMHAQSAEL QRLAMEQQWL HAHPHMHGSH LPSQEDYYSR LKKEGDKQL
//