ID H2LLW6_ORYLA Unreviewed; 1537 AA.
AC H2LLW6;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 73.
DE SubName: Full=Arginine-glutamic acid dipeptide repeats {ECO:0000313|Ensembl:ENSORLP00000007033.2};
GN Name=RERE {ECO:0000313|Ensembl:ENSORLP00000007033.2};
OS Oryzias latipes (Japanese rice fish) (Japanese killifish).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; Oryziinae;
OC Oryzias.
OX NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000007033.2, ECO:0000313|Proteomes:UP000001038};
RN [1] {ECO:0000313|Ensembl:ENSORLP00000007033.2, ECO:0000313|Proteomes:UP000001038}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000007033.2,
RC ECO:0000313|Proteomes:UP000001038};
RX PubMed=17554307; DOI=10.1038/nature05846;
RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., Yamada T.,
RA Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., Shimada A.,
RA Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., Asakawa S.,
RA Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., Sugano S.,
RA Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., Nomoto H., Nogata K.,
RA Morishita T., Endo T., Shin-I T., Takeda H., Morishita S., Kohara Y.;
RT "The medaka draft genome and insights into vertebrate genome evolution.";
RL Nature 447:714-719(2007).
RN [2] {ECO:0000313|Ensembl:ENSORLP00000007033.2}
RP IDENTIFICATION.
RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000007033.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 8090.ENSORLP00000007033; -.
DR Ensembl; ENSORLT00000007034.2; ENSORLP00000007033.2; ENSORLG00000005593.2.
DR eggNOG; KOG2133; Eukaryota.
DR GeneTree; ENSGT00940000153615; -.
DR HOGENOM; CLU_005292_1_0_1; -.
DR InParanoid; H2LLW6; -.
DR TreeFam; TF328554; -.
DR Proteomes; UP000001038; Chromosome 7.
DR Bgee; ENSORLG00000005593; Expressed in pharyngeal gill and 15 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0003714; F:transcription corepressor activity; IBA:GO_Central.
DR CDD; cd04709; BAH_MTA; 1.
DR CDD; cd11661; SANT_MTA3_like; 1.
DR CDD; cd00202; ZnF_GATA; 1.
DR Gene3D; 2.30.30.490; -; 1.
DR Gene3D; 4.10.1240.50; -; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR002951; Atrophin-like.
DR InterPro; IPR001025; BAH_dom.
DR InterPro; IPR043151; BAH_sf.
DR InterPro; IPR000949; ELM2_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR InterPro; IPR000679; Znf_GATA.
DR PANTHER; PTHR13859:SF38; ARGININE-GLUTAMIC ACID DIPEPTIDE REPEATS PROTEIN ISOFORM X1; 1.
DR PANTHER; PTHR13859; ATROPHIN-RELATED; 1.
DR Pfam; PF03154; Atrophin-1; 2.
DR Pfam; PF01426; BAH; 1.
DR Pfam; PF01448; ELM2; 1.
DR Pfam; PF00320; GATA; 1.
DR SMART; SM00439; BAH; 1.
DR SMART; SM01189; ELM2; 1.
DR SMART; SM00717; SANT; 1.
DR SMART; SM00401; ZnF_GATA; 1.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51038; BAH; 1.
DR PROSITE; PS51156; ELM2; 1.
DR PROSITE; PS51293; SANT; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000001038};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 115..287
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT DOMAIN 288..391
FT /note="ELM2"
FT /evidence="ECO:0000259|PROSITE:PS51156"
FT DOMAIN 395..447
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT REGION 1..100
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 468..498
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 547..1106
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1136..1219
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1511..1537
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..45
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 468..490
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 560..575
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 600..627
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 628..715
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 716..759
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 760..775
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 783..815
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 816..860
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 861..879
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 880..938
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 947..975
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 985..999
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1014..1032
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1033..1048
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1136..1183
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1537 AA; 170746 MW; 6584C2F891A4B63C CRC64;
MTADKEKERE KERDRDRDRD KREAGKSRRQ EGDRGDRESE SSRPRRSCTL EGGAKNYAES
EHSDDDDNDN GSTGGGSGTA EEAGKKGKKK MPKKKSRYER TENGEITSFI TEDDVVYRPG
DCVYIESRRP NTPYFICSIQ DFKLVSTYSL YLTDSPPLCP FSATSQVDEC IEQSGGKKYK
RDHLLMNVKW YYRQSEVPDS VYQHLVQDRN NENDSGRELV ITDPVVRSRE LFISDYVDTY
HAAALRGKCN ISHFSDIFAA REFRARIDSF FYILGYNPET RRLNSTQGEI RVGPSHQAKL
PELQPFPSPG GHAVTENEEL VWMPGVNDCD LLMYLRAARS MAAFAGMCDG GSTEDGCLAA
SRDDTTLNAL NTLHESSYDA GKALQRLVKK PVPKLIEKCW SEDEVKRFIK GLRQFGKNFF
RIRKELLPNK ETGELITFYY YWKKTPEAAS CRAHRRHRRQ PVFRRIKTRT ASTPVNTPSR
PPSSEFLDLS SASEDDFDSE DSEQELKGYA CRHCFSTTSK DWHHGGRENI LLCTDCRIHF
KKYAELPPIE KPVDPPPFMF KPVKEEEDGL SGKHSMRTRR NRGSMSTLRS GRKKQPASPD
GRASPTNEDL RSSGRTSPSA ASTDSTDSKT DSMKKPSKKI KEEAPSPMKS TKRQREKGTS
DSEETERATA KKSKTQELSR PDSPSECEGE GEGESSDGRS INEELSSDPK DIDQDNRSSS
PSIPSPRDNE SDSDSSAQQQ QLLQSQHPPV IQCQPGTSTA SAAPPPPPTS NPLLPPQVPT
AAASASLPPQ SLAQAGPMSL IQSGASLHPQ RLPSPHSPLT QAPPSGPTVP PQSLPSPHHG
PLPPVPHPLQ PVPPHFQSQQ RPHSPPSQSQ SSSQSGGQPP REQPLPPATM SVPHIKPPPT
TPIPQMPTPQ SHKHPPHLPA PPFLPMPSNL PPPPALKPLS SLSNHHPPSA HPPPLQLMPQ
GQQLQPPPAQ PPVLTQSQSL PPSANHQPPA APPLPHTVSH PTAGPTQPPF SSHPFSTVLP
PTGPPPSSSN SMPSLQPPSS SSSISMPLPA SVPCAGPGPS IPPVNIKEEP LDEPEEPESP
PPPQRSPSPE PTVVNTPSHA SQSARFYKHL DRGYNSCART DFYFTPLASS KLAKKREEAL
EKAKREAEQK AREEKERERE REKERERERE REKEVERAAK ASSSSHESRM GEPQMAGPAH
MRPPFDGPPT TIAAVPPYIG PDTPALRTLS EYARPHVMSP SNRNHPFFVS LNPADPLLAY
HMPGLYNADP AMRERELRER EMREREIRER ELRERMKPGF EVKPPEMETL HPSTNPMEHF
VRHGAITLPP MPSPHPFASY HPSLNHLERE RLALVGPQLR PEMSYPERVA AERLHAERMA
TVANDPIARL QMFNVTPHHH QHSHIHSHLH LHQQDPLHQG SGSHPLAVDP LGAGPHLARF
PYPPGSIPNP LLGQPPHEHE MLRHPVFGAP YPRDLPGGIP PPMSAAHQLQ AMHAQSAELQ
RLAMEQQWLH GHHMHGGPLP GQEDYYSRLK KESDKQL
//