ID A0A437D6U6_ORYJA Unreviewed; 1539 AA.
AC A0A437D6U6;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=Arginine-glutamic acid dipeptide repeats protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=OJAV_G00064400 {ECO:0000313|EMBL:RVE70431.1};
OS Oryzias javanicus (Javanese ricefish) (Aplocheilus javanicus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; Oryziinae;
OC Oryzias.
OX NCBI_TaxID=123683 {ECO:0000313|EMBL:RVE70431.1, ECO:0000313|Proteomes:UP000283210};
RN [1] {ECO:0000313|EMBL:RVE70431.1, ECO:0000313|Proteomes:UP000283210}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=RS831 {ECO:0000313|EMBL:RVE70431.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:RVE70431.1};
RA Lopez-Roques C., Donnadieu C., Bouchez O., Klopp C., Cabau C., Zahm M.;
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:RVE70431.1, ECO:0000313|Proteomes:UP000283210}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=RS831 {ECO:0000313|EMBL:RVE70431.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:RVE70431.1};
RA Herpin A., Takehana Y., Naruse K., Ansai S., Kawaguchi M.;
RT "A chromosome length genome reference of the Java medaka (oryzias
RT javanicus).";
RL Submitted (JAN-2019) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM012443; RVE70431.1; -; Genomic_DNA.
DR Ensembl; ENSOJAT00000013939; ENSOJAP00000013080; ENSOJAG00000006799.
DR Proteomes; UP000283210; Chromosome 7.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd04709; BAH_MTA; 1.
DR CDD; cd11661; SANT_MTA3_like; 1.
DR CDD; cd00202; ZnF_GATA; 1.
DR Gene3D; 2.30.30.490; -; 1.
DR Gene3D; 4.10.1240.50; -; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR002951; Atrophin-like.
DR InterPro; IPR001025; BAH_dom.
DR InterPro; IPR043151; BAH_sf.
DR InterPro; IPR000949; ELM2_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR InterPro; IPR000679; Znf_GATA.
DR PANTHER; PTHR13859:SF38; ARGININE-GLUTAMIC ACID DIPEPTIDE REPEATS PROTEIN ISOFORM X1; 1.
DR PANTHER; PTHR13859; ATROPHIN-RELATED; 1.
DR Pfam; PF03154; Atrophin-1; 1.
DR Pfam; PF01426; BAH; 1.
DR Pfam; PF01448; ELM2; 1.
DR Pfam; PF00320; GATA; 1.
DR SMART; SM00439; BAH; 1.
DR SMART; SM01189; ELM2; 1.
DR SMART; SM00717; SANT; 1.
DR SMART; SM00401; ZnF_GATA; 1.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51038; BAH; 1.
DR PROSITE; PS51156; ELM2; 1.
DR PROSITE; PS51293; SANT; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000283210};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 115..253
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT DOMAIN 254..357
FT /note="ELM2"
FT /evidence="ECO:0000259|PROSITE:PS51156"
FT DOMAIN 361..413
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT REGION 1..100
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 434..464
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 513..1098
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1128..1211
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1513..1539
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..45
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 434..456
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 526..541
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 566..593
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 594..681
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 682..725
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 726..741
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 762..789
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 790..827
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 835..850
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 851..870
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 871..929
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 938..966
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 976..1036
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1128..1175
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1189..1207
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1539 AA; 170449 MW; 4DEA7E475E62E3F3 CRC64;
MTADKEKERE KERDRDRDRD KREAGKSRRQ DGDRGDRESE SSRPRRSCTL EGGAKNYAES
EHSDDEDNDN GSTGGGSGTA EEAGKKGKKK MPKKKSRYER TENGEITSFI TEDDVVYRPG
DCVYIESRRP NTPYFICSIQ DFKLSKRDHL LMNVKWYYRQ SEVPDSVYQH LVQDRNNEND
SGRELVITDP VVRSRELFIS DYVDTYHAAA LRGKCNISHF SDIFAAREFR ARIDSFFYIL
GYNPETRRLN STQGEIRVGP SHQAKLPELQ PFPSPGGHAV TENEELVWMP GVNDCDLLMY
LRAARSMAAF AGMCDGGSTE DGCLAASRDD TTLNALNTLH ESSYDAGKAL QRLVKKPVPK
LIEKCWSEDE VKRFIKGLRQ FGKNFFRIRK ELLPNKETGE LITFYYYWKK TPEAASCRAH
RRHRRQPVFR RIKTRTASTP VNTPSRPPSS EFLDLSSASE DDFDSEDSEQ ELKGYACRHC
FSTTSKDWHH GGRENILLCT DCRIHFKKYA ELPPIEKPVD PPPFMFKPVK EEEDGLSGKH
SMRTRRNRGS MSTLRSGRKK QPASPDGRAS PTNEDLRSSG RTSPSAASTD STDSKTDSMK
KPSKKIKEEA PSPMKSTKRQ REKGASDSEE TERATAKKSK TQELSRPDSP SECEGEGEGE
SSDGRSINEE LSSDPKDIDQ DNRSSSPSIP SPRDNESDSD SSAQQQQLLQ SQHPAVIQCQ
PGTSTANAAP PPPPTSNPLL PPQVPTAAAS ASLPPQPLAQ AGPMSLIQSG ASLHPQRLPS
PHSPLTQAPS SGPTVPPQSL PSPHHGPLPP VPHPLQPAPP HLPHPHSMTP QGFPVGPSQV
PPPPVSSQSQ QRPHSPPSQS QSSSQSGGQP PREQPLPPAT MSVPHIKPPP TTPIPQMPTP
QSHKHPPHVP PPPFLPMPSN LPPPPALKPL SSLSNHHPPS AHPPPLQLMP QGQQLQPPPA
QPPVLTQSQS LPPSASHQPP AAPPLPHTVS HPTAGPPQPP FSSHPFSTVL PPTGPPPSSS
NSMPSLQPPP PSSSSISMPL PASVPCAGPG PSIPPMNIKE EPLDEPEEPE SPPPPQRSPS
PEPTVVNTPS HASQSARFYK HLDRGYNSCA RTDFYFTPLA SSKLAKKREE ALEKAKREAE
QKAREEKERE REREKERERE REREKEVERA AKASSSSHES RMGEPPMAGP AHMRPPFDGP
PTTIAAVPPY IGPDTPALRT LSEYARPHVM SPSNRNHPFF VSLNPADPLL AYHMPGLYNA
DPAMRERELR EREMREREIR ERELRERMKP GFEVKPPEME SLHPSTNPME HFVRHGAITL
PPMPSPHPFA SFHPSLNPLE RERLALVGPQ LRPEMSYPER VAAERLHAER MATVANDPIA
RLQMFNVTPH HHQHSHIHSH LHLHQQDPLH QGGGECLVCP PGSGSHPLAV DPLAAGPHLA
RFPYPPGTIP NPLLGQPPHE HEMLRHPVFG APYPRDLPGG IPPPMSAAHQ LQAMHAQSAE
LQRLAMEQQW LHGHHMHGGP LPGQEDYYSR LKKESDKQL
//