ID A0A484BWE2_DRONA Unreviewed; 2103 AA.
AC A0A484BWE2;
DT 05-JUN-2019, integrated into UniProtKB/TrEMBL.
DT 05-JUN-2019, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=Arginine-glutamic acid dipeptide repeats protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=AWZ03_000537 {ECO:0000313|EMBL:TDG52994.1};
OS Drosophila navojoa (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila.
OX NCBI_TaxID=7232 {ECO:0000313|EMBL:TDG52994.1, ECO:0000313|Proteomes:UP000295192};
RN [1] {ECO:0000313|EMBL:TDG52994.1, ECO:0000313|Proteomes:UP000295192}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Navoj_Jal97 {ECO:0000313|EMBL:TDG52994.1};
RC TISSUE=Whole organism {ECO:0000313|EMBL:TDG52994.1};
RX PubMed=30423125; DOI=.1093/jhered/esy059;
RA Vanderlinde T., Dupim E.G., Nazario-Yepiz N.O., Carvalho A.B.;
RT "An Improved Genome Assembly for Drosophila navojoa, the Basal Species in
RT the mojavensis Cluster.";
RL J. Hered. 110:118-123(2019).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:TDG52994.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSRL02000002; TDG52994.1; -; Genomic_DNA.
DR STRING; 7232.A0A484BWE2; -.
DR OMA; PPYPRPN; -.
DR Proteomes; UP000295192; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR CDD; cd11661; SANT_MTA3_like; 1.
DR Gene3D; 4.10.1240.50; -; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR000949; ELM2_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR PANTHER; PTHR13859; ATROPHIN-RELATED; 1.
DR PANTHER; PTHR13859:SF11; GRUNGE, ISOFORM J; 1.
DR SMART; SM00717; SANT; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51156; ELM2; 1.
DR PROSITE; PS51293; SANT; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000295192};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 7..113
FT /note="ELM2"
FT /evidence="ECO:0000259|PROSITE:PS51156"
FT DOMAIN 117..169
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT REGION 168..455
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 646..665
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 680..772
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 865..1092
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1107..1132
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1153..1298
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1334..1415
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1433..1472
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1501..1530
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1615..1641
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1683..1847
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1876..1927
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 547..623
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 171..186
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 205..232
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 246..262
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 263..284
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 292..322
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 358..379
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 391..408
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 415..449
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 740..754
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 891..909
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 984..1009
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1010..1029
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1047..1061
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1156..1170
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1188..1210
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1218..1241
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1342..1415
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1433..1461
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1683..1709
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1752..1768
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1814..1830
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1832..1847
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1876..1898
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1899..1917
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2103 AA; 223531 MW; A7DF7713CE51ABB2 CRC64;
MSMYVHNSHN SEIQLTAKLP DYNPISSFPV DKETDERELE ETRWSPGVVA DGDLLMFLRA
ARSMAAFQGM CDGGLEDGCL AASRDDTTIN ALDVLHDSGY DPGKALQALV KCPVSKGIDK
KWTEDETKKF IKGLRQFGKN FFRIHKDLLP HKDTPELVEF YYLWKKTPGA NNNRPHRRRR
QSALRRNRVT RANNTPPKKE DTPEPQAATT ATAAASAAET ANRSSPAVSK EENSSLTEDD
VSECDSDSSL TNKRDESPSR MRTRNKQQNN NNNNNSSSAA ASGGGGNSAA AGAASVNASG
NSSGKDQSSG SANNNSNAVA NGKRPKRGSE TPDAAAAAAA AACDSPKTPT KTVAEGSGTK
RKGGKQDTPN KKKRTEAETN SNSSEQAINN SEDNVKEKPR KRPDSPVESM NSDSRPDSVL
DDGESNTTDT DGRTIEQQSS KDSKEINCKE ENAVTLSGDV DSKSDVVNEK SIKAETACAD
DSKDTIKNMD EETNIQAPSS IQPLSIKPAH VDGLLKESSS LEAPPAVVPV APIAMKVPTI
ATVEALNASV DRDRKEAIEK MEICENEAAA RDPELLKKLA TIKQETLPQQ QQQQQQQQQQ
QQQQQQQQQQ QQQQQQQQQQ QQQMVGVPPV SAASGPMQEA VYIKKEPMED SMDATCNQNS
NEPQDLKVKI EIKNEDLKIN ASGLPPVSSA GPPPNAQLGG LHHGSSEGSN PEPLHLQHMP
HGPTPQAPAG YIIDGQLKYG PPGQPPPQPP PQLHSDPGSG GVSAPAPQKY PGDMEMKYNE
AAVKFEPSAG KFAPQELKYP VPAPLDALKY SQEMQAAAAA AAAVGKYDMK YMIEQQGKYP
VELAPPKPGY QEALKIPDVK PGFAHLPHSI GSSLDGPGPP HKYAPAGQSG PPLDQQPPGA
TPPPGIAMPK PHYQHDVQTP PLGRPFEPTG LMLKYGDPLA AKYGPPQPQD LKYPMPPVSS
SAGGENLIKA SAYGPPPESP IDASARSTPG QDSQGSNSNS NSQPPSSQPQ QFQSPHPSPH
MPSPAGGGLP PGMHPQNLIS PHSHGPPPNS GSGPGPQPPT SLHQPISSMA GQGPPGLQHG
LPPGPGGPHA QISIANSLTG VVTSLGAPTM STMAPSHPMH PHMHPHQHAH LQSLQALHRH
PDLGAAMHPH APMAMSLQAP GPPPPHSHAH PLGPSHQQQP QPQQQPGPSP AGTVRTPSPA
QQQQPPRSMH EPLPTSREPP ASHTSTAPTG SMSSINSGPG PGQGPGQGPG PGPMPHQSPH
AHRTSPLPGL AHPSGLIGHP MPIHPHLAHL PPGHPAHAAL AHPGHHLLSH SIAGLSHGGG
PIALLAGPGG LGGLPESALS RRTPPSHLSH PHSSSGPSTP HSVAISTSMS LSTTPNTVPS
SAFSRASPSV QLSSGAPPAG GPGGNSNSGT PNNSSAAAAA AAAAAAHRAA SPASSVGSLS
RQSPLHPVPQ SPLSHHPSSS ALSAAAAAVA ERDRHALLRQ QSPHMTPPPV SSASGLMASP
LSKMYAPQPG QRGLGTSPPP HLRPGASPPV IRHPQMPLPL PLIAPGGGIP QIGVHPGQSP
YPHPLLHPSV FYSPHHHNPF NSPYGYAPYG PGFPAYMKPP PPSGPLDPAA VMAAHHAGLS
GPPPPSRQDE QNAAAAAAVA AEKQHAAAVA AAQQQQQQQQ QHKTPQQQQQ QQQQQQQQQQ
QQQQQQQQQQ QQQQQQQQQQ QQQQQQQPGG PPQNKPPTPK TPQGPGGVGV GVGVGLGGPG
TPTGLPPGAY PGAHIPGYPP PPHASPFAPQ DGQPHGLKPT SHMDALRAHA HSANSAGLGS
AHHPTEPLPI DIEPDPEPEI PSPTHNIPRG PSPEAKPDDT ECHRSQSAIF VRHIDRGDYN
SCTRTDLIFK PVTDSKLARK REERDRKLAE KERERRQQQQ QQQQQQQQQQ AAAAQQAAQQ
AKMKAELKPP YADTPALRQL SEYARPHVAF SPVEQMVPYH HPMSPMYSRE RELEEIKNAQ
AAAASQSRLD PHWMEYYRRG IHPSQFPLYA NPAISQMERE RLGIPPPHHV GMDPGEHMPQ
PPEAGFQLPP NVGQYPRPNM LIPREPHSDV LLRMSYADQL QYLQAAEFQR QSLHDQYFRQ
RPR
//