ID A0A671EYY9_RHIFE Unreviewed; 1273 AA.
AC A0A671EYY9;
DT 17-JUN-2020, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE SubName: Full=Arginine-glutamic acid dipeptide repeats {ECO:0000313|EMBL:KAF6345508.1, ECO:0000313|Ensembl:ENSRFEP00010018554.1};
GN Name=RERE {ECO:0000313|Ensembl:ENSRFEP00010018554.1};
GN ORFNames=mRhiFer1_014256 {ECO:0000313|EMBL:KAF6345508.1};
OS Rhinolophus ferrumequinum (Greater horseshoe bat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; Rhinolophidae;
OC Rhinolophinae; Rhinolophus.
OX NCBI_TaxID=59479 {ECO:0000313|Ensembl:ENSRFEP00010018554.1, ECO:0000313|Proteomes:UP000472240};
RN [1] {ECO:0000313|Ensembl:ENSRFEP00010018554.1, ECO:0000313|Proteomes:UP000472240}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=25689317; DOI=10.1146/annurev-animal-090414-014900;
RA Koepfli K.P., Paten B., O'Brien S.J., Koepfli K.P., Paten B., Antunes A.,
RA Belov K., Bustamante C., Castoe T.A., Clawson H., Crawford A.J.,
RA Diekhans M., Distel D., Durbin R., Earl D., Fujita M.K., Gamble T.,
RA Georges A., Gemmell N., Gilbert M.T., Graves J.M., Green R.E., Hickey G.,
RA Jarvis E.D., Johnson W., Komissarov A., Korf I., Kuhn R., Larkin D.M.,
RA Lewin H., Lopez J.V., Ma J., Marques-Bonet T., Miller W., Murphy R.,
RA Pevzner P., Shapiro B., Steiner C., Tamazian G., Venkatesh B., Wang J.,
RA Wayne R., Wiley E., Yang H., Zhang G., Haussler D., Ryder O., O'Brien S.J.;
RT "The Genome 10K Project: a way forward.";
RL Annu Rev Anim Biosci 3:57-111(2015).
RN [2] {ECO:0000313|Ensembl:ENSRFEP00010018554.1, ECO:0000313|Proteomes:UP000472240}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=29166127; DOI=10.1146/annurev-animal-022516-022811;
RA Teeling E.C., Vernes S.C., Davalos L.M., Ray D.A., Gilbert M.T.P.,
RA Myers E.;
RT "Bat Biology, Genomes, and the Bat1K Project: To Generate Chromosome-Level
RT Genomes for All Living Bat Species.";
RL Annu Rev Anim Biosci 6:23-46(2018).
RN [3] {ECO:0000313|Ensembl:ENSRFEP00010018554.1, ECO:0000313|Proteomes:UP000472240}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Teeling E., Myers G., Vernes S., Pippel M., Winkler S., Fedrigo O.,
RA Rhie A., Koren S., Phillippy A., Lewin H., Damas J., Howe K.,
RA Mountcastle J., Jarvis E.D.;
RT "G10K-VGP greater horseshoe bat female genome, primary haplotype.";
RL Submitted (DEC-2018) to the EMBL/GenBank/DDBJ databases.
RN [4] {ECO:0000313|EMBL:KAF6345508.1, ECO:0000313|Proteomes:UP000585614}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MRhiFer1 {ECO:0000313|EMBL:KAF6345508.1};
RC TISSUE=Lung {ECO:0000313|EMBL:KAF6345508.1};
RX PubMed=32699395;
RA Jebb D., Huang Z., Pippel M., Hughes G.M., Lavrichenko K., Devanna P.,
RA Winkler S., Jermiin L.S., Skirmuntt E.C., Katzourakis A., Burkitt-Gray L.,
RA Ray D.A., Sullivan K.A.M., Roscito J.G., Kirilenko B.M., Davalos L.M.,
RA Corthals A.P., Power M.L., Jones G., Ransome R.D., Dechmann D.K.N.,
RA Locatelli A.G., Puechmaille S.J., Fedrigo O., Jarvis E.D., Hiller M.,
RA Vernes S.C., Myers E.W., Teeling E.C.;
RT "Six reference-quality genomes reveal evolution of bat adaptations.";
RL Nature 583:578-584(2020).
RN [5] {ECO:0000313|Ensembl:ENSRFEP00010018554.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JACAGC010000009; KAF6345508.1; -; Genomic_DNA.
DR Ensembl; ENSRFET00010020219.1; ENSRFEP00010018554.1; ENSRFEG00010012385.1.
DR GeneTree; ENSGT00940000153615; -.
DR Proteomes; UP000472240; Chromosome 9.
DR Proteomes; UP000585614; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd11661; SANT_MTA3_like; 1.
DR CDD; cd00202; ZnF_GATA; 1.
DR Gene3D; 4.10.1240.50; -; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR002951; Atrophin-like.
DR InterPro; IPR000949; ELM2_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR InterPro; IPR000679; Znf_GATA.
DR PANTHER; PTHR13859:SF12; ARGININE-GLUTAMIC ACID DIPEPTIDE REPEATS PROTEIN; 1.
DR PANTHER; PTHR13859; ATROPHIN-RELATED; 1.
DR Pfam; PF03154; Atrophin-1; 1.
DR Pfam; PF01448; ELM2; 1.
DR Pfam; PF00320; GATA; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM01189; ELM2; 1.
DR SMART; SM00717; SANT; 1.
DR SMART; SM00401; ZnF_GATA; 1.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51156; ELM2; 1.
DR PROSITE; PS51293; SANT; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022771};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000472240};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022771};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 16..119
FT /note="ELM2"
FT /evidence="ECO:0000259|PROSITE:PS51156"
FT DOMAIN 123..175
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT REGION 1..32
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 196..227
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 274..842
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 871..953
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 196..218
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 288..303
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 337..358
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 359..402
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 403..419
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 420..439
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 440..474
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 495..515
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 601..622
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 623..660
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 691..719
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 734..775
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 871..917
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1273 AA; 138618 MW; A0C6758152830C5F CRC64;
MDEPFSPCRR LNSTQGEIRV GPSHQAKLPD LQPFPSPDGD TVTQHEELVW MPGVNDCDLL
MYLRAARSMA AFAGMCDGGS TEDGCVAASR DDTTLNALNT LHESSYDAGK ALQRLVKKPV
PKLIEKCWTE DEVKRFVKGL RQYGKNFFRI RKELLPNKET GELITFYYYW KKTPEAASSR
AHRRHRRQAV FRRIKTRTAS TPVNTPSRPP SSEFLDLSSA SEDDFDSEDS EQELKGYACR
HCFSTTSKDW HHGGRESILL CTDCRIHFKK YGELPPIEKP VDPPPFMFKP VKEEDDGLSG
KHSMRTRRTR GSMSTLRSGR KKQPASPDGR ASPINEDIRS SGRNSPSAAS TSSNDSKAET
VKKSAKKVKE EASSPLKNTK RQREKVASDT EEADRTSSKK TKTQEISRPN SPSEGEGESS
DSRSVNDEGS SDPKDIDQDN RSTSPSIPSP QDNESDSDSS AQQQTLQAQP PALQASSAAA
AAPAVPPGAP QLPTAGPTPS ATAGPAQGSP SASQPPSQAP SAPPHAHVQQ APALHPQRLP
SPHAPLQPLT GPAGQTSAPP HTQPPLHSQG PAGPHSLQAG SLLPHPGPPQ PFGLPPQASQ
AAAHPHTSLQ PPASQSALQP QQPPREQPLP PAPLAMPHIK PPPTTPIPQL PAPQAHKHPP
HLSGPSPFSM NANLPPPPAL KPLSSLSTHH PPSAHPPPLQ LMPQSQPLPS SPAQPPVLTQ
SLPPGAASHP PTGLHQVPPQ PPFTQHPFVP GGPPPITPPT CPSTSTPPAG PGPSAQPPCS
AAVSSGGSAP GGAACPLPTV QIKEEALDDA EEPESPPPPP RSPSPEPTVV DTPSHASQSA
RFYKHLDRGY NSCARTDLYF MPLAGSKLAK KREEAIEKAK REAEQKAREE REREKEKEKE
REREREREAE RAAKASSSAH EGRLSDPQLG GPGHMRPSFE PPPTTIAAVP PYIGPDTPAL
RTLSEYARPH VMSPTNRNHP FYVPLNPTDP LLAYHMPGLY NVDPTIRERE LRERELRERE
IRERELRERM KPGFEVKPPE LDPLHPATNP MEHFARHSAL TLPPTAGPHP FASFHPGLNP
LERERLALAG PQLRPEMSYP DRLAAERIHA ERMASLTSDP LARLQMFNVT PHHHQHSHIH
SHLHLHQQDP LHQGSAGPVH PLVDPLTAGP HLARFPYPPG TLPNPLLGQP PHEHEMLRHP
VFGTPYPRDL PGAIPPPMSA AHQLQAMHAQ SAELQRLAME QQWLHGHPHM HGGHLPSQED
YYSRLKKEGD KQL
//