ID A0A672UXL4_STRHB Unreviewed; 1283 AA.
AC A0A672UXL4;
DT 17-JUN-2020, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 1.
DT 27-MAR-2024, entry version 15.
DE SubName: Full=Arginine-glutamic acid dipeptide repeats {ECO:0000313|Ensembl:ENSSHBP00005019020.1};
GN Name=RERE {ECO:0000313|Ensembl:ENSSHBP00005019020.1};
OS Strigops habroptila (Kakapo).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Psittaciformes; Psittacidae; Strigops.
OX NCBI_TaxID=2489341 {ECO:0000313|Ensembl:ENSSHBP00005019020.1, ECO:0000313|Proteomes:UP000472266};
RN [1] {ECO:0000313|Ensembl:ENSSHBP00005019020.1, ECO:0000313|Proteomes:UP000472266}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Jarvis E.D., Howard J., Rhie A., Phillippy A., Korlach J., Digby A.,
RA Iorns D., Eason D., Robertson B., Raemaekers T., Howe K., Lewin H.,
RA Damas J., Hastie A., Tracey A., Chow W., Fedrigo O.;
RT "Strigops habroptila (kakapo) genome, bStrHab1, primary haplotype, v2.";
RL Submitted (NOV-2019) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSHBP00005019020.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSSHBT00005022725.1; ENSSHBP00005019020.1; ENSSHBG00005016332.1.
DR GeneTree; ENSGT00940000153615; -.
DR Proteomes; UP000472266; Chromosome 19.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd11661; SANT_MTA3_like; 1.
DR CDD; cd00202; ZnF_GATA; 1.
DR Gene3D; 4.10.1240.50; -; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR002951; Atrophin-like.
DR InterPro; IPR000949; ELM2_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR InterPro; IPR000679; Znf_GATA.
DR PANTHER; PTHR13859; ATROPHIN-RELATED; 1.
DR PANTHER; PTHR13859:SF11; GRUNGE, ISOFORM J; 1.
DR Pfam; PF03154; Atrophin-1; 1.
DR Pfam; PF01448; ELM2; 1.
DR Pfam; PF00320; GATA; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM01189; ELM2; 1.
DR SMART; SM00717; SANT; 1.
DR SMART; SM00401; ZnF_GATA; 1.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51156; ELM2; 1.
DR PROSITE; PS51293; SANT; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022771};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000472266};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022771};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 10..113
FT /note="ELM2"
FT /evidence="ECO:0000259|PROSITE:PS51156"
FT DOMAIN 117..169
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT REGION 1..35
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 190..221
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 268..860
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 903..973
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 190..212
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 282..297
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 331..352
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 353..396
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 397..411
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 412..435
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 436..476
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 491..506
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 507..553
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 554..581
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 597..640
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 641..678
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 721..736
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 764..787
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 788..810
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 903..937
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1283 AA; 140827 MW; 3E8757CE74CA5C83 CRC64;
LDERLNSTQG EIRVGPSHQA KLPDLQPFPS PDGDTVTQHE ELVWMPGVND CDLLMYLRAA
RSMAAFAGMC DGGSTEDGCV AASRDDTTLN ALNTLHESNY DAGKALQRLV KKPVPKLIEK
CWTEDEVKRF IKGLRQYGKN FFRIRKELLP NKETGELITF YYYWKKTPEA ASSRAHRRHR
RQAVFRRIKT RTASTPVNTP SRPPSSEFLD LSSASEDDFD SEDSEQELKG YACRHCFTTT
SKDWHHGGRE NILLCTDCRI HFKKYGELPP IEKPVDPPPF MFKPVKEEDD GLSGKHSMRT
RRSRGSMSTL RSGRKKQPAS PDGRASPINE DIRSSGRNSP SAASTSSNDS KADSVKKSAK
KIKEEVSSPL KNSKRQREKA ASDTEEPDRS NAKKSKTQEI SRPNSPSEGE GEGESSDSRS
VNDEGSSDPK DIDQDNRSTS PSIPSPQDNE SDSDSSAQQQ VLQAQPQVLQ AQSGSGQAPP
PTPPISAQLP ASLPAASSAT SAPPQVSPSA SQPPSQPQAP APPPPHSHIQ QAPALHPPRL
PSPHPPLQPL SMPQSQPAPA SSSQPHSQPP LHSQAQPAPH SLQAQPLLPH PVPSQPFSLP
TQSSQSQVPL QTQAPSHSHS TLQVTQPVLP TATSLQQAQP PREQPLPPAP MAMPHIKPPP
TTPIPQLPTA PSHKHPPHLS GPSPFSMNSN LPPPPALKPL SSLSTHHPPS AHPPPLQLMP
QSQPLQSSQA QPPVLTQSQS LPPPANHPPS GLHQVSSQPP FSQHPFVPGG PPSITPPSCP
STSTPPTVPG IPLQTSISTS AASSGNVPVV TACTLPPIQI KEEVPDEAEE PESPPPPPRS
PSPEPTVVDT PSHASQSARF YKHLDRGYNS CSRADLYFMP LAGSKLAKKR EEAIEKAKRE
AEQKAREERE REKEKEKERE REREREREAE RAAKVSSSSH EGRLGESQLS GPAHMRPSFE
PPPTTIAAVP PYIGPDTPAL RTLSEYARPH VMSPTNRNHP FFVPLNPTDP LLAYHMPGLY
NVDPTIRERE LREREIRERE IRERELRERM KPGFEVKPPE LDALHPATNP MEHFARHGAL
TIPPTAGPHP FASFHPGLNP LERERLALAG PQLRPEMSYP DRLAAERIHA ERMASLTNDP
LARLQMFNVT PHHHQHSHIH SHLHLHQQDP LHQGSAGPVH PLVDPLAAGP HLARFPYPPG
TIPNPLLGQP PHEHEMLRHP VFGTPYPRDL PGAIPPPMSA AHQLQAMHAQ SAELQRLAME
QQWLHGHPHM HGGHLPSQED YYR
//