ID A0A452IG48_9SAUR Unreviewed; 714 AA.
AC A0A452IG48;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE RecName: Full=Groucho/TLE N-terminal Q-rich domain-containing protein {ECO:0000259|Pfam:PF03920};
OS Gopherus agassizii (Agassiz's desert tortoise).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Durocryptodira;
OC Testudinoidea; Testudinidae; Gopherus.
OX NCBI_TaxID=38772 {ECO:0000313|Ensembl:ENSGAGP00000026447.1, ECO:0000313|Proteomes:UP000291020};
RN [1] {ECO:0000313|Proteomes:UP000291020}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=28562605;
RA Tollis M., DeNardo D.F., Cornelius J.A., Dolby G.A., Edwards T.,
RA Henen B.T., Karl A.E., Murphy R.W., Kusumi K.;
RT "The Agassiz's desert tortoise genome provides a resource for the
RT conservation of a threatened species.";
RL PLoS ONE 12:e0177708-e0177708(2017).
RN [2] {ECO:0000313|Ensembl:ENSGAGP00000026447.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the WD repeat Groucho/TLE family.
CC {ECO:0000256|ARBA:ARBA00005969}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A452IG48; -.
DR Ensembl; ENSGAGT00000030062.1; ENSGAGP00000026447.1; ENSGAGG00000019267.1.
DR Proteomes; UP000291020; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00200; WD40; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 1.
DR InterPro; IPR005617; Groucho/TLE_N.
DR InterPro; IPR009146; Groucho_enhance.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR10814; TRANSDUCIN-LIKE ENHANCER PROTEIN; 1.
DR PANTHER; PTHR10814:SF29; TRANSDUCIN-LIKE ENHANCER PROTEIN 1; 1.
DR Pfam; PF03920; TLE_N; 1.
DR Pfam; PF00400; WD40; 6.
DR PRINTS; PR01850; GROUCHOFAMLY.
DR SMART; SM00320; WD40; 7.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 2.
DR PROSITE; PS50082; WD_REPEATS_2; 2.
DR PROSITE; PS50294; WD_REPEATS_REGION; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000291020};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT DOMAIN 1..77
FT /note="Groucho/TLE N-terminal Q-rich"
FT /evidence="ECO:0000259|Pfam:PF03920"
FT REPEAT 514..555
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 556..597
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REGION 72..101
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 122..292
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 84..98
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 122..141
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 152..194
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 195..209
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 226..245
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 255..275
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 714 AA; 76672 MW; 53D21E3A32CC469D CRC64;
MQRHYVMYYE MSYGLNIEMH KQTEIAKRLN TICAQVIPFL SQEHQQQVAQ AVERAKQVTM
AELNAIIGQQ QLQAQHLSHG HGPPVPLTPH PSGLQPPGIP PLGSSAGLLA LSSALGGQSH
LAIKDDKKHH DAEHHRDREP GTSNSLLVPD SLRNTDKRRN GPEYSNDVKK RKVDDKDSSH
YDSDGDKSDD NLVVDVSNED PSSPRASPAH SPRENGIDKN RLLKKDASSS PASTASSGSS
TSLKSKEMSL HEKASTPVLK SSTPTPRSDV PTPGTSATPG LRPGLGKPPT MDPLVNQAAA
GLRTPLAVSG PYPTPFGMVP HAGMNGELTS PGAAYASLHN MSPQMSAAAA AAAVVAYGRS
PMVGFDPPPH MRVPGIPPNL AGIPGGKPAY SFHVTADGQM QPVPFPPDAL IGPGIPRHAR
QINTLNHGEV VCAVTISNPT RHVYTGGKGC VKVWDISQPG NKSPVSQLDC LNRDNYIRSC
KLLPDGCTLI VGGEASTLSI WDLAAPTPRI KAELTSSAPA CYALAISPDS KVCFSCCSDG
NIAVWDLHNQ TLVRQFQGHT DGASCIDISN DGTKLWTGGL DNTVRSWDLR EGRQLQQHDF
TSQIFSLGYC PTGEWLAVGM ESSNVEVLHV NKPDKYQLHL HESCVLSLKF AYCGKWFVST
GKDNLLNAWR TPYGASIFQS KESSSVLSCD ISVDDKYIVT GSGDKKATVY EVIY
//