ID G5C2K9_HETGA Unreviewed; 398 AA.
AC G5C2K9;
DT 14-DEC-2011, integrated into UniProtKB/TrEMBL.
DT 14-DEC-2011, sequence version 1.
DT 24-JAN-2024, entry version 47.
DE RecName: Full=Forkhead box protein G1 {ECO:0000256|ARBA:ARBA00034868};
GN ORFNames=GW7_04499 {ECO:0000313|EMBL:EHB15770.1};
OS Heterocephalus glaber (Naked mole rat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Bathyergidae;
OC Heterocephalus.
OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB15770.1, ECO:0000313|Proteomes:UP000006813};
RN [1] {ECO:0000313|EMBL:EHB15770.1, ECO:0000313|Proteomes:UP000006813}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993625; DOI=10.1038/nature10533;
RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L.,
RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X.,
RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A.,
RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., Bronson R.T.,
RA Buffenstein R., Wang B., Han C., Li Q., Chen L., Zhao W., Sunyaev S.R.,
RA Park T.J., Zhang G., Wang J., Gladyshev V.N.;
RT "Genome sequencing reveals insights into physiology and longevity of the
RT naked mole rat.";
RL Nature 479:223-227(2011).
CC -!- FUNCTION: Transcription repression factor which plays an important role
CC in the establishment of the regional subdivision of the developing
CC brain and in the development of the telencephalon.
CC {ECO:0000256|ARBA:ARBA00034686}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00089}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH173087; EHB15770.1; -; Genomic_DNA.
DR AlphaFoldDB; G5C2K9; -.
DR STRING; 10181.G5C2K9; -.
DR eggNOG; KOG2294; Eukaryota.
DR InParanoid; G5C2K9; -.
DR Proteomes; UP000006813; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR CDD; cd20021; FH_FOXG; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR001766; Fork_head_dom.
DR InterPro; IPR047208; FOXG1.
DR InterPro; IPR018122; TF_fork_head_CS_1.
DR InterPro; IPR030456; TF_fork_head_CS_2.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR PANTHER; PTHR46617; FORKHEAD BOX PROTEIN G1; 1.
DR PANTHER; PTHR46617:SF3; FORKHEAD BOX PROTEIN G1; 1.
DR Pfam; PF00250; Forkhead; 1.
DR PRINTS; PR00053; FORKHEAD.
DR SMART; SM00339; FH; 1.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
DR PROSITE; PS00657; FORK_HEAD_1; 1.
DR PROSITE; PS00658; FORK_HEAD_2; 1.
DR PROSITE; PS50039; FORK_HEAD_3; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00089}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00089};
KW Reference proteome {ECO:0000313|Proteomes:UP000006813}.
FT DOMAIN 90..184
FT /note="Fork-head"
FT /evidence="ECO:0000259|PROSITE:PS50039"
FT DNA_BIND 90..184
FT /note="Fork-head"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00089"
FT REGION 54..88
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 336..364
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 58..88
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 398 AA; 42790 MW; 8BFBEB97E2312210 CRC64;
MAQHNEKDAD SVSYLSALGE GVRWNGADGA QGVAGCRAGL RSGCSPEYPC AAPRVRAPRA
EGAEKKGAGE GGKDGEGGKE GEKKNGKYEK PPFSYNALIM MAIRQSPEKR LTLNGIYEFI
MKNFPYYREN KQGWQNSIRH NLSLNKCFVK VPRHYDDPGK GNYWMLDPSS DDVFIGGTTG
KLRRRSTTSR AKLAFKRGAR LTSTGLTFMD RAGSLYWPMS PFLSLHHPRA SSTLSYNGTT
SAYPSHPMPY SSVLTQNSLG NNHSFSTANG LSVDRLVNGE IPYATHHLTA AALAASVPCG
LSVPCSGTYS LNPCSVNLLA GQTSYFFPHV PHPSMTSQSS TSMSARAASS STSPQAPSTL
PCESLRPSLP SFTTGLSGGL SDYFTHQNQG SSSNPLIH
//