ID G5BI69_HETGA Unreviewed; 1507 AA.
AC G5BI69;
DT 14-DEC-2011, integrated into UniProtKB/TrEMBL.
DT 14-DEC-2011, sequence version 1.
DT 24-JAN-2024, entry version 47.
DE RecName: Full=Fork-head domain-containing protein {ECO:0000259|PROSITE:PS50039};
DE Flags: Fragment;
GN ORFNames=GW7_16874 {ECO:0000313|EMBL:EHB08980.1};
OS Heterocephalus glaber (Naked mole rat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Bathyergidae;
OC Heterocephalus.
OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB08980.1, ECO:0000313|Proteomes:UP000006813};
RN [1] {ECO:0000313|EMBL:EHB08980.1, ECO:0000313|Proteomes:UP000006813}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993625; DOI=10.1038/nature10533;
RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L.,
RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X.,
RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A.,
RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., Bronson R.T.,
RA Buffenstein R., Wang B., Han C., Li Q., Chen L., Zhao W., Sunyaev S.R.,
RA Park T.J., Zhang G., Wang J., Gladyshev V.N.;
RT "Genome sequencing reveals insights into physiology and longevity of the
RT naked mole rat.";
RL Nature 479:223-227(2011).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00089}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH170413; EHB08980.1; -; Genomic_DNA.
DR STRING; 10181.G5BI69; -.
DR eggNOG; ENOG502QVBK; Eukaryota.
DR InParanoid; G5BI69; -.
DR Proteomes; UP000006813; Unassembled WGS sequence.
DR GO; GO:0044599; C:AP-5 adaptor complex; IEA:InterPro.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR CDD; cd20054; FH_FOXK1; 1.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR028222; AP5Z1.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR047394; FH_FOXK1.
DR InterPro; IPR001766; Fork_head_dom.
DR InterPro; IPR030456; TF_fork_head_CS_2.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR PANTHER; PTHR46488; AP-5 COMPLEX SUBUNIT ZETA-1; 1.
DR PANTHER; PTHR46488:SF1; AP-5 COMPLEX SUBUNIT ZETA-1; 1.
DR Pfam; PF00250; Forkhead; 1.
DR Pfam; PF14764; SPG48; 1.
DR PRINTS; PR00053; FORKHEAD.
DR SMART; SM00339; FH; 1.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
DR PROSITE; PS00658; FORK_HEAD_2; 1.
DR PROSITE; PS50039; FORK_HEAD_3; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00089}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00089};
KW Reference proteome {ECO:0000313|Proteomes:UP000006813}.
FT DOMAIN 272..367
FT /note="Fork-head"
FT /evidence="ECO:0000259|PROSITE:PS50039"
FT DNA_BIND 272..367
FT /note="Fork-head"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00089"
FT REGION 254..273
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 380..422
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 653..686
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 380..406
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EHB08980.1"
SQ SEQUENCE 1507 AA; 163452 MW; 8E7C9D749EB39558 CRC64;
RCTFRFPSTA IKIQFTSLYH KEEAPASPLR PLYPQISPLK IHIPEPDLRS LVSPIPSPTG
TISMFPWNHG VNSPSQVFSW GAILWTALQY RSAPRPPNWS SGLELALPDG EAWVGRSCHA
SVGRRAAARP ACNLGRGRPW PTAEAAAGRR RVAIAGILFP RCGTSTLGVP RLLKRVALHS
AAVCSGDSSE SPSSRWSWSL ALHRVENLPV SFRLCSVPNS CPASPRGAGS SGYRYVQNVT
SDLQLAAEFA AKAASEQQAD TSGGDSPKDE SKPPYSYAQL IVQAVSSAPD RQLTLSGIYA
HITRHYPYYR TADKGWQNSI RHNLSLNRYF IKVPRSQEEP GKGSFWRIDP ASEAKLVEQA
FRKRRQRGVS CFRTPFGPLS SRSAPASPTH PGLMSPRSSG LQTPECLSRE GSPIPHDPEL
GSKLASVPEY RYSQSAPGSP VSAQPVIMAV PPRPPSLVAK PVAYMPASIV TSQQPSGHAI
HVVQQAPTVT MVRVVTTSAS SANGYILTSQ GSTGSSHDTA GAAVLDLGSE GRGLEEKPTI
AFATIPAASR VIQTVASQMA PGVPGHTVTI LQPATPVTIG QHHLPVRAVT QNGKHAVPTN
SLATSAYALT SPLQLLAAQA SSSTPVVVTR VCEVGPEEPA AVVTAATSAT PTVATSTATS
ASSIGEPEVK RSRVEEPTPD LGGEALAGCP MWTLRPRGGE MFSAGEESLL HQAREIQDEE
LQRFCSRVSK LLQEDLGPAT VDALQRLFLI ISATKYVRRL EKTCVDLLQA TLSLPTCSEQ
VQVLCAAILR EMSPSDSLTL SCDHTQTPHQ LSLVASVLLA QGDRREEVRR VSQHVFTVLE
SRQPEGPNLR PLLPVLSRVM GLAPGTLQED QATLLSKRLV DWLRYARVQQ GVPHSGGFFS
TPRARQPGPI TEVDGAVASD FFTVLSTGQH FTEDQWLNVQ AFSMLQAWLL LSGPKGPGLP
DAEDKSELEG STLSVLSAAS AGRQLLPQER LREVAFQYCK RLIEQSNRRA LRKGDSDLQK
ACLVEAVRVL DVLCRQDPSF LYRSLSCLKA LRGQLGQDPG SERALLPLAQ FFLNHGAAAA
VDSEAIYQHL FTRLPSERFY SPMLAFEVTR FCRNNLPLFD PQLLGLLKLS FPNLFKLLAW
NSPPLTAEFV ALLPALVDAS TAVEMLHSLL DLPCLTAALD LHLRSSQTPS ERPLWDPSLR
TPGSLEAFRD PQPGRSCGHL VGGHSAAVCV PRLVPLYQLL QPMAGCARVA QCAQAVPTLL
QAFFSTVTQV ADGALTNQLA LLVLERSDAL YHVPQYQAHV HRVLSSQFLA LCKQKPSLVL
ELARELLELV GSVSNIQSRA GMFTCVVWAI GEYLSVTCDR RCTVELINKF FEALEALLFE
VTQSRPSADL PWCPPQVVTV LMTTLTKLAS RSQDLIPRVS LFLSKMRTLG QNPATSSVYR
EEDAEAIRTR ATELRALLRM PSVAQFVLSP SAEVCQPRYH RDTNTALPLA LRTVSRLVER
EAGLLPG
//