ID I3KR15_ORENI Unreviewed; 381 AA.
AC I3KR15;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 11-JUL-2012, sequence version 1.
DT 27-MAR-2024, entry version 70.
DE SubName: Full=Hepatocyte nuclear factor 3-beta {ECO:0000313|Ensembl:ENSONIP00000023560.1};
GN Name=FOXA2 {ECO:0000313|Ensembl:ENSONIP00000023560.1};
OS Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000023560.1, ECO:0000313|Proteomes:UP000005207};
RN [1] {ECO:0000313|Proteomes:UP000005207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Broad Institute Genome Assembly Team;
RG Broad Institute Sequencing Platform;
RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSONIP00000023560.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00089}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_005452472.1; XM_005452415.1.
DR AlphaFoldDB; I3KR15; -.
DR STRING; 8128.ENSONIP00000023560; -.
DR Ensembl; ENSONIT00000023581.2; ENSONIP00000023560.1; ENSONIG00000018714.2.
DR GeneID; 100705944; -.
DR KEGG; onl:100705944; -.
DR eggNOG; KOG3563; Eukaryota.
DR GeneTree; ENSGT00940000155999; -.
DR HOGENOM; CLU_027910_4_0_1; -.
DR InParanoid; I3KR15; -.
DR OMA; NNLMSEP; -.
DR OrthoDB; 5385885at2759; -.
DR TreeFam; TF316127; -.
DR Proteomes; UP000005207; Linkage group LG15.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0019904; F:protein domain specific binding; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR013638; Fork-head_N.
DR InterPro; IPR001766; Fork_head_dom.
DR InterPro; IPR018533; Forkhead_box_C.
DR InterPro; IPR018122; TF_fork_head_CS_1.
DR InterPro; IPR030456; TF_fork_head_CS_2.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR PANTHER; PTHR11829; FORKHEAD BOX PROTEIN; 1.
DR PANTHER; PTHR11829:SF380; PROTEIN FORK HEAD; 1.
DR Pfam; PF00250; Forkhead; 1.
DR Pfam; PF08430; Forkhead_N; 1.
DR Pfam; PF09354; HNF_C; 1.
DR PRINTS; PR00053; FORKHEAD.
DR SMART; SM00339; FH; 1.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
DR PROSITE; PS00657; FORK_HEAD_1; 1.
DR PROSITE; PS00658; FORK_HEAD_2; 1.
DR PROSITE; PS50039; FORK_HEAD_3; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00089}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00089};
KW Reference proteome {ECO:0000313|Proteomes:UP000005207}.
FT DOMAIN 124..218
FT /note="Fork-head"
FT /evidence="ECO:0000259|PROSITE:PS50039"
FT DNA_BIND 124..218
FT /note="Fork-head"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00089"
FT REGION 224..283
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 240..259
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 381 AA; 41608 MW; 426E82AE925EA53E CRC64;
MLSAVKMEGH EHPDWSSSYY YAEPECYPPA ANMNSMSTYM GTPGMTGTGH MNAHYVSHPV
GVSGSSVASG MTQSAGAAAL PPSMSSMSPP PYGNVPVMSP VYGQACAIRP RESKAYRRSY
THAKPPYSYI SLITMAIQQS GSKMLTLNEI YQWIMDLFPF YRQNQQRWQN SIRHSLSFND
CFIKVPRAPD KPGKGSFWTL HPDSGNMFEN GCYLRRQKRF KCGKKADSEP GCEAGAPGRS
SDSPPSSSSS SSSSPPSSDG KGADLKPHRG EPPASSPVRV PSPLTHTQHL LSHHPHPLLL
HEAAHLKTDP YHTHYSFNHP FSINNLMSEP QHHKLEPVVQ YGGYGCPVSG ALMAAKPGLE
PAHSDSGYYH GVYGRPIMNS S
//