GenomeNet

Database: UniProt
Entry: I3J4L6_ORENI
LinkDB: I3J4L6_ORENI
Original site: I3J4L6_ORENI 
ID   I3J4L6_ORENI            Unreviewed;       511 AA.
AC   I3J4L6;
DT   11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT   17-JUN-2020, sequence version 2.
DT   27-MAR-2024, entry version 61.
DE   SubName: Full=Homeobox-containing protein 1 {ECO:0000313|Ensembl:ENSONIP00000003806.2};
GN   Name=LOC100699346 {ECO:0000313|Ensembl:ENSONIP00000003806.2};
OS   Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC   Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX   NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000003806.2, ECO:0000313|Proteomes:UP000005207};
RN   [1] {ECO:0000313|Proteomes:UP000005207}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG   Broad Institute Genome Assembly Team;
RG   Broad Institute Sequencing Platform;
RA   Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT   "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL   Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSONIP00000003806.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_013127683.1; XM_013272229.2.
DR   RefSeq; XP_013127725.1; XM_013272271.2.
DR   AlphaFoldDB; I3J4L6; -.
DR   STRING; 8128.ENSONIP00000003806; -.
DR   Ensembl; ENSONIT00000003807.2; ENSONIP00000003806.2; ENSONIG00000003040.2.
DR   GeneID; 100699346; -.
DR   eggNOG; ENOG502QQSR; Eukaryota.
DR   GeneTree; ENSGT00940000154928; -.
DR   HOGENOM; CLU_052355_1_0_1; -.
DR   InParanoid; I3J4L6; -.
DR   OMA; THKPSYM; -.
DR   OrthoDB; 5399075at2759; -.
DR   TreeFam; TF320327; -.
DR   Proteomes; UP000005207; Linkage group LG1.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003691; F:double-stranded telomeric DNA binding; IEA:InterPro.
DR   GO; GO:0045893; P:positive regulation of DNA-templated transcription; IEA:InterPro.
DR   CDD; cd00086; homeodomain; 1.
DR   CDD; cd00093; HTH_XRE; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   Gene3D; 1.10.260.40; lambda repressor-like DNA-binding domains; 1.
DR   InterPro; IPR001387; Cro/C1-type_HTH.
DR   InterPro; IPR040363; HMBOX1.
DR   InterPro; IPR006899; HNF-1_N.
DR   InterPro; IPR044869; HNF-1_POU.
DR   InterPro; IPR044866; HNF_P1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR   PANTHER; PTHR14618:SF4; HOMEOBOX-CONTAINING PROTEIN 1 ISOFORM X1-RELATED; 1.
DR   PANTHER; PTHR14618; HOMEODOX-CONTAINING PROTEIN 1 HMBOX1; 1.
DR   Pfam; PF04814; HNF-1_N; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   SUPFAM; SSF47413; lambda repressor-like DNA-binding domains; 1.
DR   PROSITE; PS51937; HNF_P1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
DR   PROSITE; PS51936; POU_4; 1.
PE   4: Predicted;
KW   Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000005207};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT   DOMAIN          6..37
FT                   /note="HNF-p1"
FT                   /evidence="ECO:0000259|PROSITE:PS51937"
FT   DOMAIN          142..238
FT                   /note="POU-specific atypical"
FT                   /evidence="ECO:0000259|PROSITE:PS51936"
FT   DOMAIN          263..338
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        265..339
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          44..95
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          443..511
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        54..95
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        443..468
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   511 AA;  55727 MW;  9CC44D198D4BB274 CRC64;
     MFQQCEEPRF TIEQIDLLQR LRRTGITQAE VLHALDTLDH LDRQHGHKLT HKPSYVPPSS
     SSTVAASSSM TSTATQTSFP NNRLSLSPNN NFDTTSPPLP VPVASPNTIT AVVPNGLVAV
     TNGKLSPPRF PLGVVSATVT TPGYAFEASE EDIDVDEKVE DLMRRDSAVI KEEIKSFLAN
     RRISQAVVAQ VTGISQSRIS HWLLQQGSDL SEQKKRAFFR WYQLEKTNPG ATLAMRATPL
     ALEEVMDWNQ APPTFGSAPG GFRLRRGSRF TWRKECLAVM EGYFNDNQYP DEAKREEIAT
     ACNAVIQKPG KKLSDLERVT SLKVYNWFAN RRKDIKRRAN IEAAILESHG IEVQSPGGQS
     NSDDVDGNDF PDQGCEVSLF DKRASARQFS FSRADLSSPT QVPTLLPSWF SALGRGGVSG
     QRATSLIGRS LISGAVPQME GSRLTGVSWS PPSPSLQDEP TVHSALSEPQ DPVSLEKAAA
     VSDSSPAVTN QGDGAGCSMG NDIKTETLED D
//
DBGET integrated database retrieval system