ID I3KIK4_ORENI Unreviewed; 296 AA.
AC I3KIK4;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 2.
DT 27-MAR-2024, entry version 63.
DE SubName: Full=Homeobox protein otx5 {ECO:0000313|Ensembl:ENSONIP00000020949.2};
GN Name=LOC100700919 {ECO:0000313|Ensembl:ENSONIP00000020949.2};
OS Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000020949.2, ECO:0000313|Proteomes:UP000005207};
RN [1] {ECO:0000313|Proteomes:UP000005207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Broad Institute Genome Assembly Team;
RG Broad Institute Sequencing Platform;
RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSONIP00000020949.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_003449350.1; XM_003449302.4.
DR AlphaFoldDB; I3KIK4; -.
DR Ensembl; ENSONIT00000020968.2; ENSONIP00000020949.2; ENSONIG00000016630.2.
DR GeneID; 100700919; -.
DR KEGG; onl:100700919; -.
DR GeneTree; ENSGT00940000164977; -.
DR InParanoid; I3KIK4; -.
DR OMA; YQNTPRK; -.
DR OrthoDB; 2913621at2759; -.
DR Proteomes; UP000005207; Linkage group LG14.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003025; Otx_TF.
DR InterPro; IPR013851; Otx_TF_C.
DR PANTHER; PTHR45793:SF22; CONE-ROD HOMEOBOX PROTEIN; 1.
DR PANTHER; PTHR45793; HOMEOBOX PROTEIN; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF03529; TF_Otx; 1.
DR PRINTS; PR01255; OTXHOMEOBOX.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 4: Predicted;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000005207};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 37..97
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 39..98
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 91..151
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 91..108
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 130..144
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 296 AA; 32054 MW; 5138F4B328430D28 CRC64;
MMSYIKQPHY AVNGLTLSGP GMDLLHTAAV GYPSTPRKQR RERTTFTRAQ LDILEALFAK
TRYPDIFMRE EVALKINLPE SRVQVWFKNR RAKCRQQQQQ STGQSKPRPP KKKASPARES
NTEASANPTN GPYSPPPPPP GTAITPSSSS ASATVSIWSP ASISPLPDPL SASTTPCMQR
TTTYPMTYTQ APAYSQSYAG SSSYFTGLDC SSYLSPMHPQ LSGTGGALSP ITASSMGGPL
SQSPASLSSQ GYTAASLGFG AVDCLDYKDQ TAWKLNFNAA DCLDYKDQNS WKFQVL
//