ID W5M5S8_LEPOC Unreviewed; 268 AA.
AC W5M5S8;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 24-JAN-2024, entry version 53.
DE SubName: Full=Homeobox protein AKR-like {ECO:0000313|Ensembl:ENSLOCP00000003736.1};
OS Lepisosteus oculatus (Spotted gar).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Holostei; Semionotiformes; Lepisosteidae;
OC Lepisosteus.
OX NCBI_TaxID=7918 {ECO:0000313|Ensembl:ENSLOCP00000003736.1, ECO:0000313|Proteomes:UP000018468};
RN [1] {ECO:0000313|Proteomes:UP000018468}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lander E.S., Lindblad-Toh K.;
RT "The Draft Genome of Lepisosteus oculatus.";
RL Submitted (DEC-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLOCP00000003736.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AHAT01004271; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_006633943.1; XM_006633880.2.
DR AlphaFoldDB; W5M5S8; -.
DR STRING; 7918.ENSLOCP00000003736; -.
DR Ensembl; ENSLOCT00000003743.1; ENSLOCP00000003736.1; ENSLOCG00000003162.1.
DR GeneID; 102685535; -.
DR KEGG; loc:102685535; -.
DR CTD; 7050; -.
DR eggNOG; KOG0773; Eukaryota.
DR GeneTree; ENSGT00940000155230; -.
DR HOGENOM; CLU_034318_2_0_1; -.
DR InParanoid; W5M5S8; -.
DR OMA; SAPGMNC; -.
DR OrthoDB; 3180467at2759; -.
DR Proteomes; UP000018468; Linkage group LG9.
DR Bgee; ENSLOCG00000003162; Expressed in ovary and 13 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0001227; F:DNA-binding transcription repressor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0048854; P:brain morphogenesis; IEA:Ensembl.
DR GO; GO:0021797; P:forebrain anterior/posterior pattern specification; IEA:Ensembl.
DR GO; GO:0000122; P:negative regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:0046530; P:photoreceptor cell differentiation; IEA:Ensembl.
DR GO; GO:0048385; P:regulation of retinoic acid receptor signaling pathway; IEA:Ensembl.
DR GO; GO:0072091; P:regulation of stem cell proliferation; IEA:Ensembl.
DR GO; GO:0017015; P:regulation of transforming growth factor beta receptor signaling pathway; IEA:Ensembl.
DR GO; GO:0070654; P:sensory epithelium regeneration; IEA:Ensembl.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR PANTHER; PTHR11850:SF59; HOMEOBOX PROTEIN TGIF1; 1.
DR PANTHER; PTHR11850; HOMEOBOX PROTEIN TRANSCRIPTION FACTORS; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000018468}.
FT DOMAIN 33..96
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 35..97
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..43
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 120..170
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 268 AA; 29071 MW; 306F8F2D2A40313B CRC64;
MKAKKAMPSL SGSETEDEDS MDAPLDLSSS TGSGKRKRRG NLPKESVQIL REWLYEHRYN
AYPSEQEKAL LSKQTRLSTL QVCNWFINAR RRLLPEMLRK DGKDPNQFTI SRKGSKVCEA
GATESSLSPK SLVSEGVDKR SFHPSSMDPV LAKKSPSPKG PSPGPALPRP SVICHTTITA
VQGGPVHYYQ AISTDRPTDL HSSAAPGLHK CHGAGALEGN SSPQGGLFNT PPPTPPDLTQ
DFSGFQLLVD VALKRAAEME LQAKQLVS
//