ID A0A087XJF1_POEFO Unreviewed; 281 AA.
AC A0A087XJF1;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 56.
DE SubName: Full=Homeobox protein Nkx-6.2-like {ECO:0000313|Ensembl:ENSPFOP00000005881.1};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000005881.1, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000005881.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01011600; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AYCK01011601; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_007566853.1; XM_007566791.2.
DR RefSeq; XP_007566855.1; XM_007566793.2.
DR RefSeq; XP_007566856.1; XM_007566794.2.
DR STRING; 48698.ENSPFOP00000005881; -.
DR Ensembl; ENSPFOT00000005890.2; ENSPFOP00000005881.1; ENSPFOG00000005993.2.
DR Ensembl; ENSPFOT00000005946.2; ENSPFOP00000005937.1; ENSPFOG00000006051.2.
DR GeneID; 103148151; -.
DR GeneID; 103148154; -.
DR GeneID; 103148157; -.
DR KEGG; pfor:103148151; -.
DR KEGG; pfor:103148154; -.
DR KEGG; pfor:103148157; -.
DR CTD; 565782; -.
DR eggNOG; KOG0847; Eukaryota.
DR GeneTree; ENSGT00940000161547; -.
DR OMA; MQGAPWR; -.
DR OrthoDB; 3234377at2759; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR020479; Homeobox_metazoa.
DR InterPro; IPR000047; HTH_motif.
DR PANTHER; PTHR24340; HOMEOBOX PROTEIN NKX; 1.
DR PANTHER; PTHR24340:SF21; HOMEOBOX PROTEIN NKX-6.2; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00024; HOMEOBOX.
DR PRINTS; PR00031; HTHREPRESSR.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 4: Predicted;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 151..211
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 153..212
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 211..257
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 217..238
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 281 AA; 30907 MW; 86ED578B335B7B83 CRC64;
MLAVGQMDAN RQSAFVLGST PLAALHNMTE MKTSLFPYAL QQSPAGFKAP SLSNLSSQIS
GGTPHGISDI LGRPITTAGQ LLSGFPRING LAATTAAGMY FSPAVSRYPK PLAELPGRAP
IFWPGVMQGA PWRDPRVPCP SQANLMVDKD GKKKHSRPTF SGQQIFALEK TFEQTKYLAG
PERARLAYSL GMTESQVKVW FQNRRTKWRK RHAAEMASAK KKHDSETEKM KESSDNEEDD
EYNKPLDPNS DDEKITRLLK KHKATTNLAL ISPCSNSSDT L
//