ID A0A0P7U8T3_SCLFO Unreviewed; 269 AA.
AC A0A0P7U8T3;
DT 20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 1.
DT 24-JAN-2024, entry version 37.
DE SubName: Full=Homeobox protein MSH-C-like {ECO:0000313|EMBL:KPP70644.1};
DE SubName: Full=Homeobox protein MSX-2-like {ECO:0000313|Ensembl:ENSSFOP00015026818.1};
GN Name=LOC108942703 {ECO:0000313|Ensembl:ENSSFOP00015026818.1};
GN ORFNames=Z043_110512 {ECO:0000313|EMBL:KPP70644.1};
OS Scleropages formosus (Asian bonytongue) (Osteoglossum formosum).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Osteoglossocephala;
OC Osteoglossomorpha; Osteoglossiformes; Osteoglossidae; Scleropages.
OX NCBI_TaxID=113540 {ECO:0000313|EMBL:KPP70644.1, ECO:0000313|Proteomes:UP000034805};
RN [1] {ECO:0000313|EMBL:KPP70644.1, ECO:0000313|Proteomes:UP000034805}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Aro1 {ECO:0000313|EMBL:KPP70644.1};
RA Tan M.H., Gan H.M., Croft L.J., Austin C.M.;
RT "The genome of the Asian arowana (Scleropages formosus).";
RL Submitted (AUG-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSFOP00015026818.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the Msh homeobox family.
CC {ECO:0000256|ARBA:ARBA00038425}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARO02003363; KPP70644.1; -; Genomic_DNA.
DR RefSeq; XP_018621689.1; XM_018766173.1.
DR Ensembl; ENSSFOT00015027123.2; ENSSFOP00015026818.1; ENSSFOG00015017234.2.
DR GeneID; 108942703; -.
DR KEGG; sfm:108942703; -.
DR CTD; 17703; -.
DR GeneTree; ENSGT00940000164982; -.
DR OrthoDB; 4848801at2759; -.
DR Proteomes; UP000034805; Unassembled WGS sequence.
DR Proteomes; UP000694397; Chromosome 8.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR020479; Homeobox_metazoa.
DR PANTHER; PTHR24338; HOMEOBOX PROTEIN MSX; 1.
DR PANTHER; PTHR24338:SF9; HOMEOBOX PROTEIN MSX-3; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00024; HOMEOBOX.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000034805}.
FT DOMAIN 145..205
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 147..206
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..42
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 116..154
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 16..40
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 119..141
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 269 AA; 30155 MW; 3B63E24DBCD44A3F CRC64;
MAPYSLIMNS VQGCNSREKE PHEREDSEGE DLKSNDMATS KGCKEKTISL PFSVEALISD
RSACRTQCAT SDPTLTADQL TQPSARTVYT EKGISLEKVT HLTDCKKKES EDLRDSDQRT
WFQTSSYSSS SRPSSPPPCT LRKHKNNRKP RTPFTTSQLL ALERKFRQKQ YLSIAERAEF
SNSLNLTETQ VKIWFQNRRA KAKRLQEAEM EKLKLASKPL VPAFAFPFPL GTAVGSPSLY
GPSQAFPRPT LPVPGLFAGP VTYGMYYLS
//