ID A0A1S3R8S2_SALSA Unreviewed; 484 AA.
AC A0A1S3R8S2;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE SubName: Full=Homeobox protein unc-4 homolog {ECO:0000313|RefSeq:XP_014048531.1};
GN Name=LOC106601117 {ECO:0000313|RefSeq:XP_014048531.1};
OS Salmo salar (Atlantic salmon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Protacanthopterygii; Salmoniformes;
OC Salmonidae; Salmoninae; Salmo.
OX NCBI_TaxID=8030 {ECO:0000313|Proteomes:UP000087266, ECO:0000313|RefSeq:XP_014048531.1};
RN [1] {ECO:0000313|RefSeq:XP_014048531.1}
RP IDENTIFICATION.
RC TISSUE=Muscle {ECO:0000313|RefSeq:XP_014048531.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the paired homeobox family. Unc-4 subfamily.
CC {ECO:0000256|ARBA:ARBA00038351}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_014048531.1; XM_014193056.1.
DR AlphaFoldDB; A0A1S3R8S2; -.
DR STRING; 8030.ENSSSAP00000038228; -.
DR PaxDb; 8030-ENSSSAP00000038228; -.
DR GeneID; 106601117; -.
DR KEGG; sasa:106601117; -.
DR OMA; NSHPTSC; -.
DR OrthoDB; 2902937at2759; -.
DR Proteomes; UP000087266; Chromosome ssa03.
DR Bgee; ENSSSAG00000043469; Expressed in brain and 2 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR PANTHER; PTHR46799; HOMEOBOX PROTEIN UNC-4 HOMOLOG; 1.
DR PANTHER; PTHR46799:SF1; HOMEOBOX PROTEIN UNC-4 HOMOLOG; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000087266};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 97..157
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 99..158
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 155..182
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 195..293
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 306..326
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 352..394
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 407..434
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 449..484
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 213..293
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 354..394
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 460..476
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 484 AA; 53478 MW; ABA7F83FF8E63CD3 CRC64;
MMDSRILDTP HAQFGGSLGG MVGFPYHLSH HHVYELAGHQ LQSAAAVPFS IDGLLNGSCT
ASVVNSNPLL SGCGMNGDNQ QYKLTDSCDP DKDSPGCKRR RTRTNFTGWQ LEELEKAFNE
SHYPDVFMRE ALALRLDLIE SRVQVWFQNR RAKWRKKENT KKGPGRPAHN SHPTTCSGEP
MDAEEIARRE LERMEKKKRK QERRLLRSQN KLLSGDLFHT PGSDSDSGVS QVTDSEQNPH
CDSVGRNQTQ SSCDQTPQTL QNQRHLNQDA GGSELDSSDS SQQSSMCANS RASTLQKLNP
FSVESLLSDS RPRRKPNIDG FSALPSSRPL IGKGHFLLYP ITQPLGFIVP QTALKTTPPD
SDTDNSPQRT PVNDSSLSAN TGHKNHKESN NLNDATAIKV QVNFTAKNTS SPQNSPGHVT
FSGRNCSQPS ASCSATCFQD DDDSKYLPLD KKEQSVKDYP ETSESVTTNC PPETKTETQD
VDME
//