ID A0A1S3NKT4_SALSA Unreviewed; 1548 AA.
AC A0A1S3NKT4;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE RecName: Full=Homeobox protein cut-like {ECO:0000256|RuleBase:RU361129};
GN Name=cux2 {ECO:0000313|RefSeq:XP_014015875.1};
OS Salmo salar (Atlantic salmon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Protacanthopterygii; Salmoniformes;
OC Salmonidae; Salmoninae; Salmo.
OX NCBI_TaxID=8030 {ECO:0000313|Proteomes:UP000087266, ECO:0000313|RefSeq:XP_014015875.1};
RN [1] {ECO:0000313|RefSeq:XP_014015875.1}
RP IDENTIFICATION.
RC TISSUE=Muscle {ECO:0000313|RefSeq:XP_014015875.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the CUT homeobox family.
CC {ECO:0000256|ARBA:ARBA00008190, ECO:0000256|RuleBase:RU361129}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_014015875.1; XM_014160400.1.
DR GeneID; 106579952; -.
DR OrthoDB; 74668at2759; -.
DR Proteomes; UP000087266; Chromosome ssa20.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.260.40; lambda repressor-like DNA-binding domains; 3.
DR InterPro; IPR003350; CUT_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR PANTHER; PTHR14043; CCAAT DISPLACEMENT PROTEIN-RELATED; 1.
DR PANTHER; PTHR14043:SF5; HOMEOBOX PROTEIN CUT-LIKE 2; 1.
DR Pfam; PF02376; CUT; 3.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM01109; CUT; 3.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR SUPFAM; SSF47413; lambda repressor-like DNA-binding domains; 3.
DR PROSITE; PS51042; CUT; 3.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000087266};
KW Transcription {ECO:0000256|RuleBase:RU361129};
KW Transcription regulation {ECO:0000256|RuleBase:RU361129}.
FT DOMAIN 553..640
FT /note="CUT"
FT /evidence="ECO:0000259|PROSITE:PS51042"
FT DOMAIN 926..1013
FT /note="CUT"
FT /evidence="ECO:0000259|PROSITE:PS51042"
FT DOMAIN 1090..1177
FT /note="CUT"
FT /evidence="ECO:0000259|PROSITE:PS51042"
FT DOMAIN 1219..1279
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 1221..1280
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 142..204
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 275..304
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 423..484
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 538..559
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 671..700
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 821..879
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1010..1082
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1288..1359
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1417..1521
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 166..201
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 423..438
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 442..478
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 676..700
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 828..848
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1024..1038
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1447..1484
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1548 AA; 170220 MW; 35DED1E4DB160E55 CRC64;
MAADVGSMFQ YWKKFNLRRL QRELNSIASE LAGRQEESEH SHKHLVELSR EFKRNVPEEV
REMVAPVLKS FQAQVVALNK RSKEAESAFL GIYKQLIEAP DPVPVLEATH TLEERLQQLQ
SSAPDSEALV RELSGHWKRH LECLDKPGDT GEDPAAAESG PGEASSWTAG VMTPNSKQTL
RVTGSPQQNH QGQNQTEGET EDTDATLADR LGEAEERIKV LHSSLSTAQT DLLDLRCKYD
QEKANKAEEV GVIMMNLEKA NQKAEMAQRE VERLKEQLAS PRSATPGSCH PPEGTSGERG
DVSPSKLEAK LMAKDREILR LLENVQRLQF ALQEVQNISA NQIIELESQL AYKTEAIERL
EAKLQSQLDY EEIKTELRIL KVMKLASTNG NSSQDTAEAF LRDKEAFLPS PKYLMDKARI
LHNNDEDRSE ESGRESGREW GRPSGSFSSV SQVDERTSAS PGPPTMDVSS SSHDLPRPFS
VSPCSRDRLS ADHVVHKQLL SSPLFKKESG SPLMAFPTAL YAAKATLMSA NQGPAGAGGI
EAGLPSDQSE SGSSTAGDDD QLETAEIAFQ VKEQLLKHNI GQRVFGHYVL GLSQGSVSEI
LARPKPWRKL TVKGKEPFIK MKQFLSDDQN ILALRTIQVR QRGSITPRIR TPETGSDDAI
KNILEQAKKE IQSQRGGDIK SSLGSPSGLS SNGAGGSSDD TIKNILEQAR REMQAQQQAL
MEMDVCGRAS STSAGPQVER LGLPEPYKVL PLPTFIKQEE GGTVTVCMAN PISSPQTPLS
VLSPAAFVQN IIRKVKSEIG EAGTYFDQHW SVERGPIGMV GGVGGGSSRP FTSVSPSLSS
SSSSGPSALP RPWPRLENGE CLPNSEEASA AEEDTGSGMV RSVEVKVESD VSVSGESPGP
GRGLSYYPSY LPRTLKPTVP PLTPEQYEMY MYREVDTIEL TRQVKEKLAK NGICQRIFGE
KVLGLSQGSV SDMLSRPKPW SKLTQKGREP FIRMQLWLLD QLGQALTQLP SQSLSQDKSP
VTAQSSPSPP PSPEESHPSP LVEPVSLTLE SSKENQQPDG LGLMPPHPEG GKSTPSLLSL
HQPHTPLGIQ ELVAMSPELD TYAITKKVKE VLTDNNLGQR LFGESILGLT QGSVSDLLSR
PKPWHKLSLK GREPFVRMQL WLNDPHNVDK LRDMKKMEKK AYLKRRYGLL STGSDSDSPS
ARSECVSPAL ASIDLCPYSQ VKKPRVVLGA EEKEALRKAY LLEPYPSQHT IEMLACQLNL
KTNTVINWFH NYRSRMRREV LMEGLQDNDT DTEHHSYSPS AIQSPISDGD ERRRLHPAGR
TPHPIRPLSA NTPLPHVKQE ASELEDEGEE EERNGYIKQP KIQCFSMGVQ FPQLKTEHED
QMGGCRELHL APHYPQSLGQ EEGKSSQAQG LFQGALSLDG PQRTSQSRHE VDDSSKSPVD
PVSFKASSEP CRSSLEVSLN SPSAASSPGL MMSVSPVSSS SAPISPSLPN PPTTSTNHGL
DPNPMPPIQS PKPNRSVQRR TEKIANLNNI IHRLERAANR EETLEWEF
//