ID A0A060X736_ONCMY Unreviewed; 487 AA.
AC A0A060X736;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 36.
DE RecName: Full=Homeobox domain-containing protein {ECO:0000259|PROSITE:PS50071};
GN ORFNames=GSONMT00063389001 {ECO:0000313|EMBL:CDQ75438.1};
OS Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Protacanthopterygii; Salmoniformes;
OC Salmonidae; Salmoninae; Oncorhynchus.
OX NCBI_TaxID=8022 {ECO:0000313|EMBL:CDQ75438.1, ECO:0000313|Proteomes:UP000193380};
RN [1] {ECO:0000313|EMBL:CDQ75438.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24755649; DOI=10.1038/ncomms4657;
RA Berthelot C., Brunet F., Chalopin D., Juanchich A., Bernard M., Noel B.,
RA Bento P., Da Silva C., Labadie K., Alberti A., Aury J.M., Louis A.,
RA Dehais P., Bardou P., Montfort J., Klopp C., Cabau C., Gaspin C.,
RA Thorgaard G.H., Boussaha M., Quillet E., Guyomard R., Galiana D., Bobe J.,
RA Volff J.N., Genet C., Wincker P., Jaillon O., Roest Crollius H.,
RA Guiguen Y.;
RT "The rainbow trout genome provides novel insights into evolution after
RT whole-genome duplication in vertebrates.";
RL Nat. Commun. 5:3657-3657(2014).
RN [2] {ECO:0000313|EMBL:CDQ75438.1}
RP NUCLEOTIDE SEQUENCE.
RA Genoscope - CEA;
RL Submitted (MAR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/MEIS homeobox family.
CC {ECO:0000256|ARBA:ARBA00009661}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FR905055; CDQ75438.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A060X736; -.
DR STRING; 8022.A0A060X736; -.
DR PaxDb; 8022-A0A060X736; -.
DR Proteomes; UP000193380; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR032453; PKNOX/Meis_N.
DR PANTHER; PTHR11850:SF53; HOMEOBOX PROTEIN PKNOX2; 1.
DR PANTHER; PTHR11850; HOMEOBOX PROTEIN TRANSCRIPTION FACTORS; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR Pfam; PF16493; Meis_PKNOX_N; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000193380}.
FT DOMAIN 284..347
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 286..348
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..56
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 349..404
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 420..459
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 8..56
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 384..404
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 425..448
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 487 AA; 54274 MW; D9CF05728D7C53B8 CRC64;
MMQHVSPALT MMATQSVPPP SYQESQQMTG TTQQNSKAQQ VHISASSETG NTPINVTLDP
QAQLESDKRA VYRHPLFPLL ALLFEKCEQA TQGSECITSA SFDVDIENFV HQQEQDHKPF
FSEDPDLDNL MVKAIQVLRI HLLELEKVNE LCKDFCNRYI TCLKTKMHSD NLLRNDLGGP
YSPSHTSLSL QQDLLQTSSP SMTSVSSSVN SSGIMMPTGT LQQGNIAMTT INSQVVSGGT
LYQPVAMVTS QGQVLTQGLS QGTIQIQNSQ VNLDLASLLD SDDKKHKNKR GVLPKHATNI
MRSWLFQHLM HPYPTEDEKR QIAGQTNLTL LQVNNWFINA RRRILQPMLD ASNPDPAPKA
KKMKSQHRPT QRFWPDSIVA GVLQTHGPHS NNADNPLGLD SLQPLSSDSA TLAMQQAMLG
GTDDSMDGTE EEEEEEEEEE EEEEEEESEG GRRISGGFEE TVKTNFNHLF VLNEIHKIKP
PFIRCDV
//