ID A0A2U9BLP1_SCOMX Unreviewed; 416 AA.
AC A0A2U9BLP1;
DT 12-SEP-2018, integrated into UniProtKB/TrEMBL.
DT 12-SEP-2018, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE SubName: Full=Putative iroquois-class homeodomain protein IRX-1 {ECO:0000313|EMBL:AWP04881.1};
GN ORFNames=SMAX5B_016338 {ECO:0000313|EMBL:AWP04881.1};
OS Scophthalmus maximus (Turbot) (Psetta maxima).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Carangaria; Pleuronectiformes; Pleuronectoidei; Scophthalmidae;
OC Scophthalmus.
OX NCBI_TaxID=52904 {ECO:0000313|EMBL:AWP04881.1, ECO:0000313|Proteomes:UP000246464};
RN [1] {ECO:0000313|EMBL:AWP04881.1, ECO:0000313|Proteomes:UP000246464}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Martinez P.;
RT "Integrating genomic resources of turbot (Scophthalmus maximus) in depth
RT evaluation of genetic and physical mapping variation across individuals.";
RL Submitted (DEC-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/IRO homeobox family.
CC {ECO:0000256|ARBA:ARBA00008446}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP026249; AWP04881.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U9BLP1; -.
DR STRING; 52904.ENSSMAP00000033731; -.
DR Proteomes; UP000246464; Chromosome 7.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR003893; Iroquois_homeo.
DR PANTHER; PTHR11211; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX; 1.
DR PANTHER; PTHR11211:SF13; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX-1; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00548; IRO; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000246464};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 75..128
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 77..129
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 128..219
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 340..416
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 128..153
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 154..173
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 174..195
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 371..390
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 416 AA; 45064 MW; 58F52D05585FFE39 CRC64;
MLGMYGSPWV AHNYSAFLPY SGATDLALIS QMGSQYELKD SPGPHPASLP VHAAQSFYPY
GQYPYGDPSR AKTATRETTS TLKAWLQEHQ KNPYPTKGEK IMLAIITRMT LTQVSTWFAN
ARRRLKKENK VTWGRSAEDR DGRIFSSDNE DEPGKNGSDD DDEDEEIDLE TVDIERPEEQ
RAAAAEEEEE QGSGKVEGEA GLSAREQQAS EPKSPLSAEG LRGVEAPISL NKSPVVKLAV
DHSPSRQECQ RPAQSKPKIW SLAETATAPD SVHKHSPAVH AHHPALASGG HPALLPGHGI
YTCQIGKLHN WANAAFLNAN SLLNMRSLLG GAPAGHLPLH GAVPSARHDA RPAAAAGLGA
SGTEDESDAE SSGSFSPKRD DEESDHRPDS LKSPFQLITD RPHHRTAPQR VLTTTL
//