ID A0A401PLT1_SCYTO Unreviewed; 452 AA.
AC A0A401PLT1;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 03-MAY-2023, entry version 15.
DE RecName: Full=Homeobox domain-containing protein {ECO:0000259|PROSITE:PS50071};
GN ORFNames=scyTo_0003142 {ECO:0000313|EMBL:GCB74058.1};
OS Scyliorhinus torazame (Cloudy catshark).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes;
OC Elasmobranchii; Galeomorphii; Galeoidea; Carcharhiniformes; Scyliorhinidae;
OC Scyliorhinus.
OX NCBI_TaxID=75743 {ECO:0000313|EMBL:GCB74058.1, ECO:0000313|Proteomes:UP000288216};
RN [1] {ECO:0000313|EMBL:GCB74058.1, ECO:0000313|Proteomes:UP000288216}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=30297745; DOI=.1038/s41559-018-0673-5;
RA Hara Y, Yamaguchi K, Onimaru K, Kadota M, Koyanagi M, Keeley SD, Tatsumi K,
RA Tanaka K, Motone F, Kageyama Y, Nozu R, Adachi N, Nishimura O, Nakagawa R,
RA Tanegashima C, Kiyatake I, Matsumoto R, Murakumo K, Nishida K, Terakita A,
RA Kuratani S, Sato K, Hyodo S Kuraku.S.;
RT "Shark genomes provide insights into elasmobranch evolution and the origin
RT of vertebrates.";
RL Nat. Ecol. Evol. 2:1761-1771(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/IRO homeobox family.
CC {ECO:0000256|ARBA:ARBA00008446}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GCB74058.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFAA01000838; GCB74058.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A401PLT1; -.
DR STRING; 75743.A0A401PLT1; -.
DR OMA; NQKPKIW; -.
DR Proteomes; UP000288216; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR003893; Iroquois_homeo.
DR PANTHER; PTHR11211; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX; 1.
DR PANTHER; PTHR11211:SF14; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX-3; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00548; IRO; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000288216}.
FT DOMAIN 113..166
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 115..167
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 167..222
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 263..338
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 167..191
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 192..222
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 300..326
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 452 AA; 49509 MW; 3A180C2FD01D654D CRC64;
MSFPQLGYQY IAATRSLYPD RQGISSSRAG SDLTAPGAAS TVLTMYGSPY AAQGYGAFLP
YAADLPMFPQ LGAQYELKDS PGVQHPAFPH HHAAYYPYGQ YQFGDPSRPK NATRESTSTL
KAWLNEHRKN PYPTKGEKIM LAIITKMTLT QVSTWFANAR RRLKKENKMT WTPRNRTDEE
GNSYGSEPDL EGEKKEDDEE IDLENIDTEN MESKEDEDEQ DADLNLDCKL DGRSDSEMSD
AFDELNGSED GFLKSVVKGS RLAEEKRGDL SPEISGQSSG DQGKNGAERK SPPVSPPPES
SLSSANQKPK IWSLAETATT PDSPRRSPHI SSPPAGAAAS SFLGQHRLFA CPMGKFQNWT
NRTYSHHPLA LINSNHFLGL SAGQAIPAAG VPAFCTVRGS EERAQSTEPT VTDRSSALEI
EKKLVKTAFQ PVQRRPQNQL DAAMVLSALS SA
//