ID A0A401QDD9_SCYTO Unreviewed; 803 AA.
AC A0A401QDD9;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE RecName: Full=Homeobox domain-containing protein {ECO:0008006|Google:ProtNLM};
DE Flags: Fragment;
GN ORFNames=scyTo_0023671 {ECO:0000313|EMBL:GCB83376.1};
OS Scyliorhinus torazame (Cloudy catshark).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes;
OC Elasmobranchii; Galeomorphii; Galeoidea; Carcharhiniformes; Scyliorhinidae;
OC Scyliorhinus.
OX NCBI_TaxID=75743 {ECO:0000313|EMBL:GCB83376.1, ECO:0000313|Proteomes:UP000288216};
RN [1] {ECO:0000313|EMBL:GCB83376.1, ECO:0000313|Proteomes:UP000288216}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=30297745; DOI=.1038/s41559-018-0673-5;
RA Hara Y, Yamaguchi K, Onimaru K, Kadota M, Koyanagi M, Keeley SD, Tatsumi K,
RA Tanaka K, Motone F, Kageyama Y, Nozu R, Adachi N, Nishimura O, Nakagawa R,
RA Tanegashima C, Kiyatake I, Matsumoto R, Murakumo K, Nishida K, Terakita A,
RA Kuratani S, Sato K, Hyodo S Kuraku.S.;
RT "Shark genomes provide insights into elasmobranch evolution and the origin
RT of vertebrates.";
RL Nat. Ecol. Evol. 2:1761-1771(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GCB83376.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFAA01032188; GCB83376.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A401QDD9; -.
DR STRING; 75743.A0A401QDD9; -.
DR OMA; PLYLLEC; -.
DR Proteomes; UP000288216; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 2.
DR Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR45891; ZINC FINGER HOMEOBOX PROTEIN; 1.
DR PANTHER; PTHR45891:SF1; ZINC FINGER HOMEOBOX PROTEIN 2; 1.
DR Pfam; PF00046; Homeodomain; 2.
DR PRINTS; PR01217; PRICHEXTENSN.
DR SMART; SM00389; HOX; 2.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 2.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000288216};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 64..92
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 361..421
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 569..629
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 363..422
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 571..630
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..23
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 110..294
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 306..327
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 342..363
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 484..579
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 9..23
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 123..143
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 237..254
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 255..288
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 313..327
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 547..561
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:GCB83376.1"
SQ SEQUENCE 803 AA; 85913 MW; 489095F740778996 CRC64;
RGGEAEGGPP PAQPLPPPAL PAEAPRDATR ALLAHIGFEL IAQYNEGRQC VQTQAARLGL
AGRLRCDGCS KLFSNTLILK SHEEQVHGRL FPSEELDRYA RHYRESYDNL YPDQAASPGR
TSPVNPVPAP SPSLPPPDSS PVPSELPQAQ SEGDEEGPPR SWPLTGEEEG ALLQGPELLP
LGPWPETDPQ AVEIQAPGAL DALLPPGSEQ RGDAPSGEGE VAGTEPAAPR DCPTPTELEP
EEEEEEEEEE EPPASPPAPE PPLPPCSPSG ANPASLPDGP PLGPPAPDPQ HSHYLALRSH
LLGAQHPQPP RALFQPGANP PLPHPAPQAL KRKLEVERPP VGGLGHGGSE LTFGPGGEEH
PRDKRLRTTI MPEQLDVLYR RYLCDSNPTR KALEQISTQV GLKKRVVQVW FQNTRARERK
GQFRFVGGPP GAGAPPFPLG RYPSQSEAEA QLSPLYLLEC VPGPSPEALA PEEWRRDPFA
CQAGRIALPP SPPSPPPSSA GGRHDWSEGE GSDAAGEGGP SDGGPGPGRP RGDLSESSSQ
ADPASPCGYP PTGPPDAWLP LPAEPSDRQA PRRNRTQMSS LQLKVMKTCF QDYRTPTMLE
CQLLGDEIGL PKRVIQVWFQ NARAKDKKFK LHNAKAASSG APKGGCDLCG VKYGAYLAVR
NHVFSLPHIA KMKELVRGQL RRDKKYFAQK APAAPLLEEK EEAATPRAPF SSRPLAGQCP
LNEPPPFGNL AFPGLPGLPP ILLPGASGTP SGFTTLTAGE WLSIRSTYTW RTGAKLWAWK
CGCVVGALGI WEAVFRLANI LGS
//