ID A0A267G7U3_9PLAT Unreviewed; 502 AA.
AC A0A267G7U3;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=Homeobox domain-containing protein {ECO:0000259|PROSITE:PS50071};
GN ORFNames=BOX15_Mlig022963g2 {ECO:0000313|EMBL:PAA82103.1};
OS Macrostomum lignano.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes;
OC Rhabditophora; Macrostomorpha; Macrostomida; Macrostomidae; Macrostomum.
OX NCBI_TaxID=282301 {ECO:0000313|EMBL:PAA82103.1, ECO:0000313|Proteomes:UP000215902};
RN [1] {ECO:0000313|EMBL:PAA82103.1, ECO:0000313|Proteomes:UP000215902}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DV1 {ECO:0000313|EMBL:PAA82103.1};
RC TISSUE=Whole organism {ECO:0000313|EMBL:PAA82103.1};
RA Berezikov E.;
RT "A platform for efficient transgenesis in Macrostomum lignano, a flatworm
RT model organism for stem cell research.";
RL Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/MEIS homeobox family.
CC {ECO:0000256|ARBA:ARBA00009661}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PAA82103.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NIVC01000494; PAA82103.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A267G7U3; -.
DR STRING; 282301.A0A267G7U3; -.
DR Proteomes; UP000215902; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR032453; PKNOX/Meis_N.
DR PANTHER; PTHR11850:SF102; HOMEOBOX PROTEIN HOMOTHORAX; 1.
DR PANTHER; PTHR11850; HOMEOBOX PROTEIN TRANSCRIPTION FACTORS; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR Pfam; PF16493; Meis_PKNOX_N; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000215902}.
FT DOMAIN 335..398
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 337..399
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..47
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 264..338
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 400..427
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 264..278
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 314..337
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 400..415
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 502 AA; 53766 MW; BD0481C54578DFB6 CRC64;
MTAPQQQQTV DRVDVPQQQQ PRQENSSSSQ TASVDASSSA SAESGVQSQH IELSHNLICS
HPLYPLLSMV FEKCELATCQ TNRAKPGSSA DDGSSASAAA SSDLQGAAAA GGDGSEGDVC
SSLSFDEDIA AFARELNTAN GGPLTADEEL DSLMIQAIQV LRLHLLELEK VHELCDNFCT
RYITCLKGKM TVSEVDLNSA PPPQEQQQQQ QMKFEAEQQA APPLEAAHLH IQQHQPPLSN
SEQPPIVLPM YHHQPPVPLL AVVPPPPPPP PPLLQGATPA ARPMPAETGP MLAESFEASL
GSGSGAASQD DETGLDVDCE KRGGDSGGEE AEARRRRQRK RGIFPKCATN IMRAWLFQNL
SHPYPSEEQK KQLSGDTGLT ILQVNNWFIN ARRRIVQPMV DQSGQSGTSD SSADAAANMP
PLMEHGQTPR QLRYPASLPA LHAAAASCFP GYPHPGQAMP AAAAGFQPQP HLPELHRQPQ
HILLSTDSVG QYETSATAKF AN
//