ID Q9Y070_PERAM Unreviewed; 427 AA.
AC Q9Y070;
DT 01-NOV-1999, integrated into UniProtKB/TrEMBL.
DT 01-NOV-1999, sequence version 1.
DT 27-MAR-2024, entry version 128.
DE RecName: Full=Homeobox protein engrailed-like {ECO:0000256|RuleBase:RU510713};
GN Name=Pa-en2 {ECO:0000313|EMBL:CAB51042.1};
OS Periplaneta americana (American cockroach) (Blatta americana).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Polyneoptera; Dictyoptera; Blattodea; Blattoidea; Blattidae;
OC Blattinae; Periplaneta.
OX NCBI_TaxID=6978 {ECO:0000313|EMBL:CAB51042.1};
RN [1] {ECO:0000313|EMBL:CAB51042.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Whole embryo {ECO:0000313|EMBL:CAB51042.1};
RX PubMed=10712910; DOI=10.1016/S0960-9822(00)00361-4;
RA Marie B., Bacon J.P., Blagburn J.M.;
RT "Double-stranded RNA interference shows that Engrailed controls the
RT synaptic specificity of identified sensory neurons.";
RL Curr. Biol. 10:289-292(2000).
RN [2] {ECO:0000313|EMBL:CAB51042.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Whole embryo {ECO:0000313|EMBL:CAB51042.1};
RX PubMed=11180849; DOI=10.1007/s004270000082;
RA Marie B., Bacon J.P.;
RT "Two engrailed-related genes in the cockroach: cloning, phylogenetic
RT analysis, expression and isolation of splice variants.";
RL Dev. Genes Evol. 210:436-448(2000).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the Engrailed homeobox family.
CC {ECO:0000256|RuleBase:RU510713}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AJ243884; CAB51042.1; -; mRNA.
DR AlphaFoldDB; Q9Y070; -.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0007399; P:nervous system development; IEA:UniProt.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR019549; Homeobox-engrailed_C-terminal.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR000747; Homeobox_engrailed.
DR InterPro; IPR020479; Homeobox_metazoa.
DR InterPro; IPR019737; Homoebox-engrailed_CS.
DR InterPro; IPR000047; HTH_motif.
DR PANTHER; PTHR24341; HOMEOBOX PROTEIN ENGRAILED; 1.
DR PANTHER; PTHR24341:SF6; HOMEOBOX PROTEIN INVECTED-RELATED; 1.
DR Pfam; PF10525; Engrail_1_C_sig; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00026; ENGRAILED.
DR PRINTS; PR00024; HOMEOBOX.
DR PRINTS; PR00031; HTHREPRESSR.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00033; ENGRAILED; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 2: Evidence at transcript level;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}.
FT DOMAIN 325..385
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 327..386
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..148
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 164..183
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 305..333
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 22..36
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 427 AA; 47126 MW; 3B4505C6364F57BB CRC64;
MATTTLLLHN SHHHHPHSHH EHHKRQPEEE NSSCDGSDVS RHPNSPTSRA AIADDMAVTC
GDPRPPGLVD TSRIPRNSCS DDEDCCSNND EDELLSVGSE TPPPVGPITD HLISLTPSRI
HLQEDDDGDD DSDDKTEVGS TSDSCSRSSI SRCSLMTDCY STSSSITSPV PSKPEEIHPH
RQHSPSLFSR QIYRPPLQPQ QVSSLNHSAA PATRQNNNAV NNATNNVSMR ALKFSIDNIL
KPDFGRQTTI YNIPSLKKGS SGGSLSVRCS SGGPLQDAAI GSASSGSGSS QLLWPAWVYC
TRYSDRPSSG RSPRSRRMKR KDKKPEEKRP RTAFSGEQLA RLKHEFTENR YLTERRRTEL
ARELGLNEAQ IKIWFQNKRA KIKKASGQKN PLALQLMAQG LYNHSTIPMT REEEEQAAAA
EANAKKT
//