ID J9D9F0_EDHAE Unreviewed; 952 AA.
AC J9D9F0;
DT 31-OCT-2012, integrated into UniProtKB/TrEMBL.
DT 31-OCT-2012, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE RecName: Full=Homeobox domain-containing protein {ECO:0000259|PROSITE:PS50071};
GN ORFNames=EDEG_01594 {ECO:0000313|EMBL:EJW04114.1};
OS Edhazardia aedis (strain USNM 41457) (Microsporidian parasite).
OC Eukaryota; Fungi; Fungi incertae sedis; Microsporidia; Edhazardia.
OX NCBI_TaxID=1003232 {ECO:0000313|EMBL:EJW04114.1, ECO:0000313|Proteomes:UP000003163};
RN [1] {ECO:0000313|EMBL:EJW04114.1, ECO:0000313|Proteomes:UP000003163}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=USNM 41457 {ECO:0000313|EMBL:EJW04114.1,
RC ECO:0000313|Proteomes:UP000003163};
RA Liu Z.J., Shi F.L., Lu J.Q., Li M., Wang Z.L.;
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000003163}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=USNM 41457 {ECO:0000313|Proteomes:UP000003163};
RG The Broad Institute Genomics Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Cuomo C.A., Sanscrainte N.D., Goldberg J.M., Heiman D., Young S., Zeng Q.,
RA Becnel J.J., Birren B.W.;
RT "Contrasting host-pathogen interactions and genome evolution in two
RT generalist and specialist microsporidian pathogens of mosquitoes.";
RL Submitted (JUL-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJW04114.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AFBI03000023; EJW04114.1; -; Genomic_DNA.
DR AlphaFoldDB; J9D9F0; -.
DR STRING; 1003232.J9D9F0; -.
DR VEuPathDB; MicrosporidiaDB:EDEG_01594; -.
DR HOGENOM; CLU_309494_0_0_1; -.
DR InParanoid; J9D9F0; -.
DR OrthoDB; 2908004at2759; -.
DR Proteomes; UP000003163; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR PANTHER; PTHR24208:SF166; LIM HOMEOBOX TRANSCRIPTION FACTOR 1 ALPHA, ISOFORM B; 1.
DR PANTHER; PTHR24208; LIM/HOMEOBOX PROTEIN LHX; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000003163}.
FT DOMAIN 31..87
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 33..88
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
SQ SEQUENCE 952 AA; 109624 MW; B328AB2D5C465962 CRC64;
MKMDKRDSYF ESFQQTPEGL KNTYYNPFMV KHRRRTSKMQ LRVLEKTFET NVRPDANLRK
ILGEQLGMTP RSVQVWFQNR RAKIKKRKNE CEQRHPEKSR YSSTNISAIN GDNCVNSIVN
IGGSYSMNNN SVGSRMNDNH GYTEDGRAYN LSAESEYLYN NNIIQKNKIS NGIIDWKEMN
EQNDIEEWGQ MDFISSYNKN NTISSIDNIN MDNGMGINSV DNVYMENPMN TNNMGNYSST
INSYSNNINA FNSINSIPGN EYGNNLNANI NNMYNQGFDD GKMKNDNNSI GMNNKQMNYI
SSNNIMYNGY ASNIKNNDVS YNSSNFIRFD KRISGNTSSF KHHLNAQNRI IKSTINNFNA
DFTNSAYSSY EHVIGHKNSN SNNNIDTSNN ISDTAGSINN ESPQKIERCC SENADIRNTN
KLALSTSLDQ GLLNKLELND HSNNKIDSKV NTRESINDIN KDTYYSMISP KEKEDSNDYI
ISTTNENIQS NLNYNFKNID DESHNDIFSK QYNILSNHND NNHLFNSSPV MNASTRANNN
SLYRNRFPMY NQNDKSKYSY NNGNSIEEVF FRNTFKNQNY TDFDKEFGKN RNNIICNEED
KNYYTFSIDN NSSINTNNID IQNGNLNHDV YHINKIESRT DSFNNYSNIE SLEYSNLNNE
YHRRNANIFD TKEFAHPYKN NGNDTNRNVD DYLYTKDQSV NTNIDDDINV YFDNMNSNNI
QLCNNNMVDQ EYINTINTNS SIKKKQGKNN STKNVIGKKS YEEEIDLLKL SNDILCDSNN
LNNDSNINDF SNINSNTSDC ASKENVQKIE NYTQYFQNIN NHNKLHETSN QSSNINKQIP
FIENQSSLNN NNIYSTNITN NNIEESMKIN TLNINNNNKT TIGNVYETVS LYSLENTDIH
GSNEALDKLS TNSNVLINNN INVTDIKENN KYTSIQNLNN NFNGYKNDKN NS
//