ID A0A433U0U0_ELYCH Unreviewed; 583 AA.
AC A0A433U0U0;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE RecName: Full=Homeobox domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=EGW08_004795 {ECO:0000313|EMBL:RUS87420.1};
OS Elysia chlorotica (Eastern emerald elysia) (Sea slug).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Mollusca; Gastropoda;
OC Heterobranchia; Euthyneura; Panpulmonata; Sacoglossa; Placobranchoidea;
OC Plakobranchidae; Elysia.
OX NCBI_TaxID=188477 {ECO:0000313|EMBL:RUS87420.1, ECO:0000313|Proteomes:UP000271974};
RN [1] {ECO:0000313|EMBL:RUS87420.1, ECO:0000313|Proteomes:UP000271974}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=EC2010 {ECO:0000313|EMBL:RUS87420.1};
RC TISSUE=Whole organism of an adult {ECO:0000313|EMBL:RUS87420.1};
RA Cai H., Li Q., Fang X., Li J., Curtis N.E., Altenburger A., Shibata T.,
RA Feng M., Maeda T., Schwartz J.A., Shigenobu S., Lundholm N., Nishiyama T.,
RA Yang H., Hasebe M., Li S., Pierce S.K., Wang J.;
RT "A draft genome assembly of the solar-powered sea slug Elysia chlorotica.";
RL Submitted (JAN-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the paired homeobox family.
CC {ECO:0000256|ARBA:ARBA00005733}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RUS87420.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; RQTK01000111; RUS87420.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A433U0U0; -.
DR STRING; 188477.A0A433U0U0; -.
DR Proteomes; UP000271974; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR001523; Paired_dom.
DR InterPro; IPR043565; PAX_fam.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR PANTHER; PTHR45636:SF5; EYEGONE, ISOFORM A-RELATED; 1.
DR PANTHER; PTHR45636; PAIRED BOX PROTEIN PAX-6-RELATED-RELATED; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF00292; PAX; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00351; PAX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS51057; PAIRED_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Paired box {ECO:0000256|ARBA:ARBA00022724};
KW Reference proteome {ECO:0000313|Proteomes:UP000271974};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 18..136
FT /note="Paired"
FT /evidence="ECO:0000259|PROSITE:PS51057"
FT DOMAIN 288..348
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 290..349
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 27..53
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 257..278
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 380..406
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 436..484
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 497..583
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 29..47
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 380..399
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 529..544
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 548..583
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 583 AA; 62754 MW; F5FFB611B8F7CAD8 CRC64;
MMDDTPLMLP YPRPLHTLHG GAPLLNFPAP HGAPVPPPLP PGPASGEGPP QGMATRDCLQ
RVWNCLYDPL SFRCSTIGGS KPKVATPSVV CKIEQFKNNS PTMFAWEIRD KLLAEGICNH
SNVPSVSSIN RILRNRAAER AANEYAKVAS QVLQPLYTSW WTGQAPPSTV STPLAPPKIS
LTPPTTPLIA SVGYRPVPPA YGATCLSPGG HAPSSTPLEM MAAAAVGMTP PLPLSRASFS
SPCSMPVSVY TNRISPTEAE EERAKMDSPD SDGAINISDP YLDCDGVQKL RRNRTTFSPS
QLEILEREFV KTHYPGVTTR EVLASKTGLS EARVQVWFSN RRAKWRRHQR LKLLQSTNPF
ALHYPPSMQH AIAEVAEREP MGRSRLSHGE AHADHRGHGA HSPGTAVGPH GFPIYLGYPR
VAYDMAAAPP SGFGHDLHNS SMSSGCSNQS PSPPPLPVRV ASSTPPHHPH HRATPEYSSV
TTTPEAISSQ LQITAASYQT SPPHPAAPLA IASPPMTIDA PPSSLPVVSP QEKRASSKPS
FSISEHAAFQ PDKRRGEQGG EGKQGEDDSE CNRCLIISHV DEN
//