GenomeNet

Database: UniProt
Entry: W6UBD5_ECHGR
LinkDB: W6UBD5_ECHGR
Original site: W6UBD5_ECHGR 
ID   W6UBD5_ECHGR            Unreviewed;       273 AA.
AC   W6UBD5;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   16-APR-2014, sequence version 1.
DT   27-MAR-2024, entry version 38.
DE   SubName: Full=Homeobox protein EgHBX4 {ECO:0000313|EMBL:EUB57866.1};
GN   ORFNames=EGR_07228 {ECO:0000313|EMBL:EUB57866.1};
OS   Echinococcus granulosus (Hydatid tapeworm).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC   Eucestoda; Cyclophyllidea; Taeniidae; Echinococcus;
OC   Echinococcus granulosus group.
OX   NCBI_TaxID=6210 {ECO:0000313|EMBL:EUB57866.1, ECO:0000313|Proteomes:UP000019149};
RN   [1] {ECO:0000313|EMBL:EUB57866.1, ECO:0000313|Proteomes:UP000019149}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=24013640; DOI=10.1038/ng.2757;
RA   Zheng H., Zhang W., Zhang L., Zhang Z., Li J., Lu G., Zhu Y., Wang Y.,
RA   Huang Y., Liu J., Kang H., Chen J., Wang L., Chen A., Yu S., Gao Z.,
RA   Jin L., Gu W., Wang Z., Zhao L., Shi B., Wen H., Lin R., Jones M.K.,
RA   Brejova B., Vinar T., Zhao G., McManus D.P., Chen Z., Zhou Y., Wang S.;
RT   "The genome of the hydatid tapeworm Echinococcus granulosus.";
RL   Nat. Genet. 45:1168-1175(2013).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EUB57866.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; APAU02000073; EUB57866.1; -; Genomic_DNA.
DR   AlphaFoldDB; W6UBD5; -.
DR   STRING; 6210.W6UBD5; -.
DR   EnsemblMetazoa; XM_024496477.1; XP_024349062.1; GeneID_36342943.
DR   OMA; MPYRQSS; -.
DR   OrthoDB; 5400206at2759; -.
DR   Proteomes; UP000019149; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR   CDD; cd00086; homeodomain; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR000047; HTH_motif.
DR   PANTHER; PTHR45793; HOMEOBOX PROTEIN; 1.
DR   PANTHER; PTHR45793:SF5; HOMEOTIC PROTEIN OCELLILESS; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   PRINTS; PR00031; HTHREPRESSR.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000019149}.
FT   DOMAIN          206..266
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        208..267
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          1..56
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..17
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   273 AA;  30715 MW;  5345D23F583C6968 CRC64;
     MRQFASQQRS PYRPNGGFVA SSQYDAVGAP DFLTSRESPQ STPPESSAPE MDMPYRQSSG
     SYCGDDGRNE MLPGAVLGTS FQLPHSSPPL VDPPPLPKST WNTVQEVSRT RSGSMPRIQS
     AFPNHVVHSF NSSLSHAPRP TLSHAAIIHH LQQQYRHCQH QQLIPFMSVT SMPSQQMQHP
     LTKSGMESEA GISQDVACNA ESRAQAPSRR ERTIYTPEQL EAMEEVFGVN RYPDVSMREE
     LASRLGINES KIQVWFKNRR AKLRNLERSR RRE
//
DBGET integrated database retrieval system