GenomeNet

Database: UniProt
Entry: W6UNN2_ECHGR
LinkDB: W6UNN2_ECHGR
Original site: W6UNN2_ECHGR 
ID   W6UNN2_ECHGR            Unreviewed;      1051 AA.
AC   W6UNN2;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   16-APR-2014, sequence version 1.
DT   27-MAR-2024, entry version 38.
DE   SubName: Full=Zinc finger protein 2 {ECO:0000313|EMBL:EUB62843.1};
GN   ORFNames=EGR_02284 {ECO:0000313|EMBL:EUB62843.1};
OS   Echinococcus granulosus (Hydatid tapeworm).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC   Eucestoda; Cyclophyllidea; Taeniidae; Echinococcus;
OC   Echinococcus granulosus group.
OX   NCBI_TaxID=6210 {ECO:0000313|EMBL:EUB62843.1, ECO:0000313|Proteomes:UP000019149};
RN   [1] {ECO:0000313|EMBL:EUB62843.1, ECO:0000313|Proteomes:UP000019149}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=24013640; DOI=10.1038/ng.2757;
RA   Zheng H., Zhang W., Zhang L., Zhang Z., Li J., Lu G., Zhu Y., Wang Y.,
RA   Huang Y., Liu J., Kang H., Chen J., Wang L., Chen A., Yu S., Gao Z.,
RA   Jin L., Gu W., Wang Z., Zhao L., Shi B., Wen H., Lin R., Jones M.K.,
RA   Brejova B., Vinar T., Zhao G., McManus D.P., Chen Z., Zhou Y., Wang S.;
RT   "The genome of the hydatid tapeworm Echinococcus granulosus.";
RL   Nat. Genet. 45:1168-1175(2013).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EUB62843.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; APAU02000010; EUB62843.1; -; Genomic_DNA.
DR   AlphaFoldDB; W6UNN2; -.
DR   STRING; 6210.W6UNN2; -.
DR   EnsemblMetazoa; XM_024491533.1; XP_024354039.1; GeneID_36337999.
DR   OMA; TICECEN; -.
DR   OrthoDB; 4180951at2759; -.
DR   Proteomes; UP000019149; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR   CDD; cd00086; homeodomain; 3.
DR   Gene3D; 3.30.160.60; Classic Zinc Finger; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 3.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR036236; Znf_C2H2_sf.
DR   InterPro; IPR013087; Znf_C2H2_type.
DR   PANTHER; PTHR45891; ZINC FINGER HOMEOBOX PROTEIN; 1.
DR   PANTHER; PTHR45891:SF3; ZINC FINGER PROTEIN 2; 1.
DR   Pfam; PF00046; Homeodomain; 3.
DR   SMART; SM00389; HOX; 3.
DR   SMART; SM00355; ZnF_C2H2; 5.
DR   SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 3.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 3.
DR   PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000019149};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW   Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT   DOMAIN          339..364
FT                   /note="C2H2-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50157"
FT   DOMAIN          624..684
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DOMAIN          721..781
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DOMAIN          905..965
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        626..685
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   DNA_BIND        723..782
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   DNA_BIND        907..966
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          104..128
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          687..724
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          836..865
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          881..908
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        693..708
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        850..865
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1051 AA;  118077 MW;  4A863F6ECA352AD7 CRC64;
     MNLPMVEPSV ANFNAERAHV MVSPPAIPEM ETGNSGCIYC LTKVLHPRLG RGESYACGYK
     PYRCEICNYS TVTKGNLAIH EQSDRHLNNV QEYDQHQKLQ CVHKPSEQDS VNSSVSHPPA
     SANQSSPPTM SLFPQTYLAF LEVFSQKGGT CPMDDFSTDL LEVADHTFFC QICGVFGTDS
     VEELISHAEQ NRIPKNLDLA NQQLTTHTSG AWHCNLCSYR SPLKANFQLH CKTEKHAQRL
     SLLLHVWEGK EGRLGASLPT NDLLLSSTSK NSVYCQLRCL PCGFFTCSVH KMRVHCQTPM
     HGFLASAFGA VVRRRSQLKL SLMTASDGGS VMNGARVIYA CRKCEMTFTS LTGLMQHFQS
     ADFHETVSKK FNLRRFNSIV LNTSLDDEGS RIRRHSAPPT INAEFSSTSD QSRKRSIEYD
     NGGNVSAMND AQMPQSLLAG ATCQTLTDGE NAHCRMHITT SPLSSWLNCV NLQNSELLQG
     LARFIDEQKE QTSAPQTCDS LFAWFGADNT LCSFLISWLR HQFPQMDGNM SDEFVAFILE
     LIAQQPNVLG RFLDLRELQQ RQAFSTCHQC SPPRSFLLPS STSLHFNIHH NDELPALVLE
     AVKKMENTVI EVVRALQPQQ FQLPLMKGGR TRLSVNHLEV FRSSFCSSNQ ITEATIAEIC
     KKTGLEEKAV KHWFRGTLFK GRQRFKEDSS NLDIPQPPIN QASATEHSDE VRSVGRSTKP
     TSTKRFRTPI SSIQQAVLLQ YFQADQNPSR RQMDIISSEV NLPKRVVQVW FQNARSRERR
     MFIKEASAQE SPFNELKCLV SGANSTNTLP EQTPVDKLQQ IFNQILTQMP PLGIIQPPAS
     APIQPPVTSI PQQLPSQSSE NYEMDSPLDL STVSYRSNDY QSSTLQNLSP HAGSAASANE
     TRSDSFCRRN RTSISSTQAR FMQWFFQHHK TPTICECENI GRAIGLSRRV VQVWFQNQRA
     KKKKLARSTA ICSEASFHQP ESESQFLLEG NECKLCNVKI RRMSEEDTAA VTEHIFSTVH
     IDRLFATICQ GDLWSAAVER QQNCPPEPKH E
//
DBGET integrated database retrieval system