ID W6UNN2_ECHGR Unreviewed; 1051 AA.
AC W6UNN2;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 38.
DE SubName: Full=Zinc finger protein 2 {ECO:0000313|EMBL:EUB62843.1};
GN ORFNames=EGR_02284 {ECO:0000313|EMBL:EUB62843.1};
OS Echinococcus granulosus (Hydatid tapeworm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC Eucestoda; Cyclophyllidea; Taeniidae; Echinococcus;
OC Echinococcus granulosus group.
OX NCBI_TaxID=6210 {ECO:0000313|EMBL:EUB62843.1, ECO:0000313|Proteomes:UP000019149};
RN [1] {ECO:0000313|EMBL:EUB62843.1, ECO:0000313|Proteomes:UP000019149}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24013640; DOI=10.1038/ng.2757;
RA Zheng H., Zhang W., Zhang L., Zhang Z., Li J., Lu G., Zhu Y., Wang Y.,
RA Huang Y., Liu J., Kang H., Chen J., Wang L., Chen A., Yu S., Gao Z.,
RA Jin L., Gu W., Wang Z., Zhao L., Shi B., Wen H., Lin R., Jones M.K.,
RA Brejova B., Vinar T., Zhao G., McManus D.P., Chen Z., Zhou Y., Wang S.;
RT "The genome of the hydatid tapeworm Echinococcus granulosus.";
RL Nat. Genet. 45:1168-1175(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EUB62843.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; APAU02000010; EUB62843.1; -; Genomic_DNA.
DR AlphaFoldDB; W6UNN2; -.
DR STRING; 6210.W6UNN2; -.
DR EnsemblMetazoa; XM_024491533.1; XP_024354039.1; GeneID_36337999.
DR OMA; TICECEN; -.
DR OrthoDB; 4180951at2759; -.
DR Proteomes; UP000019149; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 3.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 3.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR45891; ZINC FINGER HOMEOBOX PROTEIN; 1.
DR PANTHER; PTHR45891:SF3; ZINC FINGER PROTEIN 2; 1.
DR Pfam; PF00046; Homeodomain; 3.
DR SMART; SM00389; HOX; 3.
DR SMART; SM00355; ZnF_C2H2; 5.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 3.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 3.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000019149};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 339..364
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 624..684
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 721..781
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 905..965
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 626..685
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 723..782
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 907..966
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 104..128
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 687..724
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 836..865
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 881..908
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 693..708
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 850..865
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1051 AA; 118077 MW; 4A863F6ECA352AD7 CRC64;
MNLPMVEPSV ANFNAERAHV MVSPPAIPEM ETGNSGCIYC LTKVLHPRLG RGESYACGYK
PYRCEICNYS TVTKGNLAIH EQSDRHLNNV QEYDQHQKLQ CVHKPSEQDS VNSSVSHPPA
SANQSSPPTM SLFPQTYLAF LEVFSQKGGT CPMDDFSTDL LEVADHTFFC QICGVFGTDS
VEELISHAEQ NRIPKNLDLA NQQLTTHTSG AWHCNLCSYR SPLKANFQLH CKTEKHAQRL
SLLLHVWEGK EGRLGASLPT NDLLLSSTSK NSVYCQLRCL PCGFFTCSVH KMRVHCQTPM
HGFLASAFGA VVRRRSQLKL SLMTASDGGS VMNGARVIYA CRKCEMTFTS LTGLMQHFQS
ADFHETVSKK FNLRRFNSIV LNTSLDDEGS RIRRHSAPPT INAEFSSTSD QSRKRSIEYD
NGGNVSAMND AQMPQSLLAG ATCQTLTDGE NAHCRMHITT SPLSSWLNCV NLQNSELLQG
LARFIDEQKE QTSAPQTCDS LFAWFGADNT LCSFLISWLR HQFPQMDGNM SDEFVAFILE
LIAQQPNVLG RFLDLRELQQ RQAFSTCHQC SPPRSFLLPS STSLHFNIHH NDELPALVLE
AVKKMENTVI EVVRALQPQQ FQLPLMKGGR TRLSVNHLEV FRSSFCSSNQ ITEATIAEIC
KKTGLEEKAV KHWFRGTLFK GRQRFKEDSS NLDIPQPPIN QASATEHSDE VRSVGRSTKP
TSTKRFRTPI SSIQQAVLLQ YFQADQNPSR RQMDIISSEV NLPKRVVQVW FQNARSRERR
MFIKEASAQE SPFNELKCLV SGANSTNTLP EQTPVDKLQQ IFNQILTQMP PLGIIQPPAS
APIQPPVTSI PQQLPSQSSE NYEMDSPLDL STVSYRSNDY QSSTLQNLSP HAGSAASANE
TRSDSFCRRN RTSISSTQAR FMQWFFQHHK TPTICECENI GRAIGLSRRV VQVWFQNQRA
KKKKLARSTA ICSEASFHQP ESESQFLLEG NECKLCNVKI RRMSEEDTAA VTEHIFSTVH
IDRLFATICQ GDLWSAAVER QQNCPPEPKH E
//