GenomeNet

Database: UniProt
Entry: P28167
LinkDB: P28167
Original site: P28167 
ID   ZFH2_DROME              Reviewed;        3005 AA.
AC   P28167; Q95SN9; Q9V4D7;
DT   01-OCT-1994, integrated into UniProtKB/Swiss-Prot.
DT   07-JUN-2004, sequence version 2.
DT   16-APR-2014, entry version 125.
DE   RecName: Full=Zinc finger protein 2;
DE   AltName: Full=Zinc finger homeodomain protein 2;
GN   Name=zfh2; Synonyms=zfh-2; ORFNames=CG1449;
OS   Drosophila melanogaster (Fruit fly).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
OC   Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha;
OC   Ephydroidea; Drosophilidae; Drosophila; Sophophora.
OX   NCBI_TaxID=7227;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [MRNA].
RX   PubMed=1680376; DOI=10.1016/0925-4773(91)90048-B;
RA   Fortini M.E., Lai Z., Rubin G.M.;
RT   "The Drosophila zfh-1 and zfh-2 genes encode novel proteins containing
RT   both zinc-finger and homeodomain motifs.";
RL   Mech. Dev. 34:113-122(1991).
RN   [2]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley;
RX   PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA   Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA   Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA   George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA   Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X.,
RA   Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D.,
RA   Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G.,
RA   Abril J.F., Agbayani A., An H.-J., Andrews-Pfannkoch C., Baldwin D.,
RA   Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M.,
RA   Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S.,
RA   Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P.,
RA   Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I.,
RA   Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P.,
RA   de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M.,
RA   Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P.,
RA   Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W.,
RA   Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K.,
RA   Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA   Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J.,
RA   Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C.,
RA   Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A.,
RA   Kimmel B.E., Kodira C.D., Kraft C.L., Kravitz S., Kulp D., Lai Z.,
RA   Lasko P., Lei Y., Levitsky A.A., Li J.H., Li Z., Liang Y., Lin X.,
RA   Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D.,
RA   Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A.,
RA   Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L.,
RA   Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M.,
RA   Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G.,
RA   Reinert K., Remington K., Saunders R.D.C., Scheeler F., Shen H.,
RA   Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA   Spier E., Spradling A.C., Stapleton M., Strong R., Sun E.,
RA   Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X.,
RA   Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J.,
RA   Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A.,
RA   Ye J., Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L.,
RA   Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.C., Zhu X.,
RA   Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.;
RT   "The genome sequence of Drosophila melanogaster.";
RL   Science 287:2185-2195(2000).
RN   [3]
RP   GENOME REANNOTATION.
RC   STRAIN=Berkeley;
RX   PubMed=12537572;
RA   Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA   Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA   Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA   Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA   Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q.,
RA   Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M.,
RA   Lewis S.E.;
RT   "Annotation of the Drosophila melanogaster euchromatic genome: a
RT   systematic review.";
RL   Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN   [4]
RP   NUCLEOTIDE SEQUENCE OF 2146-2353.
RC   STRAIN=Crete24, Crete26, Crete30, Crete31, Crete35, Crete40, Crete42,
RC   Crete43, Crete44, Crete8, FTF1, FTF105, FTF14, FTF2, FTF20, FTF23,
RC   FTF26, FTF28, FTF5, FTF6, Zim11, Zim2, Zim30, and Zim53;
RA   Sheldahl L.A., Weinreich D.M., Rand D.M.;
RT   "Recombination, dominance and selection on amino acid polymorphism in
RT   the Drosophila genome: contrasting patterns on the X and 4th
RT   chromosomes.";
RL   Submitted (JUN-2003) to the EMBL/GenBank/DDBJ databases.
RN   [5]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 2830-3005.
RC   STRAIN=Berkeley; TISSUE=Head;
RX   PubMed=12537569;
RA   Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M.,
RA   George R.A., Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H.,
RA   Rubin G.M., Celniker S.E.;
RT   "A Drosophila full-length cDNA resource.";
RL   Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
RN   [6]
RP   PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-1337; SER-2361;
RP   SER-2450; SER-2451; SER-2749 AND SER-2751, AND IDENTIFICATION BY MASS
RP   SPECTROMETRY.
RC   TISSUE=Embryo;
RX   PubMed=18327897; DOI=10.1021/pr700696a;
RA   Zhai B., Villen J., Beausoleil S.A., Mintseris J., Gygi S.P.;
RT   "Phosphoproteome analysis of Drosophila melanogaster embryos.";
RL   J. Proteome Res. 7:1675-1682(2008).
CC   -!- FUNCTION: Involved in the development of the embryonic central
CC       nervous system.
CC   -!- SUBCELLULAR LOCATION: Nucleus (Probable).
CC   -!- TISSUE SPECIFICITY: Largely restricted to the CNS of late embryo.
CC   -!- SIMILARITY: Contains 16 C2H2-type zinc fingers.
CC   -!- SIMILARITY: Contains 3 homeobox DNA-binding domains.
CC   -!- SEQUENCE CAUTION:
CC       Sequence=AAL28229.1; Type=Erroneous initiation;
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; M63450; AAA29051.1; -; mRNA.
DR   EMBL; AE014135; AAF59339.2; -; Genomic_DNA.
DR   EMBL; AY312816; AAQ67630.1; -; Genomic_DNA.
DR   EMBL; AY312817; AAQ67631.1; -; Genomic_DNA.
DR   EMBL; AY312818; AAQ67632.1; -; Genomic_DNA.
DR   EMBL; AY312819; AAQ67633.1; -; Genomic_DNA.
DR   EMBL; AY312820; AAQ67634.1; -; Genomic_DNA.
DR   EMBL; AY312821; AAQ67635.1; -; Genomic_DNA.
DR   EMBL; AY312822; AAQ67636.1; -; Genomic_DNA.
DR   EMBL; AY312823; AAQ67637.1; -; Genomic_DNA.
DR   EMBL; AY312824; AAQ67638.1; -; Genomic_DNA.
DR   EMBL; AY312825; AAQ67639.1; -; Genomic_DNA.
DR   EMBL; AY312826; AAQ67640.1; -; Genomic_DNA.
DR   EMBL; AY312827; AAQ67641.1; -; Genomic_DNA.
DR   EMBL; AY312828; AAQ67642.1; -; Genomic_DNA.
DR   EMBL; AY312829; AAQ67643.1; -; Genomic_DNA.
DR   EMBL; AY312830; AAQ67644.1; -; Genomic_DNA.
DR   EMBL; AY312831; AAQ67645.1; -; Genomic_DNA.
DR   EMBL; AY312832; AAQ67646.1; -; Genomic_DNA.
DR   EMBL; AY312833; AAQ67647.1; -; Genomic_DNA.
DR   EMBL; AY312834; AAQ67648.1; -; Genomic_DNA.
DR   EMBL; AY312835; AAQ67649.1; -; Genomic_DNA.
DR   EMBL; AY312836; AAQ67650.1; -; Genomic_DNA.
DR   EMBL; AY312837; AAQ67651.1; -; Genomic_DNA.
DR   EMBL; AY312838; AAQ67652.1; -; Genomic_DNA.
DR   EMBL; AY312839; AAQ67653.1; -; Genomic_DNA.
DR   EMBL; AY060681; AAL28229.1; ALT_INIT; mRNA.
DR   PIR; S33642; S33642.
DR   RefSeq; NP_524623.2; NM_079884.5.
DR   UniGene; Dm.3906; -.
DR   ProteinModelPortal; P28167; -.
DR   SMR; P28167; 556-669, 1067-1094, 1207-1233, 1432-1569, 1794-1859, 2159-2211, 2758-2818.
DR   BioGrid; 68628; 2.
DR   IntAct; P28167; 1.
DR   MINT; MINT-904703; -.
DR   PaxDb; P28167; -.
DR   PRIDE; P28167; -.
DR   EnsemblMetazoa; FBtr0089070; FBpp0088139; FBgn0004607.
DR   GeneID; 43795; -.
DR   KEGG; dme:Dmel_CG1449; -.
DR   CTD; 43795; -.
DR   FlyBase; FBgn0004607; zfh2.
DR   eggNOG; NOG301069; -.
DR   GeneTree; ENSGT00530000063717; -.
DR   InParanoid; P28167; -.
DR   KO; K09380; -.
DR   OrthoDB; EOG74J96X; -.
DR   PhylomeDB; P28167; -.
DR   GenomeRNAi; 43795; -.
DR   NextBio; 835886; -.
DR   PRO; PR:P28167; -.
DR   Bgee; P28167; -.
DR   GO; GO:0005634; C:nucleus; IDA:FlyBase.
DR   GO; GO:0003677; F:DNA binding; TAS:FlyBase.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   GO; GO:0000977; F:RNA polymerase II regulatory region sequence-specific DNA binding; IDA:FlyBase.
DR   GO; GO:0003700; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro.
DR   GO; GO:0007476; P:imaginal disc-derived wing morphogenesis; IMP:FlyBase.
DR   GO; GO:0007399; P:nervous system development; IEP:FlyBase.
DR   GO; GO:0035220; P:wing disc development; IMP:FlyBase.
DR   Gene3D; 1.10.10.60; -; 3.
DR   Gene3D; 3.30.160.60; -; 1.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR009057; Homeodomain-like.
DR   InterPro; IPR007087; Znf_C2H2.
DR   InterPro; IPR015880; Znf_C2H2-like.
DR   InterPro; IPR013087; Znf_C2H2/integrase_DNA-bd.
DR   Pfam; PF00046; Homeobox; 3.
DR   Pfam; PF00096; zf-C2H2; 3.
DR   SMART; SM00389; HOX; 3.
DR   SMART; SM00355; ZnF_C2H2; 15.
DR   SUPFAM; SSF46689; SSF46689; 3.
DR   PROSITE; PS00027; HOMEOBOX_1; 2.
DR   PROSITE; PS50071; HOMEOBOX_2; 3.
DR   PROSITE; PS00028; ZINC_FINGER_C2H2_1; 8.
DR   PROSITE; PS50157; ZINC_FINGER_C2H2_2; 7.
PE   1: Evidence at protein level;
KW   Complete proteome; DNA-binding; Homeobox; Metal-binding; Nucleus;
KW   Phosphoprotein; Reference proteome; Repeat; Zinc; Zinc-finger.
FT   CHAIN         1   3005       Zinc finger protein 2.
FT                                /FTId=PRO_0000047242.
FT   ZN_FING     133    156       C2H2-type 1.
FT   ZN_FING     559    582       C2H2-type 2.
FT   ZN_FING     614    638       C2H2-type 3.
FT   ZN_FING     732    756       C2H2-type 4.
FT   ZN_FING     897    916       C2H2-type 5; degenerate.
FT   ZN_FING     940    964       C2H2-type 6.
FT   ZN_FING     999   1023       C2H2-type 7.
FT   ZN_FING    1074   1098       C2H2-type 8.
FT   ZN_FING    1210   1233       C2H2-type 9.
FT   ZN_FING    1341   1365       C2H2-type 10.
FT   ZN_FING    1438   1462       C2H2-type 11.
FT   ZN_FING    1477   1500       C2H2-type 12; degenerate.
FT   ZN_FING    1513   1535       C2H2-type 13.
FT   ZN_FING    1541   1564       C2H2-type 14.
FT   DNA_BIND   1797   1856       Homeobox 1.
FT   DNA_BIND   2154   2213       Homeobox 2.
FT   ZN_FING    2234   2256       C2H2-type 15.
FT   ZN_FING    2371   2393       C2H2-type 16.
FT   DNA_BIND   2760   2819       Homeobox 3.
FT   MOD_RES    1337   1337       Phosphothreonine.
FT   MOD_RES    2361   2361       Phosphoserine.
FT   MOD_RES    2450   2450       Phosphoserine.
FT   MOD_RES    2451   2451       Phosphoserine.
FT   MOD_RES    2749   2749       Phosphoserine.
FT   MOD_RES    2751   2751       Phosphoserine.
FT   CONFLICT    230    230       V -> D (in Ref. 1; AAA29051).
FT   CONFLICT    401    401       N -> H (in Ref. 1; AAA29051).
FT   CONFLICT   3001   3005       VIGGK -> GKSFGKIQKSST (in Ref. 5).
SQ   SEQUENCE   3005 AA;  332019 MW;  C3C60179AB1B2184 CRC64;
     MSSFDVETFN GKIVYNLDGS AHIIATDNTN GGGSGSGQNC YGSTTNSLKN LSKDKGRGQE
     EKDIEHPSQY HREQSDNKRQ EEAVDNRPGV ESLGSACYKS SPKIHSFRVV SAQDANSTCQ
     DQIRAFKIQK PILMCFICKL SFGNVKSFSL HANTEHRLNL EELDQQLLNR EYSSAIIQRN
     MDEKPQISFL QPLANNDASA DTNDTEKLQT ATEGSDATLP SSPQPVFRNV SELEPENKQE
     TEQNRLLNQD REQEPESDQH TSSSKMAAPS AYIPLSSPKV AGKLTVKFGS LNSATAKTNN
     LSKVSSTSSP PSTYASGEVL SPSTDNISNH KSTHCNQETE PPSSSSSEVE MKIGSMSTSP
     QTNDSDVPCS GFLQMQHMTT GGAYTPQVSS FHASLAALAA NESNDNRVKL ITEFLQQQLQ
     QHQSSLFPSP CPDHPDLNGV DCKTCELLDI QQRSKSPSSS HHQFSQSLPQ LQIQSQPQQT
     PHRSPCSNSV ALPVSPSASS VASVGNASTA TSSFTIGACS EHINGRPQGV DCARCEMLLN
     SARLNSGVQM STRNSCKTLK CPQCNWHYKY QETLEIHMRE KHPDGESACG YCLAGQQHPR
     LARGESYSCG YKPYRCEICN YSTTTKGNLS IHMQSDKHLN NMQELNSSQN MVAAAAAAAV
     TGKLLLSSSS PQVTAACPSN SGSGAGSGSS NIVGGTASLS GNATPSVTGA NSSNANAGSN
     TNNAGTKPKP SFRCDICSYD TSVARNLRIH MTSEKHTHNM AVLQNNIKHI QAFNFLQQQQ
     QSGTGNIASH SSGSFMPEVA LADLAYNQAL MIQLLHQQQQ HQQSANTKLS PSSSPVSTPD
     QFSFSPKPIK LNHGTGAAMG IGMAMGMGMS HSNEVSCELS GDPHPLTKTD KWPMAFYSCL
     VCDCYSTNNL DDLNQHLLLD RSRQSSSASS EIMVIHNNNY ICRLCNYKTN LKANFQLHSK
     TDKHLQKLNF INHIREGGPQ NEYKMQYQQQ QLAANVVQLK CNCCDFHTNS IQKLSLHTQQ
     MRHDTMRMIF QHLLYIVQQS EMHNKSSGSA EDDPQCACPD EDQQLQLQSS KKLLLCQLCN
     FTAQNIHEMV QHVKGIRHLQ VEQFICLQRR SENQEIPALN EVFKVTEWVM ENEDVSLAPG
     LNLARTTTND ATTDASYAAA SSAAVPAIPD VSMFSPTSPS SCATSCDKNL SQIVLPNVNN
     LGSGVPTTVF KCNLCEYFVQ SKSEIAAHIE TEHSCAESDE FITIPTNTAA LQAFQTAVAA
     AALAAVHQRC AVINPPTQDT VDEDKDLDTN VSDGPVGIKQ ERLEQEVDRT TSMDVTKDLA
     SQATDFGAPE SPKVAETEVG VQCPLCLENH FREKQYLEDH LTSVHSVTRD GLSRLLLLVD
     QKALKKESTD IACPTDKAPY ANTNALERAP TPIENTCNVS LIKSTSANPS QSVSLQGLSC
     QQCEASFKHE EQLLKHAQQN QHFSLQNGEY LCLAASHISR PCFMTFRTIP TMISHFQDLH
     MSLIISERHV YKYRCKQCSL AFKTQEKLTT HMLYHSMRDA TKCSFCQRNF RSTQALQKHM
     EQAHAEDGTP STRTNSPQTP MLSTEETHKH LLAESHAVER EVSGSDVSPI ELETHLNKET
     RHLSPTPMSL DSQSHQKHLA TFAALLKQQQ CNSDAGGLHP EALSMSTGEM PPQLQGLQNL
     QHIQQHFGAV AAAAGLPINP VDMLNIMQFH HLMSLNFMNL APPLVFGANA AGNAVSGPSA
     LNNSITTSTA TSASGLGDTH LTSGVSSIPV DSGKATAVPP QTQLNANANS QQLASNQKRA
     RTRITDDQLK ILRAHFDINN SPSEESIMEM SQKANLPMKV VKHWFRNTLF KERQRNKDSP
     YNFNNPPSTT LNLEEYERTG QAKVTPLNDT CSVAVTGPMT SSTISLPPSG NINLSSKENA
     TSKVLAAGKA NASGPVTFSA TVPVSTPLSR PESTNSSGNI SDYIGNNIFF GQLGSKEQIL
     PYSLDGQIKS EPQDDMIGAT DFAYQTKQHS SFSFLKQQQD LVDPPEQCLT NQNADTAQDQ
     SLLAGSSLAS NCQSQQQINI FETKSESGSS DVLSRPPSPN SGAAGNVYGS MNDLLNQQLE
     NMGSNMGPPK KMQIVGKTFE KNVAPMVTSG SVSTQFESNS SNSSSSSSST SGGKRANRTR
     FTDYQIKVLQ EFFENNSYPK DSDLEYLSKL LLLSPRVIVV WFQNARQKQR KIYENQPNNT
     LFENEETKKQ NINYACKKCN LVFQRYYELI RHQKNHCFKE ENNKKSAKAQ IAAAQIAQNL
     SSEDSNSSMD IHHVGICPPG SAVASHTLST PGSAAPLPGQ YTQHSFGALP SPQHLFAKSS
     SLTDFSPSTT PTPPQRERSN SLDQIQRPPK FDCDKCELNF NQLEKLREHQ LLHLMNPGNI
     CSDVGQNSNP EANFGPFGSI LQSLQQAAAQ QQQQHHQQPP TKKRKYSDCS SNADEMQSLS
     ELEASQKKHE YLYKYFMQNE TSQEVKQQFL MQQQQKKLEQ GNECDFELDF LTNFYQQNEL
     KKVSNYDFLL QYYRTHEEAK SSQQHTFSSS KKPTIEFLLQ YYQLNESKKF FQLVASPQII
     PDVPGYKPSL RIPKSTSDEA PYIGETSLEQ ATELQREKQD EQLRIDRPSE ENDLSMNKNK
     VENINNNNIN VDQSNLTETN GGVPSVETKE ECTQESSLIA MDDENKYLCT RSKQKDDKEK
     SHYLHNLEDF LDATMIENNS QTLTFNDDEK ACQKDELTQN SNAIEKRSSV SPVNVSSKQN
     KRLRTTILPE QLNFLYECYQ SESNPSRKML EEISKKVNLK KRVVQVWFQN SRAKDKKSRN
     QRHYAHISDD NSYDGSSGKE VYSDLRSNGI TVDTDLETNL QDCQLCQVTQ VNIRKHAFSV
     EHISKMKKLL EQTTELYAQS NGSGSEDNDS DREKRFYNLS KAFLLQHVVT NATSHAIHTA
     RQDSDVIAEG NCILNYDTNG GDSKSHVQHN LPNEVVSEDA RKIAGNQELM QQLFNRNHIT
     VIGGK
//
DBGET integrated database retrieval system