ID B4G2B2_DROPE Unreviewed; 1466 AA.
AC B4G2B2;
DT 23-SEP-2008, integrated into UniProtKB/TrEMBL.
DT 23-SEP-2008, sequence version 1.
DT 24-JAN-2024, entry version 105.
DE SubName: Full=GL23881 {ECO:0000313|EMBL:EDW23957.1};
GN Name=Dper\GL23881 {ECO:0000313|EMBL:EDW23957.1};
GN ORFNames=Dper_GL23881 {ECO:0000313|EMBL:EDW23957.1};
OS Drosophila persimilis (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7234 {ECO:0000313|Proteomes:UP000008744};
RN [1] {ECO:0000313|EMBL:EDW23957.1, ECO:0000313|Proteomes:UP000008744}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MSH-3 / Tucson 14011-0111.49
RC {ECO:0000313|Proteomes:UP000008744};
RX PubMed=17994087; DOI=10.1038/nature06341;
RG Drosophila 12 genomes consortium;
RA Clark A.G., Eisen M.B., Smith D.R., Bergman C.M., Oliver B., Markow T.A.,
RA Kaufman T.C., Kellis M., Gelbart W., Iyer V.N., Pollard D.A., Sackton T.B.,
RA Larracuente A.M., Singh N.D., Abad J.P., Abt D.N., Adryan B., Aguade M.,
RA Akashi H., Anderson W.W., Aquadro C.F., Ardell D.H., Arguello R.,
RA Artieri C.G., Barbash D.A., Barker D., Barsanti P., Batterham P.,
RA Batzoglou S., Begun D., Bhutkar A., Blanco E., Bosak S.A., Bradley R.K.,
RA Brand A.D., Brent M.R., Brooks A.N., Brown R.H., Butlin R.K., Caggese C.,
RA Calvi B.R., Bernardo de Carvalho A., Caspi A., Castrezana S.,
RA Celniker S.E., Chang J.L., Chapple C., Chatterji S., Chinwalla A.,
RA Civetta A., Clifton S.W., Comeron J.M., Costello J.C., Coyne J.A., Daub J.,
RA David R.G., Delcher A.L., Delehaunty K., Do C.B., Ebling H., Edwards K.,
RA Eickbush T., Evans J.D., Filipski A., Findeiss S., Freyhult E., Fulton L.,
RA Fulton R., Garcia A.C., Gardiner A., Garfield D.A., Garvin B.E., Gibson G.,
RA Gilbert D., Gnerre S., Godfrey J., Good R., Gotea V., Gravely B.,
RA Greenberg A.J., Griffiths-Jones S., Gross S., Guigo R., Gustafson E.A.,
RA Haerty W., Hahn M.W., Halligan D.L., Halpern A.L., Halter G.M., Han M.V.,
RA Heger A., Hillier L., Hinrichs A.S., Holmes I., Hoskins R.A., Hubisz M.J.,
RA Hultmark D., Huntley M.A., Jaffe D.B., Jagadeeshan S., Jeck W.R.,
RA Johnson J., Jones C.D., Jordan W.C., Karpen G.H., Kataoka E.,
RA Keightley P.D., Kheradpour P., Kirkness E.F., Koerich L.B., Kristiansen K.,
RA Kudrna D., Kulathinal R.J., Kumar S., Kwok R., Lander E., Langley C.H.,
RA Lapoint R., Lazzaro B.P., Lee S.J., Levesque L., Li R., Lin C.F., Lin M.F.,
RA Lindblad-Toh K., Llopart A., Long M., Low L., Lozovsky E., Lu J., Luo M.,
RA Machado C.A., Makalowski W., Marzo M., Matsuda M., Matzkin L.,
RA McAllister B., McBride C.S., McKernan B., McKernan K., Mendez-Lago M.,
RA Minx P., Mollenhauer M.U., Montooth K., Mount S.M., Mu X., Myers E.,
RA Negre B., Newfeld S., Nielsen R., Noor M.A., O'Grady P., Pachter L.,
RA Papaceit M., Parisi M.J., Parisi M., Parts L., Pedersen J.S., Pesole G.,
RA Phillippy A.M., Ponting C.P., Pop M., Porcelli D., Powell J.R.,
RA Prohaska S., Pruitt K., Puig M., Quesneville H., Ram K.R., Rand D.,
RA Rasmussen M.D., Reed L.K., Reenan R., Reily A., Remington K.A.,
RA Rieger T.T., Ritchie M.G., Robin C., Rogers Y.H., Rohde C., Rozas J.,
RA Rubenfield M.J., Ruiz A., Russo S., Salzberg S.L., Sanchez-Gracia A.,
RA Saranga D.J., Sato H., Schaeffer S.W., Schatz M.C., Schlenke T.,
RA Schwartz R., Segarra C., Singh R.S., Sirot L., Sirota M., Sisneros N.B.,
RA Smith C.D., Smith T.F., Spieth J., Stage D.E., Stark A., Stephan W.,
RA Strausberg R.L., Strempel S., Sturgill D., Sutton G., Sutton G.G., Tao W.,
RA Teichmann S., Tobari Y.N., Tomimura Y., Tsolas J.M., Valente V.L.,
RA Venter E., Venter J.C., Vicario S., Vieira F.G., Vilella A.J.,
RA Villasante A., Walenz B., Wang J., Wasserman M., Watts T., Wilson D.,
RA Wilson R.K., Wing R.A., Wolfner M.F., Wong A., Wong G.K., Wu C.I., Wu G.,
RA Yamamoto D., Yang H.P., Yang S.P., Yorke J.A., Yoshida K., Zdobnov E.,
RA Zhang P., Zhang Y., Zimin A.V., Baldwin J., Abdouelleil A., Abdulkadir J.,
RA Abebe A., Abera B., Abreu J., Acer S.C., Aftuck L., Alexander A., An P.,
RA Anderson E., Anderson S., Arachi H., Azer M., Bachantsang P., Barry A.,
RA Bayul T., Berlin A., Bessette D., Bloom T., Blye J., Boguslavskiy L.,
RA Bonnet C., Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S.,
RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., Costello M.,
RA D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., Dhargay N., Dooley K.,
RA Dooley E., Doricent M., Dorje P., Dorjee K., Dupes A., Elong R., Falk J.,
RA Farina A., Faro S., Ferguson D., Fisher S., Foley C.D., Franke A.,
RA Friedrich D., Gadbois L., Gearin G., Gearin C.R., Giannoukos G., Goode T.,
RA Graham J., Grandbois E., Grewal S., Gyaltsen K., Hafez N., Hagos B.,
RA Hall J., Henson C., Hollinger A., Honan T., Huard M.D., Hughes L.,
RA Hurhula B., Husby M.E., Kamat A., Kanga B., Kashin S., Khazanovich D.,
RA Kisner P., Lance K., Lara M., Lee W., Lennon N., Letendre F., LeVine R.,
RA Lipovsky A., Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y.,
RA Lubonja R., Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C.,
RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O.,
RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., Mulrain L.,
RA Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., Nicol R., Norbu C.,
RA Norbu N., Novod N., O'Neill B., Osman S., Markiewicz E., Oyono O.L.,
RA Patti C., Phunkhang P., Pierre F., Priest M., Raghuraman S., Rege F.,
RA Reyes R., Rise C., Rogov P., Ross K., Ryan E., Settipalli S., Shea T.,
RA Sherpa N., Shi L., Shih D., Sparrow T., Spaulding J., Stalker J.,
RA Stange-Thomann N., Stavropoulos S., Stone C., Strader C., Tesfaye S.,
RA Thomson T., Thoulutsang Y., Thoulutsang D., Topham K., Topping I.,
RA Tsamla T., Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M.,
RA Wilkinson J., Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D.,
RA Zimmer A., Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J.,
RA Chin C., Gnerre S., Grabherr M., Kleber M., Mauceli E., MacCallum I.;
RT "Evolution of genes and genomes on the Drosophila phylogeny.";
RL Nature 450:203-218(2007).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH479179; EDW23957.1; -; Genomic_DNA.
DR RefSeq; XP_002012971.1; XM_002012935.1.
DR STRING; 7234.B4G2B2; -.
DR EnsemblMetazoa; FBtr0189496; FBpp0187988; FBgn0161471.
DR eggNOG; KOG1081; Eukaryota.
DR HOGENOM; CLU_252531_0_0_1; -.
DR OMA; MHADCLE; -.
DR PhylomeDB; B4G2B2; -.
DR Proteomes; UP000008744; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0005721; C:pericentric heterochromatin; IEA:EnsemblMetazoa.
DR GO; GO:0005700; C:polytene chromosome; IEA:EnsemblMetazoa.
DR GO; GO:0031490; F:chromatin DNA binding; IEA:EnsemblMetazoa.
DR GO; GO:0042054; F:histone methyltransferase activity; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR CDD; cd15565; PHD2_NSD; 1.
DR CDD; cd15566; PHD3_NSD; 1.
DR CDD; cd15567; PHD4_NSD; 1.
DR CDD; cd15489; PHD_SF; 1.
DR CDD; cd20144; PWWP_NSD_rpt1; 1.
DR CDD; cd05838; PWWP_NSD_rpt2; 1.
DR Gene3D; 2.30.30.140; -; 2.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR006560; AWS_dom.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR22884:SF413; HISTONE-LYSINE N-METHYLTRANSFERASE CG1716-RELATED; 1.
DR PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1.
DR Pfam; PF17907; AWS; 1.
DR Pfam; PF00855; PWWP; 2.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00570; AWS; 1.
DR SMART; SM00249; PHD; 3.
DR SMART; SM00293; PWWP; 2.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 2.
DR SUPFAM; SSF82199; SET domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 2.
DR PROSITE; PS51215; AWS; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50812; PWWP; 2.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000008744};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 400..473
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 1086..1148
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 1217..1267
FT /note="AWS"
FT /evidence="ECO:0000259|PROSITE:PS51215"
FT DOMAIN 1261..1386
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1394..1410
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 1..66
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 147..186
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 229..380
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 542..562
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 583..643
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1416..1466
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 11..26
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 147..161
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 234..271
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 276..318
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 627..643
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1449..1466
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1466 AA; 163109 MW; 5F579805027F8DAB CRC64;
MKISAGAHSE TDGVGAQSNL SFNADGTATG DAAAGNAESV ASQITEHPEK VKEKRRKRSK
IVAHDGDSVT EFTENDLNGY LSPKNAPFAI KKQQRLSGLN RKRDLINLQS ALSPKYIGYA
NANSPTPLSD SDDTVRTTRR RFTQAAALNS SNNNNHCELQ PRDNDAATES PKTLEGTPLL
DRSDSKNYIS PIDKLLMKNG ASSPNSTGFE TDPEDVAPLA ARLHLRRKLK RAAKTKSTSE
TVEPPEKVPS VKEEPMDITD LQPKEVSDAK IEDTVPFSTD VNSESESRAS SCDVVSSSRI
SLSNENAQDL PGEPTDTSRK TIESTDEPND PDPDPDPDSE SGFALPAATL TDMKSPGELS
PTAFRQPEDN GYVKSPSSVS LDSAKGLSIV TDPGWGTYQV GDLYWGKVFS YCFWPCMVCP
DPYGQIVGNL PAHPQRTSTD NAPLPIQVHV RFFADSGRRN WIKPENLLSF AGLKAYEEQR
EEVRIKYGNR SGKYRTMVPK RTKLDVWRQA IDEAQLVAQV PYSERLEKFY QIYESVVTIN
KQKRKRTNSM TTQDTSDVGS SLYDSTDNLH AKAHLQLMNV KRERSVTPVS PAFSPVKKSQ
MRTKRRKLSK GAEAETGGGT VPSESVAGAE KNITPSNSGG TYENPEFRQL FDAIQEYVMV
YRHDDKVEKV LRAVVGNIWS LKQIQLRELE RELAYGEMDQ EQLGSAVVER KRGTGTTQRL
SNRLKTLVER RSRTPVVTPS GTPTPTATST IGGAEMRLSE ISKPKKPVNR PIEEVVEDIF
QLDSKYLFRG LPRGTVCKYC FKTGSNLLQC SRTCHSWMHA DCLESKRNGL AASKKEGLKL
KALEDVSTSS STTPPVPAAE HITGDAPDAL ATIATAAPTA PTFEVICHEC NVNGEASCCI
CLQVLPPAYP SPSPGSPLKE GSSEAKEATP VDGTMIMCGQ NLCGRQYHAG CCKYWPQAIA
SSSLTRCPLH VCHTCVSDNP RKFLPVGSSK LTRCVKCPAS YHQDSRCIPA GSRMLTTTNL
ICPRHNVAKT DAHLNVLWCF ICVKGGELLC CETCPIAVHA GCRKVPIEKH ENYICEECES
GRLPLYGEIV WAKFNNFRWW PAIILPPTEI PNNILKKAHG ESEFVVRFFG THDHGWIPRR
RVYLYIEGDT GEKLKPRSQL HRKFYNGIEE ATRFLKITKA RRHEQMVARG IKVNPPPYVK
IKINKAVPPV KFVTNSEEHS TCDCRPEDEH PCGANSNCLN RMLFNECHPE YCRCGDRCEN
RMFETRKSPR MDVVYMNARG FGLVCREPIA EGDFIIEYVG EVINQEEFQR RMLRKQKDRD
ENFYFLGVEK EFIIDAGPKG NLARFMNHSC EPNCTSQKWT VNCTHRVGLF AIQDIPAETE
LTFNYLWDDL LNDKKKACHC GSERCSGEIG GKLKDKSSNV KLKQQQRSRG AAIRIHVPPA
KKAKKSVPKI KHVSADDELE EAKKDQ
//