ID B4KTZ9_DROMO Unreviewed; 1442 AA.
AC B4KTZ9;
DT 23-SEP-2008, integrated into UniProtKB/TrEMBL.
DT 23-SEP-2008, sequence version 1.
DT 27-MAR-2024, entry version 97.
DE RecName: Full=Peregrin {ECO:0008006|Google:ProtNLM};
GN Name=Dmoj\GI20042 {ECO:0000313|EMBL:EDW08576.1};
GN ORFNames=Dmoj_GI20042 {ECO:0000313|EMBL:EDW08576.1};
OS Drosophila mojavensis (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila.
OX NCBI_TaxID=7230 {ECO:0000313|EMBL:EDW08576.1, ECO:0000313|Proteomes:UP000009192};
RN [1] {ECO:0000313|EMBL:EDW08576.1, ECO:0000313|Proteomes:UP000009192}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Tucson 15081-1352.22 {ECO:0000313|Proteomes:UP000009192};
RX PubMed=17994087; DOI=10.1038/nature06341;
RG Drosophila 12 genomes consortium;
RA Clark A.G., Eisen M.B., Smith D.R., Bergman C.M., Oliver B., Markow T.A.,
RA Kaufman T.C., Kellis M., Gelbart W., Iyer V.N., Pollard D.A., Sackton T.B.,
RA Larracuente A.M., Singh N.D., Abad J.P., Abt D.N., Adryan B., Aguade M.,
RA Akashi H., Anderson W.W., Aquadro C.F., Ardell D.H., Arguello R.,
RA Artieri C.G., Barbash D.A., Barker D., Barsanti P., Batterham P.,
RA Batzoglou S., Begun D., Bhutkar A., Blanco E., Bosak S.A., Bradley R.K.,
RA Brand A.D., Brent M.R., Brooks A.N., Brown R.H., Butlin R.K., Caggese C.,
RA Calvi B.R., Bernardo de Carvalho A., Caspi A., Castrezana S.,
RA Celniker S.E., Chang J.L., Chapple C., Chatterji S., Chinwalla A.,
RA Civetta A., Clifton S.W., Comeron J.M., Costello J.C., Coyne J.A., Daub J.,
RA David R.G., Delcher A.L., Delehaunty K., Do C.B., Ebling H., Edwards K.,
RA Eickbush T., Evans J.D., Filipski A., Findeiss S., Freyhult E., Fulton L.,
RA Fulton R., Garcia A.C., Gardiner A., Garfield D.A., Garvin B.E., Gibson G.,
RA Gilbert D., Gnerre S., Godfrey J., Good R., Gotea V., Gravely B.,
RA Greenberg A.J., Griffiths-Jones S., Gross S., Guigo R., Gustafson E.A.,
RA Haerty W., Hahn M.W., Halligan D.L., Halpern A.L., Halter G.M., Han M.V.,
RA Heger A., Hillier L., Hinrichs A.S., Holmes I., Hoskins R.A., Hubisz M.J.,
RA Hultmark D., Huntley M.A., Jaffe D.B., Jagadeeshan S., Jeck W.R.,
RA Johnson J., Jones C.D., Jordan W.C., Karpen G.H., Kataoka E.,
RA Keightley P.D., Kheradpour P., Kirkness E.F., Koerich L.B., Kristiansen K.,
RA Kudrna D., Kulathinal R.J., Kumar S., Kwok R., Lander E., Langley C.H.,
RA Lapoint R., Lazzaro B.P., Lee S.J., Levesque L., Li R., Lin C.F., Lin M.F.,
RA Lindblad-Toh K., Llopart A., Long M., Low L., Lozovsky E., Lu J., Luo M.,
RA Machado C.A., Makalowski W., Marzo M., Matsuda M., Matzkin L.,
RA McAllister B., McBride C.S., McKernan B., McKernan K., Mendez-Lago M.,
RA Minx P., Mollenhauer M.U., Montooth K., Mount S.M., Mu X., Myers E.,
RA Negre B., Newfeld S., Nielsen R., Noor M.A., O'Grady P., Pachter L.,
RA Papaceit M., Parisi M.J., Parisi M., Parts L., Pedersen J.S., Pesole G.,
RA Phillippy A.M., Ponting C.P., Pop M., Porcelli D., Powell J.R.,
RA Prohaska S., Pruitt K., Puig M., Quesneville H., Ram K.R., Rand D.,
RA Rasmussen M.D., Reed L.K., Reenan R., Reily A., Remington K.A.,
RA Rieger T.T., Ritchie M.G., Robin C., Rogers Y.H., Rohde C., Rozas J.,
RA Rubenfield M.J., Ruiz A., Russo S., Salzberg S.L., Sanchez-Gracia A.,
RA Saranga D.J., Sato H., Schaeffer S.W., Schatz M.C., Schlenke T.,
RA Schwartz R., Segarra C., Singh R.S., Sirot L., Sirota M., Sisneros N.B.,
RA Smith C.D., Smith T.F., Spieth J., Stage D.E., Stark A., Stephan W.,
RA Strausberg R.L., Strempel S., Sturgill D., Sutton G., Sutton G.G., Tao W.,
RA Teichmann S., Tobari Y.N., Tomimura Y., Tsolas J.M., Valente V.L.,
RA Venter E., Venter J.C., Vicario S., Vieira F.G., Vilella A.J.,
RA Villasante A., Walenz B., Wang J., Wasserman M., Watts T., Wilson D.,
RA Wilson R.K., Wing R.A., Wolfner M.F., Wong A., Wong G.K., Wu C.I., Wu G.,
RA Yamamoto D., Yang H.P., Yang S.P., Yorke J.A., Yoshida K., Zdobnov E.,
RA Zhang P., Zhang Y., Zimin A.V., Baldwin J., Abdouelleil A., Abdulkadir J.,
RA Abebe A., Abera B., Abreu J., Acer S.C., Aftuck L., Alexander A., An P.,
RA Anderson E., Anderson S., Arachi H., Azer M., Bachantsang P., Barry A.,
RA Bayul T., Berlin A., Bessette D., Bloom T., Blye J., Boguslavskiy L.,
RA Bonnet C., Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S.,
RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., Costello M.,
RA D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., Dhargay N., Dooley K.,
RA Dooley E., Doricent M., Dorje P., Dorjee K., Dupes A., Elong R., Falk J.,
RA Farina A., Faro S., Ferguson D., Fisher S., Foley C.D., Franke A.,
RA Friedrich D., Gadbois L., Gearin G., Gearin C.R., Giannoukos G., Goode T.,
RA Graham J., Grandbois E., Grewal S., Gyaltsen K., Hafez N., Hagos B.,
RA Hall J., Henson C., Hollinger A., Honan T., Huard M.D., Hughes L.,
RA Hurhula B., Husby M.E., Kamat A., Kanga B., Kashin S., Khazanovich D.,
RA Kisner P., Lance K., Lara M., Lee W., Lennon N., Letendre F., LeVine R.,
RA Lipovsky A., Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y.,
RA Lubonja R., Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C.,
RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O.,
RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., Mulrain L.,
RA Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., Nicol R., Norbu C.,
RA Norbu N., Novod N., O'Neill B., Osman S., Markiewicz E., Oyono O.L.,
RA Patti C., Phunkhang P., Pierre F., Priest M., Raghuraman S., Rege F.,
RA Reyes R., Rise C., Rogov P., Ross K., Ryan E., Settipalli S., Shea T.,
RA Sherpa N., Shi L., Shih D., Sparrow T., Spaulding J., Stalker J.,
RA Stange-Thomann N., Stavropoulos S., Stone C., Strader C., Tesfaye S.,
RA Thomson T., Thoulutsang Y., Thoulutsang D., Topham K., Topping I.,
RA Tsamla T., Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M.,
RA Wilkinson J., Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D.,
RA Zimmer A., Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J.,
RA Chin C., Gnerre S., Grabherr M., Kleber M., Mauceli E., MacCallum I.;
RT "Evolution of genes and genomes on the Drosophila phylogeny.";
RL Nature 450:203-218(2007).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH933808; EDW08576.1; -; Genomic_DNA.
DR RefSeq; XP_002004641.1; XM_002004605.2.
DR SMR; B4KTZ9; -.
DR EnsemblMetazoa; FBtr0170767; FBpp0169259; FBgn0142778.
DR GeneID; 6578733; -.
DR KEGG; dmo:Dmoj_GI20042; -.
DR eggNOG; KOG0955; Eukaryota.
DR HOGENOM; CLU_003589_1_0_1; -.
DR InParanoid; B4KTZ9; -.
DR OMA; TRHKMRE; -.
DR OrthoDB; 163389at2759; -.
DR PhylomeDB; B4KTZ9; -.
DR Proteomes; UP000009192; Unassembled WGS sequence.
DR GO; GO:0070776; C:MOZ/MORF histone acetyltransferase complex; IEA:EnsemblMetazoa.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0010698; F:acetyltransferase activator activity; IEA:EnsemblMetazoa.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR CDD; cd15670; ePHD_BRPF; 1.
DR CDD; cd15572; PHD_BRPF; 1.
DR CDD; cd05839; PWWP_BRPF; 1.
DR Gene3D; 2.30.30.140; -; 1.
DR Gene3D; 1.20.920.10; Bromodomain-like; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 2.
DR InterPro; IPR001487; Bromodomain.
DR InterPro; IPR036427; Bromodomain-like_sf.
DR InterPro; IPR019542; Enhancer_polycomb-like_N.
DR InterPro; IPR034732; EPHD.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR013087; Znf_C2H2_type.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR13793:SF164; BROMODOMAIN-CONTAINING PROTEIN, 140KD, ISOFORM A; 1.
DR PANTHER; PTHR13793; PHD FINGER PROTEINS; 1.
DR Pfam; PF00439; Bromodomain; 1.
DR Pfam; PF10513; EPL1; 1.
DR Pfam; PF13831; PHD_2; 1.
DR Pfam; PF00855; PWWP; 1.
DR Pfam; PF13832; zf-HC5HC2H_2; 1.
DR PRINTS; PR00503; BROMODOMAIN.
DR SMART; SM00297; BROMO; 1.
DR SMART; SM00249; PHD; 2.
DR SMART; SM00293; PWWP; 1.
DR SUPFAM; SSF47370; Bromodomain; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS50014; BROMODOMAIN_2; 1.
DR PROSITE; PS51805; EPHD; 1.
DR PROSITE; PS50812; PWWP; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 4: Predicted;
KW Bromodomain {ECO:0000256|ARBA:ARBA00023117, ECO:0000256|PROSITE-
KW ProRule:PRU00035}; Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000009192};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00042}.
FT DOMAIN 23..54
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 271..321
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 325..445
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS51805"
FT DOMAIN 616..686
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT DOMAIN 1317..1390
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT REGION 43..108
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 787..932
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1002..1037
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1079..1293
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 837..854
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 861..894
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 915..932
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1083..1112
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1123..1146
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1147..1172
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1173..1196
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1202..1216
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1261..1288
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1442 AA; 159246 MW; C494D343F50D98CF CRC64;
MGLDFDVVEY CKGVKSKQSQ PPFACPVRDC DRSYKTLVGL QGHLTNYDHD NPQPLTPILK
PNRKRARSSR ALHSTPKDNG SGGGGGGGGN SEGENGCGNG RSTTNPESLV SYNEQEGTVT
FNIDGKSVRL GIDEPLPLVD DEEFAELVER GCILNADAPP LEENAAWGQV QVPVANVREI
NDYNVPDAPP RPLAYYRFIE KSAEELDGEI EYDVDEEDTA WLEHMNELRA KQNLNAVSID
TMELLMDRLE KESHFQAAAN GTPTGVEVDD DAVCCICLDG ECQNTNVILF CDMCNLAVHQ
DCYGVPYIPE GQWLCRRCLQ SPSKAVNCVL CPNAGGAFKQ TDHGQWAHVV CALWIPEVRF
ANTVFLEPID SIETIPAARW RLTCYVCKEK GLGACIQCHR NSCYAAFHVT CAQQAGLYMT
MDTVKDGHND SSVHVQKFAY CHAHTPADAK LKTNVPDFED TRLKMREARK ALAKKRSTAP
VVLIPTIPPD RVHEIAGMVH MQKKKEFLDR IIAYWTLKRH YRNGVPLLRR LQSQGNNHGV
IQRNGIEGSP DTKELYRQLK YWQCLRQDLE RARLLCELVR KREKLKVAFV KISEEVVMLQ
LNPLESALSK LLDALETRDT MEIFREPVNT NEVPDYMDIV KQPMDLGTMR AKLKDCRYTK
LEQLEADFDL MIQNCLAYNN KDTVFYRAGI RMRDQAAPLF VQLRKELQRD GLLERGQRIH
VDHVEGEVEQ ELRQLLSAPP SEDVVQKLLI LADKSQVLKN PSYRTKKIKQ IRLEINRMRK
TLQKARFAAR YASHANSQSD DDNEETRVKP SKKRMRKRLN SSAMDLDESQ DVPQSQQHDE
DEDDEDDSDD DSMADDNVSK DTGHMVQTPP CSPVKSLNNS SSPVGINRRT AILLTRKSQA
ALKRPSEPLT TPVKEEHHNS QSLSTQSTSA SSSSTITAAT AASSAMNASL HAATATPTAM
SSFALTPHNS TASAVGMGVA GSSLTSAALS MSNKLNLSLG AGLGMSKSPK RPPRFRRLPS
TSPKKSPNPA AAAAAAAAAA TTMTTPTTVQ SIPALASADP ALPFERIPDS FRVYRANNQR
DVSESDDAPS QSSSPCSSCS DFSMSGSCSD FDSDEASDGD AHSDDGDSDS TKAHAMRDDS
QEDCTTDAMD LQHASLNNAQ GNGNMAISSS DSSSDSDDDD DDDEEEDDDE EDEEEQQQRQ
LDAKNARVTR STPTPLQGRG GAMAAARGRG KRRNNLSEST SSTATPPPLL RKAGKLRSAT
PNASPLVNSI KSRRSTTTAT TVTNNNKASK AHHELEETAT EVHVRHNNSN QKAALEPLQL
VWAKCRGYPW YPALILDPKT PKGFVYNGVP LPAPPTDVLA LRKNCLDDVV FLVLFFDVKR
TWQWLPANKL DILGIDKQLD QQKLVESRKP AERKAVKKAY QDALLYQSQV SDLEGQGPDP
IM
//