GenomeNet

Database: UniProt
Entry: Q9VVQ6_DROME
LinkDB: Q9VVQ6_DROME
Original site: Q9VVQ6_DROME 
ID   Q9VVQ6_DROME            Unreviewed;       515 AA.
AC   Q9VVQ6;
DT   01-MAY-2000, integrated into UniProtKB/TrEMBL.
DT   26-JUN-2013, sequence version 3.
DT   27-MAR-2024, entry version 161.
DE   RecName: Full=procollagen-proline 4-dioxygenase {ECO:0000256|ARBA:ARBA00012269};
DE            EC=1.14.11.2 {ECO:0000256|ARBA:ARBA00012269};
GN   Name=Dmel\CG18233 {ECO:0000313|EMBL:AAF49254.3};
GN   ORFNames=CG18233 {ECO:0000313|EMBL:AAF49254.3,
GN   ECO:0000313|FlyBase:FBgn0036795}, Dmel_CG18233
GN   {ECO:0000313|EMBL:AAF49254.3};
OS   Drosophila melanogaster (Fruit fly).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC   Drosophilidae; Drosophila; Sophophora.
OX   NCBI_TaxID=7227 {ECO:0000313|EMBL:AAF49254.3, ECO:0000313|Proteomes:UP000000803};
RN   [1] {ECO:0000313|EMBL:AAF49254.3, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA   Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA   Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA   George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA   Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA   Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA   Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A.,
RA   An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA   Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA   Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA   Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA   Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA   Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA   Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA   Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA   Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA   Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA   Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA   Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F.,
RA   Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA   Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA   Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA   McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA   Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA   Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA   Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA   Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H.,
RA   Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E.,
RA   Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C.,
RA   Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A.,
RA   Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C.,
RA   Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G.,
RA   Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.,
RA   Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.;
RT   "The genome sequence of Drosophila melanogaster.";
RL   Science 287:2185-2195(2000).
RN   [2] {ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537568;
RA   Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A.,
RA   Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A.,
RA   George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R.,
RA   Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J.,
RA   Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C.,
RA   Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.;
RT   "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster
RT   euchromatic genome sequence.";
RL   Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002).
RN   [3] {ECO:0000313|Proteomes:UP000000803}
RP   GENOME REANNOTATION.
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA   Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA   Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA   Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA   Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA   Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA   Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT   "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT   review.";
RL   Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN   [4] {ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537573;
RA   Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R.,
RA   Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M.,
RA   Celniker S.E.;
RT   "The transposable elements of the Drosophila melanogaster euchromatin: a
RT   genomics perspective.";
RL   Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002).
RN   [5] {ECO:0000313|EMBL:AAF49254.3, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537574;
RA   Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A.,
RA   Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G.,
RA   Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M.,
RA   Karpen G.H.;
RT   "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.";
RL   Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002).
RN   [6] {ECO:0000313|EMBL:AAF49254.3, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=16110336; DOI=10.1371/journal.pcbi.0010022;
RA   Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D.,
RA   Ashburner M., Anxolabehere D.;
RT   "Combined evidence annotation of transposable elements in genome
RT   sequences.";
RL   PLoS Comput. Biol. 1:166-175(2005).
RN   [7] {ECO:0000313|EMBL:AAF49254.3, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=17569856; DOI=10.1126/science.1139815;
RA   Smith C.D., Shu S., Mungall C.J., Karpen G.H.;
RT   "The Release 5.1 annotation of Drosophila melanogaster heterochromatin.";
RL   Science 316:1586-1591(2007).
RN   [8] {ECO:0000313|EMBL:AAF49254.3, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=17569867; DOI=10.1126/science.1139816;
RA   Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M.,
RA   Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A.,
RA   Dimitri P., Karpen G.H., Celniker S.E.;
RT   "Sequence finishing and mapping of Drosophila melanogaster
RT   heterochromatin.";
RL   Science 316:1625-1628(2007).
CC   -!- FUNCTION: Catalyzes the post-translational formation of 4-
CC       hydroxyproline in -Xaa-Pro-Gly- sequences in collagens and other
CC       proteins. {ECO:0000256|ARBA:ARBA00002035}.
CC   -!- COFACTOR:
CC       Name=L-ascorbate; Xref=ChEBI:CHEBI:38290;
CC         Evidence={ECO:0000256|ARBA:ARBA00001961};
CC   -!- SUBCELLULAR LOCATION: Endoplasmic reticulum lumen
CC       {ECO:0000256|ARBA:ARBA00004319}.
CC   -!- SIMILARITY: Belongs to the P4HA family.
CC       {ECO:0000256|ARBA:ARBA00006511}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AE014296; AAF49254.3; -; Genomic_DNA.
DR   RefSeq; NP_649044.3; NM_140787.2.
DR   AlphaFoldDB; Q9VVQ6; -.
DR   SMR; Q9VVQ6; -.
DR   IntAct; Q9VVQ6; 1.
DR   STRING; 7227.FBpp0293245; -.
DR   PaxDb; 7227-FBpp0293245; -.
DR   EnsemblMetazoa; FBtr0304702; FBpp0293245; FBgn0036795.
DR   GeneID; 40025; -.
DR   KEGG; dme:Dmel_CG18233; -.
DR   UCSC; CG18233-RA; d. melanogaster.
DR   AGR; FB:FBgn0036795; -.
DR   FlyBase; FBgn0036795; CG18233.
DR   VEuPathDB; VectorBase:FBgn0036795; -.
DR   eggNOG; KOG1591; Eukaryota.
DR   GeneTree; ENSGT00940000163795; -.
DR   HOGENOM; CLU_024155_2_0_1; -.
DR   InParanoid; Q9VVQ6; -.
DR   OMA; TITKWLH; -.
DR   OrthoDB; 2899308at2759; -.
DR   PhylomeDB; Q9VVQ6; -.
DR   BioGRID-ORCS; 40025; 0 hits in 1 CRISPR screen.
DR   GenomeRNAi; 40025; -.
DR   Proteomes; UP000000803; Chromosome 3L.
DR   Bgee; FBgn0036795; Expressed in male reproductive gland and 6 other cell types or tissues.
DR   ExpressionAtlas; Q9VVQ6; baseline and differential.
DR   Genevisible; Q9VVQ6; DM.
DR   GO; GO:0005783; C:endoplasmic reticulum; IBA:GO_Central.
DR   GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProtKB-SubCell.
DR   GO; GO:0005506; F:iron ion binding; IEA:InterPro.
DR   GO; GO:0031418; F:L-ascorbic acid binding; IEA:InterPro.
DR   GO; GO:0016491; F:oxidoreductase activity; ISM:FlyBase.
DR   GO; GO:0004656; F:procollagen-proline 4-dioxygenase activity; IBA:GO_Central.
DR   GO; GO:0019953; P:sexual reproduction; IEP:FlyBase.
DR   Gene3D; 6.10.140.1460; -; 1.
DR   Gene3D; 2.60.120.620; q2cbj1_9rhob like domain; 1.
DR   Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR   InterPro; IPR005123; Oxoglu/Fe-dep_dioxygenase.
DR   InterPro; IPR045054; P4HA-like.
DR   InterPro; IPR006620; Pro_4_hyd_alph.
DR   InterPro; IPR044862; Pro_4_hyd_alph_FE2OG_OXY.
DR   InterPro; IPR013547; Pro_4_hyd_alph_N.
DR   InterPro; IPR011990; TPR-like_helical_dom_sf.
DR   PANTHER; PTHR10869:SF207; P4HA_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR10869; PROLYL 4-HYDROXYLASE ALPHA SUBUNIT; 1.
DR   Pfam; PF13640; 2OG-FeII_Oxy_3; 1.
DR   Pfam; PF08336; P4Ha_N; 1.
DR   SMART; SM00702; P4Hc; 1.
DR   PROSITE; PS51471; FE2OG_OXY; 1.
PE   3: Inferred from homology;
KW   Dioxygenase {ECO:0000256|ARBA:ARBA00022964};
KW   Endoplasmic reticulum {ECO:0000256|ARBA:ARBA00022824};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Iron {ECO:0000256|ARBA:ARBA00023004};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Oxidoreductase {ECO:0000256|ARBA:ARBA00023002,
KW   ECO:0000313|EMBL:AAF49254.3};
KW   Reference proteome {ECO:0000313|Proteomes:UP000000803};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..515
FT                   /note="procollagen-proline 4-dioxygenase"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004335338"
FT   DOMAIN          391..499
FT                   /note="Fe2OG dioxygenase"
FT                   /evidence="ECO:0000259|PROSITE:PS51471"
SQ   SEQUENCE   515 AA;  59192 MW;  094795016239D941 CRC64;
     MLNVVRAAAV LILVQSIHST DPNDFLNRKY YSSSVLGLVK LLKMEQEFMV NFSIYANILQ
     EKVDNLNIFL DALKRPNHKT HNEREKFVSN PLNAFGLIRR LNQDWPKLQN YTQKPLGLEQ
     LTAMQDIVSA APESFDMNEK LKAMHRIETT YDLQPKDIAK GLLQRTEFNY RLLFRDCLAL
     AYHKFEIGEF KRSLLWFQEA LKLSTDGSLE IMNRLKEIEL KGFATAVAKR SIYLSNQGLT
     NETIDDMANS QLQDAKLMDL EQLISSELKQ WINDDSTTPP TDHNLGCRGL FPKKSNLVCR
     YNSSTNAFLK LAPLKMEEIS RDPYIVMFHE VISDKDIEEM KGEITEMENG WTSLGDPKEI
     VSRVYWIRKE SSFSKRINQR ISDMTGFKLE EFPAIQLANF GVGGYFKPHY DFYTDRLKEV
     DVNNTLGDRI GSIIFYAGEV SQGGQTVFPD LKVAVEPKKG NALFWFNAFD DSTPDPRSLH
     SVCPVLVGSR WTITKWLHYA PQLFVKPCSP RVHLE
//
DBGET integrated database retrieval system