ID Q0KHY6_DROME Unreviewed; 535 AA.
AC Q0KHY6;
DT 03-OCT-2006, integrated into UniProtKB/TrEMBL.
DT 03-OCT-2006, sequence version 1.
DT 27-MAR-2024, entry version 135.
DE RecName: Full=procollagen-proline 4-dioxygenase {ECO:0000256|ARBA:ARBA00012269};
DE EC=1.14.11.2 {ECO:0000256|ARBA:ARBA00012269};
GN Name=Dmel\CG31524 {ECO:0000313|EMBL:ABI31221.1};
GN ORFNames=CG31524 {ECO:0000313|EMBL:ABI31221.1,
GN ECO:0000313|FlyBase:FBgn0051524}, Dmel_CG31524
GN {ECO:0000313|EMBL:ABI31221.1};
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227 {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803};
RN [1] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A.,
RA An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H.,
RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E.,
RA Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C.,
RA Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A.,
RA Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C.,
RA Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G.,
RA Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.,
RA Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [2] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537568;
RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A.,
RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A.,
RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R.,
RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J.,
RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C.,
RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.;
RT "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster
RT euchromatic genome sequence.";
RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002).
RN [3] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP GENOME REANNOTATION.
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537573;
RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R.,
RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M.,
RA Celniker S.E.;
RT "The transposable elements of the Drosophila melanogaster euchromatin: a
RT genomics perspective.";
RL Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002).
RN [5] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537574;
RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A.,
RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G.,
RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M.,
RA Karpen G.H.;
RT "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.";
RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002).
RN [6] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022;
RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D.,
RA Ashburner M., Anxolabehere D.;
RT "Combined evidence annotation of transposable elements in genome
RT sequences.";
RL PLoS Comput. Biol. 1:166-175(2005).
RN [7] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569856; DOI=10.1126/science.1139815;
RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.;
RT "The Release 5.1 annotation of Drosophila melanogaster heterochromatin.";
RL Science 316:1586-1591(2007).
RN [8] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569867; DOI=10.1126/science.1139816;
RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M.,
RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A.,
RA Dimitri P., Karpen G.H., Celniker S.E.;
RT "Sequence finishing and mapping of Drosophila melanogaster
RT heterochromatin.";
RL Science 316:1625-1628(2007).
CC -!- FUNCTION: Catalyzes the post-translational formation of 4-
CC hydroxyproline in -Xaa-Pro-Gly- sequences in collagens and other
CC proteins. {ECO:0000256|ARBA:ARBA00002035}.
CC -!- COFACTOR:
CC Name=L-ascorbate; Xref=ChEBI:CHEBI:38290;
CC Evidence={ECO:0000256|ARBA:ARBA00001961};
CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum lumen
CC {ECO:0000256|ARBA:ARBA00004319}.
CC -!- SIMILARITY: Belongs to the P4HA family.
CC {ECO:0000256|ARBA:ARBA00006511}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE014297; ABI31221.1; -; Genomic_DNA.
DR RefSeq; NP_001036777.1; NM_001043312.2.
DR AlphaFoldDB; Q0KHY6; -.
DR SMR; Q0KHY6; -.
DR DNASU; 318781; -.
DR EnsemblMetazoa; FBtr0110860; FBpp0110159; FBgn0051524.
DR GeneID; 318781; -.
DR KEGG; dme:Dmel_CG31524; -.
DR UCSC; CG31524-RB; d. melanogaster.
DR AGR; FB:FBgn0051524; -.
DR FlyBase; FBgn0051524; CG31524.
DR VEuPathDB; VectorBase:FBgn0051524; -.
DR HOGENOM; CLU_024155_1_1_1; -.
DR OrthoDB; 2899308at2759; -.
DR BioGRID-ORCS; 318781; 0 hits in 1 CRISPR screen.
DR GenomeRNAi; 318781; -.
DR Proteomes; UP000000803; Chromosome 3R.
DR Bgee; FBgn0051524; Expressed in male reproductive gland and 6 other cell types or tissues.
DR ExpressionAtlas; Q0KHY6; baseline and differential.
DR Genevisible; Q0KHY6; DM.
DR GO; GO:0005783; C:endoplasmic reticulum; IBA:GO_Central.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProtKB-SubCell.
DR GO; GO:0005506; F:iron ion binding; IEA:InterPro.
DR GO; GO:0031418; F:L-ascorbic acid binding; IEA:InterPro.
DR GO; GO:0004656; F:procollagen-proline 4-dioxygenase activity; IBA:GO_Central.
DR Gene3D; 6.10.140.1460; -; 1.
DR Gene3D; 2.60.120.620; q2cbj1_9rhob like domain; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR005123; Oxoglu/Fe-dep_dioxygenase.
DR InterPro; IPR045054; P4HA-like.
DR InterPro; IPR006620; Pro_4_hyd_alph.
DR InterPro; IPR044862; Pro_4_hyd_alph_FE2OG_OXY.
DR InterPro; IPR013547; Pro_4_hyd_alph_N.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR PANTHER; PTHR10869:SF207; P4HA_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR10869; PROLYL 4-HYDROXYLASE ALPHA SUBUNIT; 1.
DR Pfam; PF13640; 2OG-FeII_Oxy_3; 1.
DR Pfam; PF08336; P4Ha_N; 1.
DR SMART; SM00702; P4Hc; 1.
DR SUPFAM; SSF48452; TPR-like; 1.
DR PROSITE; PS51471; FE2OG_OXY; 1.
DR PROSITE; PS50005; TPR; 1.
PE 3: Inferred from homology;
KW Dioxygenase {ECO:0000256|ARBA:ARBA00022964};
KW Endoplasmic reticulum {ECO:0000256|ARBA:ARBA00022824};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Iron {ECO:0000256|ARBA:ARBA00023004};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Oxidoreductase {ECO:0000256|ARBA:ARBA00023002,
KW ECO:0000313|EMBL:ABI31221.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000000803};
KW Signal {ECO:0000256|SAM:SignalP};
KW TPR repeat {ECO:0000256|PROSITE-ProRule:PRU00339}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..535
FT /note="procollagen-proline 4-dioxygenase"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004174929"
FT REPEAT 215..248
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT DOMAIN 401..508
FT /note="Fe2OG dioxygenase"
FT /evidence="ECO:0000259|PROSITE:PS51471"
SQ SEQUENCE 535 AA; 61615 MW; 64F1516FB4BBB647 CRC64;
MPVLKLLFVV IFFLSLSMGQ IEATQPRFAR SVVNMDDLLN MEDDLVSKLE GYAEKLSYKA
NTIRWGIQQM REQLDKSKKE QSFDLFNRYS FIRHMQADWL MWKQYLDKPV IHELNYKQMD
NLRMPQELDL FDASEAIRRM QATYAMLSND IAEGFLDGVQ YTSKLSPIDC LAMGRHLMNQ
SRWTIAEQWI LAGIKAQDRK GPQTEMILLR GPTKAELFRT LGKVRFERRN EEGALKAYQA
ALKHSPHDLE IFQEYQNLKR RVLTLSPSEP IREEPNDDIE EMELPPCCSG RCEGPRKLNR
LYCVYNCVTA PFLRLAPIKT EILSVDPFVI LLHDMVSHKE GALIRSSSKN QILPSETVNA
ANEFEIAKFR TSKSVWFDSD ANEATLKLTQ RLGEATGLDM KHSEPFQVIN YGIGGVFESH
FDTSLADEDR FVNGYIDRLA TTLFYLNDVP QGGATHFPGL NITVFPKFGT VLMWYNLHTE
GMLHVRTMHT GCPVIVGSKW VVSKWIDDKG QEFRRPCLRS RLDSKYLSSI EKIII
//