GenomeNet

Database: UniProt
Entry: Q0KHY6_DROME
LinkDB: Q0KHY6_DROME
Original site: Q0KHY6_DROME 
ID   Q0KHY6_DROME            Unreviewed;       535 AA.
AC   Q0KHY6;
DT   03-OCT-2006, integrated into UniProtKB/TrEMBL.
DT   03-OCT-2006, sequence version 1.
DT   27-MAR-2024, entry version 135.
DE   RecName: Full=procollagen-proline 4-dioxygenase {ECO:0000256|ARBA:ARBA00012269};
DE            EC=1.14.11.2 {ECO:0000256|ARBA:ARBA00012269};
GN   Name=Dmel\CG31524 {ECO:0000313|EMBL:ABI31221.1};
GN   ORFNames=CG31524 {ECO:0000313|EMBL:ABI31221.1,
GN   ECO:0000313|FlyBase:FBgn0051524}, Dmel_CG31524
GN   {ECO:0000313|EMBL:ABI31221.1};
OS   Drosophila melanogaster (Fruit fly).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC   Drosophilidae; Drosophila; Sophophora.
OX   NCBI_TaxID=7227 {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803};
RN   [1] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA   Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA   Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA   George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA   Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA   Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA   Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A.,
RA   An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA   Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA   Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA   Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA   Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA   Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA   Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA   Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA   Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA   Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA   Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA   Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F.,
RA   Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA   Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA   Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA   McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA   Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA   Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA   Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA   Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H.,
RA   Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E.,
RA   Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C.,
RA   Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A.,
RA   Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C.,
RA   Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G.,
RA   Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.,
RA   Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.;
RT   "The genome sequence of Drosophila melanogaster.";
RL   Science 287:2185-2195(2000).
RN   [2] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537568;
RA   Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A.,
RA   Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A.,
RA   George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R.,
RA   Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J.,
RA   Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C.,
RA   Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.;
RT   "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster
RT   euchromatic genome sequence.";
RL   Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002).
RN   [3] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP   GENOME REANNOTATION.
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA   Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA   Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA   Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA   Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA   Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA   Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT   "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT   review.";
RL   Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN   [4] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537573;
RA   Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R.,
RA   Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M.,
RA   Celniker S.E.;
RT   "The transposable elements of the Drosophila melanogaster euchromatin: a
RT   genomics perspective.";
RL   Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002).
RN   [5] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537574;
RA   Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A.,
RA   Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G.,
RA   Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M.,
RA   Karpen G.H.;
RT   "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.";
RL   Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002).
RN   [6] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=16110336; DOI=10.1371/journal.pcbi.0010022;
RA   Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D.,
RA   Ashburner M., Anxolabehere D.;
RT   "Combined evidence annotation of transposable elements in genome
RT   sequences.";
RL   PLoS Comput. Biol. 1:166-175(2005).
RN   [7] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=17569856; DOI=10.1126/science.1139815;
RA   Smith C.D., Shu S., Mungall C.J., Karpen G.H.;
RT   "The Release 5.1 annotation of Drosophila melanogaster heterochromatin.";
RL   Science 316:1586-1591(2007).
RN   [8] {ECO:0000313|EMBL:ABI31221.1, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=17569867; DOI=10.1126/science.1139816;
RA   Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M.,
RA   Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A.,
RA   Dimitri P., Karpen G.H., Celniker S.E.;
RT   "Sequence finishing and mapping of Drosophila melanogaster
RT   heterochromatin.";
RL   Science 316:1625-1628(2007).
CC   -!- FUNCTION: Catalyzes the post-translational formation of 4-
CC       hydroxyproline in -Xaa-Pro-Gly- sequences in collagens and other
CC       proteins. {ECO:0000256|ARBA:ARBA00002035}.
CC   -!- COFACTOR:
CC       Name=L-ascorbate; Xref=ChEBI:CHEBI:38290;
CC         Evidence={ECO:0000256|ARBA:ARBA00001961};
CC   -!- SUBCELLULAR LOCATION: Endoplasmic reticulum lumen
CC       {ECO:0000256|ARBA:ARBA00004319}.
CC   -!- SIMILARITY: Belongs to the P4HA family.
CC       {ECO:0000256|ARBA:ARBA00006511}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AE014297; ABI31221.1; -; Genomic_DNA.
DR   RefSeq; NP_001036777.1; NM_001043312.2.
DR   AlphaFoldDB; Q0KHY6; -.
DR   SMR; Q0KHY6; -.
DR   DNASU; 318781; -.
DR   EnsemblMetazoa; FBtr0110860; FBpp0110159; FBgn0051524.
DR   GeneID; 318781; -.
DR   KEGG; dme:Dmel_CG31524; -.
DR   UCSC; CG31524-RB; d. melanogaster.
DR   AGR; FB:FBgn0051524; -.
DR   FlyBase; FBgn0051524; CG31524.
DR   VEuPathDB; VectorBase:FBgn0051524; -.
DR   HOGENOM; CLU_024155_1_1_1; -.
DR   OrthoDB; 2899308at2759; -.
DR   BioGRID-ORCS; 318781; 0 hits in 1 CRISPR screen.
DR   GenomeRNAi; 318781; -.
DR   Proteomes; UP000000803; Chromosome 3R.
DR   Bgee; FBgn0051524; Expressed in male reproductive gland and 6 other cell types or tissues.
DR   ExpressionAtlas; Q0KHY6; baseline and differential.
DR   Genevisible; Q0KHY6; DM.
DR   GO; GO:0005783; C:endoplasmic reticulum; IBA:GO_Central.
DR   GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProtKB-SubCell.
DR   GO; GO:0005506; F:iron ion binding; IEA:InterPro.
DR   GO; GO:0031418; F:L-ascorbic acid binding; IEA:InterPro.
DR   GO; GO:0004656; F:procollagen-proline 4-dioxygenase activity; IBA:GO_Central.
DR   Gene3D; 6.10.140.1460; -; 1.
DR   Gene3D; 2.60.120.620; q2cbj1_9rhob like domain; 1.
DR   Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR   InterPro; IPR005123; Oxoglu/Fe-dep_dioxygenase.
DR   InterPro; IPR045054; P4HA-like.
DR   InterPro; IPR006620; Pro_4_hyd_alph.
DR   InterPro; IPR044862; Pro_4_hyd_alph_FE2OG_OXY.
DR   InterPro; IPR013547; Pro_4_hyd_alph_N.
DR   InterPro; IPR011990; TPR-like_helical_dom_sf.
DR   InterPro; IPR019734; TPR_repeat.
DR   PANTHER; PTHR10869:SF207; P4HA_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR10869; PROLYL 4-HYDROXYLASE ALPHA SUBUNIT; 1.
DR   Pfam; PF13640; 2OG-FeII_Oxy_3; 1.
DR   Pfam; PF08336; P4Ha_N; 1.
DR   SMART; SM00702; P4Hc; 1.
DR   SUPFAM; SSF48452; TPR-like; 1.
DR   PROSITE; PS51471; FE2OG_OXY; 1.
DR   PROSITE; PS50005; TPR; 1.
PE   3: Inferred from homology;
KW   Dioxygenase {ECO:0000256|ARBA:ARBA00022964};
KW   Endoplasmic reticulum {ECO:0000256|ARBA:ARBA00022824};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Iron {ECO:0000256|ARBA:ARBA00023004};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Oxidoreductase {ECO:0000256|ARBA:ARBA00023002,
KW   ECO:0000313|EMBL:ABI31221.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000000803};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   TPR repeat {ECO:0000256|PROSITE-ProRule:PRU00339}.
FT   SIGNAL          1..23
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           24..535
FT                   /note="procollagen-proline 4-dioxygenase"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004174929"
FT   REPEAT          215..248
FT                   /note="TPR"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT   DOMAIN          401..508
FT                   /note="Fe2OG dioxygenase"
FT                   /evidence="ECO:0000259|PROSITE:PS51471"
SQ   SEQUENCE   535 AA;  61615 MW;  64F1516FB4BBB647 CRC64;
     MPVLKLLFVV IFFLSLSMGQ IEATQPRFAR SVVNMDDLLN MEDDLVSKLE GYAEKLSYKA
     NTIRWGIQQM REQLDKSKKE QSFDLFNRYS FIRHMQADWL MWKQYLDKPV IHELNYKQMD
     NLRMPQELDL FDASEAIRRM QATYAMLSND IAEGFLDGVQ YTSKLSPIDC LAMGRHLMNQ
     SRWTIAEQWI LAGIKAQDRK GPQTEMILLR GPTKAELFRT LGKVRFERRN EEGALKAYQA
     ALKHSPHDLE IFQEYQNLKR RVLTLSPSEP IREEPNDDIE EMELPPCCSG RCEGPRKLNR
     LYCVYNCVTA PFLRLAPIKT EILSVDPFVI LLHDMVSHKE GALIRSSSKN QILPSETVNA
     ANEFEIAKFR TSKSVWFDSD ANEATLKLTQ RLGEATGLDM KHSEPFQVIN YGIGGVFESH
     FDTSLADEDR FVNGYIDRLA TTLFYLNDVP QGGATHFPGL NITVFPKFGT VLMWYNLHTE
     GMLHVRTMHT GCPVIVGSKW VVSKWIDDKG QEFRRPCLRS RLDSKYLSSI EKIII
//
DBGET integrated database retrieval system