GenomeNet

Database: UniProt
Entry: Q9VA61_DROME
LinkDB: Q9VA61_DROME
Original site: Q9VA61_DROME 
ID   Q9VA61_DROME            Unreviewed;       542 AA.
AC   Q9VA61;
DT   01-MAY-2000, integrated into UniProtKB/TrEMBL.
DT   01-OCT-2002, sequence version 2.
DT   27-MAR-2024, entry version 177.
DE   RecName: Full=procollagen-proline 4-dioxygenase {ECO:0000256|ARBA:ARBA00012269};
DE            EC=1.14.11.2 {ECO:0000256|ARBA:ARBA00012269};
GN   Name=PH4alphaNE2 {ECO:0000313|EMBL:AAF57061.2,
GN   ECO:0000313|FlyBase:FBgn0039783};
GN   Synonyms=Dmel\CG9720 {ECO:0000313|EMBL:AAF57061.2}, PH4-alpha-NE2
GN   {ECO:0000313|FlyBase:FBgn0039783};
GN   ORFNames=CG9720 {ECO:0000313|EMBL:AAF57061.2,
GN   ECO:0000313|FlyBase:FBgn0039783}, Dmel_CG9720
GN   {ECO:0000313|EMBL:AAF57061.2};
OS   Drosophila melanogaster (Fruit fly).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC   Drosophilidae; Drosophila; Sophophora.
OX   NCBI_TaxID=7227 {ECO:0000313|EMBL:AAF57061.2, ECO:0000313|Proteomes:UP000000803};
RN   [1] {ECO:0000313|EMBL:AAF57061.2, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA   Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA   Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA   George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA   Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA   Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA   Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A.,
RA   An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA   Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA   Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA   Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA   Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA   Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA   Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA   Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA   Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA   Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA   Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA   Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F.,
RA   Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA   Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA   Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA   McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA   Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA   Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA   Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA   Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H.,
RA   Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E.,
RA   Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C.,
RA   Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A.,
RA   Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C.,
RA   Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G.,
RA   Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.,
RA   Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.;
RT   "The genome sequence of Drosophila melanogaster.";
RL   Science 287:2185-2195(2000).
RN   [2] {ECO:0000313|EMBL:AAF57061.2, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537568;
RA   Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A.,
RA   Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A.,
RA   George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R.,
RA   Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J.,
RA   Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C.,
RA   Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.;
RT   "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster
RT   euchromatic genome sequence.";
RL   Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002).
RN   [3] {ECO:0000313|EMBL:AAF57061.2, ECO:0000313|Proteomes:UP000000803}
RP   GENOME REANNOTATION.
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA   Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA   Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA   Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA   Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA   Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA   Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT   "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT   review.";
RL   Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN   [4] {ECO:0000313|EMBL:AAF57061.2, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537573;
RA   Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R.,
RA   Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M.,
RA   Celniker S.E.;
RT   "The transposable elements of the Drosophila melanogaster euchromatin: a
RT   genomics perspective.";
RL   Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002).
RN   [5] {ECO:0000313|EMBL:AAF57061.2, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537574;
RA   Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A.,
RA   Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G.,
RA   Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M.,
RA   Karpen G.H.;
RT   "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.";
RL   Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002).
RN   [6] {ECO:0000313|EMBL:AAF57061.2, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=16110336; DOI=10.1371/journal.pcbi.0010022;
RA   Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D.,
RA   Ashburner M., Anxolabehere D.;
RT   "Combined evidence annotation of transposable elements in genome
RT   sequences.";
RL   PLoS Comput. Biol. 1:166-175(2005).
RN   [7] {ECO:0000313|EMBL:AAF57061.2, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=17569856; DOI=10.1126/science.1139815;
RA   Smith C.D., Shu S., Mungall C.J., Karpen G.H.;
RT   "The Release 5.1 annotation of Drosophila melanogaster heterochromatin.";
RL   Science 316:1586-1591(2007).
RN   [8] {ECO:0000313|EMBL:AAF57061.2, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=17569867; DOI=10.1126/science.1139816;
RA   Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M.,
RA   Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A.,
RA   Dimitri P., Karpen G.H., Celniker S.E.;
RT   "Sequence finishing and mapping of Drosophila melanogaster
RT   heterochromatin.";
RL   Science 316:1625-1628(2007).
CC   -!- FUNCTION: Catalyzes the post-translational formation of 4-
CC       hydroxyproline in -Xaa-Pro-Gly- sequences in collagens and other
CC       proteins. {ECO:0000256|ARBA:ARBA00002035}.
CC   -!- COFACTOR:
CC       Name=L-ascorbate; Xref=ChEBI:CHEBI:38290;
CC         Evidence={ECO:0000256|ARBA:ARBA00001961};
CC   -!- SUBCELLULAR LOCATION: Endoplasmic reticulum lumen
CC       {ECO:0000256|ARBA:ARBA00004319}.
CC   -!- SIMILARITY: Belongs to the P4HA family.
CC       {ECO:0000256|ARBA:ARBA00006511}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AE014297; AAF57061.2; -; Genomic_DNA.
DR   RefSeq; NP_733378.1; NM_170499.2.
DR   AlphaFoldDB; Q9VA61; -.
DR   SMR; Q9VA61; -.
DR   IntAct; Q9VA61; 1.
DR   STRING; 7227.FBpp0085047; -.
DR   PaxDb; 7227-FBpp0085047; -.
DR   EnsemblMetazoa; FBtr0085685; FBpp0085047; FBgn0039783.
DR   GeneID; 43628; -.
DR   KEGG; dme:Dmel_CG9720; -.
DR   UCSC; CG9720-RA; d. melanogaster.
DR   AGR; FB:FBgn0039783; -.
DR   CTD; 43628; -.
DR   FlyBase; FBgn0039783; PH4alphaNE2.
DR   VEuPathDB; VectorBase:FBgn0039783; -.
DR   eggNOG; KOG1591; Eukaryota.
DR   GeneTree; ENSGT00940000163795; -.
DR   HOGENOM; CLU_024155_1_1_1; -.
DR   InParanoid; Q9VA61; -.
DR   OMA; GFFETHL; -.
DR   OrthoDB; 2899308at2759; -.
DR   PhylomeDB; Q9VA61; -.
DR   BioGRID-ORCS; 43628; 0 hits in 1 CRISPR screen.
DR   GenomeRNAi; 43628; -.
DR   Proteomes; UP000000803; Chromosome 3R.
DR   Bgee; FBgn0039783; Expressed in male reproductive gland and 5 other cell types or tissues.
DR   ExpressionAtlas; Q9VA61; baseline and differential.
DR   Genevisible; Q9VA61; DM.
DR   GO; GO:0005783; C:endoplasmic reticulum; IBA:GO_Central.
DR   GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProtKB-SubCell.
DR   GO; GO:0005615; C:extracellular space; ISM:FlyBase.
DR   GO; GO:0005506; F:iron ion binding; IEA:InterPro.
DR   GO; GO:0031418; F:L-ascorbic acid binding; IEA:InterPro.
DR   GO; GO:0004656; F:procollagen-proline 4-dioxygenase activity; IBA:GO_Central.
DR   GO; GO:0016477; P:cell migration; IMP:FlyBase.
DR   GO; GO:0019953; P:sexual reproduction; IEP:FlyBase.
DR   Gene3D; 6.10.140.1460; -; 1.
DR   Gene3D; 2.60.120.620; q2cbj1_9rhob like domain; 1.
DR   Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR   InterPro; IPR005123; Oxoglu/Fe-dep_dioxygenase.
DR   InterPro; IPR045054; P4HA-like.
DR   InterPro; IPR006620; Pro_4_hyd_alph.
DR   InterPro; IPR044862; Pro_4_hyd_alph_FE2OG_OXY.
DR   InterPro; IPR013547; Pro_4_hyd_alph_N.
DR   InterPro; IPR011990; TPR-like_helical_dom_sf.
DR   InterPro; IPR019734; TPR_repeat.
DR   PANTHER; PTHR10869:SF207; P4HA_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR10869; PROLYL 4-HYDROXYLASE ALPHA SUBUNIT; 1.
DR   Pfam; PF13640; 2OG-FeII_Oxy_3; 1.
DR   Pfam; PF08336; P4Ha_N; 1.
DR   SMART; SM00702; P4Hc; 1.
DR   SUPFAM; SSF48452; TPR-like; 1.
DR   PROSITE; PS51471; FE2OG_OXY; 1.
DR   PROSITE; PS50005; TPR; 1.
PE   3: Inferred from homology;
KW   Dioxygenase {ECO:0000256|ARBA:ARBA00022964};
KW   Endoplasmic reticulum {ECO:0000256|ARBA:ARBA00022824};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Iron {ECO:0000256|ARBA:ARBA00023004};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Oxidoreductase {ECO:0000256|ARBA:ARBA00023002,
KW   ECO:0000313|EMBL:AAF57061.2};
KW   Reference proteome {ECO:0000313|Proteomes:UP000000803};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   TPR repeat {ECO:0000256|PROSITE-ProRule:PRU00339}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..542
FT                   /note="procollagen-proline 4-dioxygenase"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004335596"
FT   REPEAT          220..253
FT                   /note="TPR"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT   DOMAIN          410..515
FT                   /note="Fe2OG dioxygenase"
FT                   /evidence="ECO:0000259|PROSITE:PS51471"
FT   REGION          351..372
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   542 AA;  62120 MW;  280F2A9AC4A99177 CRC64;
     MLFWRLVLLG MLYLSLSFGQ LRENNAQQRF ARSVVNMDDM LNLEDDLVSN VEKLAEALAR
     KAKTIKWGVF KMMKRRQEYK SSMEIFANPI DTFSLIRHMQ SNWLMWLLYL ETPVGQEELF
     FVDSRMPLLP KYFDFIDAAE GIRKMQATYQ MFSSDIAKGL LDGVQYNSSL KPIDCLAIGL
     HLMNNSRWYA AEQWISASIE AYDQKSSQTD MELLRGPKLA DLCRILGQVQ MKQRNHEGAL
     QAYQVALKLS PHDPEIYEEY RILEKRDLTL SDIEPIEQDK DNSHERLVLP PCCSGRCQVP
     RNLSNLYCVY NHVTSPFLQL APIKTEILSI DPFVVLLHDM ISQKESTLIR TSSKEHMLPS
     ATTDPDASDD ETQVDTYRTS KSVWYSSDFN DTTKKITERL GDATGLDMNS TEFYQVINYG
     LGGFFETHLD MLLSEKNRFN GTSDRIATTL FYLNEVRQGG GTYFPRLNLT VFPQPGSALF
     WYNLDTKGND HMGSLHTGCP VIVGSKWVMS KWINDMGQEF TRPCVESSLS SNEVLSAERL
     II
//
DBGET integrated database retrieval system