ID K7MI44_SOYBN Unreviewed; 301 AA.
AC K7MI44;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 78.
DE RecName: Full=procollagen-proline 4-dioxygenase {ECO:0000256|ARBA:ARBA00012269};
DE EC=1.14.11.2 {ECO:0000256|ARBA:ARBA00012269};
GN Name=100795761 {ECO:0000313|EnsemblPlants:KRH08931};
GN ORFNames=GLYMA_16G182000 {ECO:0000313|EMBL:KRH08931.1};
OS Glycine max (Soybean) (Glycine hispida).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3847 {ECO:0000313|EMBL:KRH08931.1};
RN [1] {ECO:0000313|EMBL:KRH08931.1, ECO:0000313|EnsemblPlants:KRH08931}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:KRH08931};
RC TISSUE=Callus {ECO:0000313|EMBL:KRH08931.1};
RX PubMed=20075913; DOI=10.1038/nature08670;
RA Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W.,
RA Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., May G.D.,
RA Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., Sandhu D.,
RA Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., Goodstein D.,
RA Barry K., Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N.,
RA Joshi T., Libault M., Sethuraman A., Zhang X.-C., Shinozaki K.,
RA Nguyen H.T., Wing R.A., Cregan P., Specht J., Grimwood J., Rokhsar D.,
RA Stacey G., Shoemaker R.C., Jackson S.A.;
RT "Genome sequence of the palaeopolyploid soybean.";
RL Nature 463:178-183(2010).
RN [2] {ECO:0000313|EnsemblPlants:KRH08931}
RP IDENTIFICATION.
RC STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:KRH08931};
RG EnsemblPlants;
RL Submitted (FEB-2018) to UniProtKB.
RN [3] {ECO:0000313|EMBL:KRH08931.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Callus {ECO:0000313|EMBL:KRH08931.1};
RA Schmutz J., Cannon S., Schlueter J., Ma J., Mitros T., Nelson W., Hyten D.,
RA Song Q., Thelen J., Cheng J., Xu D., Hellsten U., May G., Yu Y.,
RA Sakurai T., Umezawa T., Bhattacharyya M., Sandhu D., Valliyodan B.,
RA Lindquist E., Peto M., Grant D., Shu S., Goodstein D., Barry K.,
RA Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N., Joshi T.,
RA Libault M., Sethuraman A., Zhang X., Shinozaki K., Nguyen H., Wing R.,
RA Cregan P., Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.,
RA Jackson S.;
RT "WGS assembly of Glycine max.";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=2-oxoglutarate + L-prolyl-[collagen] + O2 = CO2 + succinate +
CC trans-4-hydroxy-L-prolyl-[collagen]; Xref=Rhea:RHEA:18945, Rhea:RHEA-
CC COMP:11676, Rhea:RHEA-COMP:11680, ChEBI:CHEBI:15379,
CC ChEBI:CHEBI:16526, ChEBI:CHEBI:16810, ChEBI:CHEBI:30031,
CC ChEBI:CHEBI:50342, ChEBI:CHEBI:61965; EC=1.14.11.2;
CC Evidence={ECO:0000256|ARBA:ARBA00024151};
CC -!- COFACTOR:
CC Name=L-ascorbate; Xref=ChEBI:CHEBI:38290;
CC Evidence={ECO:0000256|ARBA:ARBA00001961};
CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004648}; Single-pass type II membrane protein
CC {ECO:0000256|ARBA:ARBA00004648}. Membrane
CC {ECO:0000256|ARBA:ARBA00004606}; Single-pass type II membrane protein
CC {ECO:0000256|ARBA:ARBA00004606}.
CC -!- SIMILARITY: Belongs to the P4HA family.
CC {ECO:0000256|ARBA:ARBA00006511}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000849; KRH08931.1; -; Genomic_DNA.
DR RefSeq; XP_003548177.2; XM_003548129.3.
DR AlphaFoldDB; K7MI44; -.
DR SMR; K7MI44; -.
DR STRING; 3847.K7MI44; -.
DR PaxDb; 3847-GLYMA16G30130-2; -.
DR EnsemblPlants; KRH08931; KRH08931; GLYMA_16G182000.
DR GeneID; 100795761; -.
DR Gramene; KRH08931; KRH08931; GLYMA_16G182000.
DR KEGG; gmx:100795761; -.
DR eggNOG; KOG1591; Eukaryota.
DR InParanoid; K7MI44; -.
DR OrthoDB; 5488227at2759; -.
DR Proteomes; UP000008827; Chromosome 16.
DR ExpressionAtlas; K7MI44; baseline and differential.
DR GO; GO:0005783; C:endoplasmic reticulum; IBA:GO_Central.
DR GO; GO:0005789; C:endoplasmic reticulum membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005506; F:iron ion binding; IEA:InterPro.
DR GO; GO:0031418; F:L-ascorbic acid binding; IEA:InterPro.
DR GO; GO:0004656; F:procollagen-proline 4-dioxygenase activity; IBA:GO_Central.
DR Gene3D; 2.60.120.620; q2cbj1_9rhob like domain; 1.
DR InterPro; IPR045054; P4HA-like.
DR InterPro; IPR006620; Pro_4_hyd_alph.
DR InterPro; IPR044862; Pro_4_hyd_alph_FE2OG_OXY.
DR InterPro; IPR003582; ShKT_dom.
DR PANTHER; PTHR10869:SF102; PROLYL 4-HYDROXYLASE 12-RELATED; 1.
DR PANTHER; PTHR10869; PROLYL 4-HYDROXYLASE ALPHA SUBUNIT; 1.
DR Pfam; PF13640; 2OG-FeII_Oxy_3; 1.
DR SMART; SM00702; P4Hc; 1.
DR SMART; SM00254; ShKT; 1.
DR PROSITE; PS51670; SHKT; 1.
PE 3: Inferred from homology;
KW Dioxygenase {ECO:0000256|ARBA:ARBA00022964};
KW Endoplasmic reticulum {ECO:0000256|ARBA:ARBA00022824};
KW Iron {ECO:0000256|ARBA:ARBA00023004};
KW Membrane {ECO:0000256|ARBA:ARBA00022989};
KW Oxidoreductase {ECO:0000256|ARBA:ARBA00023002};
KW Reference proteome {ECO:0000313|Proteomes:UP000008827};
KW Signal {ECO:0000256|SAM:SignalP};
KW Signal-anchor {ECO:0000256|ARBA:ARBA00022968};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..301
FT /note="procollagen-proline 4-dioxygenase"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014581801"
FT DOMAIN 261..301
FT /note="ShKT"
FT /evidence="ECO:0000259|PROSITE:PS51670"
SQ SEQUENCE 301 AA; 33398 MW; D5E0EB4DA800EBA4 CRC64;
MASISLLLAL FVFFLIATSL TESSRKELRN KQETALQMLE RSIHFSNRIN PSRVVQISWQ
PRVFLYKGFL SDKECDYLVS LAYAVKEKSS GNGGLSEGVE TSLDMEDDIL ARIEERLSVW
AFLPKEYSKP LQVMHYGPEQ NGRNLDYFTN KTQLELSGPL MATIILYLSN DVTQGGQILF
PESVPGSSSW SSCSNSSNIL QPVKGNAILF FSLHPSASPD KSSFHARCPV LEGDMWSAIK
YFYAKPISRG KVSATLDGGE CTDEDDSCPA WAAVGECQRN PVFMIGSPDY YGTCRKSCNA
C
//