GenomeNet

Database: UniProt
Entry: K7MI44_SOYBN
LinkDB: K7MI44_SOYBN
Original site: K7MI44_SOYBN 
ID   K7MI44_SOYBN            Unreviewed;       301 AA.
AC   K7MI44;
DT   09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT   09-JAN-2013, sequence version 1.
DT   27-MAR-2024, entry version 78.
DE   RecName: Full=procollagen-proline 4-dioxygenase {ECO:0000256|ARBA:ARBA00012269};
DE            EC=1.14.11.2 {ECO:0000256|ARBA:ARBA00012269};
GN   Name=100795761 {ECO:0000313|EnsemblPlants:KRH08931};
GN   ORFNames=GLYMA_16G182000 {ECO:0000313|EMBL:KRH08931.1};
OS   Glycine max (Soybean) (Glycine hispida).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC   NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC   Glycine subgen. Soja.
OX   NCBI_TaxID=3847 {ECO:0000313|EMBL:KRH08931.1};
RN   [1] {ECO:0000313|EMBL:KRH08931.1, ECO:0000313|EnsemblPlants:KRH08931}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:KRH08931};
RC   TISSUE=Callus {ECO:0000313|EMBL:KRH08931.1};
RX   PubMed=20075913; DOI=10.1038/nature08670;
RA   Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W.,
RA   Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., May G.D.,
RA   Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., Sandhu D.,
RA   Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., Goodstein D.,
RA   Barry K., Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N.,
RA   Joshi T., Libault M., Sethuraman A., Zhang X.-C., Shinozaki K.,
RA   Nguyen H.T., Wing R.A., Cregan P., Specht J., Grimwood J., Rokhsar D.,
RA   Stacey G., Shoemaker R.C., Jackson S.A.;
RT   "Genome sequence of the palaeopolyploid soybean.";
RL   Nature 463:178-183(2010).
RN   [2] {ECO:0000313|EnsemblPlants:KRH08931}
RP   IDENTIFICATION.
RC   STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:KRH08931};
RG   EnsemblPlants;
RL   Submitted (FEB-2018) to UniProtKB.
RN   [3] {ECO:0000313|EMBL:KRH08931.1}
RP   NUCLEOTIDE SEQUENCE.
RC   TISSUE=Callus {ECO:0000313|EMBL:KRH08931.1};
RA   Schmutz J., Cannon S., Schlueter J., Ma J., Mitros T., Nelson W., Hyten D.,
RA   Song Q., Thelen J., Cheng J., Xu D., Hellsten U., May G., Yu Y.,
RA   Sakurai T., Umezawa T., Bhattacharyya M., Sandhu D., Valliyodan B.,
RA   Lindquist E., Peto M., Grant D., Shu S., Goodstein D., Barry K.,
RA   Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N., Joshi T.,
RA   Libault M., Sethuraman A., Zhang X., Shinozaki K., Nguyen H., Wing R.,
RA   Cregan P., Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.,
RA   Jackson S.;
RT   "WGS assembly of Glycine max.";
RL   Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=2-oxoglutarate + L-prolyl-[collagen] + O2 = CO2 + succinate +
CC         trans-4-hydroxy-L-prolyl-[collagen]; Xref=Rhea:RHEA:18945, Rhea:RHEA-
CC         COMP:11676, Rhea:RHEA-COMP:11680, ChEBI:CHEBI:15379,
CC         ChEBI:CHEBI:16526, ChEBI:CHEBI:16810, ChEBI:CHEBI:30031,
CC         ChEBI:CHEBI:50342, ChEBI:CHEBI:61965; EC=1.14.11.2;
CC         Evidence={ECO:0000256|ARBA:ARBA00024151};
CC   -!- COFACTOR:
CC       Name=L-ascorbate; Xref=ChEBI:CHEBI:38290;
CC         Evidence={ECO:0000256|ARBA:ARBA00001961};
CC   -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane
CC       {ECO:0000256|ARBA:ARBA00004648}; Single-pass type II membrane protein
CC       {ECO:0000256|ARBA:ARBA00004648}. Membrane
CC       {ECO:0000256|ARBA:ARBA00004606}; Single-pass type II membrane protein
CC       {ECO:0000256|ARBA:ARBA00004606}.
CC   -!- SIMILARITY: Belongs to the P4HA family.
CC       {ECO:0000256|ARBA:ARBA00006511}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM000849; KRH08931.1; -; Genomic_DNA.
DR   RefSeq; XP_003548177.2; XM_003548129.3.
DR   AlphaFoldDB; K7MI44; -.
DR   SMR; K7MI44; -.
DR   STRING; 3847.K7MI44; -.
DR   PaxDb; 3847-GLYMA16G30130-2; -.
DR   EnsemblPlants; KRH08931; KRH08931; GLYMA_16G182000.
DR   GeneID; 100795761; -.
DR   Gramene; KRH08931; KRH08931; GLYMA_16G182000.
DR   KEGG; gmx:100795761; -.
DR   eggNOG; KOG1591; Eukaryota.
DR   InParanoid; K7MI44; -.
DR   OrthoDB; 5488227at2759; -.
DR   Proteomes; UP000008827; Chromosome 16.
DR   ExpressionAtlas; K7MI44; baseline and differential.
DR   GO; GO:0005783; C:endoplasmic reticulum; IBA:GO_Central.
DR   GO; GO:0005789; C:endoplasmic reticulum membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005506; F:iron ion binding; IEA:InterPro.
DR   GO; GO:0031418; F:L-ascorbic acid binding; IEA:InterPro.
DR   GO; GO:0004656; F:procollagen-proline 4-dioxygenase activity; IBA:GO_Central.
DR   Gene3D; 2.60.120.620; q2cbj1_9rhob like domain; 1.
DR   InterPro; IPR045054; P4HA-like.
DR   InterPro; IPR006620; Pro_4_hyd_alph.
DR   InterPro; IPR044862; Pro_4_hyd_alph_FE2OG_OXY.
DR   InterPro; IPR003582; ShKT_dom.
DR   PANTHER; PTHR10869:SF102; PROLYL 4-HYDROXYLASE 12-RELATED; 1.
DR   PANTHER; PTHR10869; PROLYL 4-HYDROXYLASE ALPHA SUBUNIT; 1.
DR   Pfam; PF13640; 2OG-FeII_Oxy_3; 1.
DR   SMART; SM00702; P4Hc; 1.
DR   SMART; SM00254; ShKT; 1.
DR   PROSITE; PS51670; SHKT; 1.
PE   3: Inferred from homology;
KW   Dioxygenase {ECO:0000256|ARBA:ARBA00022964};
KW   Endoplasmic reticulum {ECO:0000256|ARBA:ARBA00022824};
KW   Iron {ECO:0000256|ARBA:ARBA00023004};
KW   Membrane {ECO:0000256|ARBA:ARBA00022989};
KW   Oxidoreductase {ECO:0000256|ARBA:ARBA00023002};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008827};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   Signal-anchor {ECO:0000256|ARBA:ARBA00022968};
KW   Transmembrane {ECO:0000256|ARBA:ARBA00022692};
KW   Transmembrane helix {ECO:0000256|ARBA:ARBA00022989}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..301
FT                   /note="procollagen-proline 4-dioxygenase"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5014581801"
FT   DOMAIN          261..301
FT                   /note="ShKT"
FT                   /evidence="ECO:0000259|PROSITE:PS51670"
SQ   SEQUENCE   301 AA;  33398 MW;  D5E0EB4DA800EBA4 CRC64;
     MASISLLLAL FVFFLIATSL TESSRKELRN KQETALQMLE RSIHFSNRIN PSRVVQISWQ
     PRVFLYKGFL SDKECDYLVS LAYAVKEKSS GNGGLSEGVE TSLDMEDDIL ARIEERLSVW
     AFLPKEYSKP LQVMHYGPEQ NGRNLDYFTN KTQLELSGPL MATIILYLSN DVTQGGQILF
     PESVPGSSSW SSCSNSSNIL QPVKGNAILF FSLHPSASPD KSSFHARCPV LEGDMWSAIK
     YFYAKPISRG KVSATLDGGE CTDEDDSCPA WAAVGECQRN PVFMIGSPDY YGTCRKSCNA
     C
//
DBGET integrated database retrieval system