ID A0A1D2MU57_ORCCI Unreviewed; 679 AA.
AC A0A1D2MU57;
DT 30-NOV-2016, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2016, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE RecName: Full=procollagen-proline 4-dioxygenase {ECO:0000256|ARBA:ARBA00012269};
DE EC=1.14.11.2 {ECO:0000256|ARBA:ARBA00012269};
GN ORFNames=Ocin01_10141 {ECO:0000313|EMBL:ODM96546.1};
OS Orchesella cincta (Springtail) (Podura cincta).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Collembola;
OC Entomobryomorpha; Entomobryoidea; Orchesellidae; Orchesellinae; Orchesella.
OX NCBI_TaxID=48709 {ECO:0000313|EMBL:ODM96546.1, ECO:0000313|Proteomes:UP000094527};
RN [1] {ECO:0000313|EMBL:ODM96546.1, ECO:0000313|Proteomes:UP000094527}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Mixed pool {ECO:0000313|EMBL:ODM96546.1};
RX PubMed=27289101; DOI=10.1093/gbe/evw134;
RA Faddeeva-Vakhrusheva A., Derks M.F., Anvar S.Y., Agamennone V., Suring W.,
RA Smit S., van Straalen N.M., Roelofs D.;
RT "Gene Family Evolution Reflects Adaptation to Soil Environmental Stressors
RT in the Genome of the Collembolan Orchesella cincta.";
RL Genome Biol. Evol. 8:2106-2117(2016).
CC -!- FUNCTION: Catalyzes the post-translational formation of 4-
CC hydroxyproline in -Xaa-Pro-Gly- sequences in collagens and other
CC proteins. {ECO:0000256|ARBA:ARBA00002035}.
CC -!- COFACTOR:
CC Name=L-ascorbate; Xref=ChEBI:CHEBI:38290;
CC Evidence={ECO:0000256|ARBA:ARBA00001961};
CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum lumen
CC {ECO:0000256|ARBA:ARBA00004319}.
CC -!- SIMILARITY: Belongs to the P4HA family.
CC {ECO:0000256|ARBA:ARBA00006511}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ODM96546.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LJIJ01000528; ODM96546.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1D2MU57; -.
DR STRING; 48709.A0A1D2MU57; -.
DR OrthoDB; 2899308at2759; -.
DR Proteomes; UP000094527; Unassembled WGS sequence.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProtKB-SubCell.
DR GO; GO:0005506; F:iron ion binding; IEA:InterPro.
DR GO; GO:0031418; F:L-ascorbic acid binding; IEA:InterPro.
DR GO; GO:0004656; F:procollagen-proline 4-dioxygenase activity; IEA:UniProtKB-EC.
DR Gene3D; 2.60.120.620; q2cbj1_9rhob like domain; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR005123; Oxoglu/Fe-dep_dioxygenase.
DR InterPro; IPR045054; P4HA-like.
DR InterPro; IPR006620; Pro_4_hyd_alph.
DR InterPro; IPR044862; Pro_4_hyd_alph_FE2OG_OXY.
DR InterPro; IPR013547; Pro_4_hyd_alph_N.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR PANTHER; PTHR10869:SF207; P4HA_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR10869; PROLYL 4-HYDROXYLASE ALPHA SUBUNIT; 1.
DR Pfam; PF13640; 2OG-FeII_Oxy_3; 1.
DR Pfam; PF08336; P4Ha_N; 1.
DR SMART; SM00702; P4Hc; 1.
DR PROSITE; PS51471; FE2OG_OXY; 1.
PE 3: Inferred from homology;
KW Dioxygenase {ECO:0000256|ARBA:ARBA00022964};
KW Endoplasmic reticulum {ECO:0000256|ARBA:ARBA00022824};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Iron {ECO:0000256|ARBA:ARBA00023004};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Oxidoreductase {ECO:0000256|ARBA:ARBA00023002};
KW Reference proteome {ECO:0000313|Proteomes:UP000094527};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..679
FT /note="procollagen-proline 4-dioxygenase"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008904540"
FT DOMAIN 553..664
FT /note="Fe2OG dioxygenase"
FT /evidence="ECO:0000259|PROSITE:PS51471"
FT REGION 49..70
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 679 AA; 78378 MW; 3996BF03F2E621DB CRC64;
MSFSGSVWVI VPFYLLLTFV VTAASEQCGS EQGDKPTLQT LSEELNKTEG LGSVGGQACK
AKRETGDEES EENWRIETGL DFIRENTMLR IDEDLYFGLD EYSKFRRDEL SLLELVRTFI
REVKGVRALP LGASATVLSG DDTGGNKPAW CFYPYQLKNL AQTEVIIYEV LRDFYWSVLK
KERKGQFLAL DSYLKDIEES SLRAITYVEY TNEKKNSTFE DREDPYLYIA THPLLLYRMV
RRFAEDLQPI LLELEEYEDL ISDLYDKANP DLVVFPNHDD LTDALESIKR IQEGTNLNTL
EFATGKMDQL QTDIHVNALH ALEIGAYYFT AGHFAYAVEW MYYALNTVSQ SSVIEKYPII
ERFFRKSFDH LVEVHNQNWE PTSKNYWNLG VFHTKVTAPF EPKSYVLMRQ IMKDEAAGLR
KTLDWNWSVN VNFMHVCAGK NLQTEEERKG LKCWLQRKHN PYWSINPLKM EILNRDPPVY
QIYDFIGDKL IEKMKKAAVD GLERSTVVDR NDHSRSVVDT TRTSSQVWLY DDRENGKFLL
PLTYRMELLS QLTIAPPQAC DAYQVASYGP SHHYAPHIDS FEDTRRAVAK GNRIVTIMGY
MSHVKAGGRT GFPLLGVAAE PIKGSVVMWY NLKKNGAERD MRTWHGGCPV VFGVKWITNK
WVYYNENFRN FPCGLNQDE
//