ID R7UGZ6_CAPTE Unreviewed; 796 AA.
AC R7UGZ6;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 48.
DE RecName: Full=procollagen-proline 3-dioxygenase {ECO:0000256|ARBA:ARBA00012262};
DE EC=1.14.11.7 {ECO:0000256|ARBA:ARBA00012262};
DE Flags: Fragment;
GN ORFNames=CAPTEDRAFT_167288 {ECO:0000313|EMBL:ELU03063.1};
OS Capitella teleta (Polychaete worm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Polychaeta;
OC Sedentaria; Scolecida; Capitellidae; Capitella.
OX NCBI_TaxID=283909 {ECO:0000313|EMBL:ELU03063.1};
RN [1] {ECO:0000313|Proteomes:UP000014760}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=I ESC-2004 {ECO:0000313|Proteomes:UP000014760};
RA Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ELU03063.1, ECO:0000313|Proteomes:UP000014760}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=I ESC-2004 {ECO:0000313|EMBL:ELU03063.1,
RC ECO:0000313|Proteomes:UP000014760};
RX PubMed=23254933; DOI=10.1038/nature11696;
RA Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT "Insights into bilaterian evolution from three spiralian genomes.";
RL Nature 493:526-531(2013).
RN [3] {ECO:0000313|EnsemblMetazoa:CapteP167288}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC -!- COFACTOR:
CC Name=Fe cation; Xref=ChEBI:CHEBI:24875;
CC Evidence={ECO:0000256|ARBA:ARBA00001962};
CC -!- COFACTOR:
CC Name=L-ascorbate; Xref=ChEBI:CHEBI:38290;
CC Evidence={ECO:0000256|ARBA:ARBA00001961};
CC -!- SIMILARITY: Belongs to the leprecan family.
CC {ECO:0000256|ARBA:ARBA00006487}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMQN01008594; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KB303505; ELU03063.1; -; Genomic_DNA.
DR AlphaFoldDB; R7UGZ6; -.
DR STRING; 283909.R7UGZ6; -.
DR EnsemblMetazoa; CapteT167288; CapteP167288; CapteG167288.
DR HOGENOM; CLU_017820_0_0_1; -.
DR OMA; NEDTECR; -.
DR Proteomes; UP000014760; Unassembled WGS sequence.
DR GO; GO:0005783; C:endoplasmic reticulum; IEA:UniProtKB-KW.
DR GO; GO:0005506; F:iron ion binding; IEA:InterPro.
DR GO; GO:0031418; F:L-ascorbic acid binding; IEA:InterPro.
DR GO; GO:0019797; F:procollagen-proline 3-dioxygenase activity; IEA:UniProtKB-EC.
DR GO; GO:0032963; P:collagen metabolic process; IEA:InterPro.
DR Gene3D; 2.60.120.620; q2cbj1_9rhob like domain; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 2.
DR InterPro; IPR005123; Oxoglu/Fe-dep_dioxygenase.
DR InterPro; IPR039575; P3H.
DR InterPro; IPR006620; Pro_4_hyd_alph.
DR InterPro; IPR044862; Pro_4_hyd_alph_FE2OG_OXY.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR PANTHER; PTHR14049; LEPRECAN 1; 1.
DR PANTHER; PTHR14049:SF9; PROCOLLAGEN-PROLINE 3-DIOXYGENASE; 1.
DR Pfam; PF13640; 2OG-FeII_Oxy_3; 1.
DR SMART; SM00702; P4Hc; 1.
DR PROSITE; PS51471; FE2OG_OXY; 1.
DR PROSITE; PS50005; TPR; 1.
PE 3: Inferred from homology;
KW Dioxygenase {ECO:0000256|ARBA:ARBA00022964};
KW Endoplasmic reticulum {ECO:0000256|ARBA:ARBA00022824};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Iron {ECO:0000256|ARBA:ARBA00023004};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Oxidoreductase {ECO:0000256|ARBA:ARBA00023002};
KW Reference proteome {ECO:0000313|Proteomes:UP000014760};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW TPR repeat {ECO:0000256|PROSITE-ProRule:PRU00339}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..796
FT /note="procollagen-proline 3-dioxygenase"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008788011"
FT REPEAT 289..322
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT DOMAIN 541..652
FT /note="Fe2OG dioxygenase"
FT /evidence="ECO:0000259|PROSITE:PS51471"
FT NON_TER 796
FT /evidence="ECO:0000313|EMBL:ELU03063.1"
SQ SEQUENCE 796 AA; 90305 MW; E4805105E6C3A112 CRC64;
MWNIINFSFL LGLLSFRLSS VSAESHEYDA LYNSAVEAYL QQRWYECKAE MENAINAYNT
LKQDLVQCRV TCNYITSEID AVQYLELSFL DTALQRSNCL RRCFEKHDVL GVSDEVKDAF
ARRLPYDYLQ ICAFKTGDLE LAVSSAYTFI VANPDHEIMQ SNIKFYQEQN GVDPTWFKDL
ESKLYQKYYS SAVASYSAFQ WHDVIQFMEE AVQDFLQEEG RCQRSCEGTY DHESLTHFYI
AIAGLNYESR NCSTLVLFTD HWINVLQCQV NCEDEISIVN GLQIKNMLAE MYHYLQFAYY
KVHDLTSAAA CTETALALKP QDEAMLKNKA FYQKQGMDKE DFSTRKEISS YIEARSLLLS
QLNFIRTKYK FSEADLDVKD IEQKDEEESP IFMNNAGVVI SGSSYPNPLT NASLFGTRDS
AFLAQLGDFS SKRGLTIVME NEELEGKYRV ATDGFLTSEQ CHSLMNLAGV RATKGDGYQN
ESPHTKFEKY EGLTINRAAQ LAIKDSVSLT EAQQFLNITE TARTFLETYY NLQIPLHFHY
THLVCRTALS DEAQNGRHDL SHPVHSDNCY MQKDGTCLRE APAYVQRDFS GIIYLNDDFE
GGNFFFANRD DSVQKTIRPK CGRMVAFESS DFHGVLPVTK GRRCAIAVWY TLDPLAKEGS
HEAAQQLLAA KVSRDQKLAN ADLRHLLNEH DFSLVKTEVE LNGPERFAAD GILNEQQCKA
LMKLANQGAI VGDGYNAGKK PTAQTSPHTN HEYFAGLRID RAAELAADGT VDASAVDLYL
NSSAMSLDFV KKYFDM
//