ID Q4CWZ0_TRYCC Unreviewed; 318 AA.
AC Q4CWZ0;
DT 13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2005, sequence version 1.
DT 24-JAN-2024, entry version 62.
DE RecName: Full=procollagen-proline 4-dioxygenase {ECO:0000256|ARBA:ARBA00012269};
DE EC=1.14.11.2 {ECO:0000256|ARBA:ARBA00012269};
GN ORFNames=Tc00.1047053507057.40 {ECO:0000313|EMBL:EAN84793.1};
OS Trypanosoma cruzi (strain CL Brener).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN84793.1, ECO:0000313|Proteomes:UP000002296};
RN [1] {ECO:0000313|EMBL:EAN84793.1, ECO:0000313|Proteomes:UP000002296}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CL Brener {ECO:0000313|EMBL:EAN84793.1,
RC ECO:0000313|Proteomes:UP000002296};
RX PubMed=16020725; DOI=10.1126/science.1112631;
RA El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA Andersson B.;
RT "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT disease.";
RL Science 309:409-415(2005).
CC -!- COFACTOR:
CC Name=L-ascorbate; Xref=ChEBI:CHEBI:38290;
CC Evidence={ECO:0000256|ARBA:ARBA00001961};
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN84793.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAHK01001589; EAN84793.1; -; Genomic_DNA.
DR RefSeq; XP_806644.1; XM_801551.1.
DR AlphaFoldDB; Q4CWZ0; -.
DR PaxDb; 353153-Q4CWZ0; -.
DR EnsemblProtists; EAN84793; EAN84793; Tc00.1047053507057.40.
DR GeneID; 3536707; -.
DR KEGG; tcr:507057.40; -.
DR eggNOG; ENOG502QWTF; Eukaryota.
DR InParanoid; Q4CWZ0; -.
DR OMA; INDRIMV; -.
DR OrthoDB; 166711at2759; -.
DR Proteomes; UP000002296; Unassembled WGS sequence.
DR GO; GO:0051213; F:dioxygenase activity; IEA:UniProtKB-KW.
DR GO; GO:0005506; F:iron ion binding; IEA:InterPro.
DR GO; GO:0031418; F:L-ascorbic acid binding; IEA:InterPro.
DR GO; GO:0016705; F:oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen; IEA:InterPro.
DR Gene3D; 2.60.120.620; q2cbj1_9rhob like domain; 1.
DR InterPro; IPR045054; P4HA-like.
DR InterPro; IPR006620; Pro_4_hyd_alph.
DR InterPro; IPR044862; Pro_4_hyd_alph_FE2OG_OXY.
DR PANTHER; PTHR10869; PROLYL 4-HYDROXYLASE ALPHA SUBUNIT; 1.
DR PANTHER; PTHR10869:SF207; PROLYL 4-HYDROXYLASE SUBUNIT ALPHA-2; 1.
DR Pfam; PF13640; 2OG-FeII_Oxy_3; 1.
DR SMART; SM00702; P4Hc; 1.
PE 4: Predicted;
KW Dioxygenase {ECO:0000256|ARBA:ARBA00022964};
KW Oxidoreductase {ECO:0000256|ARBA:ARBA00023002};
KW Reference proteome {ECO:0000313|Proteomes:UP000002296}.
FT DOMAIN 41..263
FT /note="Prolyl 4-hydroxylase alpha subunit"
FT /evidence="ECO:0000259|SMART:SM00702"
SQ SEQUENCE 318 AA; 35405 MW; 8BFCEE8368C73DC7 CRC64;
MYSYNNIVDK GEPLKKDVID RVPLEIAPTE VVLQSRVDER VEVLVVENFL SAAECDRIIA
ACEEVGYTFW RQKDANAACD ARQRSGETEA RAFRVVDTIE ACFPQLTRAL SERIQRVVHL
EPKLFGPSVT DSEDMFARDL AGTWVPLSLS SNLLLGKYGP GGHFSPHIDG STVVDLNTRS
LYTLLIYLNH CACGGETAIF MGEQADVLEL DPQTGKYVGK KEKRVGAVYP KKGSAAFFFC
DVLHEGTPVG EGCCKYILRG DFLYRRDPPI LTTENDKKAF DLYEQARVAE SNGNAMLACE
LFQRVRKLSS GVAELYQL
//