GenomeNet

Database: UniProt
Entry: A0A151M743_ALLMI
LinkDB: A0A151M743_ALLMI
Original site: A0A151M743_ALLMI 
ID   A0A151M743_ALLMI        Unreviewed;       577 AA.
AC   A0A151M743;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   27-MAR-2024, entry version 19.
DE   RecName: Full=procollagen-proline 4-dioxygenase {ECO:0000256|ARBA:ARBA00012269};
DE            EC=1.14.11.2 {ECO:0000256|ARBA:ARBA00012269};
GN   Name=P4HA1 {ECO:0000313|EMBL:KYO20260.1};
GN   ORFNames=Y1Q_0010806 {ECO:0000313|EMBL:KYO20260.1};
OS   Alligator mississippiensis (American alligator).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC   Alligator.
OX   NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO20260.1};
RN   [1] {ECO:0000313|EMBL:KYO20260.1, ECO:0000313|Proteomes:UP000050525}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO20260.1};
RX   PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA   St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA   Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA   Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA   Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA   Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA   McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA   Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA   Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA   Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA   Ray D.A.;
RT   "Sequencing three crocodilian genomes to illuminate the evolution of
RT   archosaurs and amniotes.";
RL   Genome Biol. 13:415-415(2012).
CC   -!- FUNCTION: Catalyzes the post-translational formation of 4-
CC       hydroxyproline in -Xaa-Pro-Gly- sequences in collagens and other
CC       proteins. {ECO:0000256|ARBA:ARBA00002035}.
CC   -!- COFACTOR:
CC       Name=L-ascorbate; Xref=ChEBI:CHEBI:38290;
CC         Evidence={ECO:0000256|ARBA:ARBA00001961};
CC   -!- SUBCELLULAR LOCATION: Endoplasmic reticulum lumen
CC       {ECO:0000256|ARBA:ARBA00004319}.
CC   -!- SIMILARITY: Belongs to the P4HA family.
CC       {ECO:0000256|ARBA:ARBA00006511}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KYO20260.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AKHW03006437; KYO20260.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A151M743; -.
DR   STRING; 8496.A0A151M743; -.
DR   Proteomes; UP000050525; Unassembled WGS sequence.
DR   GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProtKB-SubCell.
DR   GO; GO:0005506; F:iron ion binding; IEA:InterPro.
DR   GO; GO:0031418; F:L-ascorbic acid binding; IEA:InterPro.
DR   GO; GO:0004656; F:procollagen-proline 4-dioxygenase activity; IEA:UniProtKB-EC.
DR   Gene3D; 6.10.140.1460; -; 1.
DR   Gene3D; 2.60.120.620; q2cbj1_9rhob like domain; 1.
DR   Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR   InterPro; IPR005123; Oxoglu/Fe-dep_dioxygenase.
DR   InterPro; IPR045054; P4HA-like.
DR   InterPro; IPR006620; Pro_4_hyd_alph.
DR   InterPro; IPR044862; Pro_4_hyd_alph_FE2OG_OXY.
DR   InterPro; IPR013547; Pro_4_hyd_alph_N.
DR   InterPro; IPR011990; TPR-like_helical_dom_sf.
DR   PANTHER; PTHR10869; PROLYL 4-HYDROXYLASE ALPHA SUBUNIT; 1.
DR   PANTHER; PTHR10869:SF240; PROLYL 4-HYDROXYLASE SUBUNIT ALPHA-2; 1.
DR   Pfam; PF13640; 2OG-FeII_Oxy_3; 1.
DR   Pfam; PF08336; P4Ha_N; 1.
DR   SMART; SM00702; P4Hc; 1.
DR   SUPFAM; SSF48452; TPR-like; 1.
DR   PROSITE; PS51471; FE2OG_OXY; 1.
PE   3: Inferred from homology;
KW   Dioxygenase {ECO:0000256|ARBA:ARBA00022964};
KW   Endoplasmic reticulum {ECO:0000256|ARBA:ARBA00022824};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Iron {ECO:0000256|ARBA:ARBA00023004};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Oxidoreductase {ECO:0000256|ARBA:ARBA00023002};
KW   Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..41
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           42..577
FT                   /note="procollagen-proline 4-dioxygenase"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5007584825"
FT   DOMAIN          434..562
FT                   /note="Fe2OG dioxygenase"
FT                   /evidence="ECO:0000259|PROSITE:PS51471"
SQ   SEQUENCE   577 AA;  66425 MW;  543AD09E0045C77A CRC64;
     MMGRPHNHLQ QSLRIGAFLM MKPWMLLLLL TCISCTWPTQ AEFFTSIGQM TDLVYAEKDL
     VRSLKEYIKE EESKLSKIKS WAEKMEAVTS KSTSDPEGYL AHPVNAYKLV KRLNTEWLEL
     ENLVLQDTTN GFIANLTIQR QFFPTEEDET GAAKALMRLQ DTYKLDPETI SKGVLPGTKY
     RSSLTVDDCF GMGKTAYNDG DYYHTVLWMQ QALKQHEEGE ESSITKAEIL DYLSYAVFQL
     GDLRRAMELT RRLVSLDSTH ERAGSNLRYF EKLLEKERMA TLLNETSTRT EPVMQGGVYE
     RPPDYLPERD IYEGLCRGEG VKMTPRRQKR LFCRYHDGNR NPHLLIAPFK EEDEWDSPHI
     VRYYDVMSDE EIEKIKELAK PRLARATVRD PKTGVLTVAS YRVSKSSWLE EYDDPIVAKV
     NHRMQHITGL TVKTAELLQV ANYGLGGQYE PHFDFSRRPF DNTLKTEGNR LATFLNYKDE
     PDAFKRLGTG NRVATFLNYM SDVEAGGATV FPDFGAAIWP KKGTAVFWYN LFRSGEGDYR
     TRHAACPVLV GCKWVSNKWF HERGNEFLRP CGRTEVD
//
DBGET integrated database retrieval system