ID C1FDC1_MICCC Unreviewed; 1026 AA.
AC C1FDC1;
DT 26-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 26-MAY-2009, sequence version 1.
DT 27-MAR-2024, entry version 49.
DE SubName: Full=Peptidase family U32 {ECO:0000313|EMBL:ACO68760.1};
GN Name=PEPU32 {ECO:0000313|EMBL:ACO68760.1};
GN ORFNames=MICPUN_113244 {ECO:0000313|EMBL:ACO68760.1};
OS Micromonas commoda (strain RCC299 / NOUM17 / CCMP2709) (Picoplanktonic
OS green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; Mamiellophyceae; Mamiellales;
OC Mamiellaceae; Micromonas.
OX NCBI_TaxID=296587 {ECO:0000313|EMBL:ACO68760.1, ECO:0000313|Proteomes:UP000002009};
RN [1] {ECO:0000313|EMBL:ACO68760.1, ECO:0000313|Proteomes:UP000002009}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=RCC299 / NOUM17 {ECO:0000313|Proteomes:UP000002009};
RX PubMed=19359590; DOI=10.1126/science.1167222;
RA Worden A.Z., Lee J.H., Mock T., Rouze P., Simmons M.P., Aerts A.L.,
RA Allen A.E., Cuvelier M.L., Derelle E., Everett M.V., Foulon E.,
RA Grimwood J., Gundlach H., Henrissat B., Napoli C., McDonald S.M.,
RA Parker M.S., Rombauts S., Salamov A., Von Dassow P., Badger J.H.,
RA Coutinho P.M., Demir E., Dubchak I., Gentemann C., Eikrem W., Gready J.E.,
RA John U., Lanier W., Lindquist E.A., Lucas S., Mayer K.F., Moreau H.,
RA Not F., Otillar R., Panaud O., Pangilinan J., Paulsen I., Piegu B.,
RA Poliakov A., Robbens S., Schmutz J., Toulza E., Wyss T., Zelensky A.,
RA Zhou K., Armbrust E.V., Bhattacharya D., Goodenough U.W., Van de Peer Y.,
RA Grigoriev I.V.;
RT "Green evolution and dynamic adaptations revealed by genomes of the marine
RT picoeukaryotes Micromonas.";
RL Science 324:268-272(2009).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP001574; ACO68760.1; -; Genomic_DNA.
DR RefSeq; XP_002507502.1; XM_002507456.1.
DR AlphaFoldDB; C1FDC1; -.
DR STRING; 296587.C1FDC1; -.
DR GeneID; 8250044; -.
DR KEGG; mis:MICPUN_113244; -.
DR eggNOG; ENOG502QVE5; Eukaryota.
DR InParanoid; C1FDC1; -.
DR OMA; ACRLPYG; -.
DR OrthoDB; 316355at2759; -.
DR Proteomes; UP000002009; Chromosome 1.
DR GO; GO:0044249; P:cellular biosynthetic process; IEA:UniProt.
DR GO; GO:1901576; P:organic substance biosynthetic process; IEA:UniProt.
DR InterPro; IPR020988; Pept_U32_collagenase.
DR InterPro; IPR001539; Peptidase_U32.
DR PANTHER; PTHR30217:SF10; 23S RRNA 5-HYDROXYCYTIDINE C2501 SYNTHASE; 1.
DR PANTHER; PTHR30217; PEPTIDASE U32 FAMILY; 1.
DR Pfam; PF12392; DUF3656; 1.
DR Pfam; PF01136; Peptidase_U32; 2.
DR SUPFAM; SSF51395; FMN-linked oxidoreductases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002009}.
FT DOMAIN 520..648
FT /note="Peptidase U32 collagenase"
FT /evidence="ECO:0000259|Pfam:PF12392"
FT REGION 31..73
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 36..72
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1026 AA; 111622 MW; D52EF2FBA8674138 CRC64;
MPVPRPGAKK NRHCDKIPRA RICNSRFCKG SKKTGCGTRP SRNSNSAAPH TVKNSQRAFA
TSERNKNPDL NDVNTLVHPA FKRRQLSRKP EILAPAGGWP QLRAAVEAGA DAVYFGLEAL
NARARASNFA VEEVADVIGY LHERGVRGFV AINVLVFDEE LAQAEKLVRA VARAGADAVI
VQDVGLVSLI KGLAPDLVIH GSTQMSITSA EGAEFARELG CERVVVGREL SIREIAAVRE
GTTAEVEAFV HGALCVSYSG QCFSSEAWGG RSANRGQCAQ ACRLPYGLVV DGEVRDMGDV
KYLLSPQDLM AVELVPQLID AGVSCFKIEG RLKGPEYVGL TTSVYRRAVD EAWEARCCES
VDGHAPSGAE WDLPDAVRTD LAQVFARGQD DNYDGLTKGF LEGPNHQRLV RGRSPRHRGV
LLGKVVDSFI HSGGGVVVLL SGKVSVKRGD GVVFDRGMPD KAEAGGSVWE VLDSSGKSVA
RSAHEAVISG EYELMFASAT MNMWASEHGG LQEPKPGDLV WRSSDTSLTA RLRKMYGKDV
DQRLPQKKEP VVVHMTSEGI GSPLRISLID AQGRRGVGIT RGSFSVAQRR PLTFASLATA
VGQLGDTYFC VGELNIESIQ GLNSLPGLFI SVGEIKAARR EAVEALMSFR STSGIAEADN
NSESLALQES MEISWWDNRA TEKETKILQP KAVFTKTSDQ ECKLTVLCRT REQTFAAMNI
PWLQEIILDF LEVHGLQDMV DRVKASGRRA VVATPRVLKP NEERLWRFYL KLGADALLVR
SAGLMRILKN LRENGEDRFI PELRGDFSLN AANAVGASVL FRHGGLLTLT PTHDLNAAQH
VNMARALGAE GAAKLEVIVH QHLPIFYTEH CVFCRFLSEG NSYRDCGHPC ETTRIHLRDG
GGSDHLVLAD MGCRNTVFNA KAQSGAEYVH ELINAGIKRF RVELVDEPAE TVSPLLEAYK
DCLLGLKRGR DVVRLVGSFP DANGRSHGAG RGSFEVKKEV DRASMKQTAA ARSTAHVPAR
SVIEDN
//