ID C1MYC2_MICPC Unreviewed; 565 AA.
AC C1MYC2;
DT 26-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 26-MAY-2009, sequence version 1.
DT 27-MAR-2024, entry version 49.
DE SubName: Full=Glycoside hydrolase family 13 protein {ECO:0000313|EMBL:EEH55026.1};
DE Flags: Fragment;
GN ORFNames=MICPUCDRAFT_34447 {ECO:0000313|EMBL:EEH55026.1};
OS Micromonas pusilla (strain CCMP1545) (Picoplanktonic green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; Mamiellophyceae; Mamiellales;
OC Mamiellaceae; Micromonas.
OX NCBI_TaxID=564608 {ECO:0000313|Proteomes:UP000001876};
RN [1] {ECO:0000313|EMBL:EEH55026.1, ECO:0000313|Proteomes:UP000001876}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1545 {ECO:0000313|EMBL:EEH55026.1,
RC ECO:0000313|Proteomes:UP000001876};
RX PubMed=19359590; DOI=10.1126/science.1167222;
RA Worden A.Z., Lee J.H., Mock T., Rouze P., Simmons M.P., Aerts A.L.,
RA Allen A.E., Cuvelier M.L., Derelle E., Everett M.V., Foulon E.,
RA Grimwood J., Gundlach H., Henrissat B., Napoli C., McDonald S.M.,
RA Parker M.S., Rombauts S., Salamov A., Von Dassow P., Badger J.H.,
RA Coutinho P.M., Demir E., Dubchak I., Gentemann C., Eikrem W., Gready J.E.,
RA John U., Lanier W., Lindquist E.A., Lucas S., Mayer K.F., Moreau H.,
RA Not F., Otillar R., Panaud O., Pangilinan J., Paulsen I., Piegu B.,
RA Poliakov A., Robbens S., Schmutz J., Toulza E., Wyss T., Zelensky A.,
RA Zhou K., Armbrust E.V., Bhattacharya D., Goodenough U.W., Van de Peer Y.,
RA Grigoriev I.V.;
RT "Green evolution and dynamic adaptations revealed by genomes of the marine
RT picoeukaryotes Micromonas.";
RL Science 324:268-272(2009).
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 13 family.
CC {ECO:0000256|ARBA:ARBA00008061}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG663742; EEH55026.1; -; Genomic_DNA.
DR RefSeq; XP_003060257.1; XM_003060211.1.
DR AlphaFoldDB; C1MYC2; -.
DR STRING; 564608.C1MYC2; -.
DR GeneID; 9685989; -.
DR KEGG; mpp:MICPUCDRAFT_34447; -.
DR eggNOG; KOG0471; Eukaryota.
DR OrthoDB; 318869at2759; -.
DR Proteomes; UP000001876; Unassembled WGS sequence.
DR GO; GO:0016787; F:hydrolase activity; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:2001070; F:starch binding; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd05467; CBM20; 1.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR013784; Carb-bd-like_fold.
DR InterPro; IPR002044; CBM_fam20.
DR InterPro; IPR006047; Glyco_hydro_13_cat_dom.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR013783; Ig-like_fold.
DR PANTHER; PTHR10357; ALPHA-AMYLASE FAMILY MEMBER; 1.
DR PANTHER; PTHR10357:SF210; MALTODEXTRIN GLUCOSIDASE; 1.
DR Pfam; PF00128; Alpha-amylase; 1.
DR Pfam; PF00686; CBM_20; 1.
DR SMART; SM00642; Aamy; 1.
DR SMART; SM01065; CBM_2; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF49452; Starch-binding domain-like; 1.
DR PROSITE; PS51166; CBM20; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000313|EMBL:EEH55026.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000001876}.
FT DOMAIN 1..97
FT /note="CBM20"
FT /evidence="ECO:0000259|PROSITE:PS51166"
FT NON_TER 565
FT /evidence="ECO:0000313|EMBL:EEH55026.1"
SQ SEQUENCE 565 AA; 64121 MW; C70C750ADA00B273 CRC64;
METNFGDHLA IVGKHPLLGE WVPSKGVPMD WVEGTRWTAT VYLPEFSLLQ YKYVVRQGWN
PNGEARWQGG PDRIVATGNH HTEQSIRDVW TDGDWGAENP TCKALNAAIA AFDEVRPGDL
GYGAESSTIK NTATLPEWAR NAVFYQIFPL GYFGAPTVND QKSKVAPRLK QIREHYKHLE
ELGVDAVYFS PLFESGTHGY DTFDYFEIDR RLGDVKLFKQ IVKELHEHGI KVVLDGVFNH
TGTGHFAFKD LTINGPNSQY AGWYHLGARK ADFEGYCVVD DMSDSGFSYD CWEGHPVLPR
LNLENHDVKQ HIFDVARFWV DEVGIDGWRL DVAHEIEPDF WREFRAVCDN TRLGKDCLLV
GEMIHGNYNY WVGDDRLHSG TNYQMSHATW FSLNDRNYEY FYNALLRENC LFSGLTLVNF
LGNHDVPRVA SNITNPKHYL HAMVVLLLMK GIPCLYYGDE FGMEGTPADG TDEHSGGDDA
MRRPMLDCAK PATWPAVGVS RVALNKKLIK IRREHPVFGI DGTQDIEDIQ LLSNDQIIVT
RYVKETDPKT KKEEVTAVGC LVFNC
//