ID Q7PI98_ANOGA Unreviewed; 545 AA.
AC Q7PI98;
DT 15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2007, sequence version 3.
DT 27-MAR-2024, entry version 128.
DE SubName: Full=AGAP006422-PA {ECO:0000313|EMBL:EAA44227.3};
GN Name=1277034 {ECO:0000313|EnsemblMetazoa:AGAP006422-PA};
GN ORFNames=AgaP_AGAP006422 {ECO:0000313|EMBL:EAA44227.3};
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA44227.3};
RN [1] {ECO:0000313|EMBL:EAA44227.3, ECO:0000313|Proteomes:UP000007062}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST {ECO:0000313|EMBL:EAA44227.3,
RC ECO:0000313|Proteomes:UP000007062};
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
RN [2] {ECO:0000313|EMBL:EAA44227.3}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA44227.3};
RG The Anopheles Genome Sequencing Consortium;
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:EAA44227.3}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA44227.3};
RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT "The Anopheles gambiae genome: an update.";
RL Trends Parasitol. 20:49-52(2004).
RN [4] {ECO:0000313|EMBL:EAA44227.3}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA44227.3};
RX PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT "Update of the Anopheles gambiae PEST genome assembly.";
RL Genome Biol. 8:R5.1-R5.13(2007).
RN [5] {ECO:0000313|EMBL:EAA44227.3}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA44227.3};
RG VectorBase;
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN [6] {ECO:0000313|EnsemblMetazoa:AGAP006422-PA}
RP IDENTIFICATION.
RC STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP006422-PA};
RG EnsemblMetazoa;
RL Submitted (JAN-2021) to UniProtKB.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 1 family.
CC {ECO:0000256|RuleBase:RU003690}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAAB01008960; EAA44227.3; -; Genomic_DNA.
DR RefSeq; XP_316460.3; XM_316460.4.
DR AlphaFoldDB; Q7PI98; -.
DR STRING; 7165.Q7PI98; -.
DR PaxDb; 7165-AGAP006422-PA; -.
DR EnsemblMetazoa; AGAP006422-RA; AGAP006422-PA; AGAP006422.
DR GeneID; 1277034; -.
DR KEGG; aga:AgaP_AGAP006422; -.
DR VEuPathDB; VectorBase:AGAP006422; -.
DR eggNOG; KOG0626; Eukaryota.
DR HOGENOM; CLU_001859_1_3_1; -.
DR InParanoid; Q7PI98; -.
DR OMA; DPSWPIC; -.
DR OrthoDB; 3373839at2759; -.
DR Proteomes; UP000007062; Chromosome 2L.
DR GO; GO:0008422; F:beta-glucosidase activity; IBA:GO_Central.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR001360; Glyco_hydro_1.
DR InterPro; IPR018120; Glyco_hydro_1_AS.
DR InterPro; IPR033132; Glyco_hydro_1_N_CS.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR PANTHER; PTHR10353; GLYCOSYL HYDROLASE; 1.
DR PANTHER; PTHR10353:SF36; KLOTHO (MAMMALIAN AGING-ASSOCIATED PROTEIN) HOMOLOG; 1.
DR Pfam; PF00232; Glyco_hydro_1; 1.
DR PRINTS; PR00131; GLHYDRLASE1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR PROSITE; PS00572; GLYCOSYL_HYDROL_F1_1; 1.
DR PROSITE; PS00653; GLYCOSYL_HYDROL_F1_2; 1.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|RuleBase:RU004468};
KW Hydrolase {ECO:0000256|RuleBase:RU004468};
KW Reference proteome {ECO:0000313|Proteomes:UP000007062};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..545
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014587735"
FT ACT_SITE 409
FT /note="Nucleophile"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10055"
SQ SEQUENCE 545 AA; 61562 MW; 295D16769E6DE905 CRC64;
MRSSVVALAF AAFGLLLVTG AIGQDAITRR FPDGFEFGVG TSAYQIEGGW NEDGKGESIW
DHLSHTVPSK IVDGSTGDVA CDSYHQWKRD VEMVNELGVQ YYRFSISWPR LMPTGLSNSV
NEKGIEYYNK LIDELLRNGI KPMVTLYHWD LPQRLQELGG WLNPAIVEYF REYVRVAFSS
FGDRVKLWTT INEPWHICEN GYGREEMAPG YDFPGVPAYM CGHHILLAHG EAVRLYRSTF
ESVQQGKIGI SLDARWPEPA HILSEDDREA SDWQLQFHLG WFAHPIFSAE GDYPSIMKER
IGNLSEAQGF PQSRLPVFTA REINLLRGSS DFFALNTYTT SLVSKNDANN TAGYPVPSYL
HDMGVVESAD PDWPVAEETS WIKIVPFGLH KLLLWIKDNY NSPVIYITEN GIGSGPGTKD
LQRVHYLNFY LNSVLVAIED GCDVRLYVAW SLMDNFEWRD GYTQKFGLYY VDFDDPARTR
YGKVSSKVFA RIVKTRTIDF NYQPEPDVLL TGIKANGTAV LPNVVVLAST VLLMVVGRRI
YATFG
//