GenomeNet

Database: UniProt
Entry: Q7PYQ3_ANOGA
LinkDB: Q7PYQ3_ANOGA
Original site: Q7PYQ3_ANOGA 
ID   Q7PYQ3_ANOGA            Unreviewed;       630 AA.
AC   Q7PYQ3;
DT   15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT   27-JUL-2011, sequence version 5.
DT   16-JAN-2019, entry version 111.
DE   SubName: Full=AGAP002058-PA {ECO:0000313|EMBL:EAA01064.5};
DE   SubName: Full=Beta-galactosidase {ECO:0000313|VectorBase:AGAP002058-PA};
GN   Name=4576940 {ECO:0000313|VectorBase:AGAP002058-PA};
GN   ORFNames=AgaP_AGAP002058 {ECO:0000313|EMBL:EAA01064.5};
OS   Anopheles gambiae (African malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
OC   Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea;
OC   Culicidae; Anophelinae; Anopheles.
OX   NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA01064.5, ECO:0000313|Proteomes:UP000007062};
RN   [1] {ECO:0000313|EMBL:EAA01064.5, ECO:0000313|Proteomes:UP000007062, ECO:0000313|VectorBase:AGAP002058-PA}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA01064.5,
RC   ECO:0000313|Proteomes:UP000007062};
RX   PubMed=12364791; DOI=10.1126/science.1076181;
RA   Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA   Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA   Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B.,
RA   Lai Z., Kraft C.L., Abril J.F., Anthouard V., Arensburger P.,
RA   Atkinson P.W., Baden H., de Berardinis V., Baldwin D., Benes V.,
RA   Biedler J., Blass C., Bolanos R., Boscus D., Barnstead M., Cai S.,
RA   Center A., Chaturverdi K., Christophides G.K., Chrystal M.A.M.,
RA   Clamp M., Cravchik A., Curwen V., Dana A., Delcher A., Dew I.,
RA   Evans C.A., Flanigan M., Grundschober-Freimoser A., Friedli L., Gu Z.,
RA   Guan P., Guigo R., Hillenmeyer M.E., Hladun S.L., Hogan J.R.,
RA   Hong Y.S., Hoover J., Jaillon O., Ke Z., Kodira C.D., Kokoza E.,
RA   Koutsos A., Letunic I., Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F.,
RA   Lopez J.R., Malek J.A., McIntosh T.C., Meister S., Miller J.R.,
RA   Mobarry C., Mongin E., Murphy S.D., O'Brochta D.A., Pfannkoch C.,
RA   Qi R., Regier M.A., Remington K., Shao H., Sharakhova M.V.,
RA   Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J., Thomasova D.,
RA   Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B., Wang A.H.,
RA   Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M., Yao A.,
RA   Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA   Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F.,
RA   Mural R.J., Myers E.W., Adams M.D., Smith H.O., Broder S.,
RA   Gardner M.J., Fraser C.M., Birney E., Bork P., Brey P.T., Venter J.C.,
RA   Weissenbach J., Kafatos F.C., Collins F.H., Hoffman S.L.;
RT   "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL   Science 298:129-149(2002).
RN   [2] {ECO:0000313|EMBL:EAA01064.5}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA01064.5};
RG   The Anopheles Genome Sequencing Consortium;
RL   Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:EAA01064.5}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA01064.5};
RX   PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA   Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT   "The Anopheles gambiae genome: an update.";
RL   Trends Parasitol. 20:49-52(2004).
RN   [4] {ECO:0000313|EMBL:EAA01064.5}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA01064.5};
RX   PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA   Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA   Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT   "Update of the Anopheles gambiae PEST genome assembly.";
RL   Genome Biol. 8:R5-R5(2007).
RN   [5] {ECO:0000313|EMBL:EAA01064.5}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA01064.5};
RG   VectorBase;
RL   Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN   [6] {ECO:0000313|VectorBase:AGAP002058-PA}
RP   IDENTIFICATION.
RC   STRAIN=PEST {ECO:0000313|VectorBase:AGAP002058-PA};
RG   VectorBase;
RL   Submitted (FEB-2017) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the glycosyl hydrolase 35 family.
CC       {ECO:0000256|RuleBase:RU003679, ECO:0000256|SAAS:SAAS00534244}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   -----------------------------------------------------------------------
DR   EMBL; AAAB01008987; EAA01064.5; -; Genomic_DNA.
DR   RefSeq; XP_320991.5; XM_320991.5.
DR   ProteinModelPortal; Q7PYQ3; -.
DR   STRING; 7165.AGAP002058-PA; -.
DR   EnsemblMetazoa; AGAP002058-RA; AGAP002058-PA; AGAP002058.
DR   GeneID; 4576940; -.
DR   KEGG; aga:AgaP_AGAP002058; -.
DR   VectorBase; AGAP002058-RA; AGAP002058-PA; AGAP002058.
DR   eggNOG; KOG0496; Eukaryota.
DR   eggNOG; COG1874; LUCA.
DR   HOGENOM; HOG000221607; -.
DR   InParanoid; Q7PYQ3; -.
DR   KO; K12309; -.
DR   OrthoDB; 179316at2759; -.
DR   PhylomeDB; Q7PYQ3; -.
DR   Proteomes; UP000007062; Chromosome 2R.
DR   GO; GO:0005773; C:vacuole; IBA:GO_Central.
DR   GO; GO:0004565; F:beta-galactosidase activity; IBA:GO_Central.
DR   GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR   Gene3D; 2.60.120.260; -; 2.
DR   InterPro; IPR026283; B-gal_1-like.
DR   InterPro; IPR025300; BetaGal_jelly_roll_dom.
DR   InterPro; IPR008979; Galactose-bd-like_sf.
DR   InterPro; IPR031330; Gly_Hdrlase_35_cat.
DR   InterPro; IPR001944; Glycoside_Hdrlase_35.
DR   InterPro; IPR017853; Glycoside_hydrolase_SF.
DR   PANTHER; PTHR23421; PTHR23421; 1.
DR   Pfam; PF13364; BetaGal_dom4_5; 1.
DR   Pfam; PF01301; Glyco_hydro_35; 1.
DR   PIRSF; PIRSF006336; B-gal; 1.
DR   PRINTS; PR00742; GLHYDRLASE35.
DR   SUPFAM; SSF49785; SSF49785; 1.
DR   SUPFAM; SSF51445; SSF51445; 1.
PE   3: Inferred from homology;
KW   Complete proteome {ECO:0000313|Proteomes:UP000007062};
KW   Glycosidase {ECO:0000256|SAAS:SAAS00108888};
KW   Hydrolase {ECO:0000256|SAAS:SAAS00108869};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007062};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL        1     18       {ECO:0000256|SAM:SignalP}.
FT   CHAIN        19    630       {ECO:0000256|SAM:SignalP}.
FT                                /FTId=PRO_5014588005.
FT   DOMAIN       41    357       Glyco_hydro_35. {ECO:0000259|Pfam:
FT                                PF01301}.
FT   DOMAIN      536    609       BetaGal_dom4_5. {ECO:0000259|Pfam:
FT                                PF13364}.
FT   ACT_SITE    189    189       Proton donor. {ECO:0000256|PIRSR:
FT                                PIRSR006336-1}.
FT   ACT_SITE    267    267       Nucleophile. {ECO:0000256|PIRSR:
FT                                PIRSR006336-1}.
SQ   SEQUENCE   630 AA;  71756 MW;  D14ECE51A927836B CRC64;
     MIAYRVLPLL ALCLSGWAAT IEAAAQQPPR KFDIDFQNDT FTKDGQPFQF ISGSFHYFRA
     LPESWRHILR SMRAAGLNTV MTYIEWSLHE PMPGQYQWEG IANLEEFIEI AQSENLFVIL
     RPGPYICAER DMGGFPHWLL TKYPSIKLRT YDTDYLREVQ NWYNQLMPRL VRYLYGNGGP
     VIMVSIENEY GSFKACDGQY MQFLKNLTVH FVQDKAVLFT NDGPELLKCG SIPGILPTLD
     FGITNNPNAF WQQLRKYLPK GPLVNAEYYP GWLTHWMEPT ARVDAGMVVN TLKLMLNQKA
     NVNFYMFFGG TNFGFTAGAN DVGPGKYSAD ITSYDYDAPL DEAGDPTPKY FAIRKVLVEY
     FGDPGVPAPV KLPKMTLETV WLERRGSMLS KHGRTMLAQK IVTSVTPVSF EALNQHSGFV
     LYETQLPAGY NRDPYTLKVE NLHDRAYVHI DGTFAGILSR ETNTNTIPLS VGLGTRLQLL
     VESQGRINYN IPNDFKGILG TVTVDAKPLY NWTITSFPLD SYRYLENFLT QQPTEQEDLD
     GAGAQVYYGT FTISSDTIYD TYLYPSVWGK GLVFINGFNL GRYWPLAGPQ ITLYVPRHIL
     KKGNNQIVMI EYQQHIQHPY VQFIDKPIFM
//
DBGET integrated database retrieval system