GenomeNet

Database: UniProt/TrEMBL
Entry: Q7Q8I5_ANOGA
LinkDB: Q7Q8I5_ANOGA
Original site: Q7Q8I5_ANOGA 
ID   Q7Q8I5_ANOGA            Unreviewed;       596 AA.
AC   Q7Q8I5;
DT   15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT   23-OCT-2007, sequence version 4.
DT   27-MAR-2024, entry version 134.
DE   RecName: Full=Polypeptide N-acetylgalactosaminyltransferase {ECO:0000256|RuleBase:RU361242};
DE            EC=2.4.1.- {ECO:0000256|RuleBase:RU361242};
DE   AltName: Full=Protein-UDP acetylgalactosaminyltransferase {ECO:0000256|RuleBase:RU361242};
DE   Flags: Fragment;
GN   ORFNames=AgaP_AGAP008613 {ECO:0000313|EMBL:EAA10180.4};
OS   Anopheles gambiae (African malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA10180.4};
RN   [1] {ECO:0000313|EMBL:EAA10180.4}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA10180.4};
RX   PubMed=12364791; DOI=10.1126/science.1076181;
RA   Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA   Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA   Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA   Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA   Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA   Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA   Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA   Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA   Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA   Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA   Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA   Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA   McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA   O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA   Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA   Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA   Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA   Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA   Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA   Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA   Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA   Collins F.H., Hoffman S.L.;
RT   "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL   Science 298:129-149(2002).
RN   [2] {ECO:0000313|EMBL:EAA10180.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA10180.4};
RG   The Anopheles Genome Sequencing Consortium;
RL   Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:EAA10180.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA10180.4};
RX   PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA   Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT   "The Anopheles gambiae genome: an update.";
RL   Trends Parasitol. 20:49-52(2004).
RN   [4] {ECO:0000313|EMBL:EAA10180.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA10180.4};
RX   PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA   Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA   Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT   "Update of the Anopheles gambiae PEST genome assembly.";
RL   Genome Biol. 8:R5.1-R5.13(2007).
RN   [5] {ECO:0000313|EMBL:EAA10180.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA10180.4};
RG   VectorBase;
RL   Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- COFACTOR:
CC       Name=Mn(2+); Xref=ChEBI:CHEBI:29035;
CC         Evidence={ECO:0000256|RuleBase:RU361242};
CC   -!- PATHWAY: Protein modification; protein glycosylation.
CC       {ECO:0000256|RuleBase:RU361242}.
CC   -!- SUBCELLULAR LOCATION: Golgi apparatus membrane
CC       {ECO:0000256|ARBA:ARBA00004323, ECO:0000256|RuleBase:RU361242}; Single-
CC       pass type II membrane protein {ECO:0000256|ARBA:ARBA00004323,
CC       ECO:0000256|RuleBase:RU361242}. Membrane
CC       {ECO:0000256|ARBA:ARBA00004606}; Single-pass type II membrane protein
CC       {ECO:0000256|ARBA:ARBA00004606}.
CC   -!- SIMILARITY: Belongs to the glycosyltransferase 2 family. GalNAc-T
CC       subfamily. {ECO:0000256|ARBA:ARBA00005680,
CC       ECO:0000256|RuleBase:RU361242}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EAA10180.4}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAAB01008944; EAA10180.4; -; Genomic_DNA.
DR   RefSeq; XP_314708.4; XM_314708.4.
DR   AlphaFoldDB; Q7Q8I5; -.
DR   STRING; 7165.Q7Q8I5; -.
DR   PaxDb; 7165-AGAP008613-PA; -.
DR   GeneID; 1275461; -.
DR   KEGG; aga:AgaP_AGAP008613; -.
DR   VEuPathDB; VectorBase:AGAP008613; -.
DR   eggNOG; KOG3736; Eukaryota.
DR   HOGENOM; CLU_013477_0_1_1; -.
DR   PhylomeDB; Q7Q8I5; -.
DR   UniPathway; UPA00378; -.
DR   GO; GO:0000139; C:Golgi membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0030246; F:carbohydrate binding; IEA:UniProtKB-KW.
DR   GO; GO:0016757; F:glycosyltransferase activity; IEA:UniProtKB-KW.
DR   GO; GO:0006486; P:protein glycosylation; IEA:UniProtKB-UniPathway.
DR   CDD; cd02510; pp-GalNAc-T; 1.
DR   CDD; cd00161; RICIN; 1.
DR   Gene3D; 2.80.10.50; -; 1.
DR   InterPro; IPR045885; GalNAc-T.
DR   InterPro; IPR001173; Glyco_trans_2-like.
DR   InterPro; IPR029044; Nucleotide-diphossugar_trans.
DR   InterPro; IPR035992; Ricin_B-like_lectins.
DR   InterPro; IPR000772; Ricin_B_lectin.
DR   PANTHER; PTHR11675; N-ACETYLGALACTOSAMINYLTRANSFERASE; 1.
DR   PANTHER; PTHR11675:SF140; POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE 5; 1.
DR   Pfam; PF00535; Glycos_transf_2; 1.
DR   Pfam; PF00652; Ricin_B_lectin; 1.
DR   SMART; SM00458; RICIN; 1.
DR   SUPFAM; SSF53448; Nucleotide-diphospho-sugar transferases; 1.
DR   SUPFAM; SSF50370; Ricin B-like lectins; 1.
DR   PROSITE; PS50231; RICIN_B_LECTIN; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157,
KW   ECO:0000256|RuleBase:RU361242};
KW   Glycosyltransferase {ECO:0000256|RuleBase:RU361242};
KW   Golgi apparatus {ECO:0000256|ARBA:ARBA00023034,
KW   ECO:0000256|RuleBase:RU361242};
KW   Lectin {ECO:0000256|ARBA:ARBA00022734, ECO:0000256|RuleBase:RU361242};
KW   Manganese {ECO:0000256|RuleBase:RU361242};
KW   Membrane {ECO:0000256|ARBA:ARBA00023136};
KW   Signal-anchor {ECO:0000256|ARBA:ARBA00022968};
KW   Transferase {ECO:0000256|RuleBase:RU361242};
KW   Transmembrane {ECO:0000256|ARBA:ARBA00022692};
KW   Transmembrane helix {ECO:0000256|ARBA:ARBA00022989}.
FT   DOMAIN          455..578
FT                   /note="Ricin B lectin"
FT                   /evidence="ECO:0000259|SMART:SM00458"
FT   REGION          1..71
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:EAA10180.4"
SQ   SEQUENCE   596 AA;  68241 MW;  1766DDA996795769 CRC64;
     VFHFQVPNAA RQSQQQRRAA APEDDDPIVD AEEIDTNSNN FGDPGGYGGG GGGRDADSSM
     PRTYRPQELK KWRQAPTVAE NYGRPGEMGK PVKIPANQQE LMKEKFKENQ FNLLASDMIW
     LNRSLTDVRH HDCKKKHYPA KLPTTSIVIV FHNEAWSTLL RTIWSVINRS PRPLLKEIIL
     VDDASEREHL GRQLEEYVRT LPVPTFVLRT GKRSGLIRAR LLGAKHVKGQ VITFLDAHCE
     CTEGWLEPLL ARIVLDRKTV VCPIIDVISD ETFEYVTASD QTWGGFNWKL NFRWYRVPAR
     EMQRRNHDRT APLRTPTMAG GLFSIDRDYF YEIGSYDEGM DIWGGENLEM SFRIWQCGGI
     LEISPCSHVG HVFRDKSPYT FPGGVANIVL KNAARVAEVW LDEWKEFYYQ MSPGARKASA
     GDVSERRALR ERLKCKSFRW YLENIYPESQ MPLDYYFLGE IRNVKTHNCL DTMGRKSNEK
     IGSSYCHGLG GNQVFAYTKR HQIMSDDNCL DASNALGPVN LVRCHGMGGN QEWIYDDEEK
     TIKHVNSGNC LTRASEDDPS TPLLRPCNYS EGQQWLMQSK FKWQAHHGDN RIGEDR
//
DBGET integrated database retrieval system