GenomeNet

Database: UniProt
Entry: A0A1Y9GLM8_ANOAR
LinkDB: A0A1Y9GLM8_ANOAR
Original site: A0A1Y9GLM8_ANOAR 
ID   A0A1Y9GLM8_ANOAR        Unreviewed;      3589 AA.
AC   A0A1Y9GLM8;
DT   30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT   30-AUG-2017, sequence version 1.
DT   27-MAR-2024, entry version 33.
DE   RecName: Full=Mucin-5AC {ECO:0008006|Google:ProtNLM};
OS   Anopheles arabiensis (Mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7173 {ECO:0000313|EnsemblMetazoa:AARA016405-PA.1, ECO:0000313|Proteomes:UP000075840};
RN   [1] {ECO:0000313|Proteomes:UP000075840}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Dongola {ECO:0000313|Proteomes:UP000075840};
RG   The Broad Institute Genomics Platform;
RA   Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA   Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA   Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA   Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA   Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA   Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA   Birren B.;
RT   "The Genome Sequence of Anopheles arabiensis DONG5_A.";
RL   Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EnsemblMetazoa:AARA016405-PA.1}
RP   IDENTIFICATION.
RC   STRAIN=Dongola {ECO:0000313|EnsemblMetazoa:AARA016405-PA.1};
RG   EnsemblMetazoa;
RL   Submitted (AUG-2022) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the glycosyl hydrolase 18 family. Chitinase
CC       class II subfamily. {ECO:0000256|ARBA:ARBA00009121}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; APCN01003924; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EnsemblMetazoa; AARA016405-RA; AARA016405-PA; AARA016405.
DR   VEuPathDB; VectorBase:AARA016405; -.
DR   VEuPathDB; VectorBase:AARA21_005994; -.
DR   Proteomes; UP000075840; Unassembled WGS sequence.
DR   GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR   GO; GO:0008061; F:chitin binding; IEA:UniProtKB-KW.
DR   GO; GO:0004568; F:chitinase activity; IEA:UniProt.
DR   GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR   GO; GO:0006032; P:chitin catabolic process; IEA:UniProt.
DR   CDD; cd02872; GH18_chitolectin_chitotriosidase; 1.
DR   Gene3D; 3.10.50.10; -; 1.
DR   Gene3D; 2.170.140.10; Chitin binding domain; 2.
DR   Gene3D; 3.20.20.80; Glycosidases; 1.
DR   InterPro; IPR002557; Chitin-bd_dom.
DR   InterPro; IPR036508; Chitin-bd_dom_sf.
DR   InterPro; IPR011583; Chitinase_II.
DR   InterPro; IPR029070; Chitinase_insertion_sf.
DR   InterPro; IPR001223; Glyco_hydro18_cat.
DR   InterPro; IPR001579; Glyco_hydro_18_chit_AS.
DR   InterPro; IPR017853; Glycoside_hydrolase_SF.
DR   PANTHER; PTHR11177; CHITINASE; 1.
DR   PANTHER; PTHR11177:SF400; CHITINASE 6, ISOFORM C; 1.
DR   Pfam; PF01607; CBM_14; 2.
DR   Pfam; PF00704; Glyco_hydro_18; 1.
DR   SMART; SM00494; ChtBD2; 2.
DR   SMART; SM00636; Glyco_18; 1.
DR   SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR   SUPFAM; SSF54556; Chitinase insertion domain; 1.
DR   SUPFAM; SSF57625; Invertebrate chitin-binding proteins; 2.
DR   PROSITE; PS50940; CHIT_BIND_II; 2.
DR   PROSITE; PS01095; GH18_1; 1.
DR   PROSITE; PS51910; GH18_2; 1.
PE   3: Inferred from homology;
KW   Chitin-binding {ECO:0000256|ARBA:ARBA00022669};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
SQ   SEQUENCE   3589 AA;  388107 MW;  154A09A4742B18F1 CRC64;
     MFPLHKGAEL FLAIAILALW SGNESAAADN NQRVVCYYTN WSVYRPGTAK FTPQNINPYL
     CTHLIYSFGG FTKENTLKPY DKYQDIEQGG FAKFTGLKTY NKNLKTLLAI GGWNEGSSRF
     SPLVADAERR NQFVKNTIKF LRQNHFDGLD LDWEYPAFRD GSKPKDRENY AQLVQELREE
     FERESSKTGR PRLLLTMAVP AGIEYVEKGY DVPKLNKYLD WFNLLSYDYH SAYEPAVNHH
     SPLFSLEEAS EYNFDSELNI DYSVKFYLKA GADRDKLVLG IPTYGRSYTL YNPDATDIGA
     PADGPGEQGD ATREKGYLAY YEICSSLKDS NDWTVVQPNP KAMGPYAYKG NQWVGYDDEA
     IARRKAKYVA ENGLGGIMFW SIDNDDFRGT CHGKPYPIIE AAKEALLAST EVGINDVASS
     GRPRKPSRSR SRPGSATRNR VNTDNNNEIK ASFKTSQGRK VARPVRVTTT STTTTTTTED
     SLYIGGRTTT PQPPTTPDPG ADFKCEDEGF FPHPRDCKKY FWCLDSPSLG LVAHQFTCPS
     GLVFNKLADS CDYARNVICS KTAPSTTSST TSTTTSTTTT AAAPTTSTTA RPRITVATNR
     NNFFNRGIFT TTTTTTTAEP TVEADYSDED NTETQPEEDP KVIKELIALI KKVGGIEELE
     KQLQAQEDGS ILLKGAEADP VSTTAPTISK SLYERVLSRT GNVGQKFRPA ITFTQQQDRA
     GSLPENKYSS VVRNSYTNSR VGPQNEGLDQ LPEFEGVFRE KPKYVTLQRA RPTKAASIED
     RYDDDEDIDE EENQPSTFTT GRPTAVPKYV SIRRQRPTTV AEQPQPEVDA EEPAEDEQEE
     EEEEPQTVNR NWYRKPTEAS QPTESEPERS SPNRYNTIDR RRTTSVPYTS SRLQETERDD
     VGTDSSLLVP STDFQTIYTT ASTTTTTLEP SPQSSEPNLI PYSTRQYSSV ERVSGNTISD
     DGDDDDVDGD DRSGNPSSSS SSDSSNDSDS STSAGPKPTT AGLDSDRASD AEVTDTTVPI
     GPTLHRADDT GSSSDGSSAS GWPADTTLEV TTAVPNTGES VTGSSVLDAG LVGGSDTFAN
     PDRRFGSPYE GPEDDETVAP SVTTTASSSS SSSSRATATT ATTTPSVPST TVATSPPTND
     ATSNSVFKFT RTRKPATTTT TTTIATTDTP LEPVTNPSRR IGGAAPVRAA PRPFVAGARR
     TRPTRVPDAA TTTAAAAVPD PTTLRTVKVS FSNAQRFDLS ASRQRRVGSA DSAEQQPPQE
     GPAGEQPAAR APAARGNRGR VRFRTAPEPA PAARSTPLSY DEPSRPPVPF AGSSSSTTDI
     TTTTTQTVSR RRTRPTAASE QLAANRFALG RTTTLEPLTT VTERGSLRRV NFSLYSGRRS
     SESPVPSPTT STEEPRIETE PVTPEIRSYT RIADDPLMTQ ERILLTLGTA LRYGFSSRNE
     LTTGGTTAAQ AFTTVQDSFS TVAPAPARST FRGRPFGRSE SVDSTVNSLL AESLNDMSTE
     LPSQAEDATD SRLLNEITTL YNVNAAEVDT TTPQPRKLAS VRYSPPSTTE AVDTTLNVVV
     SEVNTNKTFI PGRTRPTRPA RLRTTAADTS AQTPDSTDAT TRAPSRRPFA PRRPAQEGSL
     AARLTPTEEP LGNDVTERAT GISTTKRQRA NRRRTTTAAP STTTPAAAST VEDRSPLPRS
     RASFNTEQRR QRLPGNRLLK PASTTPTTTT ESFTVEPTFP DSNVANSDFT TVITDTSNEI
     APGGYDLLNS DNLLSLPNLR TGEQADEDED SATETLSTLT ASATEPNPSS NLVAGGPAKA
     EPPRPNDSQQ IESESIPARA STRGFGATRR KFTGKFVTTE APEPTEPIKK FVPTRRNRPS
     FSLARSGRLS TTTEEPASVG SADGDSSAPP AGPSPFVPSR KRKPLPLGGR ASSTTERSSG
     ESGAGDDTAP PAARPTRRRS AYSLNRPARP QAFENSFDKA TVAAKPVEQK SINRTSVLLP
     KRPNLFVRTT TTTTTSTTAA PPTAYGGDAT DRTTLADRFG SSEEQRYSGE ENEGDNEIQR
     QGKRLFGQLG SDASNSLLLP GEVETERPRP EPTFGGVRRP GGRVGGITPT RPQPSSGSNR
     FIFKNNQPAT ASGNEGSAGS STTQRTFTTR TRRPFGNVGG YKPTAQNGTG LVSGDGAGPV
     AGSVTTPRAR FTPTTRTRPT FGRLRSTTAA LGSRLNVEDE RAAPTLNELS GEQGGGFATR
     RYQPRKPTGG RRNFTAPAAG GRNVTLPGVL LLTTREPAEG AAQTTPDANG ETTLPATVEY
     ATDDTKQSSA TESDALAPTT TPTATDTITT NSITTTTTTT TTTKTTTNNN NNNNEAENTT
     SYEDTTPNTF RTTTEYYTAS DETHYTTLNT ADTIYTINDN IDIDDVNVLQ DRGRTSTTTV
     KPTTLYHVFS IDKENESVPS STEIYRTEEQ EIELPAANRT DKLVKIHRVV EIYTKNASNP
     DELPVMQKLG EINRKIIIRL VEPNRANRTA GGHGSTSSSY TTTTLGGAAG GAAAPVTERV
     EEDESEDGRG RPMVLSSNSV FTVETSTIPL EGLFDPEKGA GFPTTELPQG AGEESQTVAP
     YPEIQRIASA DFSADRPLDQ DDEPTAVTQQ QQQQPAVEQV TSLRPVDFSD GTDGDSAPPH
     TTNYYGAVEE EAEEEEQTTR VTLEADDTTL RSSSTARTTP VVSSSTQREI VHKYTLPAEF
     SFQEPSSETT TLKTTTTTTT TTTTTMTPDT TTTSTSTPTT TTAAAPELLT TTVDDVPDLP
     TVPDRVQSSL VLPSRESKKK YTKRPDGPAV RVPSDFAYSR DSSAEADGER ATEPSYQTEP
     PVQTTTKATR HYTNNRKYVS SLSLRPNLNG PEPAQAAPSY RKSKDIPVVP LPRTVSSKSS
     TQRAQSDTTT STTTAASSTS TTTTTTTTPA PITKIDFNIK YVLPVSLPDA AAHELKIESK
     NDKYSVPLSP ILRPDFVTRE ISGERQDRQF SPVYRPELDN DELTRQLTSD ADPVALEAEH
     LDREILGGGE DGADAGQLSE ESVGPVSEQL VGGGVLPSSA AYFLRRTAEV RPQYRPSLRR
     PSGSGKPSGL LQASREQLDT TTERTKSTRR SEVPSASGEQ ATSDPATTTT ATTPPSLLTR
     RRRPTTASNY LAKIANRQRP AAATAVMSST EETTTTTSRP DGSPSRGSRP NLTSNGRRKY
     GPANRTGYTG SSPTAGSESS TTASIPSASS ETSPEVSTRP VNSLFKRRNN GSPLTRTTTP
     TPLESIPSSS AAPVSSKFFK KNKYRTGALS TGPGAERSEQ PVPPASQSLD AGPGVDQNIE
     PSIRQNSKLY NAKNRYQQIG EPEEDGAEPI EQELLDALTT LSRAPLPVVS TTTVRNNGYE
     SVVDKEWVRQ QQKQPAQQQY AAEVNGRATT HRYNVTHDYP NAVGYGSVPA QATSTHRLPP
     PVQRPRIATH SEYYYQQTTP GLIYTSPIAT ATISPRQDTT VLSQQFPRNH QPAHYERQSA
     SSGRRKQKDD VTGRTYRPSP LDYDYYDDGD TRKVGKSSSQ VKVIMHGPGI IECLDQGNFP
     HPLSCKKFIS CAKMEIGGVI GWEYTCPKGL SYDPVGGICN WSAGLGCNE
//
DBGET integrated database retrieval system