ID A0A1Y9GLM8_ANOAR Unreviewed; 3589 AA.
AC A0A1Y9GLM8;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 27-MAR-2024, entry version 33.
DE RecName: Full=Mucin-5AC {ECO:0008006|Google:ProtNLM};
OS Anopheles arabiensis (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7173 {ECO:0000313|EnsemblMetazoa:AARA016405-PA.1, ECO:0000313|Proteomes:UP000075840};
RN [1] {ECO:0000313|Proteomes:UP000075840}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Dongola {ECO:0000313|Proteomes:UP000075840};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles arabiensis DONG5_A.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AARA016405-PA.1}
RP IDENTIFICATION.
RC STRAIN=Dongola {ECO:0000313|EnsemblMetazoa:AARA016405-PA.1};
RG EnsemblMetazoa;
RL Submitted (AUG-2022) to UniProtKB.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 18 family. Chitinase
CC class II subfamily. {ECO:0000256|ARBA:ARBA00009121}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; APCN01003924; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EnsemblMetazoa; AARA016405-RA; AARA016405-PA; AARA016405.
DR VEuPathDB; VectorBase:AARA016405; -.
DR VEuPathDB; VectorBase:AARA21_005994; -.
DR Proteomes; UP000075840; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0008061; F:chitin binding; IEA:UniProtKB-KW.
DR GO; GO:0004568; F:chitinase activity; IEA:UniProt.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR GO; GO:0006032; P:chitin catabolic process; IEA:UniProt.
DR CDD; cd02872; GH18_chitolectin_chitotriosidase; 1.
DR Gene3D; 3.10.50.10; -; 1.
DR Gene3D; 2.170.140.10; Chitin binding domain; 2.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR002557; Chitin-bd_dom.
DR InterPro; IPR036508; Chitin-bd_dom_sf.
DR InterPro; IPR011583; Chitinase_II.
DR InterPro; IPR029070; Chitinase_insertion_sf.
DR InterPro; IPR001223; Glyco_hydro18_cat.
DR InterPro; IPR001579; Glyco_hydro_18_chit_AS.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR PANTHER; PTHR11177; CHITINASE; 1.
DR PANTHER; PTHR11177:SF400; CHITINASE 6, ISOFORM C; 1.
DR Pfam; PF01607; CBM_14; 2.
DR Pfam; PF00704; Glyco_hydro_18; 1.
DR SMART; SM00494; ChtBD2; 2.
DR SMART; SM00636; Glyco_18; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF54556; Chitinase insertion domain; 1.
DR SUPFAM; SSF57625; Invertebrate chitin-binding proteins; 2.
DR PROSITE; PS50940; CHIT_BIND_II; 2.
DR PROSITE; PS01095; GH18_1; 1.
DR PROSITE; PS51910; GH18_2; 1.
PE 3: Inferred from homology;
KW Chitin-binding {ECO:0000256|ARBA:ARBA00022669};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
SQ SEQUENCE 3589 AA; 388107 MW; 154A09A4742B18F1 CRC64;
MFPLHKGAEL FLAIAILALW SGNESAAADN NQRVVCYYTN WSVYRPGTAK FTPQNINPYL
CTHLIYSFGG FTKENTLKPY DKYQDIEQGG FAKFTGLKTY NKNLKTLLAI GGWNEGSSRF
SPLVADAERR NQFVKNTIKF LRQNHFDGLD LDWEYPAFRD GSKPKDRENY AQLVQELREE
FERESSKTGR PRLLLTMAVP AGIEYVEKGY DVPKLNKYLD WFNLLSYDYH SAYEPAVNHH
SPLFSLEEAS EYNFDSELNI DYSVKFYLKA GADRDKLVLG IPTYGRSYTL YNPDATDIGA
PADGPGEQGD ATREKGYLAY YEICSSLKDS NDWTVVQPNP KAMGPYAYKG NQWVGYDDEA
IARRKAKYVA ENGLGGIMFW SIDNDDFRGT CHGKPYPIIE AAKEALLAST EVGINDVASS
GRPRKPSRSR SRPGSATRNR VNTDNNNEIK ASFKTSQGRK VARPVRVTTT STTTTTTTED
SLYIGGRTTT PQPPTTPDPG ADFKCEDEGF FPHPRDCKKY FWCLDSPSLG LVAHQFTCPS
GLVFNKLADS CDYARNVICS KTAPSTTSST TSTTTSTTTT AAAPTTSTTA RPRITVATNR
NNFFNRGIFT TTTTTTTAEP TVEADYSDED NTETQPEEDP KVIKELIALI KKVGGIEELE
KQLQAQEDGS ILLKGAEADP VSTTAPTISK SLYERVLSRT GNVGQKFRPA ITFTQQQDRA
GSLPENKYSS VVRNSYTNSR VGPQNEGLDQ LPEFEGVFRE KPKYVTLQRA RPTKAASIED
RYDDDEDIDE EENQPSTFTT GRPTAVPKYV SIRRQRPTTV AEQPQPEVDA EEPAEDEQEE
EEEEPQTVNR NWYRKPTEAS QPTESEPERS SPNRYNTIDR RRTTSVPYTS SRLQETERDD
VGTDSSLLVP STDFQTIYTT ASTTTTTLEP SPQSSEPNLI PYSTRQYSSV ERVSGNTISD
DGDDDDVDGD DRSGNPSSSS SSDSSNDSDS STSAGPKPTT AGLDSDRASD AEVTDTTVPI
GPTLHRADDT GSSSDGSSAS GWPADTTLEV TTAVPNTGES VTGSSVLDAG LVGGSDTFAN
PDRRFGSPYE GPEDDETVAP SVTTTASSSS SSSSRATATT ATTTPSVPST TVATSPPTND
ATSNSVFKFT RTRKPATTTT TTTIATTDTP LEPVTNPSRR IGGAAPVRAA PRPFVAGARR
TRPTRVPDAA TTTAAAAVPD PTTLRTVKVS FSNAQRFDLS ASRQRRVGSA DSAEQQPPQE
GPAGEQPAAR APAARGNRGR VRFRTAPEPA PAARSTPLSY DEPSRPPVPF AGSSSSTTDI
TTTTTQTVSR RRTRPTAASE QLAANRFALG RTTTLEPLTT VTERGSLRRV NFSLYSGRRS
SESPVPSPTT STEEPRIETE PVTPEIRSYT RIADDPLMTQ ERILLTLGTA LRYGFSSRNE
LTTGGTTAAQ AFTTVQDSFS TVAPAPARST FRGRPFGRSE SVDSTVNSLL AESLNDMSTE
LPSQAEDATD SRLLNEITTL YNVNAAEVDT TTPQPRKLAS VRYSPPSTTE AVDTTLNVVV
SEVNTNKTFI PGRTRPTRPA RLRTTAADTS AQTPDSTDAT TRAPSRRPFA PRRPAQEGSL
AARLTPTEEP LGNDVTERAT GISTTKRQRA NRRRTTTAAP STTTPAAAST VEDRSPLPRS
RASFNTEQRR QRLPGNRLLK PASTTPTTTT ESFTVEPTFP DSNVANSDFT TVITDTSNEI
APGGYDLLNS DNLLSLPNLR TGEQADEDED SATETLSTLT ASATEPNPSS NLVAGGPAKA
EPPRPNDSQQ IESESIPARA STRGFGATRR KFTGKFVTTE APEPTEPIKK FVPTRRNRPS
FSLARSGRLS TTTEEPASVG SADGDSSAPP AGPSPFVPSR KRKPLPLGGR ASSTTERSSG
ESGAGDDTAP PAARPTRRRS AYSLNRPARP QAFENSFDKA TVAAKPVEQK SINRTSVLLP
KRPNLFVRTT TTTTTSTTAA PPTAYGGDAT DRTTLADRFG SSEEQRYSGE ENEGDNEIQR
QGKRLFGQLG SDASNSLLLP GEVETERPRP EPTFGGVRRP GGRVGGITPT RPQPSSGSNR
FIFKNNQPAT ASGNEGSAGS STTQRTFTTR TRRPFGNVGG YKPTAQNGTG LVSGDGAGPV
AGSVTTPRAR FTPTTRTRPT FGRLRSTTAA LGSRLNVEDE RAAPTLNELS GEQGGGFATR
RYQPRKPTGG RRNFTAPAAG GRNVTLPGVL LLTTREPAEG AAQTTPDANG ETTLPATVEY
ATDDTKQSSA TESDALAPTT TPTATDTITT NSITTTTTTT TTTKTTTNNN NNNNEAENTT
SYEDTTPNTF RTTTEYYTAS DETHYTTLNT ADTIYTINDN IDIDDVNVLQ DRGRTSTTTV
KPTTLYHVFS IDKENESVPS STEIYRTEEQ EIELPAANRT DKLVKIHRVV EIYTKNASNP
DELPVMQKLG EINRKIIIRL VEPNRANRTA GGHGSTSSSY TTTTLGGAAG GAAAPVTERV
EEDESEDGRG RPMVLSSNSV FTVETSTIPL EGLFDPEKGA GFPTTELPQG AGEESQTVAP
YPEIQRIASA DFSADRPLDQ DDEPTAVTQQ QQQQPAVEQV TSLRPVDFSD GTDGDSAPPH
TTNYYGAVEE EAEEEEQTTR VTLEADDTTL RSSSTARTTP VVSSSTQREI VHKYTLPAEF
SFQEPSSETT TLKTTTTTTT TTTTTMTPDT TTTSTSTPTT TTAAAPELLT TTVDDVPDLP
TVPDRVQSSL VLPSRESKKK YTKRPDGPAV RVPSDFAYSR DSSAEADGER ATEPSYQTEP
PVQTTTKATR HYTNNRKYVS SLSLRPNLNG PEPAQAAPSY RKSKDIPVVP LPRTVSSKSS
TQRAQSDTTT STTTAASSTS TTTTTTTTPA PITKIDFNIK YVLPVSLPDA AAHELKIESK
NDKYSVPLSP ILRPDFVTRE ISGERQDRQF SPVYRPELDN DELTRQLTSD ADPVALEAEH
LDREILGGGE DGADAGQLSE ESVGPVSEQL VGGGVLPSSA AYFLRRTAEV RPQYRPSLRR
PSGSGKPSGL LQASREQLDT TTERTKSTRR SEVPSASGEQ ATSDPATTTT ATTPPSLLTR
RRRPTTASNY LAKIANRQRP AAATAVMSST EETTTTTSRP DGSPSRGSRP NLTSNGRRKY
GPANRTGYTG SSPTAGSESS TTASIPSASS ETSPEVSTRP VNSLFKRRNN GSPLTRTTTP
TPLESIPSSS AAPVSSKFFK KNKYRTGALS TGPGAERSEQ PVPPASQSLD AGPGVDQNIE
PSIRQNSKLY NAKNRYQQIG EPEEDGAEPI EQELLDALTT LSRAPLPVVS TTTVRNNGYE
SVVDKEWVRQ QQKQPAQQQY AAEVNGRATT HRYNVTHDYP NAVGYGSVPA QATSTHRLPP
PVQRPRIATH SEYYYQQTTP GLIYTSPIAT ATISPRQDTT VLSQQFPRNH QPAHYERQSA
SSGRRKQKDD VTGRTYRPSP LDYDYYDDGD TRKVGKSSSQ VKVIMHGPGI IECLDQGNFP
HPLSCKKFIS CAKMEIGGVI GWEYTCPKGL SYDPVGGICN WSAGLGCNE
//