GenomeNet

Database: UniProt
Entry: W5JK44_ANODA
LinkDB: W5JK44_ANODA
Original site: W5JK44_ANODA 
ID   W5JK44_ANODA            Unreviewed;       797 AA.
AC   W5JK44;
DT   19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT   19-MAR-2014, sequence version 1.
DT   27-MAR-2024, entry version 47.
DE   RecName: Full=GH18 domain-containing protein {ECO:0000259|Pfam:PF00704};
GN   ORFNames=AND_003499 {ECO:0000313|EMBL:ETN64747.1};
OS   Anopheles darlingi (Mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=43151 {ECO:0000313|EMBL:ETN64747.1};
RN   [1] {ECO:0000313|EMBL:ETN64747.1, ECO:0000313|Proteomes:UP000000673}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=20920257; DOI=10.1186/1471-2164-11-529;
RA   Mendes N.D., Freitas A.T., Vasconcelos A.T., Sagot M.F.;
RT   "Combination of measures distinguishes pre-miRNAs from other stem-loops in
RT   the genome of the newly sequenced Anopheles darlingi.";
RL   BMC Genomics 11:529-529(2010).
RN   [2] {ECO:0000313|EMBL:ETN64747.1}
RP   NUCLEOTIDE SEQUENCE.
RA   Almeida L.G., Nicolas M.F., Souza R.C., Vasconcelos A.T.R.;
RL   Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:ETN64747.1}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=23761445;
RA   Marinotti O., Cerqueira G.C., de Almeida L.G., Ferro M.I., Loreto E.L.,
RA   Zaha A., Teixeira S.M., Wespiser A.R., Almeida E Silva A.,
RA   Schlindwein A.D., Pacheco A.C., Silva A.L., Graveley B.R., Walenz B.P.,
RA   Lima Bde A., Ribeiro C.A., Nunes-Silva C.G., de Carvalho C.R., Soares C.M.,
RA   de Menezes C.B., Matiolli C., Caffrey D., Araujo D.A., de Oliveira D.M.,
RA   Golenbock D., Grisard E.C., Fantinatti-Garboggini F., de Carvalho F.M.,
RA   Barcellos F.G., Prosdocimi F., May G., Azevedo Junior G.M., Guimaraes G.M.,
RA   Goldman G.H., Padilha I.Q., Batista Jda S., Ferro J.A., Ribeiro J.M.,
RA   Fietto J.L., Dabbas K.M., Cerdeira L., Agnez-Lima L.F., Brocchi M.,
RA   de Carvalho M.O., Teixeira Mde M., Diniz Maia Mde M., Goldman M.H.,
RA   Cruz Schneider M.P., Felipe M.S., Hungria M., Nicolas M.F., Pereira M.,
RA   Montes M.A., Cantao M.E., Vincentz M., Rafael M.S., Silverman N.,
RA   Stoco P.H., Souza R.C., Vicentini R., Gazzinelli R.T., Neves Rde O.,
RA   Silva R., Astolfi-Filho S., Maciel T.E., Urmenyi T.P., Tadei W.P.,
RA   Camargo E.P., de Vasconcelos A.T.;
RT   "The genome of Anopheles darlingi, the main neotropical malaria vector.";
RL   Nucleic Acids Res. 41:7387-7400(2013).
RN   [4] {ECO:0000313|EnsemblMetazoa:ADAC003499-PA}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (JUN-2015) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ADMH02000889; ETN64747.1; -; Genomic_DNA.
DR   AlphaFoldDB; W5JK44; -.
DR   STRING; 43151.W5JK44; -.
DR   EnsemblMetazoa; ADAC003499-RA; ADAC003499-PA; ADAC003499.
DR   VEuPathDB; VectorBase:ADAC003499; -.
DR   VEuPathDB; VectorBase:ADAR2_005837; -.
DR   eggNOG; ENOG502RTKF; Eukaryota.
DR   HOGENOM; CLU_352746_0_0_1; -.
DR   OrthoDB; 3442517at2759; -.
DR   Proteomes; UP000000673; Unassembled WGS sequence.
DR   GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR   Gene3D; 3.10.50.10; -; 1.
DR   Gene3D; 3.20.20.80; Glycosidases; 2.
DR   InterPro; IPR029070; Chitinase_insertion_sf.
DR   InterPro; IPR001223; Glyco_hydro18_cat.
DR   InterPro; IPR017853; Glycoside_hydrolase_SF.
DR   PANTHER; PTHR43907:SF6; AGAP000789-PA; 1.
DR   PANTHER; PTHR43907; SLEI FAMILY PROTEIN; 1.
DR   Pfam; PF00704; Glyco_hydro_18; 2.
DR   SUPFAM; SSF51445; (Trans)glycosidases; 2.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000000673};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..32
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           33..797
FT                   /note="GH18 domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5010155526"
FT   DOMAIN          72..273
FT                   /note="GH18"
FT                   /evidence="ECO:0000259|Pfam:PF00704"
FT   DOMAIN          547..650
FT                   /note="GH18"
FT                   /evidence="ECO:0000259|Pfam:PF00704"
FT   REGION          693..762
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   797 AA;  85579 MW;  6B2D2BAC4B7DF5A0 CRC64;
     MVTVAGDIAG RRRVSVAAVA LVVLSLLLSA TGVQSGKASG LVCIGTMGNF TANTAIGFCT
     SAVYIAAKPG ANGDVAYVNA NSATLLSGLT SFCSRKQAYP YVDLYVGVRA AGTDTNVATM
     LSDATVRVSF ITKLIQFVKT YTGCSGIYID LIGLTAAQSA NYGLFMDKLL TDATAATVKV
     ASALPWNAVK SVDIYYNPTL PKLAFNLLLT YEQTYTTIPT TVRPIAQLFT MDAPLDQIDQ
     TIFSNLFRWV IKGLNPKVII LGLPMYSQKY TVSGATGFGA LGSTAPVADT YCNLLPLLQE
     LLRLHQELLP LRQELLRLHL ELLPLRQELL PLRQELLPLL QELLPLHQEL LPLLQERLPL
     LQELLPLRQE LLPLHQELLP LHLELLPLLQ ERLPLHQELL PQPQPQQRLQ RLCEQLCHTA
     ICIDEQRVCL ARIADYGTAT AFGFCNQAVY LALRLNTNAA ITYVNTAMTS AAVEAGIRTY
     AGYKTTYPQV DFYLGVDSLP GSYETWITNA RATAISAITA QLRTYPTVAG VYIDIVNVPA
     NRGYNNVFRW VLSGIPTLKL ALGLPMYGLR FTAASATALG ATSTVVNPDT YCNALIFGVT
     NTAGAAQAGE GFAYSSSAMM VYNTFNSVID KLNFASATNL FGVGLYSMDQ AGSTNAELLR
     YVTSVLAPVP PAGVVYPAAE PATCQVPITF PVPPTTTTTP TTTTPTTTTP TTTTPTTTTP
     TTTTPTTTTP TTTTPTTTTP TTTTPTTTTP TTTTPTTTTY DYDNDRSEYG NLVSQYCSGI
     QNVTAFVVLG GTCGTAS
//
DBGET integrated database retrieval system