GenomeNet

Database: UniProt
Entry: A0A0M8KEY9_9BACT
LinkDB: A0A0M8KEY9_9BACT
Original site: A0A0M8KEY9_9BACT 
ID   A0A0M8KEY9_9BACT        Unreviewed;       655 AA.
AC   A0A0M8KEY9;
DT   09-DEC-2015, integrated into UniProtKB/TrEMBL.
DT   09-DEC-2015, sequence version 1.
DT   27-SEP-2017, entry version 10.
DE   SubName: Full=Alpha-L-arabinofuranosidase {ECO:0000313|EMBL:GAP72675.1};
GN   ORFNames=SAMD00024442_4_21 {ECO:0000313|EMBL:GAP72675.1};
OS   Candidatus Symbiothrix dinenymphae.
OC   Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales;
OC   Candidatus Symbiothrix.
OX   NCBI_TaxID=467085 {ECO:0000313|EMBL:GAP72675.1, ECO:0000313|Proteomes:UP000050180};
RN   [1] {ECO:0000313|Proteomes:UP000050180}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=B4-10h {ECO:0000313|Proteomes:UP000050180};
RX   PubMed=26079531;
RA   Yuki M., Kuwahara H., Shintani M., Izawa K., Sato T., Starns D.,
RA   Hongoh Y., Ohkuma M.;
RT   "Dominant ectosymbiotic bacteria of cellulolytic protists in the
RT   termite gut also have the potential to digest lignocellulose.";
RL   Environ. Microbiol. 0:0-0(2015).
CC   -!- CAUTION: The sequence shown here is derived from an
CC       EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC       preliminary data. {ECO:0000313|EMBL:GAP72675.1}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; BBRT01000208; GAP72675.1; -; Genomic_DNA.
DR   EnsemblBacteria; GAP72675; GAP72675; SAMD00024442_4_21.
DR   Proteomes; UP000050180; Unassembled WGS sequence.
DR   GO; GO:0016798; F:hydrolase activity, acting on glycosyl bonds; IEA:InterPro.
DR   Gene3D; 2.60.120.260; -; 1.
DR   InterPro; IPR003305; CenC_carb-bd.
DR   InterPro; IPR008979; Galactose-bd-like.
DR   InterPro; IPR017853; Glycoside_hydrolase_SF.
DR   Pfam; PF02018; CBM_4_9; 1.
DR   SUPFAM; SSF49785; SSF49785; 1.
DR   SUPFAM; SSF51445; SSF51445; 1.
PE   4: Predicted;
KW   Complete proteome {ECO:0000313|Proteomes:UP000050180};
KW   Reference proteome {ECO:0000313|Proteomes:UP000050180};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL        1     22       {ECO:0000256|SAM:SignalP}.
FT   CHAIN        23    655       {ECO:0000256|SAM:SignalP}.
FT                                /FTId=PRO_5005817764.
FT   DOMAIN      110    206       CBM-cenC. {ECO:0000259|Pfam:PF02018}.
SQ   SEQUENCE   655 AA;  72507 MW;  768B967C941EDE09 CRC64;
     MKTNKFFLGL IVLAVSTANP LAAQKIDLGK VTDTKIEETL FGHNLEHTRS AVYQGLSAQL
     LRNRKFAGKP AAHFGEPAEW YRVGAANAYI TVEPHGAYVK HVGNTWKNFN NSNEINSAVI
     QNPQPNQKAG IGQRGLALKG GAAYTAKFIA RVRGDELPLK VTLTVTGSDG KVQGEKTFSL
     EGGDWRICEF TFTSQNDDTN ASFEISCSQQ AELKLGVASL MPTDNFRGMR KDAVALMDEI
     GISLLRWPGG NFSGEYRWKD GLLDVDERAP LFSYLPAETQ PHSGGYDFHE IGIDDFIALC
     REIGAEPYLT INLAWDTPEE AAQWVEYCNG STDTEWGKKR AERGYTEPYN VKYWSLGNEF
     GHGHMEGLNT PEDYAKKANS CAEAIKKVDP SIKFFSSGPY FPGGRPADWI TKGLAPMAKH
     ISYLSFHAYQ WDFVHGVDFA TDKGLKESYE RVTGAPAGWL NGLRKSRAFL DSQNEDIKKI
     AISFDEWNVF YAWFHDPCVI EGVFTALTLE MICKEYKALN MPVCMYFQPV NEGAIMIHPL
     TSELTANGQV FALMKKHRGG TLVDIRSSDA DLHCLGSVDK GKRFTVTLIN KSYDKPIPFI
     PGKGLKKIQN AALLDGSGSI FHGSKFVQTD GLKAKNAAGE FVIPPRSILQ IEGVW
//
DBGET integrated database retrieval system