GenomeNet

Database: UniProt
Entry: A0A0P7WUD6_9BACT
LinkDB: A0A0P7WUD6_9BACT
Original site: A0A0P7WUD6_9BACT 
ID   A0A0P7WUD6_9BACT        Unreviewed;       876 AA.
AC   A0A0P7WUD6;
DT   20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT   20-JAN-2016, sequence version 1.
DT   22-NOV-2017, entry version 8.
DE   SubName: Full=Extracellular carbohydrate-binding protein {ECO:0000313|EMBL:KPP94679.1};
GN   ORFNames=HLUCCA01_08465 {ECO:0000313|EMBL:KPP94679.1};
OS   Bacteroidetes bacterium HLUCCA01.
OC   Bacteria; Bacteroidetes.
OX   NCBI_TaxID=1666909 {ECO:0000313|EMBL:KPP94679.1, ECO:0000313|Proteomes:UP000050310};
RN   [1] {ECO:0000313|EMBL:KPP94679.1, ECO:0000313|Proteomes:UP000050310}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=HLUCCA01 {ECO:0000313|EMBL:KPP94679.1};
RA   Nelson W.C., Romine M.F., Lindemann S.R.;
RT   "Identification and resolution of microdiversity through metagenomic
RT   sequencing of parallel consortia.";
RL   Submitted (AUG-2015) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an
CC       EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC       preliminary data. {ECO:0000313|EMBL:KPP94679.1}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; LIHN01000021; KPP94679.1; -; Genomic_DNA.
DR   EnsemblBacteria; KPP94679; KPP94679; HLUCCA01_08465.
DR   Proteomes; UP000050310; Unassembled WGS sequence.
DR   GO; GO:0016798; F:hydrolase activity, acting on glycosyl bonds; IEA:InterPro.
DR   Gene3D; 2.60.120.260; -; 1.
DR   Gene3D; 2.60.40.10; -; 1.
DR   InterPro; IPR003305; CenC_carb-bd.
DR   InterPro; IPR008979; Galactose-bd-like_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR026444; Secre_tail.
DR   Pfam; PF02018; CBM_4_9; 1.
DR   SUPFAM; SSF49785; SSF49785; 1.
DR   TIGRFAMs; TIGR04183; Por_Secre_tail; 1.
PE   4: Predicted;
KW   Complete proteome {ECO:0000313|Proteomes:UP000050310};
KW   Reference proteome {ECO:0000313|Proteomes:UP000050310};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL        1     29       {ECO:0000256|SAM:SignalP}.
FT   CHAIN        30    876       {ECO:0000256|SAM:SignalP}.
FT                                /FTId=PRO_5006144805.
FT   DOMAIN       33    166       CBM-cenC. {ECO:0000259|Pfam:PF02018}.
SQ   SEQUENCE   876 AA;  92720 MW;  36967371BFBE957A CRC64;
     MKQNVQRAAL MFFVALGFVF TGVTSTAQAQ TADNILINGD FASGELAPWT NFQAENVNAS
     FNIVDGEVAV TGISGAGGAV WHIQLNQILT QQQIDALTIG QAYKVSFDAR SNVAERQVRL
     FFGENNGNFA VQSERNVELT TTMETYEVTF TLSAKYPEMK VGFEMGLSND DVYIDNVVLT
     ETEAVASGLD LPVTFEDADL NYALADFGGN FSEIITDPTD SGNLVVSSTK GTSGVPSEGW
     AGATVGEPSG FATPIPFSSG NTTMSVRVWS PVADIPVRLK VENSNDNTMT VETEARTTVA
     GEWETLVFNF ASQASGTAAL NFASTYNKAS IFFDFNTPGT GQTYYWDDMV FGGEGSATGP
     AAPEGFAADG APAAVPMNPG DIFLAVGPNN VGQGGIEYRL FYAPEADNVE DPLTATEYSF
     GTTAGDGGGV AAFGFVLGGL EPATSYTFWL YQYDTNAEEY SAPAAVTATT AGDGDGGDNG
     GGDGGDDTNT LDLPVTFEDT EQDYKLADFG GAASEIVEDP TNSANRVVRT VKGTDAAPSE
     GWAGTTVGED EGPDGFANAI PFDQQNTTMS LRVWSPTADI PVRLKVENSD DPTVTVETEA
     RTTVAGEWET LTFDFANQAA GTAALNLASR YNKASVFFDF NTPGTGETYY WDDMTFGGEA
     AVTAPGTPIG FQASNTIGET PVGAGEAFLA AGPNNAEQAN IVYRLFYAKT ADALENPLEG
     TEYEFGTTAG DGDGNAAFGF IIASLDPATE YNFWLYQYDT ETELISEPAL ATVVSGGEGT
     SVDDLTGMPV EFALSQNFPN PFNPTTSIRF DLPESGQVQL EVFNMMGQRV ATLVNETRPA
     GSHSVTFDAS ALSSGMYLYR LQAGNNTMMK KMTLIK
//
DBGET integrated database retrieval system