GenomeNet

Database: UniProt
Entry: W4LB43_9BACT
LinkDB: W4LB43_9BACT
Original site: W4LB43_9BACT 
ID   W4LB43_9BACT            Unreviewed;       174 AA.
AC   W4LB43;
DT   19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT   19-MAR-2014, sequence version 1.
DT   27-SEP-2017, entry version 12.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ETW94940.1};
GN   ORFNames=ETSY1_32725 {ECO:0000313|EMBL:ETW94940.1};
OS   Candidatus Entotheonella sp. TSY1.
OC   Bacteria; Nitrospinae/Tectomicrobia group; Candidatus Tectomicrobia;
OC   Candidatus Entotheonella.
OX   NCBI_TaxID=1429438 {ECO:0000313|EMBL:ETW94940.1, ECO:0000313|Proteomes:UP000019141};
RN   [1] {ECO:0000313|EMBL:ETW94940.1, ECO:0000313|Proteomes:UP000019141}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=TSY1 {ECO:0000313|Proteomes:UP000019141};
RX   PubMed=24476823; DOI=10.1038/nature12959;
RA   Wilson M.C., Mori T., Ruckert C., Uria A.R., Helf M.J., Takada K.,
RA   Gernert C., Steffens U.A., Heycke N., Schmitt S., Rinke C.,
RA   Helfrich E.J., Brachmann A.O., Gurgui C., Wakimoto T., Kracht M.,
RA   Crusemann M., Hentschel U., Abe I., Matsunaga S., Kalinowski J.,
RA   Takeyama H., Piel J.;
RT   "An environmental bacterial taxon with a large and distinct metabolic
RT   repertoire.";
RL   Nature 506:58-62(2014).
CC   -!- CAUTION: The sequence shown here is derived from an
CC       EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC       preliminary data. {ECO:0000313|EMBL:ETW94940.1}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; AZHW01000982; ETW94940.1; -; Genomic_DNA.
DR   EnsemblBacteria; ETW94940; ETW94940; ETSY1_32725.
DR   Proteomes; UP000019141; Unassembled WGS sequence.
DR   GO; GO:0016798; F:hydrolase activity, acting on glycosyl bonds; IEA:InterPro.
DR   Gene3D; 2.60.120.260; -; 1.
DR   InterPro; IPR003305; CenC_carb-bd.
DR   InterPro; IPR008979; Galactose-bd-like.
DR   Pfam; PF02018; CBM_4_9; 1.
DR   SUPFAM; SSF49785; SSF49785; 1.
PE   4: Predicted;
KW   Complete proteome {ECO:0000313|Proteomes:UP000019141};
KW   Reference proteome {ECO:0000313|Proteomes:UP000019141}.
FT   DOMAIN       21    147       CBM-cenC. {ECO:0000259|Pfam:PF02018}.
SQ   SEQUENCE   174 AA;  19899 MW;  BF7F13D3182C1E47 CRC64;
     MQLHAVRSLE VIANFARPSE LRVNGDFTHG AHGWNRLQLR PGHAAATGTV RDRAYHADIM
     FGGLKAFDIQ LSHEGMRVVQ GKTYRLTFDA RADARRTIEA AINTAATPQT TYHHETFDIT
     PQYHTYSFEF TMKEPTDTNA RVDFNPGVYD HNDVYLDNIQ FIETVWTDQQ SKTR
//
DBGET integrated database retrieval system