GenomeNet

Database: UniProt
Entry: A0A0S8HEX6_9BACT
LinkDB: A0A0S8HEX6_9BACT
Original site: A0A0S8HEX6_9BACT 
ID   A0A0S8HEX6_9BACT        Unreviewed;       852 AA.
AC   A0A0S8HEX6;
DT   17-FEB-2016, integrated into UniProtKB/TrEMBL.
DT   17-FEB-2016, sequence version 1.
DT   27-SEP-2017, entry version 9.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KPK83811.1};
GN   ORFNames=AMJ81_07265 {ECO:0000313|EMBL:KPK83811.1};
OS   Phycisphaerae bacterium SM23_33.
OC   Bacteria; Planctomycetes; Phycisphaerae.
OX   NCBI_TaxID=1703412 {ECO:0000313|EMBL:KPK83811.1, ECO:0000313|Proteomes:UP000054531};
RN   [1] {ECO:0000313|EMBL:KPK83811.1, ECO:0000313|Proteomes:UP000054531}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=SM23_33 {ECO:0000313|EMBL:KPK83811.1};
RX   PubMed=25922666; DOI=10.1186/s40168-015-0077-6;
RA   Baker B.J., Lazar C.S., Teske A.P., Dick G.J.;
RT   "Genomic resolution of linkages in carbon, nitrogen, and sulfur
RT   cycling among widespread estuary sediment bacteria.";
RL   Microbiome 3:14-14(2015).
CC   -!- CAUTION: The sequence shown here is derived from an
CC       EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC       preliminary data. {ECO:0000313|EMBL:KPK83811.1}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; LJUF01000105; KPK83811.1; -; Genomic_DNA.
DR   Proteomes; UP000054531; Unassembled WGS sequence.
DR   GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR   GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR   InterPro; IPR003305; CenC_carb-bd.
DR   InterPro; IPR008979; Galactose-bd-like.
DR   InterPro; IPR001547; Glyco_hydro_5.
DR   InterPro; IPR017853; Glycoside_hydrolase_SF.
DR   Pfam; PF02018; CBM_4_9; 1.
DR   Pfam; PF00150; Cellulase; 1.
DR   SUPFAM; SSF49785; SSF49785; 1.
DR   SUPFAM; SSF51445; SSF51445; 2.
PE   4: Predicted;
KW   Complete proteome {ECO:0000313|Proteomes:UP000054531};
KW   Reference proteome {ECO:0000313|Proteomes:UP000054531};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL        1     22       {ECO:0000256|SAM:SignalP}.
FT   CHAIN        23    852       {ECO:0000256|SAM:SignalP}.
FT                                /FTId=PRO_5006647707.
FT   DOMAIN       85    221       Cellulase. {ECO:0000259|Pfam:PF00150}.
FT   DOMAIN      291    405       CBM-cenC. {ECO:0000259|Pfam:PF02018}.
SQ   SEQUENCE   852 AA;  94375 MW;  39105593300A7C80 CRC64;
     MIMRAQALVA EIALAVCSPA VGQTFVPFVI PAEPNPKSLI ALRSGPPIST DGPRLVARGG
     HFFVGQRRVR IWGVNLCFGA CFPAQADAER VAERLAAAGI NSVRFHHMDS AAFPRGIWDR
     GDPKRLSAEA LDRLDYFIDR LARRGIYANV NLHVSRTHSR VLKLPDTDRL SNYDKMVDIF
     TPALVEAQRS YARDLLTHVS KYRKVRYADD PAVAFVEINN EDSLFMWGAD RALPALPPYY
     AEALQAAHIA WLKGRYGGTA RLRAAWNEGT EPLGENLLTD ERLRQLAEGK GAWRLERHGD
     CVAKAVKTET GVRIEIAKAD AASWHIQLNQ SGLAVKAGRY YTVVFRARAD QPRRIGYNVG
     QAHAPWGLLG LSRTADLTPQ WQTFRAGFGA SADDDNARLN LQLGGSEVAA ELADVELRPG
     GREGLRQGES LEAGKVAVFA ETETEARTLD RWRFLAETEK AYFDGMYGFI KAELGCKALV
     TGTIVFGPLG LYGQSGMDYI DGHAYWQHPR FPGRPWDPGN WTVEQKAMVD HPDESPLFRL
     AAQRLAGKPY TVSEYNHPAP NDYQAECVPM VASFAAAQDW DGVWLFAYSH RTDDWDREHF
     SSFFDIQANP AKWGFVPAGT IIFREGGVPQ GHPRWVVGLA GGRDGLSDLT RLHLRHGRDL
     TAAAADCAGS SSRMDWLNRP LAVTLGRAGR PATGGAAEAG RPLKWSGFGG KLGRFEAKGA
     LAQVELGRLP RPDGRVHPQV FVMASLDKRP LDESRRILIT ACGRCENTDM EFSADRRTVG
     RDWGGPPVRI EAVTARISLR PDKWRCDALG PDGRAGSQVP IQVQQERDER PPESWVELSP
     KYETMWYLLM RQ
//
DBGET integrated database retrieval system