GenomeNet

Database: UniProt
Entry: W4VAE8_9FIRM
LinkDB: W4VAE8_9FIRM
Original site: W4VAE8_9FIRM 
ID   W4VAE8_9FIRM            Unreviewed;      1051 AA.
AC   W4VAE8;
DT   19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT   19-MAR-2014, sequence version 1.
DT   27-MAR-2024, entry version 36.
DE   RecName: Full=VWFA domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=JCM21531_3280 {ECO:0000313|EMBL:GAE89724.1};
OS   Acetivibrio straminisolvens JCM 21531.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC   Acetivibrio.
OX   NCBI_TaxID=1294263 {ECO:0000313|EMBL:GAE89724.1, ECO:0000313|Proteomes:UP000019109};
RN   [1] {ECO:0000313|EMBL:GAE89724.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=JCM 21531 {ECO:0000313|EMBL:GAE89724.1};
RA   Yuki M., Oshima K., Suda W., Sakamoto M., Kitamura K., Iida T., Hattori M.,
RA   Ohkuma M.;
RT   "Draft Genome Sequence of Clostridium straminisolvens Strain JCM 21531T,
RT   Isolated from a Cellulose-Degrading Bacterial Community.";
RL   Genome Announc. 2:e00110-14(2014).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:GAE89724.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; BAVR01000044; GAE89724.1; -; Genomic_DNA.
DR   RefSeq; WP_038290168.1; NZ_BAVR01000044.1.
DR   AlphaFoldDB; W4VAE8; -.
DR   STRING; 1294263.JCM21531_3280; -.
DR   OrthoDB; 1656124at2; -.
DR   Proteomes; UP000019109; Unassembled WGS sequence.
DR   GO; GO:0030248; F:cellulose binding; IEA:InterPro.
DR   GO; GO:0016798; F:hydrolase activity, acting on glycosyl bonds; IEA:InterPro.
DR   GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR   CDD; cd00198; vWFA; 1.
DR   Gene3D; 2.60.40.710; Endoglucanase-like; 1.
DR   Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR   InterPro; IPR008965; CBM2/CBM3_carb-bd_dom_sf.
DR   InterPro; IPR001956; CBM3.
DR   InterPro; IPR036966; CBM3_sf.
DR   InterPro; IPR003305; CenC_carb-bd.
DR   InterPro; IPR008979; Galactose-bd-like_sf.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   Pfam; PF00942; CBM_3; 1.
DR   Pfam; PF02018; CBM_4_9; 1.
DR   Pfam; PF00092; VWA; 1.
DR   SMART; SM01067; CBM_3; 1.
DR   SUPFAM; SSF49384; Carbohydrate-binding domain; 1.
DR   SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR   SUPFAM; SSF53300; vWA-like; 1.
DR   PROSITE; PS51172; CBM3; 1.
DR   PROSITE; PS50234; VWFA; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000019109}.
FT   DOMAIN          35..190
FT                   /note="CBM3"
FT                   /evidence="ECO:0000259|PROSITE:PS51172"
FT   DOMAIN          711..924
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          831..851
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1051 AA;  114316 MW;  12C7828DC7C25D44 CRC64;
     MLKKSLYKQS TLFLTVCLLV QIISAAVIGL SVYASSEGIS VEFFNGNLGA NVGSMNLNFK
     VINNGASAVE LSDIKLRYYF TDDGVSPITV FIDHAANSGN VINTCITYTI KDINTSGANK
     YIELGFNAQA GSLAPNTSVL VKARAYQSNY SQDFTQTNDY SFCQTNTDFA AWDKVTGYLN
     GVLFSGTEPV MLTPTPSLAT PTPQMSPIPT VTSSATPTPE VIPTPSGLET PADWMNQAVP
     NGDFEAGTIL WSFYCDSVSG ANASNAVYSE PSGNKMSKTA IENVGANYWA IQLKHAGIVL
     ENSKTYRLTF DAKSTMPRDI IVSLQDATSS LIEYYGKIIK IDSHMKTYTC EFTLNSSEGT
     SVAIVFEMGK IGAIANKAHD IILDNVHIEK ITSPLPPGPP APKEDALVAS VTASRNSVYE
     IELGKEADIS LSQSGEIALE GSIDTKKEVV LVLDNSGALN SYVKDILSPL DFGIYSNRDL
     TIQGKSASIN GSVHTNSLFT STADSIQISQ TCSAASFNIV SKNVDIKTLK NITTPIEMPY
     FHNELINDAA EDFMVFRPED YPPSFFPYPM PGQEDIFIIY NIFAGRFEIF GMGTLVINSS
     MYFKGNVLIS LRSTNNVSEG FIVADGNIII QGENLYPSGP NDKLYVYSIG GNIEYQTSNS
     TINGIAYAPG NPANPDSGKI LFLGNNNTIN GAIAGNQLNF TASDLKVNHT EGQFNVVEEK
     YMQNTSHLKL VKDVAKSLID RFVGSKTKIS VIQYSDSAND NDFKQYDLFM SGNAVTLKQK
     IHAIEPGTSG FSNMGDAMRR AYHILNKPSK DPVSKYIVVL AGSVPNRWTA MNNVGNEPKT
     GNGKADHIKP DNEAYSSTDY AKDIGKMITS AGINLQFIDF SDENIGTVME EIAAESGVES
     IEATGKHYYR ANDFIELADI FDNICLKINY DVVLDNVLYE EILPAGVLPV EVPDWMSTQS
     VSVGGVTRTK ITGAINNIPL TYKGKGYSFD IGSFKIKVKF MKPGTIVFDG ADSKIIYTID
     YIDKDGKNQS RSLDKYFNDI TVNVKMSVDI N
//
DBGET integrated database retrieval system