ID W4VAE8_9FIRM Unreviewed; 1051 AA.
AC W4VAE8;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE RecName: Full=VWFA domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=JCM21531_3280 {ECO:0000313|EMBL:GAE89724.1};
OS Acetivibrio straminisolvens JCM 21531.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Acetivibrio.
OX NCBI_TaxID=1294263 {ECO:0000313|EMBL:GAE89724.1, ECO:0000313|Proteomes:UP000019109};
RN [1] {ECO:0000313|EMBL:GAE89724.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JCM 21531 {ECO:0000313|EMBL:GAE89724.1};
RA Yuki M., Oshima K., Suda W., Sakamoto M., Kitamura K., Iida T., Hattori M.,
RA Ohkuma M.;
RT "Draft Genome Sequence of Clostridium straminisolvens Strain JCM 21531T,
RT Isolated from a Cellulose-Degrading Bacterial Community.";
RL Genome Announc. 2:e00110-14(2014).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GAE89724.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BAVR01000044; GAE89724.1; -; Genomic_DNA.
DR RefSeq; WP_038290168.1; NZ_BAVR01000044.1.
DR AlphaFoldDB; W4VAE8; -.
DR STRING; 1294263.JCM21531_3280; -.
DR OrthoDB; 1656124at2; -.
DR Proteomes; UP000019109; Unassembled WGS sequence.
DR GO; GO:0030248; F:cellulose binding; IEA:InterPro.
DR GO; GO:0016798; F:hydrolase activity, acting on glycosyl bonds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd00198; vWFA; 1.
DR Gene3D; 2.60.40.710; Endoglucanase-like; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR008965; CBM2/CBM3_carb-bd_dom_sf.
DR InterPro; IPR001956; CBM3.
DR InterPro; IPR036966; CBM3_sf.
DR InterPro; IPR003305; CenC_carb-bd.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR Pfam; PF00942; CBM_3; 1.
DR Pfam; PF02018; CBM_4_9; 1.
DR Pfam; PF00092; VWA; 1.
DR SMART; SM01067; CBM_3; 1.
DR SUPFAM; SSF49384; Carbohydrate-binding domain; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS51172; CBM3; 1.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000019109}.
FT DOMAIN 35..190
FT /note="CBM3"
FT /evidence="ECO:0000259|PROSITE:PS51172"
FT DOMAIN 711..924
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 831..851
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1051 AA; 114316 MW; 12C7828DC7C25D44 CRC64;
MLKKSLYKQS TLFLTVCLLV QIISAAVIGL SVYASSEGIS VEFFNGNLGA NVGSMNLNFK
VINNGASAVE LSDIKLRYYF TDDGVSPITV FIDHAANSGN VINTCITYTI KDINTSGANK
YIELGFNAQA GSLAPNTSVL VKARAYQSNY SQDFTQTNDY SFCQTNTDFA AWDKVTGYLN
GVLFSGTEPV MLTPTPSLAT PTPQMSPIPT VTSSATPTPE VIPTPSGLET PADWMNQAVP
NGDFEAGTIL WSFYCDSVSG ANASNAVYSE PSGNKMSKTA IENVGANYWA IQLKHAGIVL
ENSKTYRLTF DAKSTMPRDI IVSLQDATSS LIEYYGKIIK IDSHMKTYTC EFTLNSSEGT
SVAIVFEMGK IGAIANKAHD IILDNVHIEK ITSPLPPGPP APKEDALVAS VTASRNSVYE
IELGKEADIS LSQSGEIALE GSIDTKKEVV LVLDNSGALN SYVKDILSPL DFGIYSNRDL
TIQGKSASIN GSVHTNSLFT STADSIQISQ TCSAASFNIV SKNVDIKTLK NITTPIEMPY
FHNELINDAA EDFMVFRPED YPPSFFPYPM PGQEDIFIIY NIFAGRFEIF GMGTLVINSS
MYFKGNVLIS LRSTNNVSEG FIVADGNIII QGENLYPSGP NDKLYVYSIG GNIEYQTSNS
TINGIAYAPG NPANPDSGKI LFLGNNNTIN GAIAGNQLNF TASDLKVNHT EGQFNVVEEK
YMQNTSHLKL VKDVAKSLID RFVGSKTKIS VIQYSDSAND NDFKQYDLFM SGNAVTLKQK
IHAIEPGTSG FSNMGDAMRR AYHILNKPSK DPVSKYIVVL AGSVPNRWTA MNNVGNEPKT
GNGKADHIKP DNEAYSSTDY AKDIGKMITS AGINLQFIDF SDENIGTVME EIAAESGVES
IEATGKHYYR ANDFIELADI FDNICLKINY DVVLDNVLYE EILPAGVLPV EVPDWMSTQS
VSVGGVTRTK ITGAINNIPL TYKGKGYSFD IGSFKIKVKF MKPGTIVFDG ADSKIIYTID
YIDKDGKNQS RSLDKYFNDI TVNVKMSVDI N
//