ID F4XFF5_9FIRM Unreviewed; 457 AA.
AC F4XFF5;
DT 28-JUN-2011, integrated into UniProtKB/TrEMBL.
DT 13-NOV-2013, sequence version 2.
DT 27-MAR-2024, entry version 35.
DE RecName: Full=VWFA domain-containing protein {ECO:0000259|PROSITE:PS50234};
GN ORFNames=HMPREF0866_02330 {ECO:0000313|EMBL:EGJ46175.2};
OS Ruminococcaceae bacterium D16.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae.
OX NCBI_TaxID=552398 {ECO:0000313|EMBL:EGJ46175.2, ECO:0000313|Proteomes:UP000002801};
RN [1] {ECO:0000313|Proteomes:UP000002801}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=D16 {ECO:0000313|Proteomes:UP000002801};
RG The Broad Institute Genome Sequencing Platform;
RA Ward D., Earl A., Feldgarden M., Gevers D., Young S., Zeng Q., Koehrsen M.,
RA Alvarado L., Berlin A.M., Borenstein D., Chapman S.B., Chen Z., Engels R.,
RA Freedman E., Gellesch M., Goldberg J., Griggs A., Gujja S., Heilman E.R.,
RA Heiman D.I., Hepburn T.A., Howarth C., Jen D., Larson L., Mehta T.,
RA Park D., Pearson M., Richards J., Roberts A., Saif S., Shea T.D.,
RA Shenoy N., Sisk P., Stolte C., Sykes S.N., Walk T., White J., Yandava C.,
RA Sibley C.D., White A.P., Crowley S., Surette M.G., Strauss J.C.,
RA Ambrose C.E., Allen-Vercoe E., Haas B., Nusbaum C., Birren B.;
RT "The Genome Sequence of Clostridium sp. D5.";
RL Submitted (MAR-2010) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EGJ46175.2, ECO:0000313|Proteomes:UP000002801}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=D16 {ECO:0000313|EMBL:EGJ46175.2,
RC ECO:0000313|Proteomes:UP000002801};
RG The Broad Institute Genomics Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Sibley C.D., White A.P.,
RA Crowley S., Surette M.G., Strauss J.C., Ambrose C.E., Allen-Vercoe E.,
RA Walker B., Young S., Zeng Q., Gargeya S., Fitzgerald M., Haas B.,
RA Abouelleil A., Allen A.W., Alvarado L., Arachchi H.M., Berlin A.M.,
RA Chapman S.B., Gainer-Dewar J., Goldberg J., Griggs A., Gujja S., Hansen M.,
RA Howarth C., Imamovic A., Ireland A., Larimer J., McCowan C., Murphy C.,
RA Pearson M., Poon T.W., Priest M., Roberts A., Saif S., Shea T., Sisk P.,
RA Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Ruminococcaceae bacterium D16.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGJ46175.2}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADDX02000001; EGJ46175.2; -; Genomic_DNA.
DR AlphaFoldDB; F4XFF5; -.
DR STRING; 552398.HMPREF0866_02330; -.
DR eggNOG; COG2304; Bacteria.
DR HOGENOM; CLU_595695_0_0_9; -.
DR OrthoDB; 1981177at2; -.
DR Proteomes; UP000002801; Unassembled WGS sequence.
DR CDD; cd00198; vWFA; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00092; VWA; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002801}.
FT DOMAIN 39..217
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
SQ SEQUENCE 457 AA; 48075 MW; 639F8115424C02A0 CRC64;
MGVTNSNKVI NTDRIPCDGT LRVTLALTAA PDIVSNPTDI VLVLDRSGSM TGTPLADMKL
GAKTFIDLID EATDSSQDGQ IGSGSRMGVV SFSNTAVADT QLITSVDALK AAVDNLSAGG
STNHADAFAK AIQLFDPASA NAKVMVMFTD GNTTIGAPPA PVAAAARAQG IIIYCIGLIG
SDGLDITALN DWATDPDASH VAVTPNAADL EELFAELAAN ISKPGATEIV IDEVVNPDFV
ITSISSPTKG SATMLDAHSL QWNIAQLGVT SSESAVLDFF IRHVGQQTGT KLVNESITYS
DKEGNVVSFP KPMVSVECDI VVHPEPCPEP VELTVEGCQD SVLVDLGDVY LESQGRIIQM
DVTIQSVCPG KRVALAAILT EVDKDGMEHQ RGMKAFTIPA HSAPVCRDVL VKCIKFVVPE
DLSVSGGAMC SPRKFKARFL ANNIDTDYRC CESTMTL
//