ID R6ZH91_9CLOT Unreviewed; 1585 AA.
AC R6ZH91;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=von Willebrand factor type A domain {ECO:0000313|EMBL:CDD42809.1};
GN ORFNames=BN593_01971 {ECO:0000313|EMBL:CDD42809.1};
OS Clostridium sp. CAG:299.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1262792 {ECO:0000313|EMBL:CDD42809.1, ECO:0000313|Proteomes:UP000017929};
RN [1] {ECO:0000313|EMBL:CDD42809.1, ECO:0000313|Proteomes:UP000017929}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:299 {ECO:0000313|Proteomes:UP000017929};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDD42809.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBGZ010000197; CDD42809.1; -; Genomic_DNA.
DR STRING; 1262792.BN593_01971; -.
DR Proteomes; UP000017929; Unassembled WGS sequence.
DR CDD; cd00198; vWFA; 2.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR Pfam; PF00092; VWA; 1.
DR Pfam; PF13519; VWA_2; 1.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS50234; VWFA; 2.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000017929}.
FT DOMAIN 561..755
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1012..1233
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 1..23
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 237..434
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 920..939
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1533..1585
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 441..468
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 264..282
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 283..308
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 309..326
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 352..423
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1533..1549
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1556..1579
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1585 AA; 171733 MW; 3797135CA76717D1 CRC64;
MMEGKAGVSM ESRKKMSRAK SNSPMKRTVR RAVAWLLVFC MCMANMNSAV YAAEIATSSD
ALMAATPSDA AATDSNAAEI LMPEQTVSGE ELKEEAIRAI SAGHEFDFDS EIRVMKDAEG
KNESYSELFK GYRSFTLFAD NGNGGRLTGG NDNAYGYIVV RVDKDSYEAF EKEEGTARAT
GSDAADSSAA ERVWSLTGDE ELIFLYVNAD NGTVTFSLNI ENLEADDIVV PSRTELYEDT
EEEAEDTSPA EKPAEDANEA GGSGSGGGSG SGSGSSGDVQ DDPTDEGASD DSQAEDGQTD
DSQNENDSEN DTDAEKPEER PEDGQNQEGG SDSGQENNDS SNTDKEDGNT DSGNDGNSGS
GSENAGDSGN ADSNGGNSGS GSRSDSGKDE GGSSNQGSSS DKGGSDSDSA KLSKSSLSLP
TVMGPNPNAE FEEDEFGDMS FDEWQEMMEE EEEEAEEDEE DYTEEVSVAY DSEDEGISYL
NMTMLPAVGA VVKKTEQAKG FAKLFKSSRS GEAEAVVMAG VVPLSGMLPA EGTYYKQIEQ
NGDRSYRLHL GVTGGEQQGL DIVLAIDLSN SMNDKIGYSK TTRLEALKDT LGYRKEVEGE
RWPPSHWEEK NGFIDDLFAQ SPNSRFSVVT YSYDSAVELG WTELGNTGSG KTRIKETIGG
LKADGGTNYE AGLYKTIETL KERGNSSNIP VVIFLSDGKP TYYYAPVNEF NEAREESGKG
SKIDDNTGDG TIFAAGKFHD EMEKMNGSVY TVGFGIPGLS DEYRHYQPEQ YLYAISQGLT
WDKDWENKLI APENNDGTTL ETDGADAQGL KDAFADILSN LKMENVTISD QLSEFVTFKG
NTAGASNVKV QTFKKNQSGE LVLQNELQEG RDYESLEFNS ETGTIRLIFG KDKKLESGVI
YELSFDIQVK DGVEIQPEDK VIGEPNTDYG KGQISSGKPG VGTNTEGYFT FGEDGTTKVY
YPHPIIPASA VYTPEHQKYI RDNGDGTYDL TLNVTSTKES SEDTHTESVP ADVMFLVDKS
RSMVHRLYSD SEGEYKGSRA EVVNNALSAA INTLGGYKDI QLGGYKWSDN PSKFLGWYES
TEQANAKLLL DWGEDKWVGG PLGILGYWEY AQGGDIESGG TCPSSALKAA ISKLKEDDRE
NVKKYIIFLT DGEPSYNYEY ENSYNAVADL QNLIPGTKLY AVGMTNNTDN TFMETVVRKA
NGLGKYDSTD GLYINGSNEE KVNAALKQIA DEIVNEVTNS TPGVTNVTIT DTLSKYAEFA
FDTNNLDNVK VTRKTADGTE TELQKDEYTV NISGKTITLS LTNVNDASGK TNELEKNVTY
SITFQVKPTE EAKQDYQTDN GYYRDPNGNL QQEVFTGAEG TDAPGNTTSS GQPGFPSNEK
AVLSWQYDGA AGKIEYDHPV LQVPQTGQFV VQKLVEIDDN ETPLADAKFI INLDKADESG
NYTEFSSVAL KADGTSSPAV KTEGEAQFKI SEAVPMEYSL TGIEVFQKAA SGELSDVTDT
RLQNDILTVQ PGDDLIVKVT NNLAHEGYFH STDQVTNWTN GNPETPFTSD KSAAREAAES
QPKADAGKKN KKVTEMEEEE GDPLV
//