ID A0A1Y4WIM2_9FIRM Unreviewed; 1236 AA.
AC A0A1Y4WIM2;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 24-JAN-2024, entry version 17.
DE RecName: Full=VWFA domain-containing protein {ECO:0000259|PROSITE:PS50234};
GN ORFNames=B5E43_00385 {ECO:0000313|EMBL:OUQ82272.1};
OS Flavonifractor sp. An100.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Flavonifractor.
OX NCBI_TaxID=1965538 {ECO:0000313|EMBL:OUQ82272.1, ECO:0000313|Proteomes:UP000196191};
RN [1] {ECO:0000313|Proteomes:UP000196191}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=An100 {ECO:0000313|Proteomes:UP000196191};
RA Medvecky M., Cejkova D., Polansky O., Karasova D., Kubasova T., Cizek A.,
RA Rychlik I.;
RT "Function of individual gut microbiota members based on whole genome
RT sequencing of pure cultures obtained from chicken caecum.";
RL Submitted (APR-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OUQ82272.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NFMA01000001; OUQ82272.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Y4WIM2; -.
DR OrthoDB; 1837798at2; -.
DR Proteomes; UP000196191; Unassembled WGS sequence.
DR CDD; cd00198; vWFA; 1.
DR Gene3D; 2.60.40.1080; -; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR Pfam; PF02368; Big_2; 1.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 1.
DR SUPFAM; SSF51126; Pectin lyase-like; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000196191};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..31
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 32..1236
FT /note="VWFA domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5011966386"
FT DOMAIN 629..745
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 46..85
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1154..1205
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1236 AA; 132967 MW; 052D02F04E6578A9 CRC64;
MAKMRTRLKK LLAVMLAVSM TMSLLNLTAF ATDGSDPLTV EVGQTLDLSG RDDGSTSTEP
VTDPDEATDE SWRSSDPQIA SVDPAGTVTG VYQGEAVITH TSYFYLWTNP DDESQQDPYD
FDEVDDPDEN MTLQRTVTTW EIQVPGPAAE VGDTAYDTLD EAVEKAPEGS TIVLLRDCTT
KGINLKKDLT IKSEDGQKYT ITFTDNGIAL WGKSLTFEEC DVVLEGIGST PYTAEWNWMT
ICASKDASLT LNNATMSLDG TGAGNVHAIY FCSNNQLNLD YSTLIIKNYK QDALEWDGGD
GGYNVNITNS TYISDNNRSG FTGTFYATIK NSNVEVINSA GNGSNGSHFI IEDSTVDFSG
NGSRGLSAGL LHIKNSTVTA NSNNGMGITV NNELQITDNS TVTVMDNASN SSYGYAAVRL
YNDYACLVDA TSKLYIKDNH NTGLYVRQGS LTVEEGATLE ITGNQVGNSL LDGYGGGLYV
GYGANYDPTV VLPSDAAIYN NHSSVGGDDI YVSEGVEGPT LTFGATDSEW VLDDCNHAID
GWYQDGPDNR WAAHKKPLHT EEFTGYTASP VVGPLALKAA HGLIPLEPDD PDLPDWDISK
SKKATNLDED YQSQVTLSLP AASYSQALDV VFVLDGSTST DEDDLATAAS QLLGELAGFE
NLNTKAGLVI FGGSEPVLYS SGSLLSLEDP ANLAALQTEM TDSSYDGMDG RSGSNLQAGV
EAARAMLGAD RSVSAEDKYM ILLTDCAARM WYEDGAAMAQ AYWCNDRVYW NSNCDFVDFR
YPGNTPCPSF SDTWADAQMG ITIGAYGMTE AEKNAASQSD VASTDTVIND PGYYTTYEAA
AYYAATSLTE AANEAHLVLV SYPYHNETNF GQYIESFRAW LDDEGYVTRY DSGSLSTEKI
FDNVKDELIQ LVDTGSKVVD VIGSGVDNAG NDYNFDFVNQ ADKLSLTVGG TALAVTTIDE
NTYGFGTAKD GVYPFLLRYY AKGEDGSSQE CFVWEINQPV TKTAPVQLTY SVVLTNPQST
SGTYGVYDEY GTNHETSLLT NLEATLYPVD STGAAGVPEN FAKPTVSYTV ETSGGGGGGG
TTTSYDLIVR YLEEGTEKVL ATAYSTTKVS GSSYDVTDRT EKEIDGYVQT DITGDPVKGT
MNSDKEIIVW YVSEEDIEEP ETPTTETPEN PEDPKDPGTE LPDGETPTSE LPEEEVPKAD
VPATGDPSLI WLAASALSGS GLAWLALSER KGKRED
//