ID R9M9L2_9FIRM Unreviewed; 1158 AA.
AC R9M9L2;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE RecName: Full=VWFA domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=C816_00373 {ECO:0000313|EMBL:EOS67341.1};
OS Oscillibacter sp. 1-3.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Oscillibacter.
OX NCBI_TaxID=1235797 {ECO:0000313|EMBL:EOS67341.1, ECO:0000313|Proteomes:UP000014108};
RN [1] {ECO:0000313|EMBL:EOS67341.1, ECO:0000313|Proteomes:UP000014108}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=1-3 {ECO:0000313|EMBL:EOS67341.1,
RC ECO:0000313|Proteomes:UP000014108};
RG The Broad Institute Genomics Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Earl A., Xavier R., Elson C., Duck W., Walker B., Young S., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Oscillibacter bacterium 1-3.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted, cell wall, S-layer
CC {ECO:0000256|ARBA:ARBA00004237}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EOS67341.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASTC01000002; EOS67341.1; -; Genomic_DNA.
DR RefSeq; WP_016320710.1; NZ_KE159703.1.
DR AlphaFoldDB; R9M9L2; -.
DR STRING; 1235797.C816_00373; -.
DR PATRIC; fig|1235797.3.peg.423; -.
DR eggNOG; COG2304; Bacteria.
DR eggNOG; COG4099; Bacteria.
DR HOGENOM; CLU_275552_0_0_9; -.
DR OrthoDB; 1699243at2; -.
DR Proteomes; UP000014108; Unassembled WGS sequence.
DR GO; GO:0030115; C:S-layer; IEA:UniProtKB-SubCell.
DR CDD; cd00198; vWFA; 1.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR026906; LRR_5.
DR InterPro; IPR032675; LRR_dom_sf.
DR InterPro; IPR001119; SLH_dom.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR43308:SF4; OUTER MEMBRANE PROTEIN ALPHA; 1.
DR PANTHER; PTHR43308; OUTER MEMBRANE PROTEIN ALPHA-RELATED; 1.
DR Pfam; PF13306; LRR_5; 1.
DR Pfam; PF00395; SLH; 3.
DR Pfam; PF00092; VWA; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS51272; SLH; 3.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Cell wall {ECO:0000256|ARBA:ARBA00022601};
KW Reference proteome {ECO:0000313|Proteomes:UP000014108};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW S-layer {ECO:0000256|ARBA:ARBA00022601};
KW Secreted {ECO:0000256|ARBA:ARBA00022601}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..31
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 32..1158
FT /note="VWFA domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004486431"
FT DOMAIN 526..725
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 920..985
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 986..1049
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 1050..1107
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT REGION 465..500
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 899..927
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 474..500
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1158 AA; 125666 MW; 62146C17916F4106 CRC64;
MSKKKISCRR GLSLFLALVM CAGLLQMPAL AAEHEHNSAG WACEKDELTC TILEHTHEEA
CTSEEPLCGL EEHRHTDSCW SWNCAEPEPD EPPVQPGLSA QDELPVQDEL PVQDETLVQD
EVPAPDEQPV QDVQTHVPSA YMDIGEMTPE GKVDRIIELM KEVPNQITEE NMDVYGPILE
ECDQIMGGEY MENFGDAELD YAYEVVFEEF FAAIDKFVPA FNAYQELLAE QGNDGKALIE
SRDEGIQLTY ADGVLTVSGE GMFTGLLINK VSYTSIKDYN SERPETAISE VVIEGENAEN
PIKIGSSLFR DAGLTKLTAG NILAESMAFQ EAFVQGANIE LTDSTVCASV FTDCEGLGTV
TLNHVTFHDN YGRFHLFSGS SIDRLVIRNM DRIGGYAFSG CKIKGEVDLS GVLYIGEHAF
SGTGSPELKN ISEDAVVEYS DVFASQVENW SERVAGILQG KFRMDPPKTA EPIAPQGWTS
SKTGKENSTE GQTSTQITEE ARWSDAELTE ADVLIQTYLA DVQQMDFIFV MDLSNSMSSR
GGSDSRYSRF NEMQSKLLDV SGELLASGDE YDCQVAFVTF GGDSDPWYPL PWGESVITKG
TMDFQDFTDE VDEAAAFITG SKVYDEYTDY GLGLNKALEL AESNQEKGRS TAVILISDGS
PNKGDGKTEA EQIKALGVPV FGVLQSVPER EMENAEAAMR NICTNGFFFN GDDTEGFSKA
VNDAVHNALR TCILTVDVNS SFTLNADSIT ASAGVADVSE DGTAITWTIV GKTFEKHTLS
YKLRLNPENG AYPEGTFDTN AGSAHLLVSG LEVNEVETPQ LTRTQPVRSH TLTVRYQYSN
GTQAAAAAVQ TVAEGASYSV TSPAISGFRA SIAVVSGTMG TEDLLFTVTY TSTGGGGGGG
GGGGGGGTGG GGGGGGGGGG TGGTTIADTE VPLAGDLQLN REDHFAYIKG YEDGTVRPNN
PLTRAQVATI FYRLLDEMSR TIYFQDTNEF TDVADTFWAN KAISTLTNAG IITGFQDGTF
RPNAYITRAQ FAAIAARFDN VVPGLENPFS DVAEDYWARD LIAYAADRGW INGQGGKFRP
LENITRVEAM DFINNVLERH VDEEGLLENA TTWSDVPVGD PNYYVVEEAT NSHDYTRRKE
GELMENWTAL NEDPVWDE
//