ID R7B362_9CLOT Unreviewed; 1071 AA.
AC R7B362;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 32.
DE SubName: Full=Repeat TIGR02543 family / cell wall-binding repeat multi-domain protein {ECO:0000313|EMBL:CDD58476.1};
GN ORFNames=BN653_01983 {ECO:0000313|EMBL:CDD58476.1};
OS Clostridium sp. CAG:43.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1262805 {ECO:0000313|EMBL:CDD58476.1, ECO:0000313|Proteomes:UP000018284};
RN [1] {ECO:0000313|EMBL:CDD58476.1, ECO:0000313|Proteomes:UP000018284}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:43 {ECO:0000313|Proteomes:UP000018284};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDD58476.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBHG010000337; CDD58476.1; -; Genomic_DNA.
DR AlphaFoldDB; R7B362; -.
DR STRING; 1262805.BN653_01983; -.
DR Proteomes; UP000018284; Unassembled WGS sequence.
DR CDD; cd00063; FN3; 1.
DR Gene3D; 2.160.20.110; -; 1.
DR Gene3D; 2.60.40.1080; -; 1.
DR Gene3D; 2.10.270.10; Cholin Binding; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR018337; Cell_wall/Cho-bd_repeat.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR Pfam; PF02368; Big_2; 1.
DR SUPFAM; SSF69360; Cell wall binding repeat; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 1.
DR SUPFAM; SSF51126; Pectin lyase-like; 1.
DR PROSITE; PS51170; CW; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000018284};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 282..356
FT /note="BIG2"
FT /evidence="ECO:0000259|Pfam:PF02368"
FT REPEAT 959..980
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REGION 857..903
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 857..875
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 888..903
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1071 AA; 116275 MW; 3F25A00400B5450A CRC64;
MVPTGLFQEI SSKCEIKNLT IENAVVRASN DDARVGILVG DVYDSLTVEN CYVSGTIETT
DGTNKIEAAG GLIGNVREKY SVEIQSCYAD AEIKGTASKR FVGGLVGWTG GTTTIENSYA
VVDMDVDKGD YIGGLVGSGN VTISHSYAAG EALTKNPTGA SVAGISDNGS ISSCVSIFPE
MRSLNRIGGT SGEYTNNYGF AGTVARKSDG TILTPDPNMM GADKLYGADA TEANLKDPEF
YKGLGWDFAS TWTMDSTDRY AFPILKKQTL SPNLTLDLKP SVTGITLDKT NETIYPRGSV
QLTATVDVAN GASREVTWKS SDPAVKVEDG LVTAAAAANG TYTITAISKA DPSKWAECQL
TVDTAEHMVT VGREPGHTNS LNAVVKAYAS LNDAKGGTNP ISTTGSETGT FTFSQKAGEI
VYLAFTGLDS EDVVSKVTIT DANGSKVDAT LCNFEDPTVY YFTMPCSNAS VQVSYAVNLN
ATQYTWFVGQ EWGTWGTTAA FETKEWSGGD HIGSLKVTKI INGKLFREFK IKSMSLYKKD
SVIPKKVNSQ SELTENGNYC IEKDNTTGLP TLYVYLEGPG MVAVDIEVQD NPNAEFSITK
KPGSSSYYML NRTTAKVGDL VTATLTDEGV RRMKEMQNKN ACLTYSGGLL VVIYPPKFTE
SGGKWTASFN MPAQNIETNV YFGEKDKVTL KGTDKEVDYD GAPKSVEDGI RATIGGQDLS
EQFQGQYEVH YEGVNGTVYS SMTPPTNAGT YSCKIKIPDS NVYYRSDPIT VQLTIKKIAT
KTPKAAQAAA WTDTSVTLEA PSAFVDGTAI PAGYELEYCV DQGEWQDSPV FTGLTPETTY
KFYVRLKEGV NTTASAASEA VTVQTKKASS DPSNPTKPNP SNPNPSTPGN PQGTTTSGRD
RNTSTWVKES TGWRYRLSNG TYLSGSLVLD PATGRQVEQI VWKQLRGAWW AFGADGYIRT
GWVYDYSAGE WYYVDENTGM RTGWYLDPQD GRWYYLDPAT GEMLTEWQLI PDLGYVYLNP
YAPQPTWAYD EELKTWVYME GAGRPYGSLY MAEWTPDGYY VNADGVWEPA R
//