ID Q97E42_CLOAB Unreviewed; 874 AA.
AC Q97E42;
DT 01-OCT-2001, integrated into UniProtKB/TrEMBL.
DT 01-OCT-2001, sequence version 1.
DT 27-MAR-2024, entry version 118.
DE SubName: Full=Possible surface protein, responsible for cell interaction contains cell adhesion domain and ChW-repeats {ECO:0000313|EMBL:AAK81208.1};
GN OrderedLocusNames=CA_C3274 {ECO:0000313|EMBL:AAK81208.1};
OS Clostridium acetobutylicum (strain ATCC 824 / DSM 792 / JCM 1419 / LMG 5710
OS / VKM B-1787).
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=272562 {ECO:0000313|EMBL:AAK81208.1, ECO:0000313|Proteomes:UP000000814};
RN [1] {ECO:0000313|EMBL:AAK81208.1, ECO:0000313|Proteomes:UP000000814}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 824 / DSM 792 / JCM 1419 / LMG 5710 / VKM B-1787
RC {ECO:0000313|Proteomes:UP000000814};
RX PubMed=11466286; DOI=10.1128/JB.183.16.4823-4838.2001;
RA Nolling J., Breton G., Omelchenko M.V., Makarova K.S., Zeng Q., Gibson R.,
RA Lee H.M., Dubois J., Qiu D., Hitti J., Wolf Y.I., Tatusov R.L., Sabathe F.,
RA Doucette-Stamm L., Soucaille P., Daly M.J., Bennett G.N., Koonin E.V.,
RA Smith D.R.;
RT "Genome sequence and comparative analysis of the solvent-producing
RT bacterium Clostridium acetobutylicum.";
RL J. Bacteriol. 183:4823-4838(2001).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE001437; AAK81208.1; -; Genomic_DNA.
DR PIR; E97302; E97302.
DR RefSeq; NP_349868.1; NC_003030.1.
DR RefSeq; WP_010966548.1; NC_003030.1.
DR AlphaFoldDB; Q97E42; -.
DR STRING; 272562.CA_C3274; -.
DR GeneID; 44999769; -.
DR KEGG; cac:CA_C3274; -.
DR PATRIC; fig|272562.8.peg.3452; -.
DR eggNOG; COG4886; Bacteria.
DR eggNOG; COG5492; Bacteria.
DR HOGENOM; CLU_335791_0_0_9; -.
DR OrthoDB; 1910526at2; -.
DR Proteomes; UP000000814; Chromosome.
DR Gene3D; 2.60.40.1080; -; 2.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 2.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR006637; ChW.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR InterPro; IPR001611; Leu-rich_rpt.
DR InterPro; IPR025875; Leu-rich_rpt_4.
DR InterPro; IPR003591; Leu-rich_rpt_typical-subtyp.
DR InterPro; IPR032675; LRR_dom_sf.
DR PANTHER; PTHR46652; LEUCINE-RICH REPEAT AND IQ DOMAIN-CONTAINING PROTEIN 1-RELATED; 1.
DR PANTHER; PTHR46652:SF3; LEUCINE-RICH REPEAT-CONTAINING PROTEIN 9 ISOFORM X1; 1.
DR Pfam; PF02368; Big_2; 2.
DR Pfam; PF07538; ChW; 6.
DR Pfam; PF12799; LRR_4; 3.
DR SMART; SM00635; BID_2; 2.
DR SMART; SM00728; ChW; 6.
DR SMART; SM00365; LRR_SD22; 11.
DR SMART; SM00369; LRR_TYP; 5.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 2.
DR SUPFAM; SSF52058; L domain-like; 2.
DR PROSITE; PS50835; IG_LIKE; 1.
DR PROSITE; PS51450; LRR; 9.
PE 4: Predicted;
KW Leucine-rich repeat {ECO:0000256|ARBA:ARBA00022614};
KW Reference proteome {ECO:0000313|Proteomes:UP000000814};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..874
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5039646336"
FT DOMAIN 175..265
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
SQ SEQUENCE 874 AA; 93820 MW; 71F61E9536EC29B4 CRC64;
MNRFKIKYLL ATLACFVSFS AFTCTSSVKA DTTDTTPHVT YDAHVENIGW QDPWSKDGAE
IGTDGKGLRV EALKIKLLNA PAGAKISYQA HVQNVGWQDW VSDGTEAGTD GKGLRVEALK
IKLENMPGYS IQYQAHVQNI GWQDWVSDGA EAGTDGKGLR VEALRIKIVK NNDTPASSIS
LSKSTDSLKV GDTDTLSATL SNNANSKNIT WTSSDSSIVS VDNNGKITAL KEGTANVTAS
SDGKTASCAV TVAKAATTDT APSLSYSAHV QNIGWQTPVT DGMEVGTDGK GLRVEAFKLN
LLNAPAGAKI TYQAHVQNIG WQDWVSNGAE AGTDGKGLRV EALRIKLENM PGYSIEYQAH
VQNIGWQNWV NDGEEAGTDG QGLRVEALRI RIVKSVPVDS ISLNKQTDTI AVGNSDTLSA
AVAPTNSTFV WTSSDSSIAS VDASGKVTGI KAGTATITAS SLDGKKTASC SVTVADKTVV
TFKDPVLEQA VRKEINKTTG QLYNTDVNKI TSLGIIKDTV INSLDGIEQL SNLKQLWLNY
GNVTDLTPIS KLSNLKILSL NGDTDIDISP IGNLTNLNQL DIGESKISNI NVLNKLNNLN
YLILDKNTSI KDFSPLGSLT NLTLLQASYC NFSDLTPLAK MKNLSRVSLN YNNITSIEPL
KSSTNLVDLV LSGNKISDIT PVANLTNLES ISLSYNQVNN ISSLAKLTKL KSLMLDHTGI
SDISSLSGLT NLNYLGVQDN NIEDITSLKN LTNLANLKIS QNKISNVDAI GNLTNLTLLD
MNNNQISNIN AIKNSTKLIS LSMHHNKVSD ISALSKLTNL ESLNLGNNPI NDVTPLKDLS
HLYEVDLTTS QANIDYLNSV LRLGHAYNNK NQDN
//