ID R7C8I8_9CLOT Unreviewed; 258 AA.
AC R7C8I8;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 19.
DE SubName: Full=Bacterial group 2 Ig-like protein {ECO:0000313|EMBL:CDD74406.1};
GN ORFNames=BN737_00677 {ECO:0000313|EMBL:CDD74406.1};
OS Clostridium sp. CAG:62.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1262828 {ECO:0000313|EMBL:CDD74406.1, ECO:0000313|Proteomes:UP000018137};
RN [1] {ECO:0000313|EMBL:CDD74406.1, ECO:0000313|Proteomes:UP000018137}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:62 {ECO:0000313|Proteomes:UP000018137};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDD74406.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBHQ010000091; CDD74406.1; -; Genomic_DNA.
DR AlphaFoldDB; R7C8I8; -.
DR Proteomes; UP000018137; Unassembled WGS sequence.
DR Gene3D; 2.60.40.1080; -; 2.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR Pfam; PF02368; Big_2; 2.
DR SMART; SM00635; BID_2; 2.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000018137};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..30
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 31..258
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004429486"
FT DOMAIN 37..119
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 172..241
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT REGION 144..197
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 147..185
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 258 AA; 28444 MW; 0B4B59C1BC248A14 CRC64;
MKKRCFRLFA GVLALAVMMT YTVIWNPSQA KMTKSKDVKK VEVTNVQGKN ITLGKGKKLK
LSVNVILKKG SKASKAVKFE SSQKSVVTVS KKGVLKAKKI GKAKITVKSK ANSAKQVKFT
VTVAKKNVLI KKISMKKKLT LHMPLVEDED DSDDDSDDDT EDSDEDLDDE DDDDEDDEDE
EDYEFEPRIS PSNATNQTLK WTSSNKKVVK VDEDGYVYIV DAGKAVITAM ATDGSGVKAK
CTVTVIDDAD TDDEDDDE
//