ID R6CA73_9FIRM Unreviewed; 1277 AA.
AC R6CA73;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 34.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CDA73473.1};
GN ORFNames=BN718_00072 {ECO:0000313|EMBL:CDA73473.1};
OS Ruminococcus sp. CAG:579.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Ruminococcus.
OX NCBI_TaxID=1262963 {ECO:0000313|EMBL:CDA73473.1, ECO:0000313|Proteomes:UP000018387};
RN [1] {ECO:0000313|EMBL:CDA73473.1, ECO:0000313|Proteomes:UP000018387}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:579 {ECO:0000313|Proteomes:UP000018387};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDA73473.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBCK010000146; CDA73473.1; -; Genomic_DNA.
DR AlphaFoldDB; R6CA73; -.
DR STRING; 1262963.BN718_00072; -.
DR Proteomes; UP000018387; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0030247; F:polysaccharide binding; IEA:UniProtKB-UniRule.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd00037; CLECT; 1.
DR Gene3D; 2.60.40.290; -; 2.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR Gene3D; 4.10.1080.10; TSP type-3 repeat; 1.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR001919; CBD2.
DR InterPro; IPR008965; CBM2/CBM3_carb-bd_dom_sf.
DR InterPro; IPR012291; CBM2_carb-bd_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR028946; Ntox44.
DR InterPro; IPR002477; Peptidoglycan-bd-like.
DR InterPro; IPR036365; PGBD-like_sf.
DR InterPro; IPR028974; TSP_type-3_rpt.
DR PANTHER; PTHR37467; EXPORTED CALCIUM-BINDING GLYCOPROTEIN-RELATED; 1.
DR PANTHER; PTHR37467:SF1; EXPORTED CALCIUM-BINDING GLYCOPROTEIN-RELATED; 1.
DR Pfam; PF00553; CBM_2; 2.
DR Pfam; PF00059; Lectin_C; 1.
DR Pfam; PF15607; Ntox44; 1.
DR Pfam; PF01471; PG_binding_1; 1.
DR Pfam; PF18884; TSP3_bac; 4.
DR SMART; SM00637; CBD_II; 2.
DR SMART; SM00034; CLECT; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49384; Carbohydrate-binding domain; 2.
DR SUPFAM; SSF47090; PGBD-like; 1.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR PROSITE; PS51173; CBM2; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000018387};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..31
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 32..1277
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004401985"
FT DOMAIN 25..140
FT /note="CBM2"
FT /evidence="ECO:0000259|PROSITE:PS51173"
FT DOMAIN 140..252
FT /note="CBM2"
FT /evidence="ECO:0000259|PROSITE:PS51173"
FT DOMAIN 675..789
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT REGION 395..472
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 437..472
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1277 AA; 143438 MW; 54411D8DC0412AA9 CRC64;
MKLKSNFKRI LSGVMSLAMA ATLVPSIPVL AEDFTTQNYV YDNYAVSYNV TNSWGDTEVV
SVTLSNTGDS TIENWMLYFD PNGSTSSIWD AQWAETSTGV SYVKNAGYNA RIEPNSSITF
SYTVDNCEAV PEAFKLCQGR INKESGYDVQ LVVNETWGDN FNGAIVITNN TDSPIEAWEL
TFDTNFTITE ITNSWAATVT ELEPYSYRLK GTYTGTVYAN SSVTLGFTGV KNGDPEISNT
SLTEVVANES LIDFIKNYPD GVSIYAYGDY NNEANAIDIE WYTDYEAEVF DVWQSDDNES
YTLVSEVSDA DSYQYVITED FQTKYFKVSI PNDFGETIES VPFVVTKTED EYSVDFLDSD
GDGLPDIYEN MIGTDLNNPD TDGDGLTDYQ EVYITGTDPT KYDSVTDGVS DADADPDEDG
LTNAQEIELG TDPQNDDTDG DGLKGGEEVN DYHTDPLNPD TDRDGLPDGD EPHIGLDPSD
PETFEVPDAD FVFEQSIPAE SKALEDINTE DNAYELTVEF KSSGYAVNSA EIKQSPYSNA
IKNDAVLGKS VEISYESTCK VDECVLYYKI KDNYTNNISD KYTAYTEELD GIKRLQAFRF
DEDLHMLLPV ETTYDLENNT VICEASELGT YCLMDIEMWL DSLGFEVEEP EIAVMALAET
VTEEFIESSM IKANYNGHTY GICSVSGYDW DSAEDICEAL GGHLVTINDS DEQCFIEEKL
LSKGTKNSYW IGGQYTSSGW RWLTGEDFSA YTKWTPTQPD NYLGQEDKLM MYRNTNPLCT
SGTFGYWNDL NNDGTCNGEA FFGLNNFGFI YEKDSYKPKT YYIVIGNNFK KLTLVSPLKR
GAWTNSDTDS LTDWQEFDSR NSMLAWDSDN EPVLPTLKTV LKLKGDVKIS NEQTRELNGL
DDYLDVVRVA PFLSDPTVED TDGDNLLDNF DPKPKSVNNN GAVMKELSIE RIYNAYMLHL
EDNVSEVYDI YTATGKDSEM SWEEFIEFYS GVVDFNFSLK NASETELKIT TLQRCLEYLG
FLDMGGSAYG AMGGATQSAI QNFQLNYGLN ISEQVEFYGK WFIEIDDITY LTIANVAANH
GFYVGDKPVE AHTEYKMLCN LTAEGYGKTF FDYIPSVVPI LNIEQINNDI SIYSDINGKF
DTVYYLDYTE PLKNIITDMT NAAESFIYYP SFNGYVQFYT NVNHGGIWDV KVRESWNNTI
STIHYFSQGF KFVYNDIIMN SENLGNYLYG CTGHATGFTL NILYKGSGYA ASTGNTIDNQ
DDLDFIKRGY DYYDEIC
//