GenomeNet

Database: UniProt
Entry: R6CA73_9FIRM
LinkDB: R6CA73_9FIRM
Original site: R6CA73_9FIRM 
ID   R6CA73_9FIRM            Unreviewed;      1277 AA.
AC   R6CA73;
DT   24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT   24-JUL-2013, sequence version 1.
DT   24-JAN-2024, entry version 34.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CDA73473.1};
GN   ORFNames=BN718_00072 {ECO:0000313|EMBL:CDA73473.1};
OS   Ruminococcus sp. CAG:579.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC   Ruminococcus.
OX   NCBI_TaxID=1262963 {ECO:0000313|EMBL:CDA73473.1, ECO:0000313|Proteomes:UP000018387};
RN   [1] {ECO:0000313|EMBL:CDA73473.1, ECO:0000313|Proteomes:UP000018387}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=MGS:579 {ECO:0000313|Proteomes:UP000018387};
RA   Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA   Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA   Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA   Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA   Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA   Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA   Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA   Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA   Wang J., Brunak S., Ehrlich S.D.;
RT   "Dependencies among metagenomic species, viruses, plasmids and units of
RT   genetic variation.";
RL   Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:CDA73473.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CBCK010000146; CDA73473.1; -; Genomic_DNA.
DR   AlphaFoldDB; R6CA73; -.
DR   STRING; 1262963.BN718_00072; -.
DR   Proteomes; UP000018387; Unassembled WGS sequence.
DR   GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR   GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR   GO; GO:0030247; F:polysaccharide binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR   CDD; cd00037; CLECT; 1.
DR   Gene3D; 2.60.40.290; -; 2.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   Gene3D; 4.10.1080.10; TSP type-3 repeat; 1.
DR   InterPro; IPR001304; C-type_lectin-like.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR001919; CBD2.
DR   InterPro; IPR008965; CBM2/CBM3_carb-bd_dom_sf.
DR   InterPro; IPR012291; CBM2_carb-bd_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR028946; Ntox44.
DR   InterPro; IPR002477; Peptidoglycan-bd-like.
DR   InterPro; IPR036365; PGBD-like_sf.
DR   InterPro; IPR028974; TSP_type-3_rpt.
DR   PANTHER; PTHR37467; EXPORTED CALCIUM-BINDING GLYCOPROTEIN-RELATED; 1.
DR   PANTHER; PTHR37467:SF1; EXPORTED CALCIUM-BINDING GLYCOPROTEIN-RELATED; 1.
DR   Pfam; PF00553; CBM_2; 2.
DR   Pfam; PF00059; Lectin_C; 1.
DR   Pfam; PF15607; Ntox44; 1.
DR   Pfam; PF01471; PG_binding_1; 1.
DR   Pfam; PF18884; TSP3_bac; 4.
DR   SMART; SM00637; CBD_II; 2.
DR   SMART; SM00034; CLECT; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49384; Carbohydrate-binding domain; 2.
DR   SUPFAM; SSF47090; PGBD-like; 1.
DR   PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR   PROSITE; PS51173; CBM2; 2.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000018387};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..31
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           32..1277
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004401985"
FT   DOMAIN          25..140
FT                   /note="CBM2"
FT                   /evidence="ECO:0000259|PROSITE:PS51173"
FT   DOMAIN          140..252
FT                   /note="CBM2"
FT                   /evidence="ECO:0000259|PROSITE:PS51173"
FT   DOMAIN          675..789
FT                   /note="C-type lectin"
FT                   /evidence="ECO:0000259|PROSITE:PS50041"
FT   REGION          395..472
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        437..472
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1277 AA;  143438 MW;  54411D8DC0412AA9 CRC64;
     MKLKSNFKRI LSGVMSLAMA ATLVPSIPVL AEDFTTQNYV YDNYAVSYNV TNSWGDTEVV
     SVTLSNTGDS TIENWMLYFD PNGSTSSIWD AQWAETSTGV SYVKNAGYNA RIEPNSSITF
     SYTVDNCEAV PEAFKLCQGR INKESGYDVQ LVVNETWGDN FNGAIVITNN TDSPIEAWEL
     TFDTNFTITE ITNSWAATVT ELEPYSYRLK GTYTGTVYAN SSVTLGFTGV KNGDPEISNT
     SLTEVVANES LIDFIKNYPD GVSIYAYGDY NNEANAIDIE WYTDYEAEVF DVWQSDDNES
     YTLVSEVSDA DSYQYVITED FQTKYFKVSI PNDFGETIES VPFVVTKTED EYSVDFLDSD
     GDGLPDIYEN MIGTDLNNPD TDGDGLTDYQ EVYITGTDPT KYDSVTDGVS DADADPDEDG
     LTNAQEIELG TDPQNDDTDG DGLKGGEEVN DYHTDPLNPD TDRDGLPDGD EPHIGLDPSD
     PETFEVPDAD FVFEQSIPAE SKALEDINTE DNAYELTVEF KSSGYAVNSA EIKQSPYSNA
     IKNDAVLGKS VEISYESTCK VDECVLYYKI KDNYTNNISD KYTAYTEELD GIKRLQAFRF
     DEDLHMLLPV ETTYDLENNT VICEASELGT YCLMDIEMWL DSLGFEVEEP EIAVMALAET
     VTEEFIESSM IKANYNGHTY GICSVSGYDW DSAEDICEAL GGHLVTINDS DEQCFIEEKL
     LSKGTKNSYW IGGQYTSSGW RWLTGEDFSA YTKWTPTQPD NYLGQEDKLM MYRNTNPLCT
     SGTFGYWNDL NNDGTCNGEA FFGLNNFGFI YEKDSYKPKT YYIVIGNNFK KLTLVSPLKR
     GAWTNSDTDS LTDWQEFDSR NSMLAWDSDN EPVLPTLKTV LKLKGDVKIS NEQTRELNGL
     DDYLDVVRVA PFLSDPTVED TDGDNLLDNF DPKPKSVNNN GAVMKELSIE RIYNAYMLHL
     EDNVSEVYDI YTATGKDSEM SWEEFIEFYS GVVDFNFSLK NASETELKIT TLQRCLEYLG
     FLDMGGSAYG AMGGATQSAI QNFQLNYGLN ISEQVEFYGK WFIEIDDITY LTIANVAANH
     GFYVGDKPVE AHTEYKMLCN LTAEGYGKTF FDYIPSVVPI LNIEQINNDI SIYSDINGKF
     DTVYYLDYTE PLKNIITDMT NAAESFIYYP SFNGYVQFYT NVNHGGIWDV KVRESWNNTI
     STIHYFSQGF KFVYNDIIMN SENLGNYLYG CTGHATGFTL NILYKGSGYA ASTGNTIDNQ
     DDLDFIKRGY DYYDEIC
//
DBGET integrated database retrieval system