GenomeNet

Database: UniProt
Entry: C2KXD2_9FIRM
LinkDB: C2KXD2_9FIRM
Original site: C2KXD2_9FIRM 
ID   C2KXD2_9FIRM            Unreviewed;       901 AA.
AC   C2KXD2;
DT   16-JUN-2009, integrated into UniProtKB/TrEMBL.
DT   16-JUN-2009, sequence version 1.
DT   24-JAN-2024, entry version 52.
DE   SubName: Full=Ricin-type beta-trefoil lectin domain protein {ECO:0000313|EMBL:EEJ51570.1};
GN   ORFNames=HMPREF6123_1151 {ECO:0000313|EMBL:EEJ51570.1};
OS   Oribacterium sinus F0268.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae;
OC   Oribacterium.
OX   NCBI_TaxID=585501 {ECO:0000313|EMBL:EEJ51570.1, ECO:0000313|Proteomes:UP000004121};
RN   [1] {ECO:0000313|EMBL:EEJ51570.1, ECO:0000313|Proteomes:UP000004121}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=F0268 {ECO:0000313|EMBL:EEJ51570.1,
RC   ECO:0000313|Proteomes:UP000004121};
RA   Qin X., Bachman B., Battles P., Bell A., Bess C., Bickham C., Chaboub L.,
RA   Chen D., Coyle M., Deiros D.R., Dinh H., Forbes L., Fowler G.,
RA   Francisco L., Fu Q., Gubbala S., Hale W., Han Y., Hemphill L.,
RA   Highlander S.K., Hirani K., Hogues M., Jackson L., Jakkamsetti A.,
RA   Javaid M., Jiang H., Korchina V., Kovar C., Lara F., Lee S., Mata R.,
RA   Mathew T., Moen C., Morales K., Munidasa M., Nazareth L., Ngo R.,
RA   Nguyen L., Okwuonu G., Ongeri F., Patil S., Petrosino J., Pham C., Pham P.,
RA   Pu L.-L., Puazo M., Raj R., Reid J., Rouhana J., Saada N., Shang Y.,
RA   Simmons D., Thornton R., Warren J., Weissenberger G., Zhang J., Zhang L.,
RA   Zhou C., Zhu D., Muzny D., Worley K., Gibbs R.;
RL   Submitted (APR-2009) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EEJ51570.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ACKX01000115; EEJ51570.1; -; Genomic_DNA.
DR   RefSeq; WP_007158072.1; NZ_GG668537.1.
DR   AlphaFoldDB; C2KXD2; -.
DR   STRING; 585501.HMPREF6123_1151; -.
DR   eggNOG; COG2273; Bacteria.
DR   eggNOG; COG5263; Bacteria.
DR   eggNOG; COG5492; Bacteria.
DR   HOGENOM; CLU_321554_0_0_9; -.
DR   InParanoid; C2KXD2; -.
DR   OrthoDB; 1912376at2; -.
DR   Proteomes; UP000004121; Unassembled WGS sequence.
DR   GO; GO:0030246; F:carbohydrate binding; IEA:UniProtKB-KW.
DR   CDD; cd00161; RICIN; 1.
DR   Gene3D; 2.80.10.50; -; 3.
DR   Gene3D; 2.10.270.10; Cholin Binding; 1.
DR   Gene3D; 3.90.1720.10; endopeptidase domain like (from Nostoc punctiforme); 1.
DR   Gene3D; 2.60.40.4270; Listeria-Bacteroides repeat domain; 1.
DR   InterPro; IPR018337; Cell_wall/Cho-bd_repeat.
DR   InterPro; IPR007921; CHAP_dom.
DR   InterPro; IPR013378; InlB-like_B-rpt.
DR   InterPro; IPR042229; Listeria/Bacterioides_rpt_sf.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR035992; Ricin_B-like_lectins.
DR   InterPro; IPR000772; Ricin_B_lectin.
DR   Pfam; PF09479; Flg_new; 1.
DR   Pfam; PF14200; RicinB_lectin_2; 2.
DR   SMART; SM00458; RICIN; 2.
DR   SUPFAM; SSF69360; Cell wall binding repeat; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   SUPFAM; SSF50370; Ricin B-like lectins; 2.
DR   PROSITE; PS50911; CHAP; 1.
DR   PROSITE; PS51170; CW; 1.
DR   PROSITE; PS50231; RICIN_B_LECTIN; 2.
PE   4: Predicted;
KW   Lectin {ECO:0000313|EMBL:EEJ51570.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000004121};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..25
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           26..901
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5039623474"
FT   DOMAIN          170..302
FT                   /note="Peptidase C51"
FT                   /evidence="ECO:0000259|PROSITE:PS50911"
FT   REPEAT          803..826
FT                   /note="Cell wall-binding"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT   REGION          39..120
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          692..764
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        40..119
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        692..751
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   901 AA;  98876 MW;  34C8D543B99CF179 CRC64;
     MRRRQKQILA FFISLAMVGS STVLSYAEQQ NVASKTALKE NIEGKSEEES LEQSVASSEK
     EEGRTEETKE ALVPEEKKQV SEVKAQEEKG KEESEDKENS RKETKIEAKE KDVAEGSVEK
     EENTYSIVDL SKLKSGLKET EEEGIEEAPA AETASLFGYS GFRSVGGKTQ AEAEAWIDSM
     LGTAKDYDGQ YGVQCVDLIS YYYRFLGHNI FSYVTGGGYA YTFVYSPLPA GWIRLQNNAV
     PRPGDIVVFG ANQYGMLGTG HIGLVRAVDN TSYKFLDYNG TGHYDAGTWR WKPLHHFTCL
     IRPDFATPPP KPDRLPLPAN GSDDDIADGE YLITSTLAEG RCLAYGGNGQ SGQNVFIRDY
     RTWGEAPYYW RLERQPDDSF IIRSKAGTVL DVNGGPGAAG NGPNIQVWNQ TGTNANQRFY
     IVKQGDAYEF IPQCSGFRMD VDNAGIADGT NVRQYEPNGS YAQRFKLFKY YKPKVNEKSA
     PEVKSAGYVL RSKLSPDKAV VVEKGNPTKL GTNVLLWDYN KTAPDATVVW NLEKQADGSF
     LVKNQYNNQY LDVVADSLYD QVNILTWEKH NKPSEQWYVV SNGDGTYRFV NKNSGKVMDV
     YGGQTANGTN VQQYLWHGHD AQRFYLDRYP AAVYYNVTFK GMNGENLSTQ RVEKGENASY
     PSAPLVQGYE FTGWDKDARN VQGNITITAQ YKKKAQTGGN SGSSSSGSSG STSGSSSSGS
     SSGGGSSFSG SSSGGGGSSF SGSSSGGGGG GGGGSSSGGF GKPSVSGGNV GQVLGVERSL
     AGGQWMQDGT GWWYKKADGS YPKNNWGNED YNGKTYWYYF LDSGYMATGW IELNGSKYYL
     FPTSDGWKGR MVTGWQWIDG YCYYLEEGGA NQGRLYRNEQ KDGYQLDSEG RWTVNGKPVK
     R
//
DBGET integrated database retrieval system