ID R5IL95_9CLOT Unreviewed; 1353 AA.
AC R5IL95;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 27.
DE RecName: Full=Leucine-rich repeat domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BN757_01484 {ECO:0000313|EMBL:CCY41049.1};
OS Clostridium sp. CAG:7.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1262832 {ECO:0000313|EMBL:CCY41049.1, ECO:0000313|Proteomes:UP000018268};
RN [1] {ECO:0000313|EMBL:CCY41049.1, ECO:0000313|Proteomes:UP000018268}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:7 {ECO:0000313|Proteomes:UP000018268};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCY41049.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAYE010000038; CCY41049.1; -; Genomic_DNA.
DR STRING; 1262832.BN757_01484; -.
DR Proteomes; UP000018268; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 4.
DR InterPro; IPR026906; LRR_5.
DR InterPro; IPR032675; LRR_dom_sf.
DR PANTHER; PTHR45661:SF3; ANTIGEN BSP, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR45661; SURFACE ANTIGEN; 1.
DR Pfam; PF13306; LRR_5; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000018268}.
FT REGION 46..269
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1216..1353
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 46..60
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 61..82
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 145..201
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 202..222
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1280..1295
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1303..1353
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1353 AA; 148503 MW; 318E0B661DE960F9 CRC64;
MAAGAVSLVL VAGGFHVLWE DYQRGNLFKP ELFVKNRELQ GNQIMFPEKE NIQQNGNDPG
ENDNKKLEHD PEAKDPYGQE KKNEAAYELA NNQMEADPES ARNIFSDPSF DYAGGTGGTN
WAPEGQSVVL APDAQEKVNL PAGHSGLTTE DTGNKDTPAS SNGSGSDSGD TTISGGGSTG
GGGGSGSNGN QDNTITPGGN DVPAPTPTPD PTPTPTPTPD QPDNGDDTPT VDPDYPDDSK
QPTLPSDPSL PGGEEITLPD FPSEGLPEGD EAENAVLSII SSSDSVGELY RGAVLTDWKL
LCSVYAYVDT PTGTYRLREY NDNFKIGDHP KIAEGDFTVT FYFRPNANSP WTEIEHEFSV
KYCKMVVMGP EDEAGNRRTL DSVYLGEDEE ICLLKELKEL YTAQQNIWGM GLYDSLLQVV
PGWYVNGEQE VFTDYYKPEA PGRYEIYPMD RVDIPEYFGG YLTWDFDERG NYVYRQFLHM
CSEFLDEVEI PQGIQQVNFN GYVTGKIYIP ESVTTLLDDF IMVTQGYEVA EGNPYFCSVD
GFLYNKAKTE LLGIPVGLEE IDIPEDVTRV VIPEWNSLTK INFASPTPPD INLGRISGTQ
LVVPAEYYAD YLLTWRNSLG GNSLIPSSEE TDYTYVNGAV LSSGGTVLNR ISSDNSGLWI
VPDTVRRIKS YAADKCPNVE RLVISEGVEV LESESLNGEG LTEIYFQSDV PPEISADTFG
DLDTKNLVIY VSEDNRDAYA EAWGEILGEE TAENLIQVGK CELVETEAGM IYLDVAGSAV
LIQAPEDITS LDELDLPDGV ELTEIGNSAF KGCIDLILAE LPETVTKVGR NAFSQCEKLE
GVISYAPDYI YIGQDAFVGL RYMAFNTGYL DLEDPLMARD TLTYVTSACL LDETAARYAI
GCGDAIVLGD TEETRPLVFG LDGDDTYLLN GTTDFSGEIQ APEGRVITYV IRGALQDCQG
EFTLPAEVAE HIRAIQSAAF KNSGLTGTVE FSDDLFQIGS DAFIGCVDLT EVKFSNPASD
LGDADPGLSV DPYAFARTGL TEIEFPEDLR SVGYSAFEDT DMQSITFTGE NVPLLTYPGR
GTFYSFGVET EGLVQLEGAA AENEDAYVDV WKYSIQGYES DDEIESNIYY TVLNAAVSEW
MDAAGDFPVD EDWNYLPEFL EYIDDCTPYT IENIKFTGEE RVYTLLGKEV PDHPDRLEKP
DINEYVDAWR DRIEEEKNEK EEQAVVLDKI PEVPADVPVD PEVPQDPDQG NGTTQDPDEP
SEDPQDPETP EEPEKPETPT NPDAPETPEI PETPTNPDAP EELDQPQEPE NPDPSEDTDQ
TENESGGSME EDKEDTEEKD ASSEETSDTE ESL
//