ID R5KPV1_9CLOT Unreviewed; 1331 AA.
AC R5KPV1;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE SubName: Full=Cell surface protein {ECO:0000313|EMBL:CCY68835.1};
GN ORFNames=BN753_02121 {ECO:0000313|EMBL:CCY68835.1};
OS Clostridium sp. CAG:678.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1262831 {ECO:0000313|EMBL:CCY68835.1, ECO:0000313|Proteomes:UP000017959};
RN [1] {ECO:0000313|EMBL:CCY68835.1, ECO:0000313|Proteomes:UP000017959}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:678 {ECO:0000313|Proteomes:UP000017959};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCY68835.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAYP010000077; CCY68835.1; -; Genomic_DNA.
DR STRING; 1262831.BN753_02121; -.
DR Proteomes; UP000017959; Unassembled WGS sequence.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 5.
DR InterPro; IPR026906; LRR_5.
DR InterPro; IPR032675; LRR_dom_sf.
DR PANTHER; PTHR45661:SF3; RICH REPEAT DOMAIN PROTEIN, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR45661; SURFACE ANTIGEN; 1.
DR Pfam; PF13306; LRR_5; 5.
DR SUPFAM; SSF52058; L domain-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000017959};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..1331
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004396085"
FT REGION 1167..1301
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1187..1208
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1220..1285
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1331 AA; 145895 MW; 3DA9E837677347D6 CRC64;
MKHRFTKLLS ILLAALMIFS SLPITAFAAD GDEKLYGYEW LYTDAAGVDY IKLNEYNGND
ENLVIPAEYK GKIVDSLGIN FFDADQLKSI TVSEGIKNID QLWFWESGYV DLYLPSSLVS
ISSEAMRGNK INNLYFSEGL RGIGHYAFKA VQFKNSNIKL PESLEYLNPS SFNKSNITSI
YFGANARIGE FQYNYGDPIA YPEGEEDSDF SLFHECRSLG SITVSPQNPY LTAIDDVLYN
KKMTVLYRFP SKHDDTFEKN NPVYNYVMPK SVKSIAEFSF GDADIKINKL TFNGTLETIP
TRAFREASIN EIDWGDNSSV KVIEVEAFYE LDCESPLTLP RSVERIGADA FSRSTITAVD
FETPSNCRVI EDGAFSNCKY VKSVFIPASV ESLGTEGRSD YGAFENCTAN RSIVFEDGSK
AKIWPSDLFE GTFTGQLVPG KNSSVEEILC DFSGSDIQSV DFSSCPNLRY IAGGAFAGCD
RLISADLSNT KLFEIRTKLF NGCDNLESVK LPDSCFKIGN SAFKNCAALK EINVNNVAVI
RSNSFSGCTS LNIDVSSEEK TTEDNFTYYE ADDYAVITGC KQNGGNIVIP QTINGKPVTA
INDEAFYNEY DNRYIYNVEI PDTVKYIGSY AFAKCRLESL DLPSALEYIG TEAFYDNRNI
NVDLTLPNNV KEIGDNAFTY SGITGITLNN GLRAIGDEAF SQNDITSLVI PDSVTVCGKK
LVNVGSYDYE GRFLTEITFG AGCRNIEDYF QYGTYTIQKI SVSEDNPYYS AENGVLYNKA
KTVLLLCMEG NTNKRLVIPD TVTEICDEAF SGNRHIEYVY IPNSVKVMGD YAFQNCASLN
TVEFEKGSSY ERLYFTFNSC KNLENVIIPA DVRIGDLYAT FDDTGLTHCE LPDNVNSMSY
TFRSTPNLKS VKMPANLIKF GENCFSLSGV ESIVIPDNIK YLPYNSFYGC SNLSYVDMNN
VSSLGWRAFA YCTSLESIDL TNIKHYTRVG NLASFYGCDN LTKFYFTKET ADTDIPESGY
DGNPTVETIV VGNSVTEIKD RAFADCTNLK TALIADSVTE ISDTAFENCD NLNIVCEEGS
YAVNYAKRNS IPYTTFVVAP IPDQEYTGKA ITPELDVTAQ SKALSAGSDY TAVYSDNIHV
GTAKVNVIGL GDYSIFASLV KFNIVGEEPQ DTAPEETVPV PPQQDAEEGN GSENEGAENQ
TGSAADGNQA DHNAGAGNVP GSAQADTNGR SDTAQSAAGN PSAGAQQGNA DGTGNGSGIT
GQNSDTTGGA SNTQNENTAG ESGAANHDQQ APDLPDSGSS EETDMKWYEV ILSAILSFFN
KIIEFFRSLF S
//