ID R5S653_9CLOT Unreviewed; 1028 AA.
AC R5S653;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE SubName: Full=Repeat protein {ECO:0000313|EMBL:CCZ42451.1};
GN ORFNames=BN479_02342 {ECO:0000313|EMBL:CCZ42451.1};
OS Clostridium sp. CAG:122.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1262773 {ECO:0000313|EMBL:CCZ42451.1, ECO:0000313|Proteomes:UP000017991};
RN [1] {ECO:0000313|EMBL:CCZ42451.1, ECO:0000313|Proteomes:UP000017991}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:122 {ECO:0000313|Proteomes:UP000017991};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCZ42451.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAZY010000244; CCZ42451.1; -; Genomic_DNA.
DR AlphaFoldDB; R5S653; -.
DR Proteomes; UP000017991; Unassembled WGS sequence.
DR Gene3D; 2.160.20.110; -; 1.
DR Gene3D; 2.60.40.3630; -; 1.
DR Gene3D; 2.60.40.4270; Listeria-Bacteroides repeat domain; 3.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 1.
DR InterPro; IPR011493; GLUG.
DR InterPro; IPR022038; Ig-like_bact.
DR InterPro; IPR013378; InlB-like_B-rpt.
DR InterPro; IPR042229; Listeria/Bacterioides_rpt_sf.
DR InterPro; IPR026906; LRR_5.
DR InterPro; IPR032675; LRR_dom_sf.
DR NCBIfam; TIGR02543; List_Bact_rpt; 1.
DR Pfam; PF07523; Big_3; 1.
DR Pfam; PF09479; Flg_new; 3.
DR Pfam; PF07581; Glug; 1.
DR Pfam; PF13306; LRR_5; 1.
DR SUPFAM; SSF52058; L domain-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000017991}.
FT DOMAIN 270..291
FT /note="GLUG"
FT /evidence="ECO:0000259|Pfam:PF07581"
FT DOMAIN 602..664
FT /note="Ig-like"
FT /evidence="ECO:0000259|Pfam:PF07523"
FT REGION 792..888
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 798..820
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 821..888
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1028 AA; 111751 MW; 24E195676A79D167 CRC64;
MKCKFSEVQK GTVSFDLTTA GTGYADKTKA IGHEYGSLPV PEKDGYDFVG WSETADGNNI
ITAFDRLKTP TVTLYAIWSK QVTEQEGKPT RIPLSEEEQE KDIVYTEINS VADFIKIREN
LSGNYKLMSD IDLSKVTKDD KNVGKLGWIP FGYGTDGKWQ GSFSGIFDGN GHSISGLTMK
GNIDGSKNGE TPYTSTGLFG RVEGGKLKNV NINDADITVS NAATGCCIHT GILAGFVDSD
EETGANASVE NVSVSGKVIF SSPKQTVEEN LAVGGIIGYA GKADITNCTN NAFVGYESKE
KEMPVNPTSR FLGGITGYLQ RGNITSCVNK GNVVSYRCYY GNYNAGGDII DSALSGLNCY
NASGGICGVA AQDGLIQKCY NIAKVDTYTS NTYSLFDFTT NSYSNTMSGG ICGALYLSTQ
IKNCYNASDI HSYVNRDTTI VSPDAGDSIF DSIMKKKKLD EMTPTVPFQN AIAYAAGIVG
YSTGSSSGPV AYCFNTGKIQ GDNKHTYPIL CGSVPVFYSV YEQQKDLEGN DLYTRGAESC
DDKNSTTTAR TTDELKSEDA FRGFDFTNVW VSAEETGLSY PQLYDNMENA ITDVKYQADG
LKTEYKYGEK LDLSGLKVSY KLNGKTITKT VPDDKECDYD RFKEGTQKIT INYCNVPYTF
NVKVAEKLYK VIFTDSDGKT VIAEQELKKG QDATAPEAPA KEGYEFTGWD RSFTNVTRDI
VVKAVYTKLY KVTFKSKDGQ TELKVQYVRS GEAAEAPEAP EIEGYTFSRW SDDFTSVYRD
MVITAYYDII ATPKPTKTPK PTKTPAPTKT PKPTSTPKPT KTPKPTATVK PTGTASPNES
SNPDETVKPT ETPAQTVAPT KNPNATASPT KTPLKKNQTV TPVKNNKKSD AASYKVTDVK
KKTVTYSKTK TTSKKAVVPD TITVNGTKLK VTAVGASAFA GNKKIKTVTL GKNITKIGTK
AFYKAKNLSQ ITVNGNTIKS IGKNAFSGVK KNCKITVRAK DKKQYNKIVK LIKKAGAKKV
KFAYKKKK
//