ID R5E9V1_9CLOT Unreviewed; 676 AA.
AC R5E9V1;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 22.
DE SubName: Full=RHS repeat-associated core domain protein {ECO:0000313|EMBL:CCX85522.1};
GN ORFNames=BN724_00966 {ECO:0000313|EMBL:CCX85522.1};
OS Clostridium sp. CAG:590.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1262825 {ECO:0000313|EMBL:CCX85522.1, ECO:0000313|Proteomes:UP000017939};
RN [1] {ECO:0000313|EMBL:CCX85522.1, ECO:0000313|Proteomes:UP000017939}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:590 {ECO:0000313|Proteomes:UP000017939};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCX85522.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAXF010000017; CCX85522.1; -; Genomic_DNA.
DR AlphaFoldDB; R5E9V1; -.
DR STRING; 1262825.BN724_00966; -.
DR Proteomes; UP000017939; Unassembled WGS sequence.
DR Gene3D; 2.180.10.10; RHS repeat-associated core; 2.
DR InterPro; IPR022385; Rhs_assc_core.
DR InterPro; IPR031325; RHS_repeat.
DR InterPro; IPR006530; YD.
DR NCBIfam; TIGR03696; Rhs_assc_core; 1.
DR NCBIfam; TIGR01643; YD_repeat_2x; 6.
DR PANTHER; PTHR32305; -; 1.
DR PANTHER; PTHR32305:SF15; PROTEIN RHSA-RELATED; 1.
DR Pfam; PF05593; RHS_repeat; 5.
DR SUPFAM; SSF69304; Tricorn protease N-terminal domain; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000017939}.
SQ SEQUENCE 676 AA; 75681 MW; 3AE70CABB99A91EB CRC64;
MVSVTDADGN VITNMYDSFG NLIKQMNEST KDVATYAYIG GRYLAASSDA LGNTATMTYD
SMGNIATLTN PNGGVTSYTY DLNSNLTEER IGEDYHIGYT YNAQNLVASK TNSRNQGTVY
VYDAVGRIIR QTDEEGVIEY SYDANGNVLT VTETKGENVF TITRTYDGLN RVTSYEDGKG
KRIGYVYDKV GNLTDVIYPN GKKVTYVYDK NGSIKKLTDW DKRITTYDYD VNGRLIKTTR
PNGTVETRAY DKMGRLTLIL DKAGGTEVNH QEYCYDAAGN ITETKSLGNK TLEHSDVSDV
KMTYDKNNRL ITYNGKDVEY DKDGNMVYGP LQGKMVSFLY DCRNRLVQAG DTTYEYDAEN
NRIAEINGSR RIEYVINTQP ELGQILQSIS IMGNRKEETY YYYGNGLTAQ DNGTDYLTYH
FNNVGSTMAV TDEKGNIAGA YEYSPYGQIL HKEGNANITF LYNGQYGVAT DENGLYYMRA
RYYNVDIKRF INQDVLTGTL ERISSLNRYA YVEGNPINYL DPFGLEAYDT TVLHEVAMIA
GLISFVASKI CPKIAIIIAA AANGFDIGLY LYDSFYDIKQ GEMDEFAKNL KGIAIDLIGI
VTAGVTNGYF KAAESAANYK GIMYGYEQQL SMLYDKWQYA DNAVSNGTVA LQVLSKFYLF
LKEKVMGNDK KNEKTL
//