ID R5DRQ1_9FIRM Unreviewed; 350 AA.
AC R5DRQ1;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE SubName: Full=Collagen triple helix repeat (20 copies) {ECO:0000313|EMBL:CCX82863.1};
GN ORFNames=BN462_00735 {ECO:0000313|EMBL:CCX82863.1};
OS Ruminococcus sp. CAG:108.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Ruminococcus.
OX NCBI_TaxID=1262950 {ECO:0000313|EMBL:CCX82863.1, ECO:0000313|Proteomes:UP000018037};
RN [1] {ECO:0000313|EMBL:CCX82863.1, ECO:0000313|Proteomes:UP000018037}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:108 {ECO:0000313|Proteomes:UP000018037};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCX82863.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAXD010000061; CCX82863.1; -; Genomic_DNA.
DR AlphaFoldDB; R5DRQ1; -.
DR Proteomes; UP000018037; Unassembled WGS sequence.
DR Gene3D; 2.60.120.220; Satellite virus coat domain; 2.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR24637; COLLAGEN; 1.
DR PANTHER; PTHR24637:SF428; SCAVENGER RECEPTOR CLASS A MEMBER 3; 1.
DR Pfam; PF01391; Collagen; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:CCX82863.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000018037}.
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 48..76
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 109..199
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..15
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 48..66
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 111..129
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 168..195
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 350 AA; 37133 MW; 64AB1B5AD44E4ED8 CRC64;
MKGLDGKDGI DGKNGVDGID GKNGDTPFIG ENGNWWLGVT DIGVKAAGVD GEKGDKGDKG
DAGENGKDGA NGKTPEFRVN ENNLQWRYVG DEIWLNLYDL SILKGLDGAD GKDGINGKDG
VDGKDGSDGK DGTNGQNGAD GKDGNTPFIG ENGNWWIGTT DTGVKATGVD GEKGDKGDTG
EKGEKGDKGD KGDKGDAGQN GSCSGYFYGE ATSPSAVVLN NSRANITVYQ KINKGGLISS
YMNNITLKKG HIYNVCLSGS LEVGSNEANK SGNYSIQMTD GYDDDLCREL TRIKRDGAKI
PYTNDQHSFN FNRMYDASNK DITLQLWFEN SAYNTYLGGF RGTITITALD
//