GenomeNet

Database: UniProt
Entry: R5ILS4_9CLOT
LinkDB: R5ILS4_9CLOT
Original site: R5ILS4_9CLOT 
ID   R5ILS4_9CLOT            Unreviewed;       186 AA.
AC   R5ILS4;
DT   24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT   24-JUL-2013, sequence version 1.
DT   27-MAR-2024, entry version 28.
DE   SubName: Full=Collagen triple helix repeat (20 copies) {ECO:0000313|EMBL:CCY41269.1};
GN   ORFNames=BN757_01605 {ECO:0000313|EMBL:CCY41269.1};
OS   Clostridium sp. CAG:7.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC   Clostridium.
OX   NCBI_TaxID=1262832 {ECO:0000313|EMBL:CCY41269.1, ECO:0000313|Proteomes:UP000018268};
RN   [1] {ECO:0000313|EMBL:CCY41269.1, ECO:0000313|Proteomes:UP000018268}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=MGS:7 {ECO:0000313|Proteomes:UP000018268};
RA   Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA   Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA   Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA   Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA   Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA   Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA   Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA   Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA   Wang J., Brunak S., Ehrlich S.D.;
RT   "Dependencies among metagenomic species, viruses, plasmids and units of
RT   genetic variation.";
RL   Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:CCY41269.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CAYE010000049; CCY41269.1; -; Genomic_DNA.
DR   AlphaFoldDB; R5ILS4; -.
DR   STRING; 1262832.BN757_01605; -.
DR   Proteomes; UP000018268; Unassembled WGS sequence.
DR   InterPro; IPR008160; Collagen.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 2.
PE   4: Predicted;
KW   Collagen {ECO:0000313|EMBL:CCY41269.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000018268}.
FT   REGION          29..160
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        63..79
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   186 AA;  18020 MW;  0D2018774EE3A0C0 CRC64;
     MCFDDLDGRN EFGRRPGCCG PDGGRGGNCG PCSGSPGNGG PDGCGPGGWR PDPWRPGCGC
     TGRPGPMGPP GPRGPQGFPG YPGPQGETGA TGETGPQGPA GATGPQGPQG PVGETGATGP
     QGATGPQGPQ GPEGPRGETG PQGPAGPAGT VTPAAAVDDA TSTEDIVTQF NLLLNRLREA
     GLLETE
//
DBGET integrated database retrieval system