GenomeNet

Database: UniProt
Entry: D3E6X8_GEOS4
LinkDB: D3E6X8_GEOS4
Original site: D3E6X8_GEOS4 
ID   D3E6X8_GEOS4            Unreviewed;       767 AA.
AC   D3E6X8;
DT   23-MAR-2010, integrated into UniProtKB/TrEMBL.
DT   23-MAR-2010, sequence version 1.
DT   27-MAR-2024, entry version 57.
DE   SubName: Full=Collagen triple helix repeat protein {ECO:0000313|EMBL:ACX66451.1};
GN   OrderedLocusNames=GYMC10_4225 {ECO:0000313|EMBL:ACX66451.1};
OS   Geobacillus sp. (strain Y412MC10).
OC   Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Paenibacillus.
OX   NCBI_TaxID=481743 {ECO:0000313|EMBL:ACX66451.1, ECO:0000313|Proteomes:UP000002381};
RN   [1] {ECO:0000313|Proteomes:UP000002381}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Y412MC10 {ECO:0000313|Proteomes:UP000002381};
RA   Lucas S., Copeland A., Lapidus A., Glavina del Rio T., Dalin E., Tice H.,
RA   Bruce D., Goodwin L., Pitluck S., Saunders E., Brettin T., Detter J.C.,
RA   Han C., Larimer F., Land M., Hauser L., Kyrpides N., Ovchinnikova G.,
RA   Brumm P., Mead D.;
RT   "Complete sequence of Geobacillus sp. Y412MC10.";
RL   Submitted (OCT-2009) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:ACX66451.1, ECO:0000313|Proteomes:UP000002381}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Y412MC10 {ECO:0000313|EMBL:ACX66451.1,
RC   ECO:0000313|Proteomes:UP000002381};
RX   PubMed=23408395; DOI=10.4056/sigs.2605792;
RA   Mead D.A., Lucas S., Copeland A., Lapidus A., Cheng J.F., Bruce D.C.,
RA   Goodwin L.A., Pitluck S., Chertkov O., Zhang X., Detter J.C., Han C.S.,
RA   Tapia R., Land M., Hauser L.J., Chang Y.J., Kyrpides N.C., Ivanova N.N.,
RA   Ovchinnikova G., Woyke T., Brumm C., Hochstein R., Schoenfeld T., Brumm P.;
RT   "Complete Genome Sequence of Paenibacillus strain Y4.12MC10, a Novel
RT   Paenibacillus lautus strain Isolated from Obsidian Hot Spring in
RT   Yellowstone National Park.";
RL   Stand. Genomic Sci. 6:381-400(2012).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CP001793; ACX66451.1; -; Genomic_DNA.
DR   AlphaFoldDB; D3E6X8; -.
DR   KEGG; gym:GYMC10_4225; -.
DR   HOGENOM; CLU_001074_9_1_9; -.
DR   Proteomes; UP000002381; Chromosome.
DR   InterPro; IPR008160; Collagen.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000313|EMBL:ACX66451.1}.
FT   REGION          1..157
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          171..202
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          214..272
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          348..438
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          444..463
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          469..490
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          516..606
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          612..631
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          637..658
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          684..767
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        7..33
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        42..108
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        123..157
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        178..201
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        217..252
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        348..375
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        384..438
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        516..543
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        552..606
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        684..711
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        720..767
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   767 AA;  62926 MW;  9668C86E515B1023 CRC64;
     MIGATGVTGT TGETGATGVT GATGATGETG VTGATGATGA IGVTGATGTT GATGATGATG
     VTGETGATGV TGATGETGAT GITGATGATG VTGATGVTGA TGVTGATGET GATGAAGVTG
     ATGVTGATGS TGATGETGAT GATGATGATG VTGATGDTGA TGVTGATGAA GVTGATGATG
     VTGATGATGV TGATGDTGAT GVTGATGAVG VTGATGATGV TGATGATGVT GATGATGVTG
     ATGVTGETGA TGATGATGEA GATGVTGETG ATGATGVMGV TGATGATGAT GATGATGATG
     VTGATGATGA TGATGVTGAT GATGATGITG ATGATGATGA TGVIGATGAT GTTGETGATG
     VTGATGATGE TGVTGATGAT GAIGVTGATG TTGATGATGA TGVTGETGAT GVTGATGETG
     ATGITGATGA TGVTGATGVT GATGVTGATG ETGATGAAGV TGATGVTGAT GTTGATGATG
     ATGVTGATGA TGATGITGAT GATGATGATG VIGATGVTGT TGETGATGVT GATGATGETG
     VTGATGATGA IGVTGATGTT GATGATGATG VTGETGATGV TGATGETGAT GITGATGATG
     VTGATGVTGA TGVTGATGET GATGAAGVTG ATGVTGATGT TGATGATGAT GVTGATGATG
     ATGITGATGA TGATGATGVI GATGATGTTG ETGATGVTGA TGATGETGVT GATGATGAIG
     VTGATGTTGE TGVTGATGAT GTTGAATGAT GVRGNRSNRI NSNRSYR
//
DBGET integrated database retrieval system