ID C3GTE4_BACTU Unreviewed; 398 AA.
AC C3GTE4;
DT 16-JUN-2009, integrated into UniProtKB/TrEMBL.
DT 16-JUN-2009, sequence version 1.
DT 24-JAN-2024, entry version 29.
DE RecName: Full=Collagen triple helix repeat protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=bthur0010_55630 {ECO:0000313|EMBL:EEM74434.1};
OS Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1.
OC Bacteria; Bacillota; Bacilli; Bacillales; Bacillaceae; Bacillus;
OC Bacillus cereus group.
OX NCBI_TaxID=527029 {ECO:0000313|EMBL:EEM74434.1};
RN [1] {ECO:0000313|EMBL:EEM74434.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGSC 4BA1 {ECO:0000313|EMBL:EEM74434.1};
RX PubMed=22645259; DOI=10.1101/gr.134437.111;
RA Zwick M.E., Joseph S.J., Didelot X., Chen P.E., Bishop-Lilly K.A.,
RA Stewart A.C., Willner K., Nolan N., Lentz S., Thomason M.K.,
RA Sozhamannan S., Mateczun A.J., Du L., Read T.D.;
RT "Genomic characterization of the Bacillus cereus sensu lato species:
RT Backdrop to the evolution of Bacillus anthracis.";
RL Genome Res. 22:1512-1524(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EEM74434.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACNH01000129; EEM74434.1; -; Genomic_DNA.
DR AlphaFoldDB; C3GTE4; -.
DR HOGENOM; CLU_059323_0_0_9; -.
DR Proteomes; UP000006660; Chromosome.
DR Gene3D; 2.60.120.40; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR008983; Tumour_necrosis_fac-like_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1086; CUTICLE COLLAGEN 99-RELATED; 1.
DR Pfam; PF01391; Collagen; 3.
PE 4: Predicted;
SQ SEQUENCE 398 AA; 38510 MW; 777A04096A937EDD CRC64;
MKEMSQANIP NITPTITIGL EDAINLLLVS VALEELGVSH IINAEAEKLQ YTLGTIPGIS
TPATISDLLA INSSIKDTIV ESIKLEIVLE KKLEAVLNTP LKGATGATGA TGPTGETGAT
GAAGVTGVTG ATGATGATGV TGATGAAGAT GVTGPTGATG PTGATGVTGA TGATGITGVT
GVTGVTGATG TTGTTGAIGP TGVCPCVTGP TGPTGATGVG VTGATGETGA TGVAGATGVM
GATGATGATG ETGATGVQVT FNNAFFRTPG TTLVDSGAPI PFAENFTLNG NAITHVAGSP
DILLAPNQTY YITYRTKDEL LTPGSITGHV QLQLNGVPIP GTDSLMDSLP GAPQAITSLI
SNTIINTTGD TPSVLNLVNL DPTAQSFDLT TLTVVKLQ
//