ID A0A344U252_9ACTN Unreviewed; 411 AA.
AC A0A344U252;
DT 07-NOV-2018, integrated into UniProtKB/TrEMBL.
DT 07-NOV-2018, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE RecName: Full=Collagen-like protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=C0216_17320 {ECO:0000313|EMBL:AXE24973.1};
OS Streptomyces globosus.
OC Bacteria; Actinomycetota; Actinomycetes; Kitasatosporales;
OC Streptomycetaceae; Streptomyces.
OX NCBI_TaxID=68209 {ECO:0000313|EMBL:AXE24973.1, ECO:0000313|Proteomes:UP000252004};
RN [1] {ECO:0000313|EMBL:AXE24973.1, ECO:0000313|Proteomes:UP000252004}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=LZH-48 {ECO:0000313|EMBL:AXE24973.1,
RC ECO:0000313|Proteomes:UP000252004};
RA Ran K., Li Z., Wei S., Dong R.;
RT "Draft genome Sequence of streptomyces globosus LZH-48.";
RL Submitted (JAN-2018) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP030862; AXE24973.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A344U252; -.
DR KEGG; sgz:C0216_17320; -.
DR Proteomes; UP000252004; Chromosome.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01391; Collagen; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000252004}.
FT REGION 54..95
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 108..259
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 137..157
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 195..227
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 411 AA; 38030 MW; E84B4C3F47CDE9E8 CRC64;
MIPGRTPRPS FGPLPGRGTW LRGGALASLA LVLTGLTSPT AAVVPRAAAP LAGTPAAPGD
EGPCPEGAPL GPGAAAARPG QAGPPPGGVD EAPPDYLAAD VLGRYTESRQ QPAAVAQQTA
AGDDRTGCRR GPTGPVGPKG PAGPKGPAGP TGPTGPAGPA GPAGDDGVDG QDGLDGTPGA
TGATGIPGVT GATGATGATG ATGTTGIPGA TGATGATGET GLQGATGATG PTGAAGEAGA
AGAAGPAGPT GATGATGATG GTGAPGATGA TGATGATGAT GATGAAGPCS DIDSYAPSRA
EGFHAVLTDG TAFAGRAVPF GGPVIAWQDL TNPVAVGTDP ANPGYPAGAC AIGIEAQGDD
AYVNVVTAAG AVWQTHGDVN GAGFVWDEPW VQRTTPAPVV RRGLKPSGPH G
//