ID G7MBN6_9CLOT Unreviewed; 455 AA.
AC G7MBN6;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 24-JAN-2024, entry version 31.
DE SubName: Full=Major capsid protein HK97 {ECO:0000313|EMBL:EHI96992.1};
GN ORFNames=CDLVIII_0254 {ECO:0000313|EMBL:EHI96992.1};
OS Clostridium sp. DL-VIII.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=641107 {ECO:0000313|EMBL:EHI96992.1, ECO:0000313|Proteomes:UP000005106};
RN [1] {ECO:0000313|EMBL:EHI96992.1, ECO:0000313|Proteomes:UP000005106}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DL-VIII {ECO:0000313|EMBL:EHI96992.1,
RC ECO:0000313|Proteomes:UP000005106};
RX PubMed=23929491;
RA Taghavi S., Izquierdo J.A., van der Lelie D.;
RT "Complete Genome Sequence of Clostridium sp. Strain DL-VIII, a Novel
RT Solventogenic Clostridium Species Isolated from Anaerobic Sludge.";
RL Genome Announc. 1:e00605-e00613(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001240; EHI96992.1; -; Genomic_DNA.
DR RefSeq; WP_009167678.1; NZ_CM001240.1.
DR AlphaFoldDB; G7MBN6; -.
DR STRING; 641107.CDLVIII_0254; -.
DR eggNOG; ENOG503468Y; Bacteria.
DR HOGENOM; CLU_600921_0_0_9; -.
DR Proteomes; UP000005106; Chromosome.
DR Gene3D; 3.30.2400.10; Major capsid protein gp5; 1.
DR InterPro; IPR024455; Phage_capsid.
DR NCBIfam; TIGR01554; major_cap_HK97; 1.
DR Pfam; PF05065; Phage_capsid; 1.
DR SUPFAM; SSF56563; Major capsid protein gp5; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000005106}.
FT COILED 6..68
FT /evidence="ECO:0000256|SAM:Coils"
SQ SEQUENCE 455 AA; 50303 MW; 1CAF5419E9E49065 CRC64;
MDFKKYKEYK AERSKLLNEA QTIMRKVNEK NQMTQADQEK VDSLFNQIDK LDNENKELDK
KYKSDFENYI NSLIDPNESA LEQANRRALE DNHRVTDIYS KSVDIKNGVV IDSIGDSIRG
ENKVSNLFLN KADKLADRVS VSDERTKELL NQDGALGTVI KGMVTGKWSN QEFKNIVTTT
STGVLIPEVL SANIIDLARN LSLFTNAGVP VVPMESNNMT ISRVKTDPTF AFKAEGTEGA
EGSFELDSVE LKAKTVYGYA YVSLESINSS TNLDQIIRQV FAQAMANTID KAFIYGQANA
DNTGFETFAP GGIMNDSAIN SISATTGGGY DDFIKAISKV RQANGNPTAY GINAETEELL
SLLKTNDGQY LSAPKAVTDL QQIVSNQLKY DTTNGSDSLV FDPMAMLIGI QNNIQIKIIE
DTECLKKGLI AFQIYSMLDC KTTRPKHICK ITGIK
//