ID B1RA68_CLOPF Unreviewed; 421 AA.
AC B1RA68;
DT 20-MAY-2008, integrated into UniProtKB/TrEMBL.
DT 20-MAY-2008, sequence version 1.
DT 24-JAN-2024, entry version 32.
DE SubName: Full=Phage major capsid protein, HK97 family {ECO:0000313|EMBL:EDT22893.1};
GN ORFNames=AC1_2797 {ECO:0000313|EMBL:EDT22893.1};
OS Clostridium perfringens B str. ATCC 3626.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=451754 {ECO:0000313|EMBL:EDT22893.1, ECO:0000313|Proteomes:UP000004342};
RN [1] {ECO:0000313|EMBL:EDT22893.1, ECO:0000313|Proteomes:UP000004342}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=B str. ATCC 3626 {ECO:0000313|Proteomes:UP000004342};
RA Paulsen I., Sebastian Y.;
RT "Annotation of Clostridium perfringens B str. ATCC 3626.";
RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EDT22893.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ABDV01000026; EDT22893.1; -; Genomic_DNA.
DR RefSeq; WP_003459483.1; NZ_ABDV01000026.1.
DR AlphaFoldDB; B1RA68; -.
DR Proteomes; UP000004342; Unassembled WGS sequence.
DR InterPro; IPR024455; Phage_capsid.
DR NCBIfam; TIGR01554; major_cap_HK97; 1.
DR Pfam; PF05065; Phage_capsid; 1.
DR SUPFAM; SSF56563; Major capsid protein gp5; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils}.
FT REGION 395..421
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 28..74
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 398..421
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 421 AA; 47674 MW; 804647B26AB07456 CRC64;
MNLFERLKEL RAKKKELEEK RCGIVEEIRS LAKEKKEEEA RSKALEREKI EARMEIIEEE
IESVMTAIDE ERKNTNFTGG RVIINGDSKE EKRSLQLSAM SKTIRGIQLS EEERDIMSST
NNGAVIPQEF VNEFEKLKEG YPSLKEHCHV IPVNRNAGKM PVRAGASVDK LANLAKDTEL
VKAMLKTQPM AYDIDDYGLL APIDNSLLED SEINFLEFVN EEFAEFAVNT ENAEIVKQAK
AVLAEETIND YAGLVKTINS LVPNARKRAI IVTNSDGRAY LDGLMDKQGR PLLKELSDGG
DLVFKGRPVI ELEESIFDVG DETKFIVSDF KTLIKFMDRK QYLIDQSKEA GYTKNETIAR
IIERFDVNSP LDKSSDAEKI RKFGVIVKLQ EVLKSSPRSG KNKNESKEEI KEEGEATQQN
E
//