ID C6LAR5_9FIRM Unreviewed; 343 AA.
AC C6LAR5;
DT 22-SEP-2009, integrated into UniProtKB/TrEMBL.
DT 22-SEP-2009, sequence version 1.
DT 24-JAN-2024, entry version 44.
DE RecName: Full=CRISPR-associated endonuclease Cas1 {ECO:0000256|HAMAP-Rule:MF_01470};
DE EC=3.1.-.- {ECO:0000256|HAMAP-Rule:MF_01470};
GN Name=cas1 {ECO:0000256|HAMAP-Rule:MF_01470,
GN ECO:0000313|EMBL:EET62046.1};
GN ORFNames=BRYFOR_05709 {ECO:0000313|EMBL:EET62046.1};
OS Marvinbryantia formatexigens DSM 14469.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae;
OC Marvinbryantia.
OX NCBI_TaxID=478749 {ECO:0000313|EMBL:EET62046.1, ECO:0000313|Proteomes:UP000005561};
RN [1] {ECO:0000313|EMBL:EET62046.1, ECO:0000313|Proteomes:UP000005561}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 14469 {ECO:0000313|EMBL:EET62046.1,
RC ECO:0000313|Proteomes:UP000005561};
RA Weinstock G., Sodergren E., Clifton S., Fulton L., Fulton B., Courtney L.,
RA Fronick C., Harrison M., Strong C., Farmer C., Delahaunty K., Markovic C.,
RA Hall O., Minx P., Tomlinson C., Mitreva M., Nelson J., Hou S., Wollam A.,
RA Pepin K.H., Johnson M., Bhonagiri V., Nash W.E., Warren W., Chinwalla A.,
RA Mardis E.R., Wilson R.K.;
RL Submitted (JUL-2009) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: CRISPR (clustered regularly interspaced short palindromic
CC repeat), is an adaptive immune system that provides protection against
CC mobile genetic elements (viruses, transposable elements and conjugative
CC plasmids). CRISPR clusters contain spacers, sequences complementary to
CC antecedent mobile elements, and target invading nucleic acids. CRISPR
CC clusters are transcribed and processed into CRISPR RNA (crRNA). Acts as
CC a dsDNA endonuclease. Involved in the integration of spacer DNA into
CC the CRISPR cassette. {ECO:0000256|HAMAP-Rule:MF_01470}.
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC Evidence={ECO:0000256|HAMAP-Rule:MF_01470};
CC Name=Mn(2+); Xref=ChEBI:CHEBI:29035;
CC Evidence={ECO:0000256|HAMAP-Rule:MF_01470};
CC -!- SUBUNIT: Homodimer, forms a heterotetramer with a Cas2 homodimer.
CC {ECO:0000256|ARBA:ARBA00038592, ECO:0000256|HAMAP-Rule:MF_01470}.
CC -!- SIMILARITY: Belongs to the CRISPR-associated endonuclease Cas1 family.
CC {ECO:0000256|HAMAP-Rule:MF_01470}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EET62046.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACCL02000003; EET62046.1; -; Genomic_DNA.
DR RefSeq; WP_006860507.1; NZ_CP102268.1.
DR AlphaFoldDB; C6LAR5; -.
DR STRING; 168384.SAMN05660368_03893; -.
DR eggNOG; COG1518; Bacteria.
DR OrthoDB; 9803119at2; -.
DR Proteomes; UP000005561; Unassembled WGS sequence.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0004520; F:DNA endonuclease activity; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0051607; P:defense response to virus; IEA:UniProtKB-UniRule.
DR GO; GO:0043571; P:maintenance of CRISPR repeat elements; IEA:UniProtKB-UniRule.
DR CDD; cd09721; Cas1_I-C; 1.
DR Gene3D; 1.20.120.920; CRISPR-associated endonuclease Cas1, C-terminal domain; 1.
DR Gene3D; 3.100.10.20; CRISPR-associated endonuclease Cas1, N-terminal domain; 1.
DR HAMAP; MF_01470; Cas1; 1.
DR InterPro; IPR002729; CRISPR-assoc_Cas1.
DR InterPro; IPR042206; CRISPR-assoc_Cas1_C.
DR InterPro; IPR019856; CRISPR-assoc_Cas1_DVULG.
DR InterPro; IPR042211; CRISPR-assoc_Cas1_N.
DR NCBIfam; TIGR00287; cas1; 1.
DR NCBIfam; TIGR03640; cas1_DVULG; 1.
DR PANTHER; PTHR34353; CRISPR-ASSOCIATED ENDONUCLEASE CAS1 1; 1.
DR PANTHER; PTHR34353:SF2; CRISPR-ASSOCIATED ENDONUCLEASE CAS1 1; 1.
DR Pfam; PF01867; Cas_Cas1; 1.
PE 3: Inferred from homology;
KW Antiviral defense {ECO:0000256|ARBA:ARBA00023118, ECO:0000256|HAMAP-
KW Rule:MF_01470};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|HAMAP-
KW Rule:MF_01470};
KW Endonuclease {ECO:0000256|ARBA:ARBA00022759, ECO:0000256|HAMAP-
KW Rule:MF_01470};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|HAMAP-Rule:MF_01470};
KW Magnesium {ECO:0000256|ARBA:ARBA00022842, ECO:0000256|HAMAP-Rule:MF_01470};
KW Manganese {ECO:0000256|ARBA:ARBA00023211, ECO:0000256|HAMAP-Rule:MF_01470};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723, ECO:0000256|HAMAP-
KW Rule:MF_01470};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722, ECO:0000256|HAMAP-Rule:MF_01470};
KW Reference proteome {ECO:0000313|Proteomes:UP000005561}.
FT BINDING 166
FT /ligand="Mn(2+)"
FT /ligand_id="ChEBI:CHEBI:29035"
FT /evidence="ECO:0000256|HAMAP-Rule:MF_01470"
FT BINDING 234
FT /ligand="Mn(2+)"
FT /ligand_id="ChEBI:CHEBI:29035"
FT /evidence="ECO:0000256|HAMAP-Rule:MF_01470"
FT BINDING 249
FT /ligand="Mn(2+)"
FT /ligand_id="ChEBI:CHEBI:29035"
FT /evidence="ECO:0000256|HAMAP-Rule:MF_01470"
SQ SEQUENCE 343 AA; 39083 MW; BF2D5AD80773258D CRC64;
MKKLMNTLYV TSEDSYLSLD GENVVVLDKE REIGRLPLHN LEGIVSFGYR GTSPALMGAC
AERNISLCYL TPQGKFLARV SGKIKGNVIL REQQYRSFLD EKKSLEVAKN CITGKIYNAR
WVLERATRDH SMQVDVEKLK AASENLKNSL SLVRDCQSTE QLRGFEGEAA KVYFGVFDEL
ILQQKKDFSF HGRSRRPPMD NVNALLSFVY TLLTNMVTSA LETVGLDPYV GCFHTERPGR
ASLALDLVEE LRPVLADRFV VSLINKRMVT EKGFKAKENG AVLMDDETRK ILLTEWQNKK
KETLMHPFLN EKVEWGIVPY VQAMLLARYL RGDLDGYPPF LWK
//