ID F4A3B8_MAHA5 Unreviewed; 331 AA.
AC F4A3B8;
DT 28-JUN-2011, integrated into UniProtKB/TrEMBL.
DT 28-JUN-2011, sequence version 1.
DT 24-JAN-2024, entry version 51.
DE RecName: Full=CRISPR-associated endonuclease Cas1 {ECO:0000256|HAMAP-Rule:MF_01470};
DE EC=3.1.-.- {ECO:0000256|HAMAP-Rule:MF_01470};
GN Name=cas1 {ECO:0000256|HAMAP-Rule:MF_01470};
GN OrderedLocusNames=Mahau_2201 {ECO:0000313|EMBL:AEE97373.1};
OS Mahella australiensis (strain DSM 15567 / CIP 107919 / 50-1 BON).
OC Bacteria; Bacillota; Clostridia; Thermoanaerobacterales;
OC Thermoanaerobacterales Family IV. Incertae Sedis; Mahella.
OX NCBI_TaxID=697281 {ECO:0000313|EMBL:AEE97373.1, ECO:0000313|Proteomes:UP000008457};
RN [1] {ECO:0000313|Proteomes:UP000008457}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 15567 / CIP 107919 / 50-1 BON
RC {ECO:0000313|Proteomes:UP000008457};
RG US DOE Joint Genome Institute (JGI-PGF);
RA Lucas S., Copeland A., Lapidus A., Bruce D., Goodwin L., Pitluck S.,
RA Kyrpides N., Mavromatis K., Pagani I., Ivanova N., Teshima H., Brettin T.,
RA Detter J.C., Han C., Tapia R., Land M., Hauser L., Markowitz V.,
RA Cheng J.-F., Hugenholtz P., Woyke T., Wu D., Spring S., Pukall R.,
RA Steenblock K., Schneider S., Klenk H.-P., Eisen J.A.;
RT "The complete genome of Mahella australiensis DSM 15567.";
RL Submitted (NOV-2010) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:AEE97373.1, ECO:0000313|Proteomes:UP000008457}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 15567 / CIP 107919 / 50-1 BON
RC {ECO:0000313|Proteomes:UP000008457};
RX PubMed=21886860;
RA Sikorski J., Teshima H., Nolan M., Lucas S., Hammon N., Deshpande S.,
RA Cheng J.F., Pitluck S., Liolios K., Pagani I., Ivanova N., Huntemann M.,
RA Mavromatis K., Ovchinikova G., Pati A., Tapia R., Han C., Goodwin L.,
RA Chen A., Palaniappan K., Land M., Hauser L., Ngatchou-Djao O.D., Rohde M.,
RA Pukall R., Spring S., Abt B., Goker M., Detter J.C., Woyke T., Bristow J.,
RA Markowitz V., Hugenholtz P., Eisen J.A., Kyrpides N.C., Klenk H.P.,
RA Lapidus A.;
RT "Complete genome sequence of Mahella australiensis type strain (50-1
RT BON).";
RL Stand. Genomic Sci. 4:331-341(2011).
CC -!- FUNCTION: CRISPR (clustered regularly interspaced short palindromic
CC repeat), is an adaptive immune system that provides protection against
CC mobile genetic elements (viruses, transposable elements and conjugative
CC plasmids). CRISPR clusters contain spacers, sequences complementary to
CC antecedent mobile elements, and target invading nucleic acids. CRISPR
CC clusters are transcribed and processed into CRISPR RNA (crRNA). Acts as
CC a dsDNA endonuclease. Involved in the integration of spacer DNA into
CC the CRISPR cassette. {ECO:0000256|HAMAP-Rule:MF_01470}.
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC Evidence={ECO:0000256|HAMAP-Rule:MF_01470};
CC Name=Mn(2+); Xref=ChEBI:CHEBI:29035;
CC Evidence={ECO:0000256|HAMAP-Rule:MF_01470};
CC -!- SUBUNIT: Homodimer, forms a heterotetramer with a Cas2 homodimer.
CC {ECO:0000256|HAMAP-Rule:MF_01470}.
CC -!- SIMILARITY: Belongs to the CRISPR-associated endonuclease Cas1 family.
CC {ECO:0000256|HAMAP-Rule:MF_01470}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP002360; AEE97373.1; -; Genomic_DNA.
DR RefSeq; WP_013781800.1; NC_015520.1.
DR AlphaFoldDB; F4A3B8; -.
DR STRING; 697281.Mahau_2201; -.
DR KEGG; mas:Mahau_2201; -.
DR eggNOG; COG1518; Bacteria.
DR HOGENOM; CLU_052779_2_0_9; -.
DR OMA; YYVGSFY; -.
DR OrthoDB; 9803119at2; -.
DR Proteomes; UP000008457; Chromosome.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0004520; F:DNA endonuclease activity; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0051607; P:defense response to virus; IEA:UniProtKB-UniRule.
DR GO; GO:0043571; P:maintenance of CRISPR repeat elements; IEA:UniProtKB-UniRule.
DR CDD; cd09722; Cas1_I-B; 1.
DR Gene3D; 1.20.120.920; CRISPR-associated endonuclease Cas1, C-terminal domain; 1.
DR Gene3D; 3.100.10.20; CRISPR-associated endonuclease Cas1, N-terminal domain; 1.
DR HAMAP; MF_01470; Cas1; 1.
DR InterPro; IPR002729; CRISPR-assoc_Cas1.
DR InterPro; IPR042206; CRISPR-assoc_Cas1_C.
DR InterPro; IPR019858; CRISPR-assoc_Cas1_HMARI/TNEAP.
DR InterPro; IPR042211; CRISPR-assoc_Cas1_N.
DR NCBIfam; TIGR00287; cas1; 1.
DR NCBIfam; TIGR03641; cas1_HMARI; 1.
DR PANTHER; PTHR43219; CRISPR-ASSOCIATED ENDONUCLEASE CAS1; 1.
DR PANTHER; PTHR43219:SF1; CRISPR-ASSOCIATED ENDONUCLEASE CAS1; 1.
DR Pfam; PF01867; Cas_Cas1; 1.
PE 3: Inferred from homology;
KW Antiviral defense {ECO:0000256|ARBA:ARBA00023118, ECO:0000256|HAMAP-
KW Rule:MF_01470};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|HAMAP-
KW Rule:MF_01470};
KW Endonuclease {ECO:0000256|ARBA:ARBA00022759, ECO:0000256|HAMAP-
KW Rule:MF_01470};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|HAMAP-Rule:MF_01470};
KW Magnesium {ECO:0000256|ARBA:ARBA00022842, ECO:0000256|HAMAP-Rule:MF_01470};
KW Manganese {ECO:0000256|HAMAP-Rule:MF_01470};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723, ECO:0000256|HAMAP-
KW Rule:MF_01470};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722, ECO:0000256|HAMAP-Rule:MF_01470};
KW Reference proteome {ECO:0000313|Proteomes:UP000008457}.
FT BINDING 157
FT /ligand="Mn(2+)"
FT /ligand_id="ChEBI:CHEBI:29035"
FT /evidence="ECO:0000256|HAMAP-Rule:MF_01470"
FT BINDING 223
FT /ligand="Mn(2+)"
FT /ligand_id="ChEBI:CHEBI:29035"
FT /evidence="ECO:0000256|HAMAP-Rule:MF_01470"
FT BINDING 238
FT /ligand="Mn(2+)"
FT /ligand_id="ChEBI:CHEBI:29035"
FT /evidence="ECO:0000256|HAMAP-Rule:MF_01470"
SQ SEQUENCE 331 AA; 39211 MW; E854742B2416C47F CRC64;
MKKSFYIFSS GEFERKDNTV YFKSEAGSKY IPIEDISEIM IFGEVNFNKR FIEFLSQKEV
LLHFFNHYGY YTGSFYPREH LNSGYMILKQ AEYYMDVDKR LCLARHFVQG ATDNIQHVLK
YYINRGRDSL QAISDEIDKL YAITYECTGV EELMAIEGNI RDQYYKGFDT IIDKPAFAFE
QRTRRPPQNR MNTLISFGNS IIYTLVLSEI YKTHLDPRIG YLHTTNFRRF TLNLDVAEIF
KPILVDRAIF TLLGKNMITA NDFQSYADGI VLKEKAQKAF VTELDKRFAT TINHRDLHRQ
VSYRSIIRME LYKLEKHFMG EKEYSPFVSR W
//