ID A0A174GPY0_9CLOT Unreviewed; 219 AA.
AC A0A174GPY0;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 24-JAN-2024, entry version 24.
DE RecName: Full=pre-crRNA processing endonuclease {ECO:0000256|PIRNR:PIRNR029950};
DE EC=3.1.-.- {ECO:0000256|PIRNR:PIRNR029950};
GN Name=cas5c {ECO:0000313|EMBL:RGD67258.1};
GN ORFNames=DWX31_28295 {ECO:0000313|EMBL:RGD67258.1}, DXC39_04060
GN {ECO:0000313|EMBL:RGM09132.1}, DXD79_06280
GN {ECO:0000313|EMBL:RGJ06886.1}, ERS852407_03421
GN {ECO:0000313|EMBL:CUO63248.1};
OS Hungatella hathewayi.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae; Hungatella.
OX NCBI_TaxID=154046 {ECO:0000313|EMBL:CUO63248.1, ECO:0000313|Proteomes:UP000095651};
RN [1] {ECO:0000313|EMBL:CUO63248.1, ECO:0000313|Proteomes:UP000095651}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2789STDY5608850 {ECO:0000313|EMBL:CUO63248.1,
RC ECO:0000313|Proteomes:UP000095651};
RG Pathogen Informatics;
RL Submitted (SEP-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000261023, ECO:0000313|Proteomes:UP000261257}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=AF19-13AC {ECO:0000313|EMBL:RGD67258.1,
RC ECO:0000313|Proteomes:UP000261023}, TF05-11AC
RC {ECO:0000313|EMBL:RGM09132.1, ECO:0000313|Proteomes:UP000261257}, and
RC TM09-12 {ECO:0000313|EMBL:RGJ06886.1,
RC ECO:0000313|Proteomes:UP000263014};
RA Zou Y., Xue W., Luo G.;
RT "A genome reference for cultivated species of the human gut microbiota.";
RL Submitted (AUG-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: CRISPR (clustered regularly interspaced short palindromic
CC repeat) is an adaptive immune system that provides protection against
CC mobile genetic elements (viruses, transposable elements and conjugative
CC plasmids). CRISPR clusters contain spacers, sequences complementary to
CC antecedent mobile elements, and target invading nucleic acids. CRISPR
CC clusters are transcribed and processed into CRISPR RNA (crRNA).
CC {ECO:0000256|PIRNR:PIRNR029950}.
CC -!- SIMILARITY: Belongs to the CRISPR-associated protein Cas5 family.
CC Subtype I-C/Dvulg subfamily. {ECO:0000256|PIRNR:PIRNR029950}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CYZE01000009; CUO63248.1; -; Genomic_DNA.
DR EMBL; QTJW01000027; RGD67258.1; -; Genomic_DNA.
DR EMBL; QSON01000002; RGJ06886.1; -; Genomic_DNA.
DR EMBL; QSSQ01000001; RGM09132.1; -; Genomic_DNA.
DR RefSeq; WP_025532413.1; NZ_QTJW01000027.1.
DR AlphaFoldDB; A0A174GPY0; -.
DR OrthoDB; 5621871at2; -.
DR Proteomes; UP000095651; Unassembled WGS sequence.
DR Proteomes; UP000261023; Unassembled WGS sequence.
DR Proteomes; UP000261257; Unassembled WGS sequence.
DR Proteomes; UP000263014; Unassembled WGS sequence.
DR GO; GO:0004519; F:endonuclease activity; IEA:UniProtKB-UniRule.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0051607; P:defense response to virus; IEA:UniProtKB-UniRule.
DR GO; GO:0043571; P:maintenance of CRISPR repeat elements; IEA:UniProtKB-UniRule.
DR CDD; cd09752; Cas5_I-C; 1.
DR Gene3D; 3.30.70.2660; -; 1.
DR InterPro; IPR021124; CRISPR-assoc_prot_Cas5.
DR InterPro; IPR013422; CRISPR-assoc_prot_Cas5_N.
DR InterPro; IPR010155; CRISPR-assoc_prot_Cas5d.
DR NCBIfam; TIGR01876; cas_Cas5d; 1.
DR NCBIfam; TIGR02593; CRISPR_cas5; 1.
DR Pfam; PF09704; Cas_Cas5d; 1.
DR PIRSF; PIRSF029950; Cas_CT1134; 1.
PE 3: Inferred from homology;
KW Antiviral defense {ECO:0000256|ARBA:ARBA00023118,
KW ECO:0000256|PIRNR:PIRNR029950};
KW Endonuclease {ECO:0000256|PIRNR:PIRNR029950};
KW Hydrolase {ECO:0000256|PIRNR:PIRNR029950};
KW Nuclease {ECO:0000256|PIRNR:PIRNR029950};
KW Reference proteome {ECO:0000313|Proteomes:UP000095651};
KW RNA-binding {ECO:0000256|PIRNR:PIRNR029950}.
SQ SEQUENCE 219 AA; 25555 MW; 4801A750861EDA77 CRC64;
MSRGVRVRVW GDLALFSRPE MKVERCSYDV ITPSAARGML EAVYWHPGMK WVIDKIYVRK
PIQFTSIRRN EVKSKVLASS VLNVMNGGNK PLLISCRQEI VQRAAILLKD VDYVIEAHFD
MTDHASDCDN PGKFKDIIMR RLRRGECYHT PYFGCREFPA KFELYEGDDV TTEYKGMERE
LGYMFYDFDY SNPEDIQPLF FRAVLKDGVL DVRDQEVVR
//