ID A0A1C2IAU2_ACITH Unreviewed; 886 AA.
AC A0A1C2IAU2;
DT 02-NOV-2016, integrated into UniProtKB/TrEMBL.
DT 02-NOV-2016, sequence version 1.
DT 24-JAN-2024, entry version 28.
DE RecName: Full=DOD-type homing endonuclease domain-containing protein {ECO:0000259|PROSITE:PS50819};
GN ORFNames=A6P07_19980 {ECO:0000313|EMBL:OCX67511.1};
OS Acidithiobacillus thiooxidans (Thiobacillus thiooxidans).
OC Bacteria; Pseudomonadota; Acidithiobacillia; Acidithiobacillales;
OC Acidithiobacillaceae; Acidithiobacillus.
OX NCBI_TaxID=930 {ECO:0000313|EMBL:OCX67511.1, ECO:0000313|Proteomes:UP000094893};
RN [1] {ECO:0000313|EMBL:OCX67511.1, ECO:0000313|Proteomes:UP000094893}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=A02 {ECO:0000313|EMBL:OCX67511.1,
RC ECO:0000313|Proteomes:UP000094893};
RA Zhang X., Feng X., Tao J., Ma L., Xiao Y., Liang Y., Liu X., Yin H.;
RT "Comparative genomics of the extreme acidophile Acidithiobacillus
RT thiooxidans reveals intraspecific divergence and niche adaptation.";
RL Int. J. Mol. Sci. 0:0-0(2016).
CC -!- SIMILARITY: Belongs to the ribonucleoside diphosphate reductase large
CC chain family. {ECO:0000256|ARBA:ARBA00010406}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OCX67511.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LWSA01000348; OCX67511.1; -; Genomic_DNA.
DR RefSeq; WP_051488096.1; NZ_LZYI01000193.1.
DR AlphaFoldDB; A0A1C2IAU2; -.
DR STRING; 930.GCA_002079865_00432; -.
DR Proteomes; UP000094893; Unassembled WGS sequence.
DR GO; GO:0004519; F:endonuclease activity; IEA:InterPro.
DR GO; GO:0016539; P:intein-mediated protein splicing; IEA:InterPro.
DR Gene3D; 3.20.70.20; -; 1.
DR Gene3D; 3.10.28.10; Homing endonucleases; 1.
DR InterPro; IPR036844; Hint_dom_sf.
DR InterPro; IPR027434; Homing_endonucl.
DR InterPro; IPR006142; INTEIN.
DR InterPro; IPR004042; Intein_endonuc.
DR InterPro; IPR004860; LAGLIDADG_2.
DR InterPro; IPR013346; NrdE_NrdA_C.
DR InterPro; IPR000788; RNR_lg_C.
DR InterPro; IPR039718; Rrm1.
DR NCBIfam; TIGR02506; NrdE_NrdA; 1.
DR PANTHER; PTHR11573; RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE LARGE CHAIN; 1.
DR PANTHER; PTHR11573:SF6; RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE LARGE SUBUNIT; 1.
DR Pfam; PF14528; LAGLIDADG_3; 1.
DR Pfam; PF02867; Ribonuc_red_lgC; 1.
DR PRINTS; PR00379; INTEIN.
DR PRINTS; PR01183; RIBORDTASEM1.
DR SUPFAM; SSF51294; Hedgehog/intein (Hint) domain; 1.
DR SUPFAM; SSF55608; Homing endonucleases; 1.
DR SUPFAM; SSF51998; PFL-like glycyl radical enzymes; 1.
DR PROSITE; PS50819; INTEIN_ENDONUCLEASE; 1.
DR PROSITE; PS00089; RIBORED_LARGE; 1.
PE 3: Inferred from homology;
FT DOMAIN 90..245
FT /note="DOD-type homing endonuclease"
FT /evidence="ECO:0000259|PROSITE:PS50819"
FT REGION 249..269
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 249..264
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 886 AA; 99214 MW; B45C2632D7B654A1 CRC64;
MTYAQNDPMV EIRIKHSLMP IQVTTGHPVL AIQNVALGET SARTLKRLEN GKRSPTWVDA
GNLQVGDYVA QTIPGEVIPV AGFTDEEAYL YGIMLGDGHV AQKQGVNREY GVTLNATSKA
HLAQFVRQYL LEHDIHFWEN SAHGSVVQLR WAYGRTCLRD AVNGQFVAGE EAPVLPFDFA
DLYNAKGEKK VSARFLHLPE SQTLAILKGL LQTDGHLARG KEIFFTTASL TLAENVRYLL
LRLGIPSSGR QRDRRDEVHT ATRKDGSQDT LSGGLSYELR IPAYPKIAAL LGCEPVQKHM
ALRIGRWLFS RVTSIADITP VPFVCDLKVE GDESYMTTAF LAHNGGKRKG AVCAYLETWH
LDIEEFLELR KNTGDDRRRT HDMNTANWIP DLFMKRVMDN AEWTLFSPNE VPDLHELYGT
RFEERYAEYE SMAENGQLEQ FKKVPAVELW RKMLTMLFET GHPWITFKDP CNVRSPQRHA
GVVHSSNLCT EVTLNSNDEE TAVCNLGSVN LMQHLITNPV ISEHGDELET EDWPYDQLRE
SMTPVNALRH INKEALFETV GTAIRMLDNV IDINFYPTRK ARNANMRHRA IGLGVMGFAD
ALQALRIPMD SEAAVTFAGV SQEVISFAAI QTSADLAVAR GSYSSFQGSD WSRGILPINT
VQRLETERGM ELLTAQQPSA IDPDLWDAVR VRVQHGIRNS NIMAIAPTAT ISNIVGVSQG
IDPIYQNLYV KSNLSGEFTV VNTQMVADME KLDLWDDVMV NDLKYFDGSL AHIDRIPSWM
KRLYATAFEI DPLWLVRMTA ERQKWIDQSV SLNLYMAKPS GKALDNLYKQ AWMYGLKTTY
YLRTMGATSA EKSTVTEGTL NAVQGGTPAP AIKACLIDDP TCEACQ
//