ID C4K9E0_THASP Unreviewed; 440 AA.
AC C4K9E0; A0A5C7SM39;
DT 07-JUL-2009, integrated into UniProtKB/TrEMBL.
DT 07-JUL-2009, sequence version 1.
DT 27-MAR-2024, entry version 67.
DE SubName: Full=IS1182 family transposase {ECO:0000313|EMBL:TXH84700.1};
DE SubName: Full=Transposase IS4 family protein {ECO:0000313|EMBL:ACR02651.1};
GN OrderedLocusNames=Tmz1t_4074 {ECO:0000313|EMBL:ACR02651.1};
GN ORFNames=E6Q80_10835 {ECO:0000313|EMBL:TXH84700.1};
OS Thauera aminoaromatica.
OC Bacteria; Pseudomonadota; Betaproteobacteria; Rhodocyclales; Zoogloeaceae;
OC Thauera.
OX NCBI_TaxID=164330 {ECO:0000313|EMBL:ACR02651.1, ECO:0000313|Proteomes:UP000002186};
RN [1] {ECO:0000313|Proteomes:UP000002186}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MZ1T {ECO:0000313|Proteomes:UP000002186};
RG US DOE Joint Genome Institute;
RA Lucas S., Copeland A., Lapidus A., Glavina del Rio T., Dalin E., Tice H.,
RA Bruce D., Goodwin L., Pitluck S., Sims D., Brettin T., Detter J.C., Han C.,
RA Larimer F., Land M., Hauser L., Kyrpides N., Mikhailova N., Sayler G.S.;
RT "Complete sequence of chromosome of Thauera sp. MZ1T.";
RL Submitted (MAY-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ACR02651.1, ECO:0000313|Proteomes:UP000002186}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MZ1T {ECO:0000313|EMBL:ACR02651.1,
RC ECO:0000313|Proteomes:UP000002186};
RX PubMed=23407619; DOI=10.4056/sigs.2696029;
RA Jiang K., Sanseverino J., Chauhan A., Lucas S., Copeland A., Lapidus A.,
RA Del Rio T.G., Dalin E., Tice H., Bruce D., Goodwin L., Pitluck S., Sims D.,
RA Brettin T., Detter J.C., Han C., Chang Y.J., Larimer F., Land M.,
RA Hauser L., Kyrpides N.C., Mikhailova N., Moser S., Jegier P., Close D.,
RA Debruyn J.M., Wang Y., Layton A.C., Allen M.S., Sayler G.S.;
RT "Complete genome sequence of Thauera aminoaromatica strain MZ1T.";
RL Stand. Genomic Sci. 6:325-335(2012).
RN [3] {ECO:0000313|EMBL:TXH84700.1, ECO:0000313|Proteomes:UP000321192}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bin_27_1 {ECO:0000313|EMBL:TXH84700.1};
RA Stamps B.W., Spear J.R.;
RT "Metagenome Assembled Genomes from an Advanced Water Purification
RT Facility.";
RL Submitted (SEP-2018) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP001281; ACR02651.1; -; Genomic_DNA.
DR EMBL; SSFD01000170; TXH84700.1; -; Genomic_DNA.
DR RefSeq; WP_004301215.1; NZ_SSFD01000170.1.
DR AlphaFoldDB; C4K9E0; -.
DR STRING; 85643.Tmz1t_4074; -.
DR KEGG; tmz:Tmz1t_4074; -.
DR eggNOG; COG3666; Bacteria.
DR HOGENOM; CLU_021293_12_2_4; -.
DR OrthoDB; 111180at2; -.
DR Proteomes; UP000002186; Chromosome.
DR Proteomes; UP000321192; Unassembled WGS sequence.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0004803; F:transposase activity; IEA:InterPro.
DR GO; GO:0006313; P:DNA transposition; IEA:InterPro.
DR InterPro; IPR047629; IS1182_transpos.
DR InterPro; IPR002559; Transposase_11.
DR InterPro; IPR008490; Transposase_InsH_N.
DR NCBIfam; NF033551; transpos_IS1182; 1.
DR PANTHER; PTHR33408:SF2; TNP_DDE_DOM DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR33408; TRANSPOSASE; 1.
DR Pfam; PF01609; DDE_Tnp_1; 1.
DR Pfam; PF05598; DUF772; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002186}.
FT DOMAIN 56..123
FT /note="Transposase InsH N-terminal"
FT /evidence="ECO:0000259|Pfam:PF05598"
FT DOMAIN 274..431
FT /note="Transposase IS4-like"
FT /evidence="ECO:0000259|Pfam:PF01609"
FT REGION 214..268
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 214..267
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 440 AA; 49357 MW; 0C09F39FD76D8AF5 CRC64;
MTSYLPYCPQ QQMLLPQALQ EWLPEGHLAY FISDAVDGLD LSAFHARYAG GGPRNQPFHP
AMMVKVLLYA YATGVFSSRK IARKLHEDVA FRVLAADNFP AHRTLSDFRA VHLKELSELF
VQVVRLAREM GLVKLGTVAI DGTKVKANAS RHKAMSYGHM VKAEAELKRQ IEALLNRAKA
ADDAERNEPE WDVPAEIARR EARLTAIAEA RARLEQRQRE ADQARGRSDD DERRPRGGDG
KPKGGRYKRD FGVPEDKAQE NFTDPDSRIM KRAGGGFDPS YNAQTAVDET AHIIVAAELT
NNASDAGQLA GVLQAVRDNV EHRPRQALAD TGYRSEQTFR ELDGCGTELV VALGREGKRR
LGFDRERNPH TAQMADKLES EAGKSAYRKR KWIAEPPNGW IKNVLGFRQF SLRGLERVKA
EWKLVCMALN LRRMSTLRTA
//