GenomeNet

Database: UniProt
Entry: Q4BZZ1_CROWT
LinkDB: Q4BZZ1_CROWT
Original site: Q4BZZ1_CROWT 
ID   Q4BZZ1_CROWT            Unreviewed;       360 AA.
AC   Q4BZZ1;
DT   13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT   13-SEP-2005, sequence version 1.
DT   27-MAR-2024, entry version 42.
DE   SubName: Full=Transposase, IS605 OrfB {ECO:0000313|EMBL:EAM49465.1};
GN   ORFNames=CwatDRAFT_2331 {ECO:0000313|EMBL:EAM49465.1};
OS   Crocosphaera watsonii WH 8501.
OC   Bacteria; Cyanobacteriota; Cyanophyceae; Oscillatoriophycideae;
OC   Chroococcales; Aphanothecaceae; Crocosphaera.
OX   NCBI_TaxID=165597 {ECO:0000313|EMBL:EAM49465.1, ECO:0000313|Proteomes:UP000003922};
RN   [1] {ECO:0000313|EMBL:EAM49465.1, ECO:0000313|Proteomes:UP000003922}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=WH 8501 {ECO:0000313|EMBL:EAM49465.1,
RC   ECO:0000313|Proteomes:UP000003922};
RG   DOE Joint Genome Institute;
RL   Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:EAM49465.1, ECO:0000313|Proteomes:UP000003922}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=WH 8501 {ECO:0000313|EMBL:EAM49465.1,
RC   ECO:0000313|Proteomes:UP000003922};
RG   US DOE Joint Genome Institute (JGI-ORNL);
RA   Larimer F., Land M.;
RT   "Annotation of the draft genome assembly of Crocosphaera watsonii WH
RT   8501.";
RL   Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:EAM49465.1, ECO:0000313|Proteomes:UP000003922}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=WH 8501 {ECO:0000313|EMBL:EAM49465.1,
RC   ECO:0000313|Proteomes:UP000003922};
RG   US DOE Joint Genome Institute (JGI-PGF);
RA   Copeland A., Lucas S., Lapidus A., Barry K., Detter C., Glavina T.,
RA   Hammon N., Israni S., Pitluck S., Richardson P.;
RT   "Sequencing of the draft genome and assembly of Crocosphaera watsonii WH
RT   8501.";
RL   Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: In the C-terminal section; belongs to the transposase 35
CC       family. {ECO:0000256|ARBA:ARBA00008761}.
CC   -!- SIMILARITY: In the N-terminal section; belongs to the transposase 2
CC       family. {ECO:0000256|ARBA:ARBA00011044}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EAM49465.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AADV02000068; EAM49465.1; -; Genomic_DNA.
DR   AlphaFoldDB; Q4BZZ1; -.
DR   KEGG; cwa:CwatDRAFT_2331; -.
DR   Proteomes; UP000003922; Unassembled WGS sequence.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR   GO; GO:0032196; P:transposition; IEA:UniProtKB-KW.
DR   InterPro; IPR010095; Cas12f1-like_TNB.
DR   InterPro; IPR001959; Transposase.
DR   NCBIfam; NF040570; guided_TnpB; 1.
DR   PANTHER; PTHR30405:SF20; RNA-GUIDED DNA ENDONUCLEASE INSQ-RELATED; 1.
DR   PANTHER; PTHR30405; TRANSPOSASE; 1.
DR   Pfam; PF01385; OrfB_IS605; 1.
DR   Pfam; PF07282; OrfB_Zn_ribbon; 1.
PE   3: Inferred from homology;
KW   DNA recombination {ECO:0000256|ARBA:ARBA00023172};
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW   Reference proteome {ECO:0000313|Proteomes:UP000003922};
KW   Transposition {ECO:0000256|ARBA:ARBA00022578}.
FT   DOMAIN          133..246
FT                   /note="Probable transposase IS891/IS1136/IS1341"
FT                   /evidence="ECO:0000259|Pfam:PF01385"
FT   DOMAIN          260..325
FT                   /note="Cas12f1-like TNB"
FT                   /evidence="ECO:0000259|Pfam:PF07282"
SQ   SEQUENCE   360 AA;  42088 MW;  D5310CD2C1CAE7C5 CRC64;
     MGQKDIYKYT TQLRNEYPFV KTLNSTACQQ ACERTWTAIL KFYNNCKNNI RGLKGYPKYS
     KRTHSVEFKK SGWKLNRDTK RITFTDGKNI GELKLIGSRD LFYFQEWQIQ RVRIIRRADG
     YFVQLILKLD VREITPQLEP SKKCVGLDMG LKYLYADSDN NVVEPPKYYR KAEKRLNKLN
     RRKSRKFKRG QKQSNNYINA RRKYAKGHLK VSRQREEFAK RLALRLIQSN DLIAYEDLKV
     KNLVRNKKLA KSINDAGRSQ LRKWIEYFGV KYDRLTIAVN PTYTSQECFN CGQLIKKSLS
     VRTHVCSCGY TEDRDTMAAL NILKKATQGH WGSWSEDLNA WGDSTSILVG SNTCQGKFSQ
//
DBGET integrated database retrieval system