ID R7D9Z4_9ACTN Unreviewed; 745 AA.
AC R7D9Z4;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE SubName: Full=Integrase catalytic region {ECO:0000313|EMBL:CDD86668.1};
GN ORFNames=BN589_01203 {ECO:0000313|EMBL:CDD86668.1};
OS Collinsella sp. CAG:289.
OC Bacteria; Actinomycetota; Coriobacteriia; Coriobacteriales;
OC Coriobacteriaceae; Collinsella.
OX NCBI_TaxID=1262851 {ECO:0000313|EMBL:CDD86668.1, ECO:0000313|Proteomes:UP000017952};
RN [1] {ECO:0000313|EMBL:CDD86668.1, ECO:0000313|Proteomes:UP000017952}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:289 {ECO:0000313|Proteomes:UP000017952};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDD86668.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBHV010000118; CDD86668.1; -; Genomic_DNA.
DR AlphaFoldDB; R7D9Z4; -.
DR Proteomes; UP000017952; Unassembled WGS sequence.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR025948; HTH-like_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR011646; KAP_P-loop.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR048020; Transpos_IS3.
DR InterPro; IPR010921; Trp_repressor/repl_initiator.
DR NCBIfam; NF033516; transpos_IS3; 1.
DR PANTHER; PTHR46889; TRANSPOSASE INSF FOR INSERTION SEQUENCE IS3B-RELATED; 1.
DR PANTHER; PTHR46889:SF1; TRANSPOSASE INSO FOR INSERTION SEQUENCE ELEMENT IS911B-RELATED; 1.
DR Pfam; PF13276; HTH_21; 1.
DR Pfam; PF07693; KAP_NTPase; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF13333; rve_2; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR SUPFAM; SSF48295; TrpR-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000017952}.
FT DOMAIN 578..742
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT COILED 402..429
FT /evidence="ECO:0000256|SAM:Coils"
SQ SEQUENCE 745 AA; 83528 MW; CBD5A55A3DBBD280 CRC64;
MDNSNLSLAT DTPIKAREQD LIGRIPFAER LAGILKSAAG PESLVIGLYG PWGSGKTSVI
NLVENALSRK NDDGKAGVSV VRFEPWNYLT AEQLLAQFLK EVGGALDKDA HGRRTLFSKL
RGKRPEVLNA FAAYSEALII TAGAAASLAG APLAGVAVPA FGNRLASKLR KSADRAGSVS
AEKQRLEEEL LCLCQVGNEQ WGLRWESWTK SEQEGSRMYS AEQRKIAIET FVKFDHSYAD
TIAELGYPTR ACLRNWWNEY RDTGEVPISK FTTNPRYTAE MRRRAVEHYL EHGKSLTRTM
RALGYPKSRE VLGDWIDEIA PGQRKYRGPN PKRDPVPVEK KVQVVAELEA RSGPAAEIAE
RHGVSRTAPY IWRREIMGDN GGDPEKKGVP VSKEYDDLPD DIEVLQGMLR EAKMQLRKVQ
LELEVRQATL EIVKKDPGAE PGLLTNAEKA AMVTALRPRW KLCEVLPVVG MAKSSYEYAR
SAQARGETEE HAAARKAVIE AFEASGGTYG YRRIYARVNA DVEDGEAIGE WTVRSIMEEE
NLVARAAKKK RRYSSYEGEI SEAPPNLLRD DRGKHHFRAN KPNELWITDV TEFRIPAGKV
YLSPIVDCFD GMPLSWSIST SPDAEMANSS LLGACEWLGE GDHPKIHSDR GCHYRWPEWI
RICDENGLVR SMSRKGCSPD NARCEGYFGR LKIEFFHGCD WAGVTIEEFM GMLDAYLRWY
RDVRIKGDLD YRSPMQYRRD LGLAA
//