ID T1IH58_STRMM Unreviewed; 808 AA.
AC T1IH58;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 24-JAN-2024, entry version 35.
DE RecName: Full=Integrase catalytic domain-containing protein {ECO:0008006|Google:ProtNLM};
OS Strigamia maritima (European centipede) (Geophilus maritimus).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Myriapoda; Chilopoda;
OC Pleurostigmophora; Geophilomorpha; Linotaeniidae; Strigamia.
OX NCBI_TaxID=126957 {ECO:0000313|EnsemblMetazoa:SMAR000164-PA, ECO:0000313|Proteomes:UP000014500};
RN [1] {ECO:0000313|Proteomes:UP000014500}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Brora {ECO:0000313|Proteomes:UP000014500};
RA Richards S.R., Qu J., Jiang H., Jhangiani S.N., Agravi P., Goodspeed R.,
RA Gross S., Mandapat C., Jackson L., Mathew T., Pu L., Thornton R., Saada N.,
RA Wilczek-Boney K.B., Lee S., Kovar C., Wu Y., Scherer S.E., Worley K.C.,
RA Muzny D.M., Gibbs R.;
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:SMAR000164-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (FEB-2015) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH429716; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; T1IH58; -.
DR STRING; 126957.T1IH58; -.
DR EnsemblMetazoa; SMAR000164-RA; SMAR000164-PA; SMAR000164.
DR eggNOG; KOG0017; Eukaryota.
DR HOGENOM; CLU_001650_11_6_1; -.
DR OrthoDB; 3095763at2759; -.
DR PhylomeDB; T1IH58; -.
DR Proteomes; UP000014500; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00122; MBD; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR42648:SF11; TRANSPOSON TY4-P GAG-POL POLYPROTEIN; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF01429; MBD; 1.
DR Pfam; PF14223; Retrotran_gag_2; 1.
DR Pfam; PF00665; rve; 1.
DR SMART; SM00391; MBD; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50982; MBD; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000014500}.
FT DOMAIN 417..582
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 699..770
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT REGION 158..194
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 694..718
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 163..185
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 704..718
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 808 AA; 92180 MW; 95406D4CCC69E642 CRC64;
MSKDESKSAT AGIEKLDKIN YEKWKMNIQF LMEEYQLWGF HDGTETQPPV GSCVADRLKL
PLLKLPTCKE VWDHPAKLFE PTSIARNAKL YENFYLIQRN ENEELETFIN RIQKAADDLA
AIKVTIDNGI KAYMLLNQAE NVRRRFAEES QSKLDEVAAM YSGSKSRKNQ KTPEENVPAS
GDTSETPTKE SDDKSWLTCY NEELDTMLGT VPARVFKGVV HLVEASHVAD DRDEAVASCL
WPIQGCTRIE NWYIDSGASD HICGQRSAFS TFEEVEPVEL ELGKGTSTIT GRGQVVIRNE
IDGKFSNLTQ DNRANYHTHI YHGMMKVYFK NTRQCLMHAK REDNNLYCVL GEVTTENQVV
WENPADSKPT GRRVENYTVS IDMWHKRFCH MYARGLNELA KNTQVKGLDL DSTVKDFSCE
PCNLAKSTRV NYPKDNEKVS LGKASYLLCI VDDATRYAWV FPIPSKDAVF GVFKTFHARA
ERLSGFKLKA ICTDRGGEFT AGEFEKYLRH HGIEIQRTNA YSPQMNGTAE RINRTILDGV
RAMLADTDLP QEFSVTLTCL KNRYPYARLK QEIPYVNWRK RHLSLRHLRR PSCVAYVNIP
EQKQDGKLNV RAWKETNEVV ETKQVKFRED QEWKDWSKAS DESVSNDKFY DAKDLLDDLS
LTPVKTPGPA AQRRDALPPV QPAAIHSPLQ KLQKLKPPRP TRVAVRKPGR DGFDREEVTR
TTGGSVGRVD VYYYGPNGKR LRSRKEVKDY CTANGLTYEI LDFDFSSNQP VTSTLLDDPV
DIVNLGQVKK LLGVEFERKG GNKFIHQR
//