ID G0M9S5_CAEBE Unreviewed; 1904 AA.
AC G0M9S5;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 27-MAR-2024, entry version 52.
DE RecName: Full=Integrase catalytic domain-containing protein {ECO:0000259|PROSITE:PS50994};
GN ORFNames=CAEBREN_16458 {ECO:0000313|EMBL:EGT30954.1};
OS Caenorhabditis brenneri (Nematode worm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068};
RN [1] {ECO:0000313|Proteomes:UP000008068}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068};
RG Caenorhabditis brenneri Sequencing and Analysis Consortium;
RA Wilson R.K.;
RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL379787; EGT30954.1; -; Genomic_DNA.
DR STRING; 135651.G0M9S5; -.
DR EnsemblMetazoa; CBN16458.1; CBN16458.1; WBGene00155183.
DR eggNOG; KOG0017; Eukaryota.
DR HOGENOM; CLU_000526_5_1_1; -.
DR InParanoid; G0M9S5; -.
DR Proteomes; UP000008068; Unassembled WGS sequence.
DR GO; GO:0042575; C:DNA polymerase complex; IEA:UniProt.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR040676; DUF5641.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR008042; Retrotrans_Pao.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR PANTHER; PTHR22955:SF74; PROTEIN CBG26950; 1.
DR PANTHER; PTHR22955; RETROTRANSPOSON; 1.
DR Pfam; PF18701; DUF5641; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF05380; Peptidase_A17; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Reference proteome {ECO:0000313|Proteomes:UP000008068};
KW RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 1442..1630
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 175..224
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 237..295
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1748..1904
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 177..218
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1765..1803
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1815..1851
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1869..1890
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1904 AA; 213877 MW; CE9341D8B4A120AA CRC64;
MPPTDQVILK PGTPCLPRIN VILKPGPHVS NGSSNPKTGT QCLPRINKLG SKLPTIAIMN
KSPIQLFKRK ATLAANKAET SITAAVTLLA MADEDIGANN LSNTINSLED CKKKMDECET
DIQTIIDEDP SATQEEKDIT EKSLRDHLVS AKFPQLLEGI AETTDSLLER LDSFDSDGST
WRGNTTAQSD RPPAGSNQQS APKTSQATQN SAQDVSTQQR MDRMEETLSR VLSLLENSGR
NDSGHANSSS SGSNSGSMPH SHGASPTSTG TGNASRTPHF PTNPQPFGSN PGFNPNFNAG
FNNGCNTGFK NGSNPISDDE GDKDRSELCS TTSFAEEDDS IKLLPSTLVC NTFTTAEQFT
QLLDAFDVDV DPQLQEPMRM KVYKVSDTQA QLPFIQLHTP TGGTLLALVD TGAQTSIIST
QAAEKLQLQI VGRRKMLYSG FISQTPERWC TFYRLELLDL SGNSWTTCLP SYDQMSITFS
APEHTLEDQS FIKRNGLDCE GVTNLQDFDG QKIDMILGNN VLNKVKDIQK ANTFYLPSRR
AIEHLLVGFV HHPPILDDSF VPIDRNKPLS ISDDTKEIWI NTISIEDVTT APADEDSSNN
VSTRKLDRLL ERLWTLDVLG LMPPTVKDSK DALNADLISE FKKSAIMDKD NKIYVTWPFN
GRQDELKNNF PVAKIRLQSL LERQLAKIED RKEYHDIITK QVEEGIVEEI ALSSKSTGPE
YYIPHRVVVK EDSLTTKLRI VLDASSHMKN ELSLNDCLHP GPSILQPILG IMIRSRLSKF
LLMSDIQRAF HQVRIQEQYR DVTKFLWIAD PDKGFTEDNL RAFRFTRLPF GVSSSPFLLA
VTILRYLEIN ENSLNERIKE NLYVDNVILT SNDEEDIKEC YKQSKAIFNL MHMNLREYLS
NSPSVMGEVA ERDRHPDHVC KLLGHRWNST SDTIIVKIAK PPKGVPTKKQ LVAFSARNYD
PSGIITPIIV PVKQLISSMW SRDIKWKEKI PEDLVPAWNA IKEQFTDTTY SIPRQLTTNY
DFSSAQLIVF CDASKAHYAT AAYIRYGYKD DTYTSGLIFS KSRIRPSNGG SEYTIPRMEL
MALEIGSNAA VNLAKELHMD LKDVVLFSDS TCCLFWVLSK VNNNYGSKWV SNRVQKIHKN
ILELQLLKLE PTVRYVPSEQ NPADIASRGC SLKELRDNKL WHYGPEFLRQ PETSWTKKLD
NTPADPYEFR KQAAETGLVP ELSTYALDVQ SVVNKVSNSV PQHAIPYERT YSMHKLTIWM
TRALQWICRP IQRRNKRHPD KPIQFKDNLL ALFWSAFEAK DPLGEADLVR KLIIKCHYKD
AEERFNEFPP QRLFPILHED GSWRYKTRFS DSSDSRLKED MRFPIIIISN HPLAKLLVYE
SHEKLQHQGI QDVISDIHQR YWIEHLGRIV RAVRAHCFLC QRKHGKTFKY NFNRILPASR
TTFVGPFQFV GLDYIGPLQY KRSDGQGKLW ILLVSCIFTR AVHLEVVPDN TTVSFINGLR
RFISRRGAPQ FILSDNAPLF KLAYSIINED LKTIVNENEE LTSYLAQKHI KIKLITPLSP
WQGGAYERLV GLVKNVIQKV LSKEIRSFLE METLVIDTEG IINSRPVTPN KRAEEDAPAI
RPCDFLNPGV QLALPEKVDS VFGVIKPGET EKLTRSLLEG LGKAKEDLWD QFALSYFQTL
RELKEDGANH SAQTPKPGMI VLVESAKTKS RHHWPLGRII SVSRSMDGAP RSVLVKCGKH
ILEKPVNQLV PLEDPGDSED ETKISVPQLP PTTSYPRITL PQQDQESLQT PPAQSTSQPD
TPVQPKKKRG RPPKSKTTSS APQPTPSASS ISTTPKESQE ASAQAGDRVQ ATTKPASAPR
HRVFLPRSAK ASTQAQVSTD LANDDGTSTF LQKVDPPPPG VSRP
//