ID A0A182YS55_ANOST Unreviewed; 774 AA.
AC A0A182YS55;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE RecName: Full=Integrase catalytic domain-containing protein {ECO:0000259|PROSITE:PS50994};
OS Anopheles stephensi (Indo-Pakistan malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=30069 {ECO:0000313|EnsemblMetazoa:ASTEI11291-PA, ECO:0000313|Proteomes:UP000076408};
RN [1] {ECO:0000313|Proteomes:UP000076408}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Indian {ECO:0000313|Proteomes:UP000076408};
RX PubMed=25244985; DOI=10.1186/preaccept-1262842421127991;
RA Jiang X., Peery A., Hall A.B., Sharma A., Chen X.G., Waterhouse R.M.,
RA Komissarov A., Riehle M.M., Shouche Y., Sharakhova M.V., Lawson D.,
RA Pakpour N., Arensburger P., Davidson V.L., Eiglmeier K., Emrich S.,
RA George P., Kennedy R.C., Mane S.P., Maslen G., Oringanje C., Qi Y.,
RA Settlage R., Tojo M., Tubio J.M., Unger M.F., Wang B., Vernick K.D.,
RA Ribeiro J.M., James A.A., Michel K., Riehle M.A., Luckhart S.,
RA Sharakhov I.V., Tu Z.;
RT "Genome analysis of a major urban malaria vector mosquito, Anopheles
RT stephensi.";
RL Genome Biol. 15:459-459(2014).
RN [2] {ECO:0000313|EnsemblMetazoa:ASTEI11291-PA}
RP IDENTIFICATION.
RC STRAIN=Indian {ECO:0000313|EnsemblMetazoa:ASTEI11291-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182YS55; -.
DR STRING; 30069.A0A182YS55; -.
DR EnsemblMetazoa; ASTEI11291-RA; ASTEI11291-PA; ASTEI11291.
DR VEuPathDB; VectorBase:ASTE008233; -.
DR VEuPathDB; VectorBase:ASTE011800; -.
DR VEuPathDB; VectorBase:ASTEI11291; -.
DR VEuPathDB; VectorBase:ASTEI20_031704; -.
DR VEuPathDB; VectorBase:ASTEI20_036390; -.
DR OMA; GCLMWGI; -.
DR Proteomes; UP000076408; Unassembled WGS sequence.
DR GO; GO:0042575; C:DNA polymerase complex; IEA:UniProt.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0071897; P:DNA biosynthetic process; IEA:UniProt.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR PANTHER; PTHR37984:SF14; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000076408};
KW Transposable element {ECO:0000256|ARBA:ARBA00022464}.
FT DOMAIN 438..594
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
SQ SEQUENCE 774 AA; 87238 MW; 9D6DD1E635F540A9 CRC64;
MELDTGAPCS IIAEETLRLI KPLYTLQPSD RQFSSYTGHR VTCVGRLKVQ VTIGMRTRSE
QLYVVSGVSD SLLGREWISH FADQINLNEL FSAQTPIHSI ESTLTPDREK QLARILDSFS
SVFSDSPGKL VGPPAKVHLK QNASPVFAKA RDVPHALRER YAEEIEKKIR SGFYEKVDYS
EWASPTHVVV KKNGKLRITG NYKPTVNPLM IVDEHPILRI EDIFNRMKGA TLFCHLDVTD
AYTHLPIDEQ FRQVLTLNTT THGLIRPTRA VYGAANIPAI WQRRMEEVLR GLTNVVSFFD
DIIVFAKDFD ELLQIQQLPI KAEQIARETR KDQHLGKILK DLEMGRDLQQ TGYKAPEAKY
TTVANCLLFE HRVVIPDIFR SAILQDLHVA HIGVVKMKSL ATSYVYWPGI DKDIEQLAKS
CHECAQTASA SPKFNKHHWE YPSNAWERVH IDYAGPVADA MLLIIVDAYS KWVEVKVTHS
TTTEATIKIL DDIFASYGAP VTVVTDNGPQ FTSEDFSNFL QRSGVKFHKR SAPYHPATNG
QAERYVQTVK RALKAMHSTS ATLQANLNKF LLQYRKVPHS ETGEPPAKLF LGRNIRSRID
LVCPQSVQSR TAEKQRCDLR PSYRTFVPGQ LVYCLSGNPR KGKWIRGTVV TRLGDLHYTV
ECDGIHIKRH VDQIRPALDD NKPVNQRSVS GSPQPPEVHH RRHYYGSTQP QQVPDVAPRT
VPVSSESSDS ACSSSSSTTS ESPYGTPIGS PTPIRDTPFS VRRSTRQRTF TLLT
//