ID A0A3M0KGG4_HIRRU Unreviewed; 864 AA.
AC A0A3M0KGG4;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE RecName: Full=ribonuclease H {ECO:0000256|ARBA:ARBA00012180};
DE EC=3.1.26.4 {ECO:0000256|ARBA:ARBA00012180};
GN ORFNames=DUI87_09841 {ECO:0000313|EMBL:RMC12328.1};
OS Hirundo rustica rustica.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Sylvioidea; Hirundinidae;
OC Hirundo.
OX NCBI_TaxID=333673 {ECO:0000313|EMBL:RMC12328.1, ECO:0000313|Proteomes:UP000269221};
RN [1] {ECO:0000313|EMBL:RMC12328.1, ECO:0000313|Proteomes:UP000269221}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Chelidonia {ECO:0000313|EMBL:RMC12328.1};
RC TISSUE=Blood {ECO:0000313|EMBL:RMC12328.1};
RA Formenti G., Chiara M., Poveda L., Francoijs K.-J., Bonisoli-Alquati A.,
RA Canova L., Gianfranceschi L., Horner D.S., Saino N.;
RT "A high quality draft genome assembly of the barn swallow (H. rustica
RT rustica).";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the beta type-B retroviral polymerase family.
CC HERV class-II K(HML-2) pol subfamily. {ECO:0000256|ARBA:ARBA00010879}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RMC12328.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QRBI01000106; RMC12328.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3M0KGG4; -.
DR STRING; 333673.A0A3M0KGG4; -.
DR Proteomes; UP000269221; Unassembled WGS sequence.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0075713; P:establishment of integrated proviral latency; IEA:UniProtKB-KW.
DR GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-KW.
DR GO; GO:0044826; P:viral genome integration into host DNA; IEA:UniProtKB-KW.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 1.10.375.10; Human Immunodeficiency Virus Type 1 Capsid Protein; 1.
DR Gene3D; 2.30.30.10; Integrase, C-terminal domain superfamily, retroviral; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR036862; Integrase_C_dom_sf_retrovir.
DR InterPro; IPR001037; Integrase_C_retrovir.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR008919; Retrov_capsid_N.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR010661; RVT_thumb.
DR PANTHER; PTHR41694; ENDOGENOUS RETROVIRUS GROUP K MEMBER POL PROTEIN; 1.
DR PANTHER; PTHR41694:SF3; RNA-DIRECTED DNA POLYMERASE-RELATED; 1.
DR Pfam; PF00607; Gag_p24; 1.
DR Pfam; PF00552; IN_DBD_C; 1.
DR Pfam; PF00075; RNase_H; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF06817; RVT_thumb; 1.
DR SUPFAM; SSF50122; DNA-binding domain of retroviral integrase; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF47943; Retrovirus capsid protein, N-terminal core domain; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS51027; INTEGRASE_DBD; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 3: Inferred from homology;
KW DNA integration {ECO:0000256|ARBA:ARBA00023195};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Endonuclease {ECO:0000256|ARBA:ARBA00022759};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Magnesium {ECO:0000256|ARBA:ARBA00022842};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Reference proteome {ECO:0000313|Proteomes:UP000269221};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Viral genome integration {ECO:0000256|ARBA:ARBA00023195};
KW Virus entry into host cell {ECO:0000256|ARBA:ARBA00023296}.
FT DOMAIN 1..350
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 570..709
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT DOMAIN 687..769
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 779..826
FT /note="Integrase-type"
FT /evidence="ECO:0000259|PROSITE:PS51027"
FT DNA_BIND 779..826
FT /note="Integrase-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00506"
FT REGION 50..90
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 759..780
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 818..864
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 760..774
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 834..864
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 864 AA; 95843 MW; C484811A09B0CE58 CRC64;
MRPVFDKSKS SRAVGSSSTA CWTLPPCQVT VRPRDRGQFW GEVKAKAEGL NDGADAGAGH
AVSGVQPVPS RVPASDPVGP TGQDKTGAAA SSEGLQAFPD LQGATHNTYQ PLAWQALAEL
RDAVRKYGLG SAEVMQVLRY FNASLLTPFD IRSLARALFP LVEYDFFEYK WTQLAGRAVE
WNRTLGPGDP RRMVNIDMLM GTGNYTRAEG QAGYEPLVQE QCQQTGMAAL VQTLQLATPQ
QPFATIVQGV DEPFLCFAGR LNAAVEKQDM RNSPVLCQWY VARALSGVHK QFPDAHVYHY
MDDILVAAPT QDELLRIQPQ LLNALHSHGL QVAPEKVQQQ PPWKYLGVKI LEQTIRHQEV
QFVQSMKTLN DAQKLVGVIT WLRPYLGLTT AQLSPLFELL KGDTDLKSPC ELTPEACKAL
EEVQQAVSAS QVYRIEPSID VTVFITTPDL HPTGIIGQWN DDWTDPLHIL ERVFLPHQPH
KTVTALFELM ARLIIKCRQR CLQLMDADPS KIILLVQREE FDWSYANNVS LQSALEGFSG
QITYHLPSHK LLQVGKNTQF SLRPKSSQEP VQGPTVFTDG SGKTGKAIVT WQDGSEWQVL
ESHEDGSAQL AELRAAVMAF EKFSQEPFNL ITDSAYVADI AQRLSCSVLK EVSNPALFNL
LKALWCAIQA RFHPYMFCMY GVTLTCQGVN PRGLKALELW QTDVTQVAEF GRLKYVHVTV
DTFSSAMWAS AHTGEKARDV IAHWRQAFAV LGIPSAVKTD NGPANASQKA SDGEQQPRAK
VRVRNLVTKQ WEGPYDLITM GQGYTCVSTD TGTRWLPSKC VRPDLQPQRQ NSADRQGESR
DQLESHQVDE SSSDHSDDSS TDSD
//