ID E4XS48_OIKDI Unreviewed; 651 AA.
AC E4XS48;
DT 08-FEB-2011, integrated into UniProtKB/TrEMBL.
DT 08-FEB-2011, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE RecName: Full=ribonuclease H {ECO:0000256|ARBA:ARBA00012180};
DE EC=3.1.26.4 {ECO:0000256|ARBA:ARBA00012180};
GN ORFNames=GSOID_T00001997001 {ECO:0000313|EMBL:CBY12596.1};
OS Oikopleura dioica (Tunicate).
OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Copelata;
OC Oikopleuridae; Oikopleura.
OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY12596.1};
RN [1] {ECO:0000313|EMBL:CBY12596.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21097902; DOI=10.1126/science.1194167;
RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C.,
RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C.,
RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., Cross I.,
RA Yadetie F., Muffato M., Louis A., Butcher S., Tsagkogeorga G., Konrad A.,
RA Singh S., Jensen M.F., Cong E.H., Eikeseth-Otteraa H., Noel B.,
RA Anthouard V., Porcel B.M., Kachouri-Lafond R., Nishino A., Ugolini M.,
RA Chourrout P., Nishida H., Aasland R., Huzurbazar S., Westhof E., Delsuc F.,
RA Lehrach H., Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F.,
RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., Du Pasquier L.,
RA Boudinot P., Liberles D.A., Volff J.N., Philippe H., Lenhard B.,
RA Roest Crollius H., Wincker P., Chourrout D.;
RT "Plasticity of animal genome architecture unmasked by rapid evolution of a
RT pelagic tunicate.";
RL Science 330:1381-1385(2010).
CC -!- SIMILARITY: Belongs to the beta type-B retroviral polymerase family.
CC HERV class-II K(HML-2) pol subfamily. {ECO:0000256|ARBA:ARBA00010879}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FN653130; CBY12596.1; -; Genomic_DNA.
DR AlphaFoldDB; E4XS48; -.
DR InParanoid; E4XS48; -.
DR Proteomes; UP000001307; Unassembled WGS sequence.
DR GO; GO:0004519; F:endonuclease activity; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.10.20.370; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000001307}.
FT DOMAIN 1..59
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 502..651
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
SQ SEQUENCE 651 AA; 73966 MW; F5AD87A4EA5FC25D CRC64;
MRRNVQSYVD DLIQFGGSFT EYRESLRQLL KAVIKFGVKL KASKCQFLQR EAHFLGRVIT
KAGVQTDPAY TRSLLSMPPP TNHRELRSLV GSLTWLKEFA EARMGEEISS HLFAHVMRPI
TALLVTCKRG VIPPPFQWTP EADSAFTQLK TRLANPPVIS FPDFRHTFIL HTDASDLACG
GILTQIINGK TKLVAAVSHT FTRAEANWSV SEKECFGILW SVEKLSRLLK GTKFIIHTDH
YSLTYMDKTA FRNSKIARWQ SRLAEYDFVL QYIKGSKNNF ADWISRPFGT DNLKSRDTGP
VENAGRFLNI GNSDLVVYIP SWCTEQTNLP ITARKLIASV NVAKIIRPSP DPEMEGEMAQ
FAIHQQDDPF LAKITRAVRK ARATSSKVDL ESIIDKNDHR RVELLKIANR LSICRTSNCL
VINDRRGPRA VVPEALRAAF VRRAHDLQAH CGLPRMKENL KMLWWIDMDK DCENYVRSCV
SCLKTKGAHG RPQAPPSGQV QKGRFPGDIL NIDYVMMKEP SNGYRYMLTC ICSFSRYLWA
IPVRRDNALS AAQGLTSICL QYDFWPRLIH SDRGLHFVNS TIDEFCKSNE ILHSLTCAWR
PEANGVVERA HRTLKNGLYS TCHSENMTWT KALPYVTRAM NASICKQCSK F
//