ID A0A267FJG5_9PLAT Unreviewed; 1462 AA.
AC A0A267FJG5;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=Reverse transcriptase domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BOX15_Mlig010238g1 {ECO:0000313|EMBL:PAA73132.1};
OS Macrostomum lignano.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes;
OC Rhabditophora; Macrostomorpha; Macrostomida; Macrostomidae; Macrostomum.
OX NCBI_TaxID=282301 {ECO:0000313|EMBL:PAA73132.1, ECO:0000313|Proteomes:UP000215902};
RN [1] {ECO:0000313|EMBL:PAA73132.1, ECO:0000313|Proteomes:UP000215902}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DV1 {ECO:0000313|EMBL:PAA73132.1};
RC TISSUE=Whole organism {ECO:0000313|EMBL:PAA73132.1};
RA Berezikov E.;
RT "A platform for efficient transgenesis in Macrostomum lignano, a flatworm
RT model organism for stem cell research.";
RL Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PAA73132.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NIVC01001035; PAA73132.1; -; Genomic_DNA.
DR Proteomes; UP000215902; Unassembled WGS sequence.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0006259; P:DNA metabolic process; IEA:UniProt.
DR CDD; cd15489; PHD_SF; 1.
DR CDD; cd09276; Rnase_HI_RT_non_LTR; 1.
DR CDD; cd01650; RT_nLTR_like; 1.
DR Gene3D; 3.60.10.10; Endonuclease/exonuclease/phosphatase; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR036691; Endo/exonu/phosph_ase_sf.
DR InterPro; IPR005135; Endo/exonuclease/phosphatase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR36688; ENDO/EXONUCLEASE/PHOSPHATASE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR36688:SF1; ENDO_EXONUCLEASE_PHOSPHATASE DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF14529; Exo_endo_phos_2; 1.
DR Pfam; PF00075; RNase_H; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SMART; SM00249; PHD; 2.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF56219; DNase I-like; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000215902};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00146}.
FT DOMAIN 156..210
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 721..995
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1198..1328
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT REGION 132..151
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1462 AA; 161538 MW; 4303027C1EBF78C6 CRC64;
MHSLPCEVPE CSERCHCQKK CSGMHRAEIV RSPWRCSVHR PLELDSASAA HLRDLPAPEP
RMSQPTRCHS CSGPIRTGTT PLTCAAKGCA LACHRSSNCS HISRYRGAPS WLCPLHRQQA
VQLPGAHSIF RRPALTRRQA DPTTDRDLND PSDAQRRYCA KCSTAIRRGV AFVKCNSCSA
AYHKCCTGLN RNAADAAARS PWDCPVCTAR TQGAPAPSAA TSRVPLQDKV ARDTKFADSL
RLLQWNADGI ASKLPELAER LSDSDIDIAA VQETKLSNRT QLPKINGYIA VRKDRGDGSG
GGLLLLIRDK LAFRSSPQPT ATADMEIQTV DIQISTANWI TITNVYAPPV RTCVEDRQLP
DWNHLGTSSQ GIILGDFNSH SPAWDPIQPQ DERGSTLLDW SIDTQLEILN DGSPTRVNKA
TGGESSPDVS FASSKLASKC TWQVSDDLGS DHTPIVIRLD LKVRTLPKAQ SRLIWRKPWV
DWNAFQTVVE DKFKALHKEH LTFDARAKRF NDIVMQAANL CIGKVRTSHR PRPWLTPELK
AAIKLRNRLR RSVGENREDW LSACANVHKI ARATKEAAWR DTITDLEKEP NSNKAWKFIR
SLNGTPDSNC PNEALQVGNK VLTDPRDKAN AFAKQYASVS RYHISHSDRA TIREAKAILR
LPLPLPGSTH DGYAAITEQE LDSAIQHQRA NGAAGADEVT PRFIKALGPL ARAELLSLFN
WSFQHAAVAQ SWRTAIIIPL LKAGKSARDI AAFRPISLTS CIVKLLERIL VTRLHHLAER
HGWISPNQAG FRANYSCEDQ LLRVTQDISD GMNLKPSERT IIALLDFCKA FDKVWRERLI
TVMARKRVPL VFIKWTAAFL RHRIARVQLH GARSSPVIMR QGLPQGSVIA PFLFLIFIDT
VNDVVNPPAE ISMYADDIAL RARHRNKLEA QRAVQRALDN VARWSNEHKM VLNPDKSEAA
FFSTDTSEAA WSPAISIEGR NLRHNPTPRL LGLTLDRTLS FRSHLEHVCN RTTSRCRLLA
CLASKEWGWS KKLLLRIYRA MQLSIINFAA PAWHPWLSES AFQQLERAQN AALRIVTGQH
RTTPVEALRL EAGICSVRTG SNVICLTAFE RAQRLPDSHP CRIAATGNTA HRLRRNSWRN
HVTDLLPLIA SSVTPRAMIC TARPPPWDPF STDVRVDMAL PEPGPARTAT AIDTIRHLPG
LITIYCDGSA SGGTTNGGAS AIVTTGDPAH PIILATLQQR GAPSTSSFEE ELRAASLAIG
WINQAALTAA TNICSDSQSM LRALASGNPR IRSALPTASA EITWQWVPGH QNIPGNELAD
QAAKEATTLV SPASQISIHS AKNVIRRSVL DPPIRHERTR HAYRLLNRQL EEDSITNRAD
AVLLARLRSG HCTLFNAYRS IVDPAVDPAC HRCGHPCDDL EHWLDCPALA GTRLRLLGGP
TTDISSISAD PLGLIALARH CP
//