ID A0A0V0RGX3_9BILA Unreviewed; 1216 AA.
AC A0A0V0RGX3;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Transposon Ty3-I Gag-Pol polyprotein {ECO:0000313|EMBL:KRX13745.1};
GN Name=TY3B-I {ECO:0000313|EMBL:KRX13745.1};
GN ORFNames=T07_8503 {ECO:0000313|EMBL:KRX13745.1};
OS Trichinella nelsoni.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=6336 {ECO:0000313|EMBL:KRX13745.1, ECO:0000313|Proteomes:UP000054630};
RN [1] {ECO:0000313|EMBL:KRX13745.1, ECO:0000313|Proteomes:UP000054630}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS37 {ECO:0000313|EMBL:KRX13745.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRX13745.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDL01000185; KRX13745.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0V0RGX3; -.
DR STRING; 6336.A0A0V0RGX3; -.
DR Proteomes; UP000054630; Unassembled WGS sequence.
DR GO; GO:0042575; C:DNA polymerase complex; IEA:UniProt.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 3.10.20.370; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR PANTHER; PTHR37984:SF13; RIBONUCLEASE H; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000054630}.
FT DOMAIN 454..632
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 984..1134
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 201..222
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 201..219
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1216 AA; 136555 MW; 713B458AEEAD72F5 CRC64;
MNTTTDMKTI THFEEFDTSN PAGWEEYSER LVFFLEANSI CEGPRRLAVL CSVCGPKTYS
IIKSLASPDP PSSKTFDEVM KLLRNHFMPR PSEVYQRFLY HRRLQQPGEG VAAYVAELRH
LAQHCNFGET LESRLRDQLV CGLRDGNLQK QLLADGELTF AKALERALSA EAATTQVSDI
RAANPTVTTE VQLVAHKKSD CNSSMRNVRR QQESPQPQSH KPCYRCGGAP HRTLVVSRIA
KSQSTVKKEA AKPNRYATNS IAVEEEEYRI NHLTAPNEVI ATLPFPDVRV SLNNVVIPMQ
VDSGASLTII SEHTFKRVCL PHQRHLEPFH SVLRDFQGRE VDVLGVSSLP VKFSSFTGSL
PVVVVKGPRR SLLGRNWFKP LGIRLVGVHS VAPTSVQDLI DEYAELFSDT LGTVKGPPVV
LHTDESIPPI QMNARRVPFA LKDRISEELD RLVEQGILEP VQHTTWTTPI VPVIKNDGSI
RICGDYKCTV NKALRKDLYQ IPAVNDILAT LKKGRIFAKL DLAQAYQQLE VDEASAELQT
IITHKGAFKA KRLQFGIASA PGIFQRFMDS LLSNLEGVVP YFDDVLIVAE SQHELLEVLR
RVFDRLRDAG IRLNREKCVF VSNSVEFLGY RIDAEGIHPS EKKVEAIHKA PRPKNKQELQ
AFMGLLNFYH NFLANKAEAA EPLHRLLDKG ALWKWTHRHE KAFQKVKALI TSNAVLAQYD
DQLPLILTCD ASPHGVGCVL AHRLPCGREA PIAFHSRTLA AAERKYAQID REALAIIVGV
KKFHNYVFGR HVEIRTDHKP LLGLLGNSIQ TPASMSPRMT RWSILLSAYD YSLVYRPGLK
LGNADALSRL PQPGNKISVP DPLEVLLLEA MPTLPISAEH LADRTGKDAV LAPVRNWLEK
GWPAELRSEE FKPFHCRRDE LSLHKGCVLW GCRVVIPLAS RESILTMLHS GHPGIARMKG
LAPCAECQET RHEPAKNVMD AWPEATEPWT RIHADFFGPI GGKIFLLVVD AFSKWLEVRI
VPSTSSVAAI EVFRELFATH GLPDCLVTDN GTAFKSAEFL AFMQSNGIKH VTTAPFHPSS
NGLAERAVQS TKEALKRITS GSWSARLARL LLSQHSTPDP RSNLSPAELL MKLKLKTYLD
RILPNNAEIA KRKVTPPPHR EFKVDDHVFV RSYNQQKKWE KAKIVKRIGR LLYIVRTEIG
LIWKRHVDQI RPREIK
//