ID A0A0C2NDW5_THEKT Unreviewed; 1259 AA.
AC A0A0C2NDW5;
DT 01-APR-2015, integrated into UniProtKB/TrEMBL.
DT 01-APR-2015, sequence version 1.
DT 27-MAR-2024, entry version 33.
DE SubName: Full=Transposon Ty3-G Gag-Pol polyprotein {ECO:0000313|EMBL:KII72172.1};
GN ORFNames=RF11_10266 {ECO:0000313|EMBL:KII72172.1};
OS Thelohanellus kitauei (Myxosporean).
OC Eukaryota; Metazoa; Cnidaria; Myxozoa; Myxosporea; Bivalvulida;
OC Platysporina; Myxobolidae; Thelohanellus.
OX NCBI_TaxID=669202 {ECO:0000313|EMBL:KII72172.1, ECO:0000313|Proteomes:UP000031668};
RN [1] {ECO:0000313|EMBL:KII72172.1, ECO:0000313|Proteomes:UP000031668}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Wuqing {ECO:0000313|EMBL:KII72172.1};
RX PubMed=25381665; DOI=10.1093/gbe/evu247;
RA Yang Y., Xiong J., Zhou Z., Huo F., Miao W., Ran C., Liu Y., Zhang J.,
RA Feng J., Wang M., Wang M., Wang L., Yao B.;
RT "The genome of the myxosporean Thelohanellus kitauei shows adaptations to
RT nutrient acquisition within its fish host.";
RL Genome Biol. Evol. 6:3182-3198(2014).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KII72172.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JWZT01001363; KII72172.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0C2NDW5; -.
DR EnsemblMetazoa; KII72172; KII72172; RF11_10266.
DR OMA; LANCTIF; -.
DR Proteomes; UP000031668; Unassembled WGS sequence.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0004519; F:endonuclease activity; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR001995; Peptidase_A2_cat.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50175; ASP_PROT_RETROV; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Reference proteome {ECO:0000313|Proteomes:UP000031668};
KW RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 255..327
FT /note="Peptidase A2"
FT /evidence="ECO:0000259|PROSITE:PS50175"
FT DOMAIN 431..608
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 953..1136
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 175..194
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1259 AA; 141353 MW; 73710AAA817F2F92 CRC64;
MEEISAISHA NAVAIKLPTF WTAQPRVWFV QTEAQFHLRG IVSDTTKYYY VVGALDQETA
GRMIDTLSKP PLEGKYENLK SKLLSVFGLT RRDRACRLLD MTGLGDRKPS ALLSEMSSLA
NGHTSCMLFE EIFLRQMPEA IRMQLAGQDF TNLDLVSERA DELWQSMNSR RCESEINKVG
QSKSTSNRPV SVSHTDSENK EGWCFYHTRF GTKSYKCREP CSFANSTSKS AKIATVLARI
GTQPLLFVWD RISGRRFLVD TGAEVSVIPA THRDRQAGNH GPSLVAANDT PMRTYGRITI
PLNFNSRCFH WSFTTADVPQ PLLGADFLRS NNLLVDLKRK KLVDAESYLS ISCGQTTGHT
SKLASVAKSD DRFNNLLSEF PNLSVPTFSQ TTTKHGVEHY ISTKGPPVHG RPRRLSPEKL
AFAKLEFQKM LEMGIIRRSN SPWASPLHMV PKHSENWRPV GDYRRLNEAT IPDRYPIPNI
QDFSSSLANC TIFSKIDLVR GYHQIPVNKD DVAKTAVITP FGLFEFLRMP FGLRNAAQTF
QRMMDTVGQG LEFIFIYLDD ILVFSKSPAE HETHLRLLFE RLQQHGLVIN TDKCSFGQSS
INFLGHFISS EGILPLRDKV EAIQQFPRPC TIKGLQEFNG MVNVYRGFIP GFANTMLPLY
AALSKGSKHL VWSPEMLEAF NKTKNALADA TMLNYPQTDA PTALTTDASE TAVGAVLEQL
VDGVWQPLSF FSKKLRPSET RYSAFDRELL ALYLSVRHFR YYLEGRPFTA FTDHKPLTFA
FKKASDPWSA RQQRHLAAIS EFTTSVEHLS GKENVVADAL SRVNVNSLHS TTPGIDYEEM
AKAQCNEDKV SISHLNLSSG LVLEEIPFGN GTSKLLCDIS TGHPRPVVPV VYRRRVFDVF
HGLAHPSIRT TKRIISNKFV WHGLRKDVSK WAKACIPCQT SKVYRHTEAP LESIKVPRRR
FDHIHIDIVG PLPISCGHTH LFTIVDRFTR WPEAIPLKDT STISCARALI SSWISRFGVP
SHISSDRGAQ FTSEIWTCIT QLLGSKIHHT TAYHPQANGL VERFHRHLKS SLTARLTGPN
WFDELPWVLL GIRTAPKDDL KTSSAELVYG TPLAVPGDFL WNSQCSTPVD KFLSKLRTKV
GMLAPIPTTR HGNKGAPYVH KDLNSCPFVF VRRDKSHPPL QRCYDGPYKV LKAGPKHFQL
DIGGHAETIS IDRLKPAHLD IDEPIQVAQP PKRGRPPKSS VVGELCSGHA GRPITITHL
//