GenomeNet

Database: UniProt
Entry: A0A0C2NDW5_THEKT
LinkDB: A0A0C2NDW5_THEKT
Original site: A0A0C2NDW5_THEKT 
ID   A0A0C2NDW5_THEKT        Unreviewed;      1259 AA.
AC   A0A0C2NDW5;
DT   01-APR-2015, integrated into UniProtKB/TrEMBL.
DT   01-APR-2015, sequence version 1.
DT   27-MAR-2024, entry version 33.
DE   SubName: Full=Transposon Ty3-G Gag-Pol polyprotein {ECO:0000313|EMBL:KII72172.1};
GN   ORFNames=RF11_10266 {ECO:0000313|EMBL:KII72172.1};
OS   Thelohanellus kitauei (Myxosporean).
OC   Eukaryota; Metazoa; Cnidaria; Myxozoa; Myxosporea; Bivalvulida;
OC   Platysporina; Myxobolidae; Thelohanellus.
OX   NCBI_TaxID=669202 {ECO:0000313|EMBL:KII72172.1, ECO:0000313|Proteomes:UP000031668};
RN   [1] {ECO:0000313|EMBL:KII72172.1, ECO:0000313|Proteomes:UP000031668}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Wuqing {ECO:0000313|EMBL:KII72172.1};
RX   PubMed=25381665; DOI=10.1093/gbe/evu247;
RA   Yang Y., Xiong J., Zhou Z., Huo F., Miao W., Ran C., Liu Y., Zhang J.,
RA   Feng J., Wang M., Wang M., Wang L., Yao B.;
RT   "The genome of the myxosporean Thelohanellus kitauei shows adaptations to
RT   nutrient acquisition within its fish host.";
RL   Genome Biol. Evol. 6:3182-3198(2014).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KII72172.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JWZT01001363; KII72172.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A0C2NDW5; -.
DR   EnsemblMetazoa; KII72172; KII72172; RF11_10266.
DR   OMA; LANCTIF; -.
DR   Proteomes; UP000031668; Unassembled WGS sequence.
DR   GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0004519; F:endonuclease activity; IEA:UniProtKB-KW.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR   CDD; cd01647; RT_LTR; 1.
DR   Gene3D; 1.10.340.70; -; 1.
DR   Gene3D; 3.30.70.270; -; 2.
DR   Gene3D; 2.40.70.10; Acid Proteases; 1.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR001969; Aspartic_peptidase_AS.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR041588; Integrase_H2C2.
DR   InterPro; IPR001995; Peptidase_A2_cat.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041577; RT_RNaseH_2.
DR   PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR   Pfam; PF17921; Integrase_H2C2; 1.
DR   Pfam; PF17919; RT_RNaseH_2; 1.
DR   Pfam; PF00665; rve; 1.
DR   Pfam; PF00078; RVT_1; 1.
DR   SUPFAM; SSF50630; Acid proteases; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50175; ASP_PROT_RETROV; 1.
DR   PROSITE; PS00141; ASP_PROTEASE; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
DR   PROSITE; PS50878; RT_POL; 1.
PE   4: Predicted;
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW   Reference proteome {ECO:0000313|Proteomes:UP000031668};
KW   RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW   Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT   DOMAIN          255..327
FT                   /note="Peptidase A2"
FT                   /evidence="ECO:0000259|PROSITE:PS50175"
FT   DOMAIN          431..608
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          953..1136
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REGION          175..194
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1259 AA;  141353 MW;  73710AAA817F2F92 CRC64;
     MEEISAISHA NAVAIKLPTF WTAQPRVWFV QTEAQFHLRG IVSDTTKYYY VVGALDQETA
     GRMIDTLSKP PLEGKYENLK SKLLSVFGLT RRDRACRLLD MTGLGDRKPS ALLSEMSSLA
     NGHTSCMLFE EIFLRQMPEA IRMQLAGQDF TNLDLVSERA DELWQSMNSR RCESEINKVG
     QSKSTSNRPV SVSHTDSENK EGWCFYHTRF GTKSYKCREP CSFANSTSKS AKIATVLARI
     GTQPLLFVWD RISGRRFLVD TGAEVSVIPA THRDRQAGNH GPSLVAANDT PMRTYGRITI
     PLNFNSRCFH WSFTTADVPQ PLLGADFLRS NNLLVDLKRK KLVDAESYLS ISCGQTTGHT
     SKLASVAKSD DRFNNLLSEF PNLSVPTFSQ TTTKHGVEHY ISTKGPPVHG RPRRLSPEKL
     AFAKLEFQKM LEMGIIRRSN SPWASPLHMV PKHSENWRPV GDYRRLNEAT IPDRYPIPNI
     QDFSSSLANC TIFSKIDLVR GYHQIPVNKD DVAKTAVITP FGLFEFLRMP FGLRNAAQTF
     QRMMDTVGQG LEFIFIYLDD ILVFSKSPAE HETHLRLLFE RLQQHGLVIN TDKCSFGQSS
     INFLGHFISS EGILPLRDKV EAIQQFPRPC TIKGLQEFNG MVNVYRGFIP GFANTMLPLY
     AALSKGSKHL VWSPEMLEAF NKTKNALADA TMLNYPQTDA PTALTTDASE TAVGAVLEQL
     VDGVWQPLSF FSKKLRPSET RYSAFDRELL ALYLSVRHFR YYLEGRPFTA FTDHKPLTFA
     FKKASDPWSA RQQRHLAAIS EFTTSVEHLS GKENVVADAL SRVNVNSLHS TTPGIDYEEM
     AKAQCNEDKV SISHLNLSSG LVLEEIPFGN GTSKLLCDIS TGHPRPVVPV VYRRRVFDVF
     HGLAHPSIRT TKRIISNKFV WHGLRKDVSK WAKACIPCQT SKVYRHTEAP LESIKVPRRR
     FDHIHIDIVG PLPISCGHTH LFTIVDRFTR WPEAIPLKDT STISCARALI SSWISRFGVP
     SHISSDRGAQ FTSEIWTCIT QLLGSKIHHT TAYHPQANGL VERFHRHLKS SLTARLTGPN
     WFDELPWVLL GIRTAPKDDL KTSSAELVYG TPLAVPGDFL WNSQCSTPVD KFLSKLRTKV
     GMLAPIPTTR HGNKGAPYVH KDLNSCPFVF VRRDKSHPPL QRCYDGPYKV LKAGPKHFQL
     DIGGHAETIS IDRLKPAHLD IDEPIQVAQP PKRGRPPKSS VVGELCSGHA GRPITITHL
//
DBGET integrated database retrieval system