ID A0A3S3NKA6_9ACAR Unreviewed; 889 AA.
AC A0A3S3NKA6;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 13.
DE SubName: Full=Tcoingi protein-like protein {ECO:0000313|EMBL:RWS01070.1};
DE Flags: Fragment;
GN ORFNames=B4U79_01309 {ECO:0000313|EMBL:RWS01070.1};
OS Dinothrombium tinctorium.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Chelicerata; Arachnida; Acari;
OC Acariformes; Trombidiformes; Prostigmata; Anystina; Parasitengona;
OC Trombidioidea; Trombidiidae; Dinothrombium.
OX NCBI_TaxID=1965070 {ECO:0000313|EMBL:RWS01070.1, ECO:0000313|Proteomes:UP000285301};
RN [1] {ECO:0000313|EMBL:RWS01070.1, ECO:0000313|Proteomes:UP000285301}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=UoL-WK {ECO:0000313|EMBL:RWS01070.1};
RA Dong X., Chaisiri K., Xia D., Armstrong S.D., Fang Y., Donnelly M.J.,
RA Kadowaki T., McGarry J.W., Darby A.C., Makepeace B.L.;
RT "Genomes of trombidid mites reveal novel predicted allergens and laterally-
RT transferred genes associated with secondary metabolism.";
RL Gigascience 0:0-0(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RWS01070.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NCKU01009915; RWS01070.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3S3NKA6; -.
DR STRING; 1965070.A0A3S3NKA6; -.
DR Proteomes; UP000285301; Unassembled WGS sequence.
DR GO; GO:0003824; F:catalytic activity; IEA:InterPro.
DR GO; GO:0071897; P:DNA biosynthetic process; IEA:UniProt.
DR Gene3D; 3.60.10.10; Endonuclease/exonuclease/phosphatase; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR036691; Endo/exonu/phosph_ase_sf.
DR InterPro; IPR005135; Endo/exonuclease/phosphatase.
DR InterPro; IPR000477; RT_dom.
DR PANTHER; PTHR36688; ENDO/EXONUCLEASE/PHOSPHATASE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR36688:SF1; ENDO_EXONUCLEASE_PHOSPHATASE DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF14529; Exo_endo_phos_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF56219; DNase I-like; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000285301};
KW Transposable element {ECO:0000256|ARBA:ARBA00022464}.
FT DOMAIN 601..714
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 1..30
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 498..524
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 834..889
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 498..521
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 868..889
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:RWS01070.1"
SQ SEQUENCE 889 AA; 100667 MW; A5933C5D12BFFD99 CRC64;
WIPKSGPKRT QTTSDYWRRA THRERPTPDG LTQTLLLRSG DIHPNPGPKS PKLPNLRIIQ
LNINGLNRKL PELTLLIKAH KPHIITLQET KLSTRTKTPR IPDYAAVRSD RPSNNGGGLI
IYIHRSIPYN QIPTPQLPDY AQHQAINFYI NRKPYTLFNF YIPPPPNPTS HYHPDINSFI
IHPNPIITGD FNAHHPLWLK TQPSDIRGTA IEEHIQNGTF LVLNTHSPTR IAPNQRPTSP
DLTITTPELA AKITWQPLPK FSSDHLPLQI DIAVKVPQTR NPKSTYTNFK KANWPAYTLK
LESLLTNYDP NTFSTIDQAE KALRTAIITA AKDTIPQGKR NTYIPNYNPE ITKLIEEIPY
YPPCKQQTLP HPIKQKRWND FVSKIDRKTS PSLLWHTIKA ITNNTPVPTE IIKQAHKIDP
NPKAQATALI KHYSSISKFP TPKEFPETQT AIVQLTNSKA LGPDQLCNLH LKHLGCTAIA
AITALINRSL STAQTYHDTL APQQTRRKTP QTQTKQYPDT PPDATWLSLP NVHHNCSCRS
NTNLSQASIN LNHQPEQLLN YQTIPNNHSP HYLRWLTNFL LGRKSLVKLQ NTLSAPRSFP
NGVPQGSVLS PTLFTFYIND IPTPPEPLKM ITYADDITVL SPHPNSKTAT TNLQQYLPTI
EHWATLKHLK ISPDKSTVTL LTPDPAEYSR TINLQLNQLP IPTEKHPKIL GLTLDPKLTF
NTHTTAVIEK ATRRIHSLKA IASHTWGQDK ETLTLLYKQY IRSVLEYASP AWFPTLSQTN
LQKLKTLENR ALRIATGCIL MTDIPHLHSE TQTLPLPYHS DLLGAQFFAR ISDPQHPTAR
APPYLNPCPH HTNPSHKFRP PAPNQATPEQ NPRSHTTRYQ PERKNPTSL
//