ID A0A016UEE9_9BILA Unreviewed; 986 AA.
AC A0A016UEE9;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE RecName: Full=Reverse transcriptase domain-containing protein {ECO:0000259|PROSITE:PS50878};
GN Name=Acey_s0044.g903 {ECO:0000313|EMBL:EYC13301.1};
GN ORFNames=Y032_0044g903 {ECO:0000313|EMBL:EYC13301.1};
OS Ancylostoma ceylanicum.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC Ancylostomatinae; Ancylostoma.
OX NCBI_TaxID=53326 {ECO:0000313|EMBL:EYC13301.1, ECO:0000313|Proteomes:UP000024635};
RN [1] {ECO:0000313|Proteomes:UP000024635}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX PubMed=25730766; DOI=10.1038/ng.3237;
RA Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA Aroian R.V.;
RT "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT ceylanicum identify infection-specific gene families.";
RL Nat. Genet. 47:416-422(2015).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EYC13301.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARK01001380; EYC13301.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A016UEE9; -.
DR STRING; 53326.A0A016UEE9; -.
DR Proteomes; UP000024635; Unassembled WGS sequence.
DR GO; GO:0003824; F:catalytic activity; IEA:InterPro.
DR CDD; cd09076; L1-EN; 1.
DR CDD; cd01650; RT_nLTR_like; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 3.60.10.10; Endonuclease/exonuclease/phosphatase; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR036691; Endo/exonu/phosph_ase_sf.
DR InterPro; IPR005135; Endo/exonuclease/phosphatase.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR000477; RT_dom.
DR PANTHER; PTHR46238; REVERSE TRANSCRIPTASE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR46238:SF8; RIBONUCLEASE H; 1.
DR Pfam; PF03372; Exo_endo_phos; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF56219; DNase I-like; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000024635}.
FT DOMAIN 542..800
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 1..35
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 924..986
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..17
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 930..986
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 986 AA; 112282 MW; D54D5BBACBEEAFE2 CRC64;
MRGSIWNTVN ARTSPGSDLQ EARPPPVVKK RGLPREERPR LKTLVRVGSL NVGTLTGKTR
EIADFMARRR IHVLCLQETR WKGSKAREIG DGVKLFYYGI EAKRNGVAIA VSEPLKEYVS
SVNRVSDRII SLRVATEDGF WTVMSVYAPQ CGCTEAEKEA FYDELDDVIR SAPEGDYITV
AGDFNGHVGQ DRKGFERVHG GRGFGRRNQE GERIVELAEA HDLAIASTFF MKRESQKITY
CSGGRQSEID YILVRRRFLK TVKNVKTIPG EEIAGQHRPV VADVCIALQK HTKAKREPRI
RWWKLAGETQ KTFRDKIIAA GLPEPYGHID SVWASAATTI LTCARDTLGE TKGGRRGDRA
TWFWSEDVQK IVKAKKDAYK AWQKTKSLSS LAEYKLRKKE AKAAVARAKN AVMDELYDKL
ESSQAEKHVF RLAKARHRAS LDVTEVRAVK NEDGEVLRDP VAVKERWRVY FEHLLNEEFP
RKPKAPAEPV AGPMQPWTAD EVRKAIKKMK AGKATGPDGI PVEAWRSLGE LGVRWLTEFF
NNITRSAKMP EAWRDSIIVP IFKRKGDAMN CTNYRGIKLI AHTMKIYERL LDMRLREMVE
ISPDQFGFVP ERSTIDAIFI ARQLMEKYRE KNKPCHLAFL DLEKAYDRLP RTVLWEVMRE
RGIPECMVRT VQVMYDGSTA RVRTSHGMTS KFDITVGVHQ GSALSPFLFI MTLDTVVKHL
LEGPPSTLLY ADDVALIADS RAELQLKIQK WQTALADAGF KLNLKKTEVM SSIGGGDAML
DENGTAFTQT EEFQYLGSVL SADGTVDAAV RGRIACAWLK WRESTGILCD RRCSRVLKGK
IYRTVVRPAM MYGSECWPVT KAHERMLNTA EMRMLRWACG LTRRDKVRNE DIRALMQTAL
IQQKLRAQRL RWFGHVMRRS PLHPTRQAME MEVSGKRPRG APKKRWKDTV SKDMRELGVT
KDDAQDRDLW RRRTKTADPA NARDKR
//