ID A0A016S258_9BILA Unreviewed; 1402 AA.
AC A0A016S258;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE RecName: Full=Reverse transcriptase {ECO:0008006|Google:ProtNLM};
GN Name=Acey_s0311.g2144 {ECO:0000313|EMBL:EYB84723.1};
GN ORFNames=Y032_0311g2144 {ECO:0000313|EMBL:EYB84723.1};
OS Ancylostoma ceylanicum.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC Ancylostomatinae; Ancylostoma.
OX NCBI_TaxID=53326 {ECO:0000313|EMBL:EYB84723.1, ECO:0000313|Proteomes:UP000024635};
RN [1] {ECO:0000313|Proteomes:UP000024635}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX PubMed=25730766; DOI=10.1038/ng.3237;
RA Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA Aroian R.V.;
RT "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT ceylanicum identify infection-specific gene families.";
RL Nat. Genet. 47:416-422(2015).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EYB84723.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARK01001647; EYB84723.1; -; Genomic_DNA.
DR STRING; 53326.A0A016S258; -.
DR Proteomes; UP000024635; Unassembled WGS sequence.
DR GO; GO:0042575; C:DNA polymerase complex; IEA:UniProt.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR001995; Peptidase_A2_cat.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF13650; Asp_protease_2; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50175; ASP_PROT_RETROV; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Reference proteome {ECO:0000313|Proteomes:UP000024635};
KW RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 355..393
FT /note="Peptidase A2"
FT /evidence="ECO:0000259|PROSITE:PS50175"
FT DOMAIN 515..693
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1066..1222
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 299..327
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1342..1402
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 299..318
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1363..1385
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1402 AA; 160090 MW; 5A571CED6E231F40 CRC64;
MDPQVMMAMF REMMREERKE MMELFLKHMS GSGQGSATNE VVSVPNVMAA LSNRIEKFIF
DPEIDMSFTK WYTRYKEVFV EDAKQLTESA RVRLLCEKLD ADTFERYQRH VLPKEVTSIG
FEETVATLKQ LFDVKTSEFT LRYQCLKLEK SNAEDFLVYT GRVNEFCEKA KIHELDSDGI
KCLLWIFGLK SQREAEIRQR LIAVLDREYK AGRKVSLQEL YRECENFLSL KKDSETIAGN
VKTVEAAVKE ERRKRECWNC CGDHFAQQCK SKPWFCKLCK KTGHKERFCE VAKRRETAEN
ESEWRRSRQN SDNRRNRKTM QSSRKHVRGV KIANATAQVN STRMYVEARV NQRPVSFLLD
TGSDITLLNE KVWRSMGAPK LEKTNVVVKN ASGSSMKIHG KLWCEFEIKG SRSEGYAYVT
PHNSLLGLEW IQKNEDMSYY MRMMVAEVKA DQNGDVAMKL KETYPEVFEE GLGLCTKEKA
DLQLVGDVRP VFKACRPVPH AAVEAVEREL DRLLEMNVIA PVTHSEWAAP IVCVRKSNGK
LRVCADFSTG LNKALESFDY PLPVPEDIFA TLNGGAVFSQ IDLSDAYLQI ELSDESKKMV
VINTHRGLFQ YNRLPFGIKT APGIFQQVMN KMVTGLRGVA TYLDDILVCG RTEQEHMENL
LALFERISEY GFKVRIEKCS FAKPEIRYLG FIVDKNGRRP NPEKIEAIKS MVEPKNVGQL
RAFLGMITYY AAFMPTVKDL RGPLDALLKE DVKWEWTSKQ QLAFEKLKKA LSSELNLAHY
DPRQKIVVAA DACDYGIGCV VSHRYADGSE KPIAHASRSL TAAEKNYSQI EKEALGIVFA
VKKFHKYVFG RKFLLLTDHK PLLAIFGDKK GVPVYSANRL MRWATILLGY DFDIEYVNTT
KFGQADGLSR LMQKHQVEDE DIVIASVEND VDSLLKECIR RLPVTVVDVE SYTRTDPVLR
KVISCVKSGK WPKANQKLAH FHNRCETLSV VGGCLMSGER VVIPPELRSK VLKELHVGHP
GIVRMKKLAR SYVYWPNVDA DCEDMVRGCT SCQEAAKNPT KVPLKAWPSP TRVWQRVHVD
FAGPLQGVYY LVVVDAFSKW PEMLEMNNIS ATRTVKALKS LFARYGLPQT IVSDNGTQFT
SEQFKTMCDE GGIVHIKTAP YHPQSNGQAE RFVDTLKRGI KKLKGEERPS EETLNVVLQA
YRVTPNSSLD EKTPAEVFLG RKLRTRMSLL VPHPESGEDP LAKARRERME QQFDRKHGVV
KRKFEVGDKV YAKQWKPPHF HWVKGVVKRR VGSVNYEVEL DGRIVRKHAN QLRSREDKGH
KEESDTLKVL LEVLAAEDAY KRQENNAEPN PDRAIPNARL PAVSEPTPTS TMTRIQGQSP
PATLRRSTRI RRPVQRFDPS PA
//