GenomeNet

Database: UniProt
Entry: A0A016S258_9BILA
LinkDB: A0A016S258_9BILA
Original site: A0A016S258_9BILA 
ID   A0A016S258_9BILA        Unreviewed;      1402 AA.
AC   A0A016S258;
DT   11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT   11-JUN-2014, sequence version 1.
DT   27-MAR-2024, entry version 37.
DE   RecName: Full=Reverse transcriptase {ECO:0008006|Google:ProtNLM};
GN   Name=Acey_s0311.g2144 {ECO:0000313|EMBL:EYB84723.1};
GN   ORFNames=Y032_0311g2144 {ECO:0000313|EMBL:EYB84723.1};
OS   Ancylostoma ceylanicum.
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC   Ancylostomatinae; Ancylostoma.
OX   NCBI_TaxID=53326 {ECO:0000313|EMBL:EYB84723.1, ECO:0000313|Proteomes:UP000024635};
RN   [1] {ECO:0000313|Proteomes:UP000024635}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX   PubMed=25730766; DOI=10.1038/ng.3237;
RA   Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA   Aroian R.V.;
RT   "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT   ceylanicum identify infection-specific gene families.";
RL   Nat. Genet. 47:416-422(2015).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EYB84723.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JARK01001647; EYB84723.1; -; Genomic_DNA.
DR   STRING; 53326.A0A016S258; -.
DR   Proteomes; UP000024635; Unassembled WGS sequence.
DR   GO; GO:0042575; C:DNA polymerase complex; IEA:UniProt.
DR   GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR   CDD; cd01647; RT_LTR; 1.
DR   Gene3D; 1.10.340.70; -; 1.
DR   Gene3D; 3.30.70.270; -; 2.
DR   Gene3D; 2.40.70.10; Acid Proteases; 1.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR001969; Aspartic_peptidase_AS.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR041588; Integrase_H2C2.
DR   InterPro; IPR001995; Peptidase_A2_cat.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041577; RT_RNaseH_2.
DR   PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR   Pfam; PF13650; Asp_protease_2; 1.
DR   Pfam; PF17921; Integrase_H2C2; 1.
DR   Pfam; PF17919; RT_RNaseH_2; 1.
DR   Pfam; PF00665; rve; 1.
DR   Pfam; PF00078; RVT_1; 1.
DR   SUPFAM; SSF50630; Acid proteases; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50175; ASP_PROT_RETROV; 1.
DR   PROSITE; PS00141; ASP_PROTEASE; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
DR   PROSITE; PS50878; RT_POL; 1.
PE   4: Predicted;
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW   Reference proteome {ECO:0000313|Proteomes:UP000024635};
KW   RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW   Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT   DOMAIN          355..393
FT                   /note="Peptidase A2"
FT                   /evidence="ECO:0000259|PROSITE:PS50175"
FT   DOMAIN          515..693
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          1066..1222
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REGION          299..327
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1342..1402
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        299..318
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1363..1385
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1402 AA;  160090 MW;  5A571CED6E231F40 CRC64;
     MDPQVMMAMF REMMREERKE MMELFLKHMS GSGQGSATNE VVSVPNVMAA LSNRIEKFIF
     DPEIDMSFTK WYTRYKEVFV EDAKQLTESA RVRLLCEKLD ADTFERYQRH VLPKEVTSIG
     FEETVATLKQ LFDVKTSEFT LRYQCLKLEK SNAEDFLVYT GRVNEFCEKA KIHELDSDGI
     KCLLWIFGLK SQREAEIRQR LIAVLDREYK AGRKVSLQEL YRECENFLSL KKDSETIAGN
     VKTVEAAVKE ERRKRECWNC CGDHFAQQCK SKPWFCKLCK KTGHKERFCE VAKRRETAEN
     ESEWRRSRQN SDNRRNRKTM QSSRKHVRGV KIANATAQVN STRMYVEARV NQRPVSFLLD
     TGSDITLLNE KVWRSMGAPK LEKTNVVVKN ASGSSMKIHG KLWCEFEIKG SRSEGYAYVT
     PHNSLLGLEW IQKNEDMSYY MRMMVAEVKA DQNGDVAMKL KETYPEVFEE GLGLCTKEKA
     DLQLVGDVRP VFKACRPVPH AAVEAVEREL DRLLEMNVIA PVTHSEWAAP IVCVRKSNGK
     LRVCADFSTG LNKALESFDY PLPVPEDIFA TLNGGAVFSQ IDLSDAYLQI ELSDESKKMV
     VINTHRGLFQ YNRLPFGIKT APGIFQQVMN KMVTGLRGVA TYLDDILVCG RTEQEHMENL
     LALFERISEY GFKVRIEKCS FAKPEIRYLG FIVDKNGRRP NPEKIEAIKS MVEPKNVGQL
     RAFLGMITYY AAFMPTVKDL RGPLDALLKE DVKWEWTSKQ QLAFEKLKKA LSSELNLAHY
     DPRQKIVVAA DACDYGIGCV VSHRYADGSE KPIAHASRSL TAAEKNYSQI EKEALGIVFA
     VKKFHKYVFG RKFLLLTDHK PLLAIFGDKK GVPVYSANRL MRWATILLGY DFDIEYVNTT
     KFGQADGLSR LMQKHQVEDE DIVIASVEND VDSLLKECIR RLPVTVVDVE SYTRTDPVLR
     KVISCVKSGK WPKANQKLAH FHNRCETLSV VGGCLMSGER VVIPPELRSK VLKELHVGHP
     GIVRMKKLAR SYVYWPNVDA DCEDMVRGCT SCQEAAKNPT KVPLKAWPSP TRVWQRVHVD
     FAGPLQGVYY LVVVDAFSKW PEMLEMNNIS ATRTVKALKS LFARYGLPQT IVSDNGTQFT
     SEQFKTMCDE GGIVHIKTAP YHPQSNGQAE RFVDTLKRGI KKLKGEERPS EETLNVVLQA
     YRVTPNSSLD EKTPAEVFLG RKLRTRMSLL VPHPESGEDP LAKARRERME QQFDRKHGVV
     KRKFEVGDKV YAKQWKPPHF HWVKGVVKRR VGSVNYEVEL DGRIVRKHAN QLRSREDKGH
     KEESDTLKVL LEVLAAEDAY KRQENNAEPN PDRAIPNARL PAVSEPTPTS TMTRIQGQSP
     PATLRRSTRI RRPVQRFDPS PA
//
DBGET integrated database retrieval system