ID K1VH30_TRIAC Unreviewed; 1662 AA.
AC K1VH30;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 27-MAR-2024, entry version 59.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN ORFNames=A1Q2_05590 {ECO:0000313|EMBL:EKD00111.1};
OS Trichosporon asahii var. asahii (strain CBS 8904) (Yeast).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; Tremellomycetes;
OC Trichosporonales; Trichosporonaceae; Trichosporon.
OX NCBI_TaxID=1220162 {ECO:0000313|EMBL:EKD00111.1, ECO:0000313|Proteomes:UP000006757};
RN [1] {ECO:0000313|EMBL:EKD00111.1, ECO:0000313|Proteomes:UP000006757}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 8904 {ECO:0000313|EMBL:EKD00111.1,
RC ECO:0000313|Proteomes:UP000006757};
RX PubMed=23193141; DOI=10.1128/EC.00264-12;
RA Yang R.Y., Li H.T., Zhu H., Zhou G.P., Wang M., Wang L.;
RT "Genome sequence of the Trichosporon asahii environmental strain CBS
RT 8904.";
RL Eukaryot. Cell 11:1586-1587(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EKD00111.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMBO01000349; EKD00111.1; -; Genomic_DNA.
DR STRING; 1220162.K1VH30; -.
DR eggNOG; KOG0017; Eukaryota.
DR HOGENOM; CLU_000384_38_3_1; -.
DR InParanoid; K1VH30; -.
DR OMA; PPLCESI; -.
DR OrthoDB; 1706856at2759; -.
DR Proteomes; UP000006757; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProt.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 2.40.50.40; -; 1.
DR Gene3D; 3.10.20.370; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR023780; Chromo_domain.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR24559:SF437; RIBONUCLEASE H; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF00385; Chromo; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SMART; SM00298; CHROMO; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000006757};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Transposable element {ECO:0000256|ARBA:ARBA00022464}.
FT DOMAIN 744..923
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1297..1456
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 1605..1662
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT REGION 28..66
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 233..269
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 314..343
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 610..644
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 44..66
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 236..266
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 315..341
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1662 AA; 190554 MW; 96BF3F07F46D14F9 CRC64;
MTSTQRLQAL EAQLEALIDK QNNIENENTA LRSTIDDLRT QRESTDTSTT STSGRHEPKV
SSPEYFSGQR NKVTTFITQV RMVIGLQPSR FPTENSKVLY AGSFLCDTAF LWLQPYVASD
HPPAWLNDFN LFCKELRSMF GDPDEVATAE RQLYNLRQRG SASAYVADFT RFAAVVNWND
EALCAQFYRG LKDPIKDELA RTDKPKDLKA YKETAVRIDT RLFERHNEKD RSVKTTSFTS
TRPVTTGAPV RTTFTKSTST FGPRFRSQSP EREHLAAVVS TIRGRISREE YDRRRKNNLC
LYCGEKGHMV GQMPRGSTVQ AQQPGKSLSL RSNSSQGSEG EHRTADYLGS VHGIRDNAIR
KHITIPFHVQ RPMGRRQTVF APEHLHALVD SGATSNFIDI RFAHELRLKL QPVPHRDLIL
LDGAGQKSTI ESEVTLCLNF ENFGPHWVNC AVTSLHTFPI VLGLPWLKEH DPFVSWSLMT
IMPSTWNKSK WPVLDRATAL DLIGRNDTTR MVDETSQHAT MGYETTTPDL GSDAEQGLLS
SRAAASSKIG RAAANSMKRA ASGEELLSKA KRNRTSNNLR INLRSDTGFE FTRFAPFLPP
FDYVRHVDDV DDSDSDGTST EIGTDTDSDK TYHASRSTSK GVSWSDDPEL RLADFAMGAL
VSRTASWLCN STEAEEEDDA MFVPPEYHEY LDVFSKVEAD KLPPHRPFDH HITLQDGKTP
PFGPIYSLSE KELGVLREYL DENLEKGFIV PSESPAAAPI LFVKKKDGSL RLCVDYRGLN
KITVKNRYPL PLIPELLDRL RKAKVFTKID LRGAYNLLRI AEGDEWKTAF RTRYGLFEYK
VMPFGLTNAP ASFQHLMNHN FRDMLDDFVI CFLDDIMVFS DTTEEHEHHV KQVLQRLREV
GLYAKASKCE FNKDSVEFLG FIISDKGIGM DQKKVATILE WPKPCNLHDV RSFLGFCNFY
RRFIKGYSTI AGPLIRLTRN DVPFQWTAKE QQAFDAMKGC FITAGFLSHY DPNQHLVLET
DASDFAIAGV LSQKINDELR PIAFFSRKLS PAELNYEIHD KEMLAIVACF KEWRHYLEGA
AHQITVYTDH RSLEYFTTSK QLNRRQARWS EFLSEFDFVI IYRPGLKGTK PDALTRRPDY
HPLGKGCTLS TAANPQNHRA LLRPGQYLAS AMSFTSDIVL RLKNLMEKDS NSNTYHTKAQ
DPEDKDFAYD DNGLVTYQGR WYAPDDNELR LQLVKESHDH PTAGHPGQRK TLQNLQRNYW
WPRMKEFVNS YVDTCHECKR AKARRHSPYG FLRPMPVPPY PWSSVSMDLI EGLPLSNGFD
SILVIVDRLT KMAIFIPTTK TITAEEVARL FIKHVFAKHG VPQTIVSDRG SEFDSRFFRA
FSEMLGIDLA MSTAYHPETD GQTERVNQVL EQYIRLYVNY KQDDWFWMLP IAEFTYNNTT
HSATTVSPFF ANKGYHPRSH FTPRDSSEVT MNSPDARIQV EGLADLHSHL KEQMRVAAES
AKYFYDRHRS HAPHLRRNQK VWLDARNIKT TRPMQKLDHK YLGPFKIKKK ISDEAYKLDL
PKDMKARGLH DVFNIKLLEP YHKNKIPGRH QPPPPPVEID GEEEYEVEAI LDHKINKRYK
DPNWYLIRWR GYDSNEDSWV QTEALENAQD ILRDFWAKRQ EH
//