ID I1CCB3_RHIO9 Unreviewed; 1568 AA.
AC I1CCB3;
DT 13-JUN-2012, integrated into UniProtKB/TrEMBL.
DT 13-JUN-2012, sequence version 1.
DT 24-JAN-2024, entry version 47.
DE RecName: Full=Reverse transcriptase {ECO:0008006|Google:ProtNLM};
GN ORFNames=RO3G_10804 {ECO:0000313|EMBL:EIE86093.1};
OS Rhizopus delemar (strain RA 99-880 / ATCC MYA-4621 / FGSC 9543 / NRRL
OS 43880) (Mucormycosis agent) (Rhizopus arrhizus var. delemar).
OC Eukaryota; Fungi; Fungi incertae sedis; Mucoromycota; Mucoromycotina;
OC Mucoromycetes; Mucorales; Mucorineae; Rhizopodaceae; Rhizopus.
OX NCBI_TaxID=246409 {ECO:0000313|EMBL:EIE86093.1, ECO:0000313|Proteomes:UP000009138};
RN [1] {ECO:0000313|EMBL:EIE86093.1, ECO:0000313|Proteomes:UP000009138}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=RA 99-880 / ATCC MYA-4621 / FGSC 9543 / NRRL 43880
RC {ECO:0000313|Proteomes:UP000009138};
RX PubMed=19578406; DOI=10.1371/journal.pgen.1000549;
RA Ma L.-J., Ibrahim A.S., Skory C., Grabherr M.G., Burger G., Butler M.,
RA Elias M., Idnurm A., Lang B.F., Sone T., Abe A., Calvo S.E.,
RA Corrochano L.M., Engels R., Fu J., Hansberg W., Kim J.-M., Kodira C.D.,
RA Koehrsen M.J., Liu B., Miranda-Saavedra D., O'Leary S.,
RA Ortiz-Castellanos L., Poulter R., Rodriguez-Romero J., Ruiz-Herrera J.,
RA Shen Y.-Q., Zeng Q., Galagan J., Birren B.W., Cuomo C.A., Wickes B.L.;
RT "Genomic analysis of the basal lineage fungus Rhizopus oryzae reveals a
RT whole-genome duplication.";
RL PLoS Genet. 5:E1000549-E1000549(2009).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH476739; EIE86093.1; -; Genomic_DNA.
DR STRING; 246409.I1CCB3; -.
DR VEuPathDB; FungiDB:RO3G_10804; -.
DR eggNOG; KOG0017; Eukaryota.
DR InParanoid; I1CCB3; -.
DR OrthoDB; 1367639at2759; -.
DR Proteomes; UP000009138; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProt.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00024; CD_CSD; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 2.40.50.40; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR023780; Chromo_domain.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR33064; POL PROTEIN; 1.
DR PANTHER; PTHR33064:SF35; RIBONUCLEASE H; 1.
DR Pfam; PF00385; Chromo; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SMART; SM00298; CHROMO; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Endonuclease {ECO:0000256|ARBA:ARBA00022759};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Reference proteome {ECO:0000313|Proteomes:UP000009138};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Transposable element {ECO:0000256|ARBA:ARBA00022464}.
FT DOMAIN 681..864
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1188..1346
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 1482..1540
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT REGION 1539..1568
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1550..1568
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1568 AA; 176952 MW; 8B1156DAC6D023DB CRC64;
MVKEDQTPSP FTHGETDTFE TNMEEIPKAI EDTEMTNAEE TSSPMATSHV ASDDKPVDAV
MGLRVTLENL RQQIAHAVIS GAPQEHLKGL QERAVTIKNC IMFLDEAQAF CVSPSTPTGE
AVLGPGNHLN TSYPRSAHVI PPDLPIWQWQ GNVWRKEADV HDSVEDLLDT FALIVESNGL
SIDSSWSRLV PIKMNRDQRS WFNEVLKGRN LVWSEVRSII VKTYAAQDVA QELEYMDQLL
SLKMVPAESI EAFTDRFQRI RRAAKWDDDI RTASIYKRAL PAFLRQEVSR SLLNLGRDQQ
DSVAKVAAKA RVVLSSNLCS EGSPSPRQDS VPVKSLSMSL ASNGTEASKY NPRNSHLLSN
LQGITKKSSS PGNVKNKFHC AIHGLANHPT DKCNKYKNLL SQSSSSVSPS PTTTNPPMSF
VSVAKKCFRC SGNVPWSREH AAICPRDKPY HGPSKAIRSA RLVTSSSNGS KLNLTITPQA
RPQQASSKMS SGDSNLMDVD DEGYPVNYDF DRNVSGSVIL ATSDNVSKRF GTTKFPLSVI
YGDNDNNLIH TSHSFEVLPL SLDTEVVIGL DLMHKLNILV TNLAIRHPNL APVIDKEITD
DTPEPNKAPY GTREQQVHFH NAIKPFVDQN ALIPKNSFCT VPESVIRLDT VKGKTAYRAP
YRTPFKLLPI MRECIDTWLK DEVIERASPN SDWNSPLTLA PKKDLLGNLT GHRPCLDPRL
LNSILVSNDR HPIPKIEEIF DQLQGSTIFT TLDLRQAFHR FQIYEPDRVK TTFTFEGQQY
QFRGCPFGLK HIASRYQRVI NIVLRDLPYA QAFVDDIIIF SKSYEEHITH VQNIIQRLTK
VNLILNPDKC HFAQSTVYLL GFCVDAKGSR LDPRKVTNAL TWPRPSSGKE IQRFLGLVNY
FRKYLPNISE VTAPLDKLRF EGKLDKLWTS EQESAFEKIK ALLSSAPLLH HPDLEQPFYV
ATDASNYSIG AVLYQVIKNE TRYIGFMARS LSSSEKNYST TKRELLAVIF ALKKFHPFLW
GNPFTLYTDH KALTYLHTQP VANAMMINWL DTILDYNFKI IHRPGIQNIL PDALSRLFEP
EKTLEGDNKT IKTIVTSQII NSNGSILTSR MMMPADLMTP APEDRQKLLM DTHLEGHRGA
QAIVTALHSD GIHWTKLKED ALEIIRSCPD CQKFNIAKHG YNPLTSIYAD APWDHICIDT
AGPFPTSVQG NQYILLVVDV FTRYCVLKAL PDKSSLTIAL ALRSILSLFG RPKIIQSDNG
TEYVNEIVRL YVESSGIDHR LISAYHPRAN GIVERWVGKA KNILHKRLQG RTEDWDLYVD
STQEALNNTH TALHGTRPFS LMFARRPNEN KDYNNVLDKT KSPETMKQLE TRINEFNDTV
LPAIREKIKT SQAASRDKFN QTHRILTDIP TGSQVTLKNV NRVAKSDPLY VGNYTVKRKT
QGGSYVLVDA TGALLPRDVP PSQIKVISQE VSLSNTDQSE SYDVEAVLHH KGSPGNYLYK
VRWKGYGEED DTWEPASHFH DYRPIQKYWS RISEQEPARE VQLVPKKDTT KKRKNVHRNV
TNSKRNRR
//