ID K9FG06_PEND2 Unreviewed; 725 AA.
AC K9FG06;
DT 06-FEB-2013, integrated into UniProtKB/TrEMBL.
DT 06-FEB-2013, sequence version 1.
DT 27-MAR-2024, entry version 42.
DE SubName: Full=Retrovirus polyprotein, putative {ECO:0000313|EMBL:EKV07102.1};
GN ORFNames=PDIG_73760 {ECO:0000313|EMBL:EKV07102.1};
OS Penicillium digitatum (strain PHI26 / CECT 20796) (Green mold).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Penicillium.
OX NCBI_TaxID=1170229 {ECO:0000313|EMBL:EKV07102.1, ECO:0000313|Proteomes:UP000009882};
RN [1] {ECO:0000313|Proteomes:UP000009882}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PHI26 / CECT 20796 {ECO:0000313|Proteomes:UP000009882};
RX PubMed=23171342; DOI=10.1186/1471-2164-13-646;
RA Marcet-Houben M., Ballester A.-R., de la Fuente B., Harries E.,
RA Marcos J.F., Gonzalez-Candelas L., Gabaldon T.;
RT "Genome sequence of the necrotrophic fungus Penicillium digitatum, the main
RT postharvest pathogen of citrus.";
RL BMC Genomics 13:646-646(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EKV07102.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKCT01000266; EKV07102.1; -; Genomic_DNA.
DR STRING; 1170229.K9FG06; -.
DR eggNOG; KOG0017; Eukaryota.
DR HOGENOM; CLU_000384_38_1_1; -.
DR InParanoid; K9FG06; -.
DR OMA; ICAPTIM; -.
DR OrthoDB; 5490242at2759; -.
DR Proteomes; UP000009882; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProt.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00024; CD_CSD; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000009882};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Transposable element {ECO:0000256|ARBA:ARBA00022464}.
FT DOMAIN 1..67
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 383..547
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 667..687
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 725 AA; 84484 MW; 40CE9388C29EDD60 CRC64;
MNDVLRECLD EYAVAFVDDI LIYSENVEDH QRQVREVLRR LQKAGLQVAL SKSEFSVKET
RFLGFIVSTD GIAVDPEKIR VVQSWTIPTT VKGIQSFLGF CNFYRRFIQG YSAISKPLHR
LTRQDIPFEW SENCEKAFQT LKKKLVSAPI LRHYDPVRQT RVETDVSDGV LGAVLSQYYE
QEDFWHPVAF YSKTMQPAEL NYEVRDKELL AIVRALQEWR PELEVLSQED RLEIFTDHQS
LEYXGLLFEN NATRRAPIHL EETETIGIVE RIKEANRQSS DLDEFRQMAQ DRGNSRWTLL
DGLLLYEGRL EVPNEGDLCA RLPDEIHRQP LTAHPGIEKL KKLVSTRYHW FGWVTDVKRY
VDVDNCLICK RTKTWRDRTP GLLRPLPVPE RTWQPISMDF RSFPKDRHGY DAVLVVVDRL
SKRPISIPCH KDTNAKQMAR LFIDHVIRIT GIPETIVSDR GGQFISEFWT EFCRILGIKR
KLSTAHHPQT DGQSEIANQY MAQRLRPYVE QNQDNWSEIL PMVDFAASIL PQDTTKKSPF
FVERGYEPSM TSDWKDQETL TPNEQDAVQM LSELQDIWTQ TKEQIAKSQQ LQIRQANKHR
REEDFGVGDL VFITTKDWLQ DRPSRKLSHL ASGPYRIIEK VGNSYKIDLP DAIRVHPIFH
PSKLRKAATT EPLEGQHVDP PPPIQVGETD EWEAEKILDA RTHYRKLQCR VQWLGNDLDL
QWYPA
//