ID M1W2W0_CLAP2 Unreviewed; 1762 AA.
AC M1W2W0;
DT 01-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 01-MAY-2013, sequence version 1.
DT 27-MAR-2024, entry version 45.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCE32206.1};
GN ORFNames=CPUR_06066 {ECO:0000313|EMBL:CCE32206.1};
OS Claviceps purpurea (strain 20.1) (Ergot fungus) (Sphacelia segetum).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Clavicipitaceae; Claviceps.
OX NCBI_TaxID=1111077 {ECO:0000313|EMBL:CCE32206.1, ECO:0000313|Proteomes:UP000016801};
RN [1] {ECO:0000313|EMBL:CCE32206.1, ECO:0000313|Proteomes:UP000016801}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=20.1 {ECO:0000313|EMBL:CCE32206.1,
RC ECO:0000313|Proteomes:UP000016801};
RX PubMed=23468653; DOI=10.1371/journal.pgen.1003323;
RA Schardl C.L., Young C.A., Hesse U., Amyotte S.G., Andreeva K., Calie P.J.,
RA Fleetwood D.J., Haws D.C., Moore N., Oeser B., Panaccione D.G.,
RA Schweri K.K., Voisey C.R., Farman M.L., Jaromczyk J.W., Roe B.A.,
RA O'Sullivan D.M., Scott B., Tudzynski P., An Z., Arnaoudova E.G.,
RA Bullock C.T., Charlton N.D., Chen L., Cox M., Dinkins R.D., Florea S.,
RA Glenn A.E., Gordon A., Gueldener U., Harris D.R., Hollin W., Jaromczyk J.,
RA Johnson R.D., Khan A.K., Leistner E., Leuchtmann A., Li C., Liu J., Liu J.,
RA Liu M., Mace W., Machado C., Nagabhyru P., Pan J., Schmid J., Sugawara K.,
RA Steiner U., Takach J.E., Tanaka E., Webb J.S., Wilson E.V., Wiseman J.L.,
RA Yoshida R., Zeng Z.;
RT "Plant-symbiotic fungi as chemical engineers: Multi-genome analysis of the
RT Clavicipitaceae reveals dynamics of alkaloid loci.";
RL PLoS Genet. 9:E1003323-E1003323(2013).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCE32206.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAGA01000038; CCE32206.1; -; Genomic_DNA.
DR STRING; 1111077.M1W2W0; -.
DR VEuPathDB; FungiDB:CPUR_06066; -.
DR eggNOG; KOG0017; Eukaryota.
DR HOGENOM; CLU_000384_13_0_1; -.
DR OrthoDB; 2734036at2759; -.
DR Proteomes; UP000016801; Unassembled WGS sequence.
DR GO; GO:0005739; C:mitochondrion; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:UniProt.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0006338; P:chromatin remodeling; IEA:UniProt.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 2.40.50.40; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Magnesium {ECO:0000256|ARBA:ARBA00022842};
KW Mitochondrion {ECO:0000256|ARBA:ARBA00023128};
KW Reference proteome {ECO:0000313|Proteomes:UP000016801};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Transposable element {ECO:0000256|ARBA:ARBA00022464}.
FT DOMAIN 576..756
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1228..1404
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 1545..1601
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT REGION 1..23
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1031..1089
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..15
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1034..1050
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1069..1083
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1762 AA; 195512 MW; 79457514730D29F4 CRC64;
MADTTDTTGT AGPASPADPT TPPKNLFSAY PAWDGAADSF ENWLLTITAR LRDPGMRSYL
GSPCHVCTTL FSRIPSARQA ECAAYIRARV PDGASLDEDA SLPPFVIQDY VEVLRDAFLP
KGIAARALQK VYAIRQGPAQ PLALFMGEFV AWCTRAGSLA PTGSARVSTN KMALNFTIRD
AAATRGYASD TDFEVYSKAL LQLATEREQM PQFLSMKGSK TSLYIDVHNI ATYVSLSGAP
VPAQAPAPSP APVPSLPAPR MDLDGDTVMG GVNAVSTVTD VVAQIAALAS QIAALQSNGS
GRREALEEDC ALDLDKPPTT EEITKAKSSL DSRPFLVDCL INNHFMLSAL FDTGCLPYTA
ISDSLVKRHH LPRIPVESRE LKLAKDDNHS HPIDSITYFN LDINGRRERI FGYVIKDLHY
DLILGKGWAE ANHVVYKAGK RLLRIGNGPT RINVREAGWM NRPAVHERTK HIRDATVVSA
RLFKALAERA HRREEVMQLC SVSISDINKA LDKLAESKQP MTLTEIRARL PPQVLPENAP
AFLEDPPGDL QPPHGPLYDM SHEELLVLRA TLTDLLDKGW IRASASPASS PVLFARKPGG
GLRFCVDYRG LNAITSKDRY PLPLIRETLR QLAKALHLTK LDVRTAFHRM RMAAGEEWKT
TFRTRQGSFE WLVCPFGLTG APAHFQRWIN TVLGDALDVF CSAYMDDVII YTDGDEDDHF
AKVNLVLSRM LAAGLNIDLA KCAFNVTEIK YLGFIVETGV GIKVDPEKVE AISAWEEPTN
ASAVRSFLGF ANFYREFAPQ FADIAAPLTE LTRRGKLWRW DEVHQTAFDR MKEILVCAPV
LAMYDPELET IVEADSSGYG LGGVVSQVGQ DDLLRPIGFY SRKLTAAEIN YQIHDKELLS
IISTVKHFRG ELRSCQKTFT ILSDHRNLQY FMTTRMLSER QIRWAEELSY YNFTIKFRAG
KDSEKPDFLS RRDQVMPKDA SDERLSKRKF QLIRDRWLTP PMVTKETIQG DICAVLQISA
VRTRAALRAS ASTPLPLPPS PAPAPADGDE VSPPTPSDAS SPALIDVPSL TPGAPPDPDT
PTSAPDTLRP AQGAAIFDDF EMQTLWDRAV SMDPVYRASH AAVHRGDRSL PTELDHKVQM
PDCSFDERGA LCYRNAVWVP DHEPLRTALI QRTHDSYITG HPGRDATLAI LARGYFWPQQ
YLAVRQFVRN CAVCNRSKVT RQQGRGLLRP LPVPDRFHSE IAIDFMTELP AENDGDPRFL
MVISDRLLHS VTLEAMTTMD AEACAKVFVN SHWRFHGFPA ALTSDRGSNW TGRFWRRLCH
LVGIEQRLST AFHPQTDGAT ERWNQEVLAV LRAFISYSQT DWPQLLPCVQ LALANRDNAR
TGMSAFYLTH GYHLDPIQQA HSQASPATKD PQARANAFVS RLYEGQEIAK AAMVTAQQIM
ESQANRKRRP AEQLRVGDRV WLNLRNVTTP QLKKKLAWTQ AKYRVTKIIA PDVYELDVPS
SIHNRFFVDI LRRDPGDPLP SQVTDDAQPP PMSDGLLPDD EAPMYAVERI LRAGPWKGTR
GMLVKWAGYK EPSWTYRANL TLTDAFREFV QRYGEGDDVG EAGTGGYTGS TGKKRKGRAI
PLLQNIELQS SEITEDYADD PGEGGFAMIA GTPYFKAVSM RITRYALAKT VPQLDYARAE
LDKQRRDAGY QFTRCKGRHT SSLGLVAVTA GGMIGNMEIR SGGTADRAMR EDSVGGAVTA
LLLLSRRSNS LAHDDCHAGE VS
//