ID M3B4T4_SPHMS Unreviewed; 2852 AA.
AC M3B4T4;
DT 01-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 01-MAY-2013, sequence version 1.
DT 24-JAN-2024, entry version 44.
DE SubName: Full=Pre-mRNA splicing factor {ECO:0000313|EMBL:EMF14802.1};
GN ORFNames=SEPMUDRAFT_148402 {ECO:0000313|EMBL:EMF14802.1};
OS Sphaerulina musiva (strain SO2202) (Poplar stem canker fungus) (Septoria
OS musiva).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Dothideomycetes;
OC Dothideomycetidae; Mycosphaerellales; Mycosphaerellaceae; Sphaerulina.
OX NCBI_TaxID=692275 {ECO:0000313|EMBL:EMF14802.1, ECO:0000313|Proteomes:UP000016931};
RN [1] {ECO:0000313|EMBL:EMF14802.1, ECO:0000313|Proteomes:UP000016931}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SO2202 {ECO:0000313|EMBL:EMF14802.1,
RC ECO:0000313|Proteomes:UP000016931};
RX PubMed=23236275; DOI=10.1371/journal.ppat.1003037;
RA Ohm R.A., Feau N., Henrissat B., Schoch C.L., Horwitz B.A., Barry K.W.,
RA Condon B.J., Copeland A.C., Dhillon B., Glaser F., Hesse C.N., Kosti I.,
RA LaButti K., Lindquist E.A., Lucas S., Salamov A.A., Bradshaw R.E.,
RA Ciuffetti L., Hamelin R.C., Kema G.H.J., Lawrence C., Scott J.A.,
RA Spatafora J.W., Turgeon B.G., de Wit P.J.G.M., Zhong S., Goodwin S.B.,
RA Grigoriev I.V.;
RT "Diverse lifestyles and strategies of plant pathogenesis encoded in the
RT genomes of eighteen Dothideomycetes fungi.";
RL PLoS Pathog. 8:E1003037-E1003037(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB456262; EMF14802.1; -; Genomic_DNA.
DR RefSeq; XP_016762923.1; XM_016904901.1.
DR STRING; 692275.M3B4T4; -.
DR GeneID; 27902038; -.
DR eggNOG; KOG1795; Eukaryota.
DR HOGENOM; CLU_000380_3_0_1; -.
DR OMA; ANKWNTS; -.
DR OrthoDB; 246127at2759; -.
DR Proteomes; UP000016931; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0004519; F:endonuclease activity; IEA:InterPro.
DR GO; GO:0008237; F:metallopeptidase activity; IEA:InterPro.
DR GO; GO:0030623; F:U5 snRNA binding; IEA:InterPro.
DR GO; GO:0017070; F:U6 snRNA binding; IEA:InterPro.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR CDD; cd08056; MPN_PRP8; 1.
DR CDD; cd13838; RNase_H_like_Prp8_IV; 1.
DR Gene3D; 1.20.80.40; -; 1.
DR Gene3D; 3.30.420.230; -; 1.
DR Gene3D; 3.90.1570.40; -; 1.
DR Gene3D; 3.40.140.10; Cytidine Deaminase, domain 2; 1.
DR Gene3D; 2.170.16.10; Hedgehog/Intein (Hint) domain; 1.
DR Gene3D; 3.10.28.10; Homing endonucleases; 1.
DR Gene3D; 3.30.43.40; Pre-mRNA-processing-splicing factor 8, U5-snRNA-binding domain; 1.
DR InterPro; IPR036844; Hint_dom_sf.
DR InterPro; IPR027434; Homing_endonucl.
DR InterPro; IPR004042; Intein_endonuc.
DR InterPro; IPR000555; JAMM/MPN+_dom.
DR InterPro; IPR037518; MPN.
DR InterPro; IPR012591; PRO8NT.
DR InterPro; IPR012592; PROCN.
DR InterPro; IPR012984; PROCT.
DR InterPro; IPR027652; PRP8.
DR InterPro; IPR021983; PRP8_domainIV.
DR InterPro; IPR043173; Prp8_domainIV_fingers.
DR InterPro; IPR043172; Prp8_domainIV_palm.
DR InterPro; IPR019581; Prp8_U5-snRNA-bd.
DR InterPro; IPR042516; Prp8_U5-snRNA-bd_sf.
DR InterPro; IPR019580; Prp8_U6-snRNA-bd.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR019582; RRM_spliceosomal_PrP8.
DR PANTHER; PTHR11140; PRE-MRNA SPLICING FACTOR PRP8; 1.
DR PANTHER; PTHR11140:SF0; PRE-MRNA-PROCESSING-SPLICING FACTOR 8; 1.
DR Pfam; PF08082; PRO8NT; 1.
DR Pfam; PF08083; PROCN; 1.
DR Pfam; PF08084; PROCT; 1.
DR Pfam; PF12134; PRP8_domainIV; 1.
DR Pfam; PF10598; RRM_4; 1.
DR Pfam; PF10597; U5_2-snRNA_bdg; 1.
DR Pfam; PF10596; U6-snRNA_bdg; 2.
DR SMART; SM00232; JAB_MPN; 1.
DR SUPFAM; SSF51294; Hedgehog/intein (Hint) domain; 1.
DR SUPFAM; SSF55608; Homing endonucleases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50819; INTEIN_ENDONUCLEASE; 1.
DR PROSITE; PS50249; MPN; 1.
PE 4: Predicted;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000016931};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 1836..1934
FT /note="DOD-type homing endonuclease"
FT /evidence="ECO:0000259|PROSITE:PS50819"
FT DOMAIN 2620..2751
FT /note="MPN"
FT /evidence="ECO:0000259|PROSITE:PS50249"
FT REGION 1..72
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1733..1788
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..37
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 42..72
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1743..1763
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2852 AA; 327822 MW; 5449E08CC7975CF3 CRC64;
MAGIPPPPGW APPPPPGAPP GLSSSGIPPP PPGAVPAKDP QAEKLQQKKQ KWLRQQRQRF
GEKRRGGFVE TQKADMPPEH LRKIVKDIGD VSQKKFSADK RSYLGALKFM PHAVLKLLEN
MPMPWESSRE VKVLYHVNGC LTLVNEIPRV IEPVFHAQWA TMWVVMRREK SDRRHFKRMR
FPPFDDEEPP LSWSENIEDV EPLEPIQMEL DENEDEPVFE WFYDHRPLSD TSHVGGPSYK
DWNLSLPQMA TLHRLSNPLL SDTNDKNYFH LFDMPAFATA KALNVAIPGG PRFEPLYKDI
DPNDEDFGEF NAIDRIIFRA PIRTEYRVAF PYLYNSLPRS VKLGVYSYPQ TVYVKTEDPN
LPPFYFDPVI NPISSRQVMP KNLTISHEDE IFGHGNNEEP GEEDGGFSMP EGVDPFLDDE
DLFNDETAAA IALWWAPYPF DRRSGKMVRA QDVPLVKQWY LEHVPAGQPV KVRVSYQKLL
KTYVLNELHK KPPKAQNKQL LGRNLKRTKF FQQTTIDWVE AGLQVCRQGF NMLNLLIHRK
NLTYLHLDYN FNLKPIKTLT TKERKKSRFG NAFHLMREIL RLTKLIVDAQ VQYRMGNIDA
FQLADGILYA FNHVGQLTGM YRYKYKLMHQ IRSCKDLKHL IYYRFNSGPV GKGPGCGFWA
PAWRVWLFFL RGIIPLLERW LGNLLSRQFE GRHSKGVAKT VTKQRVESHF DLELRASVMA
DLMDMMPEGI KQQKVNTVLQ HLSEAWRCWK SNIPWKVPGL PKPIEDVILR YVKSKADWWV
SVAHYNRERI RRGATVDKTV AKKNLGRLTR LWLKAEQERQ HNYLKDGPYV STEEGVAIFT
TAVHWLESRK FQPIPFPSVS YKHDTKILIL ALERLREAYS VKGRLNQSQR EELALIEQAY
DSPGTTLARI KRFLLTQRSF KEVGIDMNDN YSSINPVYDI EPIEKITDAY LDQYLWYQAD
QRRLFPAWIK PSDSEVPPLL TYKWAQGINN LSNVWSVGEG ECNVMLETRL DKVYEKIDIT
LLNRLLRLIM DHNLADYISS KNNVQLNYKD MNHTNSYGMI RGLQFSAFVF QYYGLIIDLL
LLGLQRASEM AGPPNAPNDF LQFRDRATES RHPIRLYTRY IDKIWIFFRF TADESRDLIQ
RFLTEQPDPN FENVIGYKNK KCWPRDSRMR LMRHDVNLGR AVFWDMKNRL PRSITTVEWD
DTFASVYSRD NPNLLFAMNG FEVRILPKSR NQNDEFPTKD SVWALVDNAT KERTAHAFLQ
VTGEDIAKFN NRIRQILMSS GSTTFTKIAN KWNTTLIALF TYYREAAVST VELLDTIVKC
ETKIQTRVKI GLNSKMPSRF PPAVFYTPKE LGGLGMISGS HILIPASDKR WSKQTDSGVS
HFRAGMSHDE ETLIPNIFRY IIPWEAEFID SQRVWTEYSQ KRLEANQQNR RLTLEDLEDS
WDRGLPRINT LFQKDRSTLS FDKGFRARTE FKLYQHMKSN PFWWTSQRHD GKLWNLNAYR
TDVIQALGGV ETILEHTLFK ATAFPSWEGL FWEKACLAFG TEVLRLDHSR VKVEDVKDGD
LLLGPDGQPR RVYNTVSGEE RLYRISISED FEDLVCTSNH ILVLYHVKTD HGKDIAGPSE
DPEGETVYMT AKEFAGLPES DRSRYRVLRP AGHELPEQDV SVTPYFLGMW IGNGNHTDTT
NTDDHEEHIR DFIVAHAAEL DLHITNHGCL GSKVRESRDT IYARRLADGW KNEGRFCLPP
SDQPNDLNDT NKHVRGSTTP SSPPPTRRLR PGTIDTSVDA VQDPVDSPMD MSLPALNGNG
KDSEEGFVIS SDCEIIRMET GKAAYGELIQ DEEDLLVGDA LGQSQPKNVN SLLVAMRSLG
ILTPNEADGH DGRKRIPTMY MRNSRDVRLR VLAGFIDSNG WYAHSQNHIA FAQSERCDKD
LFWEVVYLAR SLGFGVSVKP ENCLASSGQH EIPQLRATIT GNLMEIPCLL PLKQAGAHVQ
TCRNTFTIKD IKLEAQSTKW AGFKVDKDQT YLRHDYLVLH NSGFEESMRY KKLTNAQRSG
LNQIPNRRFT LWWSPTINRA NVYVGFQVQL DLTGIFLHGK IPTLKISLIQ IFRAHLWQKI
HESVVMDLCQ VFDQELEALG IETVQKETIH PRKSYKMNSS CADILLFASH KWSVSNPSLL
YDTKDNMGLT TTNKFWVDVQ LRYGDYDSHD IERYVRAKYL DYTTDSMSIY PSATGLMIGI
DLAYNLYSAY GQYFPGLKQL VQQAMAKIMK ANPALYVLRE RIRKGLQLYA SESNQEFLNS
QNYSELFSNQ IQLFIDDTNV YRVTIHKTFE GNLTTKPING AIFIFNPRTG QLFLKIIHTS
VWAGQKRLGQ LAKWKTAEEV AALIRSLPVE EQPKQLIVTR KGLLDPLEVH LLDFPNISIR
ASELQLPFQA AMKVEKLGDM ILRATEPQMV LFNLYDEWLK SISSYTAFSR LILILRALHV
NQDKTKLLLR PDKTVITQEH HIWPTLSDED WVKVEVQLRD LILNDYGKKN NVNTSSLTNS
EIRDIILGME ISAPSMQRQQ AAEIEKAQQD QAQLTAVTTK TQNVSGEEMI VTTTSAYEQQ
SFASKTEWRT RAIATSNLRT RANNIYISSE DIRDDEHHFT YVMPKNILKR FIAIADLRVQ
VAGFLYGTSP PDNKQVKEIK TIVMVPQVGS TRDIQLPRSL PENEMLHGLE ALGVIHTAAG
NETNYMTAQD VTQHAKLMAA HPTWDRKTVT MTVNFTPGSV SLSAWSLTPQ GYQWGAENKD
LGSDQPAGFS TAFGEKSQLL LSDKIRGYFL VPEDERWNWS FLGSGFGERE KGRVFVQVGI
PRRFYDDLHR PIHFQNFAEL EDVWVDRSDN LA
//