ID A0A066X7R2_COLSU Unreviewed; 2369 AA.
AC A0A066X7R2;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 03-MAY-2023, entry version 39.
DE SubName: Full=Putative PROCN domain-containing protein {ECO:0000313|EMBL:KDN65007.1};
GN ORFNames=CSUB01_05544 {ECO:0000313|EMBL:KDN65007.1};
OS Colletotrichum sublineola (Sorghum anthracnose fungus).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Glomerellales; Glomerellaceae; Colletotrichum;
OC Colletotrichum graminicola species complex.
OX NCBI_TaxID=1173701 {ECO:0000313|EMBL:KDN65007.1, ECO:0000313|Proteomes:UP000027238};
RN [1] {ECO:0000313|Proteomes:UP000027238}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=TX430BB {ECO:0000313|Proteomes:UP000027238};
RX PubMed=24926053; DOI=10.1128/genomeA.00540-14;
RA Baroncelli R., Sanz-Martin J.M., Rech G.E., Sukno S.A., Thon M.R.;
RT "Draft genome sequence of Colletotrichum sublineola, a destructive pathogen
RT of cultivated sorghum.";
RL Genome Announc. 2:E0054014-E0054014(2014).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KDN65007.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JMSE01001075; KDN65007.1; -; Genomic_DNA.
DR STRING; 1173701.A0A066X7R2; -.
DR eggNOG; KOG1795; Eukaryota.
DR HOGENOM; CLU_000380_3_0_1; -.
DR OMA; ANKWNTS; -.
DR OrthoDB; 246127at2759; -.
DR Proteomes; UP000027238; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0008237; F:metallopeptidase activity; IEA:InterPro.
DR GO; GO:0030623; F:U5 snRNA binding; IEA:InterPro.
DR GO; GO:0017070; F:U6 snRNA binding; IEA:InterPro.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR CDD; cd08056; MPN_PRP8; 1.
DR CDD; cd13838; RNase_H_like_Prp8_IV; 1.
DR Gene3D; 1.20.80.40; -; 1.
DR Gene3D; 3.30.420.230; -; 1.
DR Gene3D; 3.90.1570.40; -; 1.
DR Gene3D; 3.40.140.10; Cytidine Deaminase, domain 2; 1.
DR Gene3D; 3.30.43.40; Pre-mRNA-processing-splicing factor 8, U5-snRNA-binding domain; 1.
DR InterPro; IPR000555; JAMM/MPN+_dom.
DR InterPro; IPR037518; MPN.
DR InterPro; IPR012591; PRO8NT.
DR InterPro; IPR012592; PROCN.
DR InterPro; IPR012984; PROCT.
DR InterPro; IPR027652; PRP8.
DR InterPro; IPR021983; PRP8_domainIV.
DR InterPro; IPR043173; Prp8_domainIV_fingers.
DR InterPro; IPR043172; Prp8_domainIV_palm.
DR InterPro; IPR019581; Prp8_U5-snRNA-bd.
DR InterPro; IPR042516; Prp8_U5-snRNA-bd_sf.
DR InterPro; IPR019580; Prp8_U6-snRNA-bd.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR019582; RRM_spliceosomal_PrP8.
DR PANTHER; PTHR11140; PRE-MRNA SPLICING FACTOR PRP8; 1.
DR PANTHER; PTHR11140:SF0; PRE-MRNA-PROCESSING-SPLICING FACTOR 8; 1.
DR Pfam; PF08082; PRO8NT; 1.
DR Pfam; PF08083; PROCN; 1.
DR Pfam; PF08084; PROCT; 1.
DR Pfam; PF12134; PRP8_domainIV; 1.
DR Pfam; PF10598; RRM_4; 1.
DR Pfam; PF10597; U5_2-snRNA_bdg; 1.
DR Pfam; PF10596; U6-snRNA_bdg; 1.
DR SMART; SM00232; JAB_MPN; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50249; MPN; 1.
PE 4: Predicted;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000027238};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 2136..2268
FT /note="MPN"
FT /evidence="ECO:0000259|PROSITE:PS50249"
FT REGION 1..46
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 56..75
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..43
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2369 AA; 274761 MW; FCD356C5F3493137 CRC64;
MSFPPPPPPG WGPPPPPPPP PSSSLPPPPA APAPPPPGYR PPADPQIAKF AQKKKEWLRN
QRNRFGEKRK GGFVETQKAD MPPEHLRKIV KDIGDVSQKK YTSDKRSYLG ALKFMPHAVM
KLLENMPMPW ESDRKVKVLY HVNGCLTLVN EIPRVIEPVF FAQWAMMWTF MRKEKADRRL
FKRMRFPPFD DEEPPLSWSE NIEDVEPLEP IQMELNEEED AAVYDWFYDH RPLIDTPHVN
GESYRTWNLN LQQMATLFRL SRPLISDVVD KNYFYLFDLK SFLTGKALNV ALPGGPRFEP
LYKDIDPNDE DFGEFNAIDR IIFRNPIRTE FRVAYPYLYN SLPRSVHLSW HSHPQVVFER
SDNPDLPAFH FDRRIHPISS RAVAPRNLTV SHEDELFGPG SNEEPEDEAF ELPAGFEPFL
ADEELTNEDT ASAIELWWAP FPFNRRSGRM VRAQDVPLIK QWYLEHPPSD KPPVKVRVSY
QKLLKNYVLN ELHKKKPKSQ QSQNLMRNLK QTKFFQQTEI DWVEAGLQVC RQGFNMLNLL
IHRKNLTYLH LDYNFNLKPV KTLTTKERKK SRFGNAFHLM REILRLTKLI VDAQVQYRLG
NIDAFQLADG ILYAFNHVGQ LTGMYRYKYK LMHQIRSCKD LKHLIYYRFN SAPVGKGPGC
GFWAPAWRVW LFFMRGIIPL LERWLGNLLS RQFEGRHSKG VAKTVTKQRV ESHFDLELRA
SVMADLMDMM PEGIKQNKVN TVLQHLSEAW RCWKSNIPWK VPGLPAPIEN IILRYVKAKA
DWWISVAHYN RERIRRGATV DKTVAKKNVG RLTRLWLKAE QERQHNHMKD GPYVSSEEAV
AIYTTTVHWL ESRKFSPIPF PSVSYKHDTK ILILALERLR EAYSVKGRLN QSQREELALI
EQAYDSPGTT LERIKRFLLT QRAFKEVGID MNDNYSTINP VYDIEPIEKI SDAYLDQYLW
YQADQRHLFP AWIKPSDSEV PPLLTYKWAQ GINNLDKVWE TENGECNVMI ETELSKVYEK
IELTLLNSLL RLIMDHNLAD YITAKNNVQL TYKDMNHVNS YGMIRGLQFS AFVFQYYGLV
LDLLLLGPQR ASEIAGPPQS PNDFLQFRDK ETETRHPIRL YTRYIDKIWV FLRFTADESR
DLIQRFLTEQ PDPNFENVIG YKSKKCWPRD SRMRLMRHDV NLGRAVFWDL KNRLPRSVTT
IEWDDSFASV YSRDNPNLLF SMCGFEVRIL PKCRNQNDDF SVKDSVWSLV DNTSKERTAH
AFLQVTEEDI QKFNNRIRQI LMSSGSTTFT KIANKWNTAL IALFTYYREA AVSTVSLLDT
IVKCETKIQT RVKIGLNSKM PSRFPPAVFY TPKELGGLGM ISGSHILIPA SDKRWSKQTD
TGVTHYRAGM THDEETLIPN IFRYIIPWEA EFIDSQRVWT EYSQKRMEAN QQNRRLTLED
LEDSWDRGLP RINTLFQKDR STLSFDKGFR ARAEFKIYQL MKNNPFWWTS QRHDGKLWNL
NAYRTDVIQA LGGVETILEH TLFKATGFPS WEGLFWEKAS GFEESMKFKK LTNAQRSGLN
QIPNRRFTLW WSPTINRANV YVGFQVQLDL TGIFLHGKIP TLKISLIQIF RAHLWQKIHE
SVVMDLCQVF DQELESLGIE TVQKETIHPR KSYKMNSSCA DILLFASHKW NVTRPSLLFD
TKDVIEPTTT NKFWVDVQLR YGDYDSHDIE RYTRAKYLDY TTDSASIYPS ATGLMIGIDL
AYNLYSAYGM YFPGLKVLIQ QAMAKIMKAN PALYVLRERI RKGLQLYASE SNQEFLNSQN
YSELFSNQTQ LFIDDTNVYR VTIHKTFEGN LTTKPINGAI FIFNPKTGQL FLKIIHTSVW
AGQKRLGQLA KWKTAEEVAA LIRSLPVEEQ PKQLIVTRKG LLDPLEVHLL DFPNISIRAS
ELQLPFQAAM KVEKLGDMIL RATEPQMVLF NLYDEWLKSI SSYTAFSRLI LILRALHVNP
DKTKLILRPD KTVITQDHHI WPSLSDEDWI KVETQLRDLI LNDYGKKNNV NVSSLTSSEV
RDIILGMEIS APSMQRQQAA EIEKQQQEQQ QLTAVTTKTQ NVHGEEIIVT TTSQFEQQTF
ASKTEWRTRA IATSNLRTKA NNIYVSSVDN DLDDVTYVMP NNILKRFITI SDLRVQVAGY
LYGSSAPDND QVKEIKCIVM IPQIGGLRNV QLPQQLPQHE FLKDMEPLGV IHTASGSELP
YMSAADVTEH ARLLDAHQEW DKHNTVTVNV AFTPGSVSLS AWGLTPQGYK WGAENKDTQS
DQPQGFTTAM GEKRKLLLSP RFRGFFLVPE NNMWNYSFMG SAFAGMEKKS IHVKLDTPLP
FYSDQHRPVH FHSFAELEDI WVDRTDNFA
//