ID G3J3E0_CORMM Unreviewed; 936 AA.
AC G3J3E0;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 57.
DE SubName: Full=Pre-mRNA splicing factor {ECO:0000313|EMBL:EGX95670.1};
GN ORFNames=CCM_00324 {ECO:0000313|EMBL:EGX95670.1};
OS Cordyceps militaris (strain CM01) (Caterpillar fungus).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Cordycipitaceae; Cordyceps.
OX NCBI_TaxID=983644 {ECO:0000313|EMBL:EGX95670.1, ECO:0000313|Proteomes:UP000001610};
RN [1] {ECO:0000313|EMBL:EGX95670.1, ECO:0000313|Proteomes:UP000001610}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CM01 {ECO:0000313|EMBL:EGX95670.1,
RC ECO:0000313|Proteomes:UP000001610};
RX PubMed=22112802; DOI=10.1186/gb-2011-12-11-r116;
RA Zheng P., Xia Y., Xiao G., Xiong C., Hu X., Zhang S., Zheng H., Huang Y.,
RA Zhou Y., Wang S., Zhao G.P., Liu X., St Leger R.J., Wang C.;
RT "Genome sequence of the insect pathogenic fungus Cordyceps militaris, a
RT valued traditional Chinese medicine.";
RL Genome Biol. 12:RESEARCH116.1-RESEARCH116.21(2011).
CC -!- SUBUNIT: Associated with the spliceosome.
CC {ECO:0000256|ARBA:ARBA00011524}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH126399; EGX95670.1; -; Genomic_DNA.
DR RefSeq; XP_006665547.1; XM_006665484.1.
DR AlphaFoldDB; G3J3E0; -.
DR STRING; 983644.G3J3E0; -.
DR GeneID; 18162359; -.
DR KEGG; cmt:CCM_00324; -.
DR VEuPathDB; FungiDB:CCM_00324; -.
DR eggNOG; KOG0495; Eukaryota.
DR HOGENOM; CLU_007010_0_0_1; -.
DR InParanoid; G3J3E0; -.
DR OMA; DGWAWYY; -.
DR OrthoDB; 655233at2759; -.
DR Proteomes; UP000001610; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProt.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 4.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR010491; PRP1_N.
DR InterPro; IPR045075; Syf1-like.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR PANTHER; PTHR11246; PRE-MRNA SPLICING FACTOR; 1.
DR PANTHER; PTHR11246:SF1; PRE-MRNA-PROCESSING FACTOR 6; 1.
DR Pfam; PF06424; PRP1_N; 1.
DR Pfam; PF13428; TPR_14; 2.
DR Pfam; PF14559; TPR_19; 1.
DR SMART; SM00386; HAT; 11.
DR SMART; SM00028; TPR; 4.
DR SUPFAM; SSF48452; TPR-like; 3.
PE 4: Predicted;
KW mRNA processing {ECO:0000256|ARBA:ARBA00023187};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Reference proteome {ECO:0000313|Proteomes:UP000001610};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 12..174
FT /note="PRP1 splicing factor N-terminal"
FT /evidence="ECO:0000259|Pfam:PF06424"
FT REGION 1..100
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 113..138
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 240..262
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 63..89
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 243..262
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 936 AA; 104274 MW; 35DC3FA75C92B2C2 CRC64;
MTTRRDFLGQ PAPENYVAGL GRGATGFTTR SDLGPARDGP SDDQIKEALA KRAQQLGLAP
EGGAKKGKDE DESGGGGGGG DDERFQDPDN EVGLFAGGLY DKDDEEADKI WEWVDERMDR
RKRQREQREE AERDEYERNN PKIQQQFTDL KRALATVSDD EWANLPEVGD LTGKNRRSKQ
ALRQRFYAVP DSVLAAARDS TEMGTMVTDD GGASSSGETS DGTMTNFAEI GAARDKVLKS
RLEQASRSGN GDAANGSSTS IDPQGYITSL NNMVMPESAT QVGDINRVRE LLQSVVKTNP
NNALGWIAAA RLEELAGKTG AARKTIDQGC ERCPKSEDAW LENIRLNQES NNAKIIARRA
IEANNRSVRL WVEAMRLEHI PNNKKRVIRQ ALDHIPESEA LWKEAVNLEE NPDDAKLLLA
KATELIPLSV DLWLALARLE TPANAQKVLN RARKACPTSH EIWIAAARLQ EQLGQANKVN
VIQRGVQVLA KEQAMPKREQ WIAEAETCEA DGATITCENI IRETLGWGLD EDDDRKETWT
EDARSSINRG RYETARAIYA YALRVFVNSK TLWHAAADLE RAHGSRASLW QVLDKAVEAC
PHSEDLWMLL AKEKWQAGEM DGARLVLKRA FQQNPNNEDI WLSAVKLESE SGHAEQARKL
LAVAREQAPT DRVWTKSVVF ERVHGDADAA LDLVLQALPL FPAAPKLWML KGQIYEALGK
TGLAREAYAA GVKAAPRSVP LWLLYARLEE GAGLTVKARS VLDRARLAVP KSPELWCESV
RLERRAGQLA QARALMARAL HEVPRSGLLY VEQIWHLEAR TQRKPRSLDA IKKVDNDPAL
FVGVARLFWA ERKLDKAQAW FERALALDAA RGDTWAWYYR FLGQHGTEEK RAEVVAKCVS
CEPRYGETWP AVAKKPENAH KSVEELLKLV AEELDQ
//