ID A0A167BW97_9PEZI Unreviewed; 694 AA.
AC A0A167BW97;
DT 06-JUL-2016, integrated into UniProtKB/TrEMBL.
DT 06-JUL-2016, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Prp31 c terminal domain-containing protein {ECO:0000313|EMBL:KZL81817.1};
GN ORFNames=CI238_01717 {ECO:0000313|EMBL:KZL81817.1};
OS Colletotrichum incanum.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Glomerellales; Glomerellaceae; Colletotrichum;
OC Colletotrichum spaethianum species complex.
OX NCBI_TaxID=1573173 {ECO:0000313|EMBL:KZL81817.1, ECO:0000313|Proteomes:UP000076584};
RN [1] {ECO:0000313|EMBL:KZL81817.1, ECO:0000313|Proteomes:UP000076584}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MAFF 238704 {ECO:0000313|EMBL:KZL81817.1,
RC ECO:0000313|Proteomes:UP000076584};
RA Hacquard S., Kracher B., Hiruma K., Weinman A., Muench P., Garrido Oter R.,
RA Ver Loren van Themaat E., Dallerey J.-F., Damm U., Henrissat B.,
RA Lespinet O., Thon M., Kemen E., McHardy A.C., Schulze-Lefert P.,
RA O'Connell R.J.;
RT "Survival trade-offs in plant roots during colonization by closely related
RT pathogenic and mutualistic fungi.";
RL Submitted (JUN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the PRP31 family.
CC {ECO:0000256|ARBA:ARBA00005572}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KZL81817.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LFIW01001556; KZL81817.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A167BW97; -.
DR STRING; 1573173.A0A167BW97; -.
DR Proteomes; UP000076584; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0046540; C:U4/U6 x U5 tri-snRNP complex; IEA:InterPro.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000244; P:spliceosomal tri-snRNP complex assembly; IEA:InterPro.
DR Gene3D; 1.10.287.4070; -; 1.
DR Gene3D; 1.10.246.90; Nop domain; 1.
DR InterPro; IPR042239; Nop_C.
DR InterPro; IPR002687; Nop_dom.
DR InterPro; IPR036070; Nop_dom_sf.
DR InterPro; IPR012976; NOSIC.
DR InterPro; IPR027105; Prp31.
DR InterPro; IPR019175; Prp31_C.
DR PANTHER; PTHR13904; PRE-MRNA SPLICING FACTOR PRP31; 1.
DR PANTHER; PTHR13904:SF0; U4_U6 SMALL NUCLEAR RIBONUCLEOPROTEIN PRP31; 1.
DR Pfam; PF01798; Nop; 1.
DR Pfam; PF09785; Prp31_C; 1.
DR SMART; SM00931; NOSIC; 1.
DR SUPFAM; SSF89124; Nop domain; 1.
DR PROSITE; PS51358; NOP; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000076584};
KW Ribonucleoprotein {ECO:0000256|ARBA:ARBA00023274};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 353..472
FT /note="Nop"
FT /evidence="ECO:0000259|PROSITE:PS51358"
FT REGION 56..117
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 467..506
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 640..694
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 100..114
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 694 AA; 74629 MW; 84ED21ED79BE0A86 CRC64;
MAKATQPRKA SMELKDNFSS LQSYVSRRAD SHHTRKYDQT HATEVATGIM ATLADELLQD
FEDSGSEAED TQQDNDPLGL LDNAPEANGR KISGDMELDG DQSEHEDEDE EMEGVSKSAV
DPVENAEDAK AKVEKMQLGS VNDVRNVASL MQTLEPVLEV SQPPGNTIGS LEAVFTSSLG
RTSPDIVTRT LQKIAHYQSQ PTPTQLLGNI EDHPEYHLLT QSNSLSTLID SEIVLVHKYI
RDHYSIRFPE LETLITNPLE YAKVVAILGN GPLDSENIKK LQHATENPLK VSLRSVLDGP
SLMIVTVEAT TTKGREMSQD ELERVYRACE MVISLDRAKK TLTEYVQSRM NLFAPNLTAL
IGSLTAAQLL NAAGGLTGLA KTPSCNIPAW GSKKGAGAAG LATNIGVRQQ GFLYHSPIIQ
GIPNDLKKQA MRIVAAKLVL AARVDRIHSS PDGSTGEDLK EQCLTRLEKL TEPPPNKGPR
ALPVPDDKPS RKRGGRRARK AKEATAMTEL RKAQNRMAFG QEEKEVGYGT GESTAGMGMI
GQGNDGRIRN LQIDQRTRAK LSQKNKGWGG ATPMNGAASS LKGFGQSANS NIDLRGKGLR
TSGVGTSLGA GAGTASSLAF TPVQGLELVD PKVQAELSRK RKAEEDRWFK GGTFTQVAGS
SSNGGGGFKK PDLPPNKKVD TGAGKMGPPP PRSG
//