ID Q2HAJ0_CHAGB Unreviewed; 606 AA.
AC Q2HAJ0;
DT 21-MAR-2006, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2006, sequence version 1.
DT 27-MAR-2024, entry version 66.
DE RecName: Full=Nop domain-containing protein {ECO:0000259|PROSITE:PS51358};
GN ORFNames=CHGG_02764 {ECO:0000313|EMBL:EAQ90829.1};
OS Chaetomium globosum (strain ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 /
OS NRRL 1970) (Soil fungus).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Chaetomiaceae; Chaetomium.
OX NCBI_TaxID=306901 {ECO:0000313|EMBL:EAQ90829.1, ECO:0000313|Proteomes:UP000001056};
RN [1] {ECO:0000313|Proteomes:UP000001056}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 / NRRL 1970
RC {ECO:0000313|Proteomes:UP000001056};
RX PubMed=25720678; DOI=10.1128/genomeA.00021-15;
RA Cuomo C.A., Untereiner W.A., Ma L.-J., Grabherr M., Birren B.W.;
RT "Draft genome sequence of the cellulolytic fungus Chaetomium globosum.";
RL Genome Announc. 3:E0002115-E0002115(2015).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the PRP31 family.
CC {ECO:0000256|ARBA:ARBA00005572}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH408030; EAQ90829.1; -; Genomic_DNA.
DR RefSeq; XP_001229280.1; XM_001229279.1.
DR AlphaFoldDB; Q2HAJ0; -.
DR STRING; 306901.Q2HAJ0; -.
DR GeneID; 4388369; -.
DR VEuPathDB; FungiDB:CHGG_02764; -.
DR eggNOG; KOG2574; Eukaryota.
DR HOGENOM; CLU_026337_2_0_1; -.
DR InParanoid; Q2HAJ0; -.
DR OMA; IGNGPMD; -.
DR OrthoDB; 4493115at2759; -.
DR Proteomes; UP000001056; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0046540; C:U4/U6 x U5 tri-snRNP complex; IEA:InterPro.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000244; P:spliceosomal tri-snRNP complex assembly; IEA:InterPro.
DR Gene3D; 1.10.287.4070; -; 1.
DR Gene3D; 1.10.246.90; Nop domain; 1.
DR InterPro; IPR042239; Nop_C.
DR InterPro; IPR002687; Nop_dom.
DR InterPro; IPR036070; Nop_dom_sf.
DR InterPro; IPR012976; NOSIC.
DR InterPro; IPR027105; Prp31.
DR InterPro; IPR019175; Prp31_C.
DR PANTHER; PTHR13904; PRE-MRNA SPLICING FACTOR PRP31; 1.
DR PANTHER; PTHR13904:SF0; U4_U6 SMALL NUCLEAR RIBONUCLEOPROTEIN PRP31; 1.
DR Pfam; PF01798; Nop; 1.
DR Pfam; PF09785; Prp31_C; 1.
DR SMART; SM00931; NOSIC; 1.
DR SUPFAM; SSF89124; Nop domain; 1.
DR PROSITE; PS51358; NOP; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000001056};
KW Ribonucleoprotein {ECO:0000256|ARBA:ARBA00023274};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 272..390
FT /note="Nop"
FT /evidence="ECO:0000259|PROSITE:PS51358"
FT REGION 1..82
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 392..422
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 576..606
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 47..66
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 606 AA; 64458 MW; C399C111F6CE86B6 CRC64;
MSTLADELLQ DFEDSGSEAG GDEHDDGLFN EAGFSGGARA DGGDIAMEEL RDEDADENED
ADMMDEADGS TTPAANGDDE KANIEKMQLG GVRDVRAVAG LMKTLTPVLE KIAHYQSQPA
EAIDNVGSVE DHPEYHLLTQ SNGLSTQIDN EIVLVHKFIR DHYSVRFPEL ETLITNPLEY
AKAVAILGNG PMDSESIKSL QTSTDNPLGM TLKSVLDGPS LMIVTVEATT SKGQAMSPEQ
LQRVVQACEM VIALDKAKKT LTEYVQSRMN IFAPNLTALI GSLTAAQLLN QAGGLTGLSK
APACNLPAWG SKKQASAALA TNVGIRHQGF IFQSPVIRTI PSDIKKQAIK MFANKIVMCA
RTDCFHQFRD GSEGERLKDE CLDRLDKLQQ KPLSKGARAL PAPDDKPSRK RGGRRARKAK
EATAVTELAK AQNRVAFNKE ELEVGYGAGD STRGMGMIGQ RDDGRLRVTQ IDNRTRAKLS
AKSKGWGGAS SLTSGSASSL RGLAGGTGVS NLSLASSKGL RTSGVGTTLG SGSATAGTVS
SLAFTATQGL ELVDPKVQAE LSRKRKADDD RWFKSGAFTQ VGGGNDGFKK PALPPSKKLD
TGSTKQ
//