ID G0SBU2_CHATD Unreviewed; 531 AA.
AC G0SBU2;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 27-MAR-2024, entry version 52.
DE RecName: Full=Pre-mRNA processing factor 4 (PRP4)-like domain-containing protein {ECO:0000259|SMART:SM00500};
GN ORFNames=CTHT_0054790 {ECO:0000313|EMBL:EGS18868.1};
OS Chaetomium thermophilum (strain DSM 1495 / CBS 144.50 / IMI 039719)
OS (Thermochaetoides thermophila).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Chaetomiaceae; Thermochaetoides.
OX NCBI_TaxID=759272 {ECO:0000313|Proteomes:UP000008066};
RN [1] {ECO:0000313|EMBL:EGS18868.1, ECO:0000313|Proteomes:UP000008066}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 1495 / CBS 144.50 / IMI 039719
RC {ECO:0000313|Proteomes:UP000008066};
RX PubMed=21784248; DOI=10.1016/j.cell.2011.06.039;
RA Amlacher S., Sarges P., Flemming D., van Noort V., Kunze R., Devos D.P.,
RA Arumugam M., Bork P., Hurt E.;
RT "Insight into structure and assembly of the nuclear pore complex by
RT utilizing the genome of a eukaryotic thermophile.";
RL Cell 146:277-289(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL988045; EGS18868.1; -; Genomic_DNA.
DR RefSeq; XP_006695813.1; XM_006695750.1.
DR AlphaFoldDB; G0SBU2; -.
DR STRING; 759272.G0SBU2; -.
DR GeneID; 18259517; -.
DR KEGG; cthr:CTHT_0054790; -.
DR eggNOG; KOG0272; Eukaryota.
DR HOGENOM; CLU_000288_57_20_1; -.
DR OMA; LNEPICY; -.
DR OrthoDB; 5476798at2759; -.
DR Proteomes; UP000008066; Unassembled WGS sequence.
DR CDD; cd00200; WD40; 1.
DR Gene3D; 4.10.280.110; Pre-mRNA processing factor 4 domain; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR020472; G-protein_beta_WD-40_rep.
DR InterPro; IPR014906; PRP4-like.
DR InterPro; IPR036285; PRP4-like_sf.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR19846:SF0; SFM DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR19846; WD40 REPEAT PROTEIN; 1.
DR Pfam; PF08799; PRP4; 1.
DR Pfam; PF00400; WD40; 6.
DR PRINTS; PR00320; GPROTEINBRPT.
DR SMART; SM00500; SFM; 1.
DR SMART; SM00320; WD40; 7.
DR SUPFAM; SSF158230; PRP4-like; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 4.
DR PROSITE; PS50294; WD_REPEATS_REGION; 4.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000008066};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT DOMAIN 66..114
FT /note="Pre-mRNA processing factor 4 (PRP4)-like"
FT /evidence="ECO:0000259|SMART:SM00500"
FT REPEAT 294..335
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 336..377
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 382..416
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 489..520
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REGION 104..141
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 123..139
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 531 AA; 58409 MW; 691B0E4D2B80A49C CRC64;
MMHPARKAYV EDAEMEDRGG VTLDNIPIDH DYEIPATAAG IPAEKASALL SQFERKRLAA
SIAVPTKDEQ VRAKLRELGE PITLFGEGPA DRRDRLRELL TEQLQKAQKE GSQEGADVEM
KDAQKEEEEE ADEQEEEFYS RGSEELLQAR INIAQYSIPR AKRRIEFQKK EASIPLRTHV
KFRKEIKERL QGFELQGSQA AGDRHVSMVR ISPNGKMVAT GNWGGQVKLI DIPSLEHRQT
LRGHVNKISG LSWMPGATLP ESNISEDTVN LASGGAEGNV LLWSLTKDTP LATLSGHAQR
VCRVEFHPSG RYVASASEDT SWRLWDVETT TELLLQEGHS RGVYAVAFNT DGSLLASAGL
DSIGRIWDLR SGRTVMILDG HTDGHIKPIY GLDWGADGHR VLTASADGWI KCWDVRKVQR
TGGIGAHTST VADVRWFKGL DDPLLGTPPG EDERGNQIPK KSGTFIVSAG FDHKVNIFSA
DDWALVQSLA GHTGPVASAD VSMDGRWIVS GGHDRTVKLW GRNDSAGMYG D
//