ID A0A1V6PI20_PENDC Unreviewed; 1803 AA.
AC A0A1V6PI20;
DT 07-JUN-2017, integrated into UniProtKB/TrEMBL.
DT 07-JUN-2017, sequence version 1.
DT 24-JAN-2024, entry version 25.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=PENDEC_c004G01694 {ECO:0000313|EMBL:OQD76695.1};
OS Penicillium decumbens.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Penicillium.
OX NCBI_TaxID=69771 {ECO:0000313|EMBL:OQD76695.1, ECO:0000313|Proteomes:UP000191522};
RN [1] {ECO:0000313|Proteomes:UP000191522}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IBT 11843 {ECO:0000313|Proteomes:UP000191522};
RX PubMed=28368369; DOI=10.1038/nmicrobiol.2017.44;
RA Nielsen J.C., Grijseels S., Prigent S., Ji B., Dainat J., Nielsen K.F.,
RA Frisvad J.C., Workman M., Nielsen J.;
RT "Global analysis of biosynthetic gene clusters reveals vast potential of
RT secondary metabolite production in Penicillium species.";
RL Nat. Microbiol. 2:17044-17044(2017).
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OQD76695.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MDYL01000004; OQD76695.1; -; Genomic_DNA.
DR STRING; 69771.A0A1V6PI20; -.
DR OMA; GQYLRAY; -.
DR OrthoDB; 167902at2759; -.
DR Proteomes; UP000191522; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-KW.
DR CDD; cd05702; S1_Rrp5_repeat_hs11_sc8; 1.
DR CDD; cd05703; S1_Rrp5_repeat_hs12_sc9; 1.
DR CDD; cd05693; S1_Rrp5_repeat_hs1_sc1; 1.
DR CDD; cd05696; S1_Rrp5_repeat_hs4; 1.
DR CDD; cd05697; S1_Rrp5_repeat_hs5; 1.
DR CDD; cd05698; S1_Rrp5_repeat_hs6_sc5; 1.
DR CDD; cd04461; S1_Rrp5_repeat_hs8_sc7; 1.
DR CDD; cd05706; S1_Rrp5_repeat_sc10; 1.
DR CDD; cd05707; S1_Rrp5_repeat_sc11; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 11.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR045209; Rrp5.
DR InterPro; IPR048058; Rrp5_S1_rpt_hs11_sc8.
DR InterPro; IPR048059; Rrp5_S1_rpt_hs1_sc1.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR PANTHER; PTHR23270; PROGRAMMED CELL DEATH PROTEIN 11 PRE-RRNA PROCESSING PROTEIN RRP5; 1.
DR PANTHER; PTHR23270:SF10; PROTEIN RRP5 HOMOLOG; 1.
DR Pfam; PF00575; S1; 5.
DR SMART; SM00386; HAT; 6.
DR SMART; SM00316; S1; 12.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 11.
DR SUPFAM; SSF48452; TPR-like; 1.
DR PROSITE; PS50126; S1; 10.
DR PROSITE; PS50005; TPR; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000191522};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW rRNA processing {ECO:0000256|ARBA:ARBA00022552};
KW TPR repeat {ECO:0000256|PROSITE-ProRule:PRU00339}.
FT DOMAIN 153..252
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 457..532
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 549..623
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 643..712
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 827..896
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 937..1013
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1045..1116
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1130..1199
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1224..1293
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1313..1384
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REPEAT 1604..1637
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REGION 1..140
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 899..918
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1385..1514
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 17..52
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1391..1449
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1480..1494
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1803 AA; 198499 MW; C8CF4AAF928ED835 CRC64;
MAPIKRKGNA PEETAARQPQ KRAKVGAEES KKESKPQDTT SKASELSVLR DDEPSFPRGG
GSVLTPLERK QIQIQANKDV LFEQKGQKKS TREFDDDNDD DEDDVEMEDG DNTTTAKKPR
KKKAKGKKSG DKGEPEKEGV RIEPLTFKRI VPGSMILGQV SSINAHNVAL SLPNNLTGYV
PLTSVSKVLD DKIEKLLNED DEDNEDSSGE ESFDLKDFFY LGQYLRAYVV STGTEASEAG
GRSKKRIELS LDPRQTNTGL KKSDLVVESA IQASVVSVED HGLVMDLGIE GSDLKGFMSS
KEIDPHVDYT KIKEGSVFLS VVTGQNASGN VIKLSGNLQS AASIKKSHFL SAAPTINSYL
PGVAAEILLT EVTENGMMGK IMGMLDTTVD LVQSGANDGK VDLVRKFKIG AKIKARIVST
FPSAEPFKVG FSMLDHIVKL SPDAHGPGSS EDAPALSAIV PEAKVVKVDP GMGVYVQVGE
SKHLGFVHIS RLSDGKVESI SAESGPFKVG TTHEARIVGY NNLDNLYFLS FEKTVIEQPF
LRIEDVTVGA TVKGKVEKLL VGPTGIDGLI VSLADGITGL VPSMHFADTQ LQFPEKKFRE
GLKISARILS VNLEKRQIRL TLKKSLLNSE SAIWKDYKKI TPGSQSPGTI VNLQAHGAVV
QFYGTVRGFL PVSEMSEAYI KDPAQHFRLG QVVNVHALSV DASQGRLAVS CKDPSTFTEK
YRKAFEDMHP GILVSGSVFE KSNDDVLLKL DDSGLVARLD VQHIIDGSPS KQSATLSKIR
VGQKLNELLV LDIQRAHRLI RVSSRASLKK AAKQGSMPRK FEEVQEGSTV TGFIRNITPA
GLFVEFLGDL TGLIPKRLIE DANLSQPEFG MSKGQTVSPT VHSVDIDFQR FILSLKPSEA
THAGPKKTKT KTPTQTDDAL ANPIDENIST ISDLAFGRVV RCKVASIKAT QVNVQLADNI
QGRIDVSEIF DKWEDLKDRK HPLRFFKAKE VLSARILGVH DARSHKFLPI SHRTGKYPVF
ELSLKPSFMQ AANPSPLNME QVKVGSSWMG FVNNVADDCL WINLSPNVRG RLRFMDASDD
LSLLTDVEKN FPIGSALKVK VTAVNADRGH LDLTAKQGYD KLSFGDISVG MILPGRVTKV
TEKQLIMQLG EALVGAVDLI DMSDDYSKAN PTVHQKNEVL RACVVAVDKA HKKISLSLRP
SKVMSSSLPV QDREITSMKE LKPNDVVRGF VKRVTDSGLF VTIGHDVTAY VRISELSDSY
LKEWKDSFQA DQLVKGRITL VDAEQGKLQL SLKESVLDPN YKDPVTIQDL KPGQVVTGKV
RKVEEFGAFI VVDGSRNVSG LCHRSEMAEK RVEDARTLYN EGDAVKAKII NIDREQNKIS
FSLKASHFQD EEDEDSEDED MSDELGGVEL AGSGEDEDED DDDVSMGGVD VEDDSEDESE
DEDEESDEDI AMTGTKPSKS GGLGADGFDW SGNIQAEDKA AAYDSDDEDS KKKKKKHRKP
EIQVDRTGDL DANGPQTVAD YERLLLGEPD SSLLWLQYMA FQLELGEVEK ARTIGERALR
TITMGQDSEK LNIWVALLNL ENTYGSDDSL EEIFKRACEY NEPAEIYDRM ISIYIQSGKN
EKADELFKQA MKKKISNQSP KFFLNYASFL FDAMAAPERG RGLLSRALQS LPAHTHVETT
AKFGQLEFRS PNGDVERGRT VFEGLLSSFP KRIDLWNILL DLEIKTGDAE QVRRLFERVL
GLQNKKGNIS IGASKKVKPK QAKFLFKKWL AFEEKLVAEG GNEKMVEDIK ARAAAYVKSL
QEE
//