ID A7TIS6_VANPO Unreviewed; 1725 AA.
AC A7TIS6;
DT 02-OCT-2007, integrated into UniProtKB/TrEMBL.
DT 02-OCT-2007, sequence version 1.
DT 27-MAR-2024, entry version 82.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=Kpol_1043p58 {ECO:0000313|EMBL:EDO17868.1};
OS Vanderwaltozyma polyspora (strain ATCC 22028 / DSM 70294 / BCRC 21397 / CBS
OS 2163 / NBRC 10782 / NRRL Y-8283 / UCD 57-17) (Kluyveromyces polysporus).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Saccharomycetaceae; Vanderwaltozyma.
OX NCBI_TaxID=436907 {ECO:0000313|Proteomes:UP000000267};
RN [1] {ECO:0000313|EMBL:EDO17868.1, ECO:0000313|Proteomes:UP000000267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 22028 / DSM 70294 / BCRC 21397 / CBS 2163 / NBRC 10782 /
RC NRRL Y-8283 / UCD 57-17 {ECO:0000313|Proteomes:UP000000267};
RX PubMed=17494770; DOI=10.1073/pnas.0608218104;
RA Scannell D.R., Frank A.C., Conant G.C., Byrne K.P., Woolfit M., Wolfe K.H.;
RT "Independent sorting-out of thousands of duplicated gene pairs in two yeast
RT species descended from a whole-genome duplication.";
RL Proc. Natl. Acad. Sci. U.S.A. 104:8397-8402(2007).
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DS480397; EDO17868.1; -; Genomic_DNA.
DR RefSeq; XP_001645726.1; XM_001645676.1.
DR STRING; 436907.A7TIS6; -.
DR GeneID; 5546121; -.
DR KEGG; vpo:Kpol_1043p58; -.
DR eggNOG; KOG1070; Eukaryota.
DR HOGENOM; CLU_000845_0_0_1; -.
DR InParanoid; A7TIS6; -.
DR OMA; GQYLRAY; -.
DR OrthoDB; 167902at2759; -.
DR PhylomeDB; A7TIS6; -.
DR Proteomes; UP000000267; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-KW.
DR CDD; cd05702; S1_Rrp5_repeat_hs11_sc8; 1.
DR CDD; cd05703; S1_Rrp5_repeat_hs12_sc9; 1.
DR CDD; cd05693; S1_Rrp5_repeat_hs1_sc1; 1.
DR CDD; cd05706; S1_Rrp5_repeat_sc10; 1.
DR CDD; cd05708; S1_Rrp5_repeat_sc12; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 9.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR045209; Rrp5.
DR InterPro; IPR048058; Rrp5_S1_rpt_hs11_sc8.
DR InterPro; IPR048059; Rrp5_S1_rpt_hs1_sc1.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR008847; Suf.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR PANTHER; PTHR23270; PROGRAMMED CELL DEATH PROTEIN 11 PRE-RRNA PROCESSING PROTEIN RRP5; 1.
DR PANTHER; PTHR23270:SF10; PROTEIN RRP5 HOMOLOG; 1.
DR Pfam; PF00575; S1; 3.
DR Pfam; PF05843; Suf; 1.
DR SMART; SM00386; HAT; 6.
DR SMART; SM00316; S1; 12.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 10.
DR SUPFAM; SSF48452; TPR-like; 2.
DR PROSITE; PS50126; S1; 10.
PE 4: Predicted;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000000267};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW rRNA processing {ECO:0000256|ARBA:ARBA00022552}.
FT DOMAIN 126..246
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 514..584
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 611..680
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 700..773
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 798..867
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 900..988
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1008..1079
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1092..1163
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1181..1249
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1269..1340
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REGION 1..43
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 66..106
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 171..200
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1370..1397
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..18
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 68..99
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 173..191
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1725 AA; 193116 MW; 2C9EFF986AC0E500 CRC64;
MAPQGKRKRD DGESPLSRED KTVQPAASSL DRNADEVSFP RGGASALTPL ELKQVANEAA
NDVLFANESS KKKAENSAET GTSDQRKSKK KKVTKKTDDD NEDSEEDIDV TSVIVSHINF
KNLKVGSQLL GQIKYITKED ICVTFADNIS GYVPLTSISD QFTDILEQLD ESMSDSSSDE
EGEDEEEYES SDDVSSDKKP LAKKEVPNLR NYFKIGQWLR CSVSANTALE TQSKKNKKKK
ILLSIEPSVV NILTEEDLNK STTVQCSVKS IEDHGAALDL GIEGITGFIS KKDYLNFDEL
KPGFVFLGNI AKKSGRSVVV NLNFSPKNKV SQISSIDAVI PGQSVDLLCE SVQPTGITGK
VYGLVPGFIS NVHLHTFDRE EIKHKYAIGS NVQGRILASL NNKNGDKVLI ISTLPHITSL
ENKLHEIENL EAFPFGYTFD SASIKGRDSE YLYLALDEDR IGEVHHSKIG NAFESEKISA
RVLGYNSIDN VFQLSTDPET LKLKYVRAAD IKIGELLTGC EIVSVSSKGI QLKIFNNQFS
AFVPPIHISD TRLVYPERKF KIGSKIKGRV LSVDFRGHIL VTLKKTLVNA EENETPLLDS
YKAAQTIKDS NSKTTATVQY FKPNGCVVSF FGGISGFLPN TEISEAFVKR PEEHLRLGQT
ILVKLLQVEK ERSRIIVTCK VSNDKADSQK ETIEKLIPGK SFVEASVIEK TKDSLVVEME
KTGLRGVVYV GHLSDLRIEQ NRAEIKKVRI GSKLNGLVID KDTRTYIFNM TLKESLIKDA
KEGKLPTSYS EVKTLDKSTP LHGYVKSVSD KGIFVAFNGK FVGLVLPSYA VENREIDFKK
SFYMNQSVTA YILRTDDEQE RFLLTLKEPK TKEKKKTSTD TAAINPIDSS IKDLSDFRVG
SIVEGKIKGV KKNQLNIILA DNLHGRVDIA EVFDNFADIK DKQQPLSNFK SGDLIKVKIL
GHHDVKSYKF LPISHSNLKN SVLELSIKPS ELKSSKINIP SIDEVKIDSE LTGFINNFAK
DCVWLTVSPI LKAKLPLFEL SDDGSRFSAI EDSFPLGTAL RVHVTSVDKT HNFVTVSNRS
KSVKSIDDVN VDDILPARII KVNDNYVLLD LGHKITGISF ITDALNDFSS SLHDEYDDKL
NEMVSAKVIS IDRETKKINL SLQNEKASNM KLKSHEDLKQ KDVVKAIIKN VTEKGIFVYL
SSALEAFVPV SKLSDSYLKD WKKFYKPMQS VIGKVISCDD NSHILLTLRE SEVNGELKVL
KGYSDIKVGD IFNGNIKNVT DFGVFVKLDN TVNVSGLAHR TEIADTVPEN LSSIFGVGDR
VKAIVLKTNP EKQQISLGLK ASYFTSAKSK ENSDSDLESD EDEVMADVNY NEESEDEAEI
DDEEPKSSKT PMTTDGLSLS AGFDWTANIL NQAQSDDDLE DEMEDFTEVK RSKHKRSKTK
IIEDKTIDIN ARAPESVSDF ERLIVGNPNS SVIWMNYMAF RLQLSEIDKA REIAERALKT
INFREENEKL NIWIAMLNLE NTFGTEETLE DVFKRSCQYM DSFTMHNKLL SIYQMSEKLD
KAAELFKATS KKFGSEKVSV WVSWGEFLIN NKQAQEARSV LASALKSLPK RNHVELVRKF
AQLEFAKGDP ERGRSLFEGL IADAPKRIDI WNVYIDQEIK IGEKKKAEDL FERVINRKIT
RKQAKFFFNK WLQFEESQND EKTVSYVKAK ATEFAENNPK ASQTD
//