ID W6MFS3_9ASCO Unreviewed; 1702 AA.
AC W6MFS3;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 24-JAN-2024, entry version 39.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=KUCA_T00000444001 {ECO:0000313|EMBL:CDK24481.1};
OS Kuraishia capsulata CBS 1993.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Saccharomycetales incertae sedis; Kuraishia.
OX NCBI_TaxID=1382522 {ECO:0000313|EMBL:CDK24481.1};
RN [1] {ECO:0000313|EMBL:CDK24481.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CBS 1993 {ECO:0000313|EMBL:CDK24481.1};
RA Genoscope - CEA;
RL Submitted (DEC-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:CDK24481.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CBS 1993 {ECO:0000313|EMBL:CDK24481.1};
RA Morales L., Noel B., Porcel B., Marcet-Houben M., Hullo M-F., Sacerdot C.,
RA Tekaia F., Leh-Louis V., Despons L., Khanna V., Aury J-M., Barbe V.,
RA Couloux A., Labadie K., Pelletier E., Souciet J-L., Boekhout T.,
RA Gabaldon T., Wincker P., Dujon B.;
RT "Complete DNA sequence of /Kuraishia capsulata/ illustrates novel genomic
RT features among budding yeasts (/Saccharomycotina/).";
RL Submitted (FEB-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; HG793125; CDK24481.1; -; Genomic_DNA.
DR STRING; 1382522.W6MFS3; -.
DR HOGENOM; CLU_000845_0_0_1; -.
DR OrthoDB; 167902at2759; -.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-KW.
DR CDD; cd05702; S1_Rrp5_repeat_hs11_sc8; 1.
DR CDD; cd05693; S1_Rrp5_repeat_hs1_sc1; 1.
DR CDD; cd05706; S1_Rrp5_repeat_sc10; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 10.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR045209; Rrp5.
DR InterPro; IPR048058; Rrp5_S1_rpt_hs11_sc8.
DR InterPro; IPR048059; Rrp5_S1_rpt_hs1_sc1.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR PANTHER; PTHR23270; PROGRAMMED CELL DEATH PROTEIN 11 PRE-RRNA PROCESSING PROTEIN RRP5; 1.
DR PANTHER; PTHR23270:SF10; PROTEIN RRP5 HOMOLOG; 1.
DR Pfam; PF00575; S1; 4.
DR SMART; SM00386; HAT; 6.
DR SMART; SM00316; S1; 13.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 11.
DR SUPFAM; SSF48452; TPR-like; 1.
DR PROSITE; PS50126; S1; 11.
DR PROSITE; PS50005; TPR; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW rRNA processing {ECO:0000256|ARBA:ARBA00022552};
KW TPR repeat {ECO:0000256|PROSITE-ProRule:PRU00339}.
FT DOMAIN 101..208
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 224..289
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 408..475
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 492..564
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 585..654
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 668..747
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 770..839
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 867..943
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1059..1128
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1146..1214
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1234..1305
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REPEAT 1561..1594
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REGION 1..30
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 143..171
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1323..1389
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1401..1437
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 146..165
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1336..1360
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1374..1389
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1412..1429
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1702 AA; 189028 MW; BBC9CAF44F121546 CRC64;
MSVSQKRKRS DSQTESKTSI DSTEISFPRG GGSVLTALEV KEISNEATRD VLFESKKQHQ
ASAGVVKKRK GTKGTAVKVA TVEKEKDVTV DHLDFKTLVP GSQVLGQIVQ INKMDLVISI
ADNLVGFVPI TSISQEITEQ LEALQESVES SDEDSEEEED NEEEEPDKKN FPNLSEIFQL
GQWLRATVIT NETKTKKKSK KRLELSIEPS IVNSNVAEDD ITQGAIIQVS VKSVEDHGVI
LNTGKGTNGF ISKKELDAGL VNIQDLHPGS VILVSIASIN SRTITCKIPH GQLKKQGSVS
TIDSITSIIP GMLVDALITG LKKDGIVAKV FGLADSSIGM THLGVYDFSK LKEKHQIGSK
IKARVIASFV LRGEAKLILS TLPHVLFLNK ESWDKETKSA PLESYPIGKV VDVTIIGKDG
TYIYTQVEGS LLGQVHYSRV SAKADLDMDF KIGSEHKARV LGYSWADNAL TLTLDKKQIE
QKYLRVQDIP IGEAVVGEVV RLITDNGLIL KIFDDFEAFV PRSHMSDIKL IYPERKFKIG
SMVKARVLKL MYKRGKASIL CTLKKSLVVT EDEDIVASFE DAQIGKRTPA TVERMQPNGC
IVAMFGQMKA FLPNSEISET FVKRPEDHLR LGQTIRVRII SSDSEAKRTV VSCRISDSLS
QTQLSAIEEM IPGKSVIKVS VVEKEKDNAI VEMEDSGLRG IIAFGHLSDK NYEQNRALLK
KTEIGSTLEA VVLMKDRKSR ILTLSAKESL VKAAKEGLLP ASFEEISIAP TTLHGFVSGI
TNSGVFVAFA NKLTGLVLPK YASDKFVDNL NSLFYVSQSV SCHVIRTDIE NKRFLLSLKS
KETDKDSEPA INPVDSKVKT ISEFVPGLIT KAQIKGIKAT QLNVQLADNQ QGRIDVTQLF
DSYDDIKNPK RPTDQFKKGD VIDVKIIGFH DTRNHKFLPV TNRKTKQVVL ELSAKKSDLG
LGAVKPLTLS NVKEGEKHLV FINNSARGFL WAGISPVIKG KLSLMDLCDD ASLLENLEEE
FPVGCALMAT IKEIEVENAT VTFSARSKPI HSIADISVGD VVPAKVLKTR ESYLLIQLCD
TVNAISFITD ALNDYSEKLD SVFSTNDICA AKVLEIDSDS SKVYVSVRTQ DTKDKLITSL
EDVKRGDVLR GFVKAVTDKG VLIALGRNIH AFVKVSDLSD SYIKEWKKTF HVHDPVQGKI
IEAKEESRIL MTLKDSEVNG ELNTLKRFED VEVGEIYEGS VRRITDFGVF VKLDGTLNIT
GLCHHSEISD KKISSIESVF GEGDRVKVKL LAKDDAKKQL SLGMKASYFI DEVAADEDIE
MEDNAVESSE QSEEEKVGDN DSEDELMEEA YAEQESGESD SESDKEVSKE SAGPEAQVKS
QGGLSTNGFD WTASILDQTM ANDESSDEED MDDEKPQKKK RRASVVKDKT AEMNTRAPQS
VSDFERLLIG NPNSSILWMN YMSFQLQLSE IEKAREIGSR ALNTINYREE QEKLNIWIAL
LNLENSFGSD ETLEDTFKRA CQYMDSFTIH QKLVGIFSIS ERNDKAEELL KTMTKKFGSE
VSVWVSYASF LLDAGRHDDA REVLARALQV LPKRSHIETV RKFAQLEFSK GDSEQGRSLF
EGLITDAAKR IDLWNVYIDQ EIKRGEKAKA EMLFERVLNR KLSRTQAKFF FNKWLNLEED
KGDDKGAEYV KAKAAEYAQN NQ
//