ID M3K6R4_CANMX Unreviewed; 1711 AA.
AC M3K6R4;
DT 01-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 01-MAY-2013, sequence version 1.
DT 24-JAN-2024, entry version 40.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=G210_5791 {ECO:0000313|EMBL:EMG50930.1};
OS Candida maltosa (strain Xu316) (Yeast).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Debaryomycetaceae; Candida/Lodderomyces clade; Candida.
OX NCBI_TaxID=1245528 {ECO:0000313|EMBL:EMG50930.1, ECO:0000313|Proteomes:UP000011777};
RN [1] {ECO:0000313|EMBL:EMG50930.1, ECO:0000313|Proteomes:UP000011777}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Xu316 {ECO:0000313|Proteomes:UP000011777};
RA Yu J., Wang Q., Geng X., Bao W., He P., Cai J.;
RT "Genome sequence of Candida maltosa Xu316, a potential industrial strain
RT for xylitol and ethanol production.";
RL Submitted (FEB-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EMG50930.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AOGT01000071; EMG50930.1; -; Genomic_DNA.
DR STRING; 1245528.M3K6R4; -.
DR eggNOG; KOG1070; Eukaryota.
DR HOGENOM; CLU_000845_0_0_1; -.
DR OMA; GQYLRAY; -.
DR OrthoDB; 167902at2759; -.
DR Proteomes; UP000011777; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-KW.
DR CDD; cd05702; S1_Rrp5_repeat_hs11_sc8; 1.
DR CDD; cd05703; S1_Rrp5_repeat_hs12_sc9; 1.
DR CDD; cd05693; S1_Rrp5_repeat_hs1_sc1; 1.
DR CDD; cd05707; S1_Rrp5_repeat_sc11; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 11.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR045209; Rrp5.
DR InterPro; IPR048058; Rrp5_S1_rpt_hs11_sc8.
DR InterPro; IPR048059; Rrp5_S1_rpt_hs1_sc1.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR PANTHER; PTHR23270; PROGRAMMED CELL DEATH PROTEIN 11 PRE-RRNA PROCESSING PROTEIN RRP5; 1.
DR PANTHER; PTHR23270:SF10; PROTEIN RRP5 HOMOLOG; 1.
DR Pfam; PF00575; S1; 4.
DR SMART; SM00386; HAT; 6.
DR SMART; SM00316; S1; 13.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 12.
DR SUPFAM; SSF48452; TPR-like; 2.
DR PROSITE; PS50126; S1; 12.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000011777};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW rRNA processing {ECO:0000256|ARBA:ARBA00022552}.
FT DOMAIN 103..214
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 318..389
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 411..479
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 496..566
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 587..656
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 670..749
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 771..842
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 873..950
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 984..1055
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1068..1137
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1155..1223
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1243..1314
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REGION 1..27
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 40..85
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 147..170
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1317..1432
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 40..55
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 71..85
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 147..165
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1322..1384
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1711 AA; 191798 MW; 93A18E71ECB396D8 CRC64;
MSDSSKSVLS NKEFAFPRGG SKPLSALEVK EISNEATKDV LFEQASESTN KRTSSTSDQP
RKKQKKVSKK KSKKQADEED AEGEESAAKV HVEHFTFKNI NSESLVLGQI KSIGKLDITL
ALGDNLVGYI PITSVSTQIT ELLEKFEAES SDEEDEDEDD EDDEEKTVAT VKSKETKEFP
NLSTIFKLGS WLRAKVFSSA EELKTKKKIR LTIEPEAVNK HLETEDLIPN NVLQCSVKSI
EDHGIILDTG IPEYSGFISN KELKTTYDIE SIKPGMVILC SISTKISGRI INLKPIENSI
YKKSAITTIS SIDAIQPGII VDALISDVTE QGLITRVYGL VDGTINLSQV QDFDIKSLKH
KYAIGSTVKA RVLAVLLKNG TKKLILSVLP HVLQLSDDSS EKKSLEAFPI GHVFDSVKVI
GSDNHYIYVN FGSTELYGQI HTSKIDDDKT LLDFNVGSSH KSRVIGFNEV DNLLMLTFES
KAINAKYLRV EDIPIGTYIP SIEIVKVLED GKGIEVKVLD EFNGFVPGNQ MSDIKLVYPE
RKFRVGSKVK GRLLGHQGKK IYITLRKALV NLEDDEILAD FDQAEVGFKT NAIVEKFVPN
GCIVKFFGNL RAFLPKNEIS ETFVDDAKNY LKLNQIVKVR ISDKNDEQKR LVVSLKQSSA
LTESQTTEIN KLISGKSIVN AIIVETGKET VMVELEGSSL RGVVAIGQLS DGNYEQNRSL
AKKLKVGTAL EVLILDKDLK ARTVIATAKK SLVDAAKHNS FPTEFSDIQQ NKVVKGYVKS
VTNLGLFVTF AGKLTGLILA KYINKAGNKD QDLSKKFYKY QSVTCRVINV DKDNERFLLT
LGSDGSGLND KEDEPLVKPV DETKKIVADY TPGVTTKALI KSVKGTQLNV QLADNLQGRV
DITQCFGSIN DIKNQHQPLS SHFNKGEIID VKVIGTHDAK NHTFLPITHR KSNKTTILEL
SIIDATSSDN KPISELKLSD IKPGDKFLGF VNNISSNVIW ISISPAIKGR ISFMDLTEDG
SIFEDIDNKL PIGSALPVTV KEVDLEHHTV SLSSRPKYID SFDEVEKGKT YPGRVFKIKP
NHVLVELGKN IVASSYITDA LDNYSNKLDH EFEVNDYVPA RVLDIDAENK RISVSLRSSS
SEDKTIESIN DLKRGQVVKG FIKNVGNNGV YVSLGRSVFA LVRISDLSDS YLKDWKKFFK
VNQLVTGKIV NCKEEGRILM TLKESEVNGE LKLLKTFEDL QVGDIFEGTV TSSTDFGVFV
KLDGAVNISG LCHHTEIAEQ DVDNVKSLFG DGDRVKVKIL NIDSEKRQLS LGMKASYFAD
GNDNDEEEEE EGEDVEMEDA EDEEDDDEEA SEEEDDVMEV DEEDEEEQSE GSDDDEEEDD
DEEKSLSTGL STNGFDWTAS ILDQAEEHDS SDEEDDEEDF TDGNKKKSKK KKSKTIVEDK
TGDLNTRAPQ STSDFERLLI GNPNSSILWM NYMSFQLQLS EIDKAREIGE RALKTINYRE
EQEKLNIWIA LLNLENTFGT DESLDDVFKK SIQYMDSYTM HQKLINIFIM SEKFDHAKEL
FQKMIKKFGK NISTWVSYGS FLLDQKQEIF EILSKALQVL PKREHIDLVK KFAQLEFNKG
DPEQGRSLFE GLISDTPKKI DLWNVYIDQE IKQDDKSKVE ELFERVLTKK INRKQAKFFF
NKWLNYEEDK QDENMVARVK AKAAEYVQNN S
//