ID A0A072P913_9EURO Unreviewed; 1828 AA.
AC A0A072P913;
DT 01-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 01-OCT-2014, sequence version 1.
DT 24-JAN-2024, entry version 36.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=A1O9_08504 {ECO:0000313|EMBL:KEF55753.1};
OS Exophiala aquamarina CBS 119918.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Chaetothyriomycetidae; Chaetothyriales; Herpotrichiellaceae; Exophiala.
OX NCBI_TaxID=1182545 {ECO:0000313|EMBL:KEF55753.1, ECO:0000313|Proteomes:UP000027920};
RN [1] {ECO:0000313|EMBL:KEF55753.1, ECO:0000313|Proteomes:UP000027920}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 119918 {ECO:0000313|EMBL:KEF55753.1,
RC ECO:0000313|Proteomes:UP000027920};
RG The Broad Institute Genomics Platform;
RA Cuomo C., de Hoog S., Gorbushina A., Walker B., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Exophiala aquamarina CBS 119918.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KEF55753.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGV01000007; KEF55753.1; -; Genomic_DNA.
DR RefSeq; XP_013258343.1; XM_013402889.1.
DR STRING; 1182545.A0A072P913; -.
DR GeneID; 25283416; -.
DR VEuPathDB; FungiDB:A1O9_08504; -.
DR HOGENOM; CLU_000845_0_0_1; -.
DR OrthoDB; 167902at2759; -.
DR Proteomes; UP000027920; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-KW.
DR CDD; cd05702; S1_Rrp5_repeat_hs11_sc8; 1.
DR CDD; cd05703; S1_Rrp5_repeat_hs12_sc9; 1.
DR CDD; cd05693; S1_Rrp5_repeat_hs1_sc1; 1.
DR CDD; cd05697; S1_Rrp5_repeat_hs5; 1.
DR CDD; cd05698; S1_Rrp5_repeat_hs6_sc5; 1.
DR CDD; cd05706; S1_Rrp5_repeat_sc10; 1.
DR CDD; cd05707; S1_Rrp5_repeat_sc11; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 10.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR045209; Rrp5.
DR InterPro; IPR048058; Rrp5_S1_rpt_hs11_sc8.
DR InterPro; IPR048059; Rrp5_S1_rpt_hs1_sc1.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR PANTHER; PTHR23270; PROGRAMMED CELL DEATH PROTEIN 11 PRE-RRNA PROCESSING PROTEIN RRP5; 1.
DR PANTHER; PTHR23270:SF10; PROTEIN RRP5 HOMOLOG; 1.
DR Pfam; PF00575; S1; 6.
DR SMART; SM00386; HAT; 6.
DR SMART; SM00316; S1; 12.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 11.
DR SUPFAM; SSF48452; TPR-like; 2.
DR PROSITE; PS50126; S1; 11.
DR PROSITE; PS50005; TPR; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000027920};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW rRNA processing {ECO:0000256|ARBA:ARBA00022552};
KW TPR repeat {ECO:0000256|PROSITE-ProRule:PRU00339}.
FT DOMAIN 162..266
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 282..351
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 464..538
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 555..629
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 649..718
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 833..904
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 944..1020
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1052..1123
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1139..1208
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1233..1302
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1322..1393
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REPEAT 1632..1665
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REGION 1..141
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 905..929
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1401..1544
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..28
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 36..61
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 82..113
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1401..1473
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1505..1519
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1530..1544
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1828 AA; 199728 MW; 19F2AD1E4680DF97 CRC64;
MSSGKRKFST EERSAKPASK RSKPDPEAQP VKSSKAPKLD KTNPSPARKA STPQAASVLT
NEQPAFPRGG ASLLTPIERK QIQAKASRDA AKEFKKAQDL FGNSGSKEED TSDDDGNSAP
ELEPSKAVPK KSRKEKFRKN KAEIDGPEIR ISGLSYKRIT TGSLVLGQIS GISTRSLTVG
LPNSLVGFVP LTALSAQLNE KIQSLLDSGV ENGATSDGEE EADDKDIDLT EYFRVGQFLR
AAVTSTEQER TGPKTQSRKR IELSIEPALT NIGLSQTNLT AGTTVQASVS SIEDHGLAVD
LSLDEGSARG FILKKHLPEG LHLADVKTGA VFLCSVLESS PNAKVIKLSA NLSNAATLKT
APSVDAFLPG TKAEILISDK TDAGLAGKVM GLLDVTADVV HSGSFKDREA FSAKYEIGKK
VTGRLICTFP LSDNKKLGFS LLDHLLDIDS LGTSTSGHNK LSISTIIDSA SVVRVEQGLG
IYLKLAGDSI GFAHVSRLSD KKVDSLSELA GAYKLGSEHK ARILEYNSVD DLYIISLQPS
ILQQPFLRVD DVPIGAVVKG KIEKLLIGAE GVKGLIVNLA EGVTGLVPQI HLSDVELKHP
ERKFKEGQTV TTRVLSTDVV RRRLRLTLKK TLVNSDQKAW SNYDGIEVGD STVGTLVKVD
RVGAIVQFFA SVKGFLPVSE MSEAFIKDAT EHFRVGQVVT VNAISVNSSE RRLTVSCRDV
NTTNPSIETS LAELHAGTIT SGTVFEKSQD DILLRLEGSD AIARLVLDHV SDGSLKKRKA
AFAKVRVGQK LENLLILQVL AKRRLVLVSN KQSLIKAATE GTLLKSYEQL RENGVYTGVV
SNVVDNGIFL SFAAGISGFI PKGQVPVETE TEPDFGMSRL QPVTARVTNI DYKGATPRFW
LTMREGKSKP ESTPTEKAGG HPEAIPLSEP VDPSVTVLGD LHVGRITKAR IVSIKETQLN
VELAKDVQGR IDASEAFDAW EEINDRKKPL RQFSAKQELT VKILGAHDTR NHKFLPLSHR
NSKHTVYELS CKASAIKHPN AENLSLQSIS SGSRWLAFVN NISDSGLWVN ISPAVRGRIR
ATDVSDDLSL VADLNTNFPL GSALRVQVLS VDAEKNRLDL SAKSDNSASS ITLKDISAGL
VLPGRITKIS DRNIIVQLSD QVVGVVDLID MADDYSEANP AKYQKNDVLR VCVLRVDVPN
KKVTLSTRPS KVLSSSLPVN DAEITSISQL SVNDVVRGFV RNVADKGVFV TLGHNITAFV
RVSNLSDSFI KEWKDHFQRD QLVKGKVILV DEASGNVQIS LRESALDPNY KAPLNFNDVK
LGDVVSGKVV KAEAFGVFIL IDNSENVRGL CHRSEIAEQR IEDASKLFSE GDKVKAKVLK
LDPVTRRVNF GMKASYFADE LDEDSVDDSE DAASEDESLA DGGVDLDEDM ERAEEDEDDD
EMELENDQED SMDDEDDGDA GSNIEDDDPE VETTSETVSS TKPKGLNVGG FDWYGLSEPN
PSSKRPASDS EEDNLTAKAS KKKKKKRAKI QVDHTADLDR DGPQSVDDYE RLLLSEPDSS
LLWLQYMAFH LDLGDADQAR SIGERALKSI GLGQEAEKLN VWIALLNLEN AYGDDESIDA
IFKRACEYND PQEVYSRLAS IYIQSGKRDK ADDLFQRMLK KFTRDPKAWI NYVTFLFETG
VDSAEKARAL LPRALQTLPK FTHFDLTLKF AQLEFKQSSG TPERGRTIFE GLLSAFPKRV
DLFNVLLDLE LKLDDNEAQV RAVFERIFSG DTAAGARKLK PKQAKYFFKR WLAFEESVAE
RSGDGDESKI EAVKARAAEW VRSTPTTN
//