ID W9VUR5_9EURO Unreviewed; 1812 AA.
AC W9VUR5;
DT 14-MAY-2014, integrated into UniProtKB/TrEMBL.
DT 14-MAY-2014, sequence version 1.
DT 24-JAN-2024, entry version 40.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=A1O7_08802 {ECO:0000313|EMBL:EXJ55871.1};
OS Cladophialophora yegresii CBS 114405.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Chaetothyriomycetidae; Chaetothyriales; Herpotrichiellaceae;
OC Cladophialophora.
OX NCBI_TaxID=1182544 {ECO:0000313|EMBL:EXJ55871.1, ECO:0000313|Proteomes:UP000019473};
RN [1] {ECO:0000313|EMBL:EXJ55871.1, ECO:0000313|Proteomes:UP000019473}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 114405 {ECO:0000313|EMBL:EXJ55871.1,
RC ECO:0000313|Proteomes:UP000019473};
RG The Broad Institute Genomics Platform;
RA Cuomo C., de Hoog S., Gorbushina A., Walker B., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Cladophialophora yegresii CBS 114405.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EXJ55871.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGW01000006; EXJ55871.1; -; Genomic_DNA.
DR RefSeq; XP_007760981.1; XM_007762791.1.
DR STRING; 1182544.W9VUR5; -.
DR GeneID; 19183366; -.
DR VEuPathDB; FungiDB:A1O7_08802; -.
DR eggNOG; KOG1070; Eukaryota.
DR HOGENOM; CLU_000845_0_0_1; -.
DR OrthoDB; 167902at2759; -.
DR Proteomes; UP000019473; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-KW.
DR CDD; cd05702; S1_Rrp5_repeat_hs11_sc8; 1.
DR CDD; cd05703; S1_Rrp5_repeat_hs12_sc9; 1.
DR CDD; cd05693; S1_Rrp5_repeat_hs1_sc1; 1.
DR CDD; cd05697; S1_Rrp5_repeat_hs5; 1.
DR CDD; cd05706; S1_Rrp5_repeat_sc10; 1.
DR CDD; cd05707; S1_Rrp5_repeat_sc11; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 9.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 2.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR045209; Rrp5.
DR InterPro; IPR048058; Rrp5_S1_rpt_hs11_sc8.
DR InterPro; IPR048059; Rrp5_S1_rpt_hs1_sc1.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR PANTHER; PTHR23270; PROGRAMMED CELL DEATH PROTEIN 11 PRE-RRNA PROCESSING PROTEIN RRP5; 1.
DR PANTHER; PTHR23270:SF10; PROTEIN RRP5 HOMOLOG; 1.
DR Pfam; PF00575; S1; 6.
DR SMART; SM00386; HAT; 5.
DR SMART; SM00316; S1; 12.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 10.
DR SUPFAM; SSF48452; TPR-like; 2.
DR PROSITE; PS50126; S1; 11.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000019473};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW rRNA processing {ECO:0000256|ARBA:ARBA00022552}.
FT DOMAIN 150..253
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 269..338
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 452..526
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 543..617
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 637..706
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 821..892
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 929..1005
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1037..1108
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1124..1193
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1218..1287
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1307..1378
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REGION 1..132
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1408..1439
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1452..1532
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 11..44
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 69..131
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1408..1436
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1499..1523
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1812 AA; 199256 MW; 162B46F51D00F3A3 CRC64;
MASAKRKPEA APSKQPSKRP KTDLSSKAEA SVKEQKKDQK PKQASVLTKE QPAFPRGGAS
LLTPLERKQI RAEAARDADR EQKPTEDLFG GVDRALDSES DKGEKVKSEI QPKRRERQRT
KGKKKPDSAA VRTKAEVQIG GLSYKRITTA SLILAQITFI SPRTLTVALP NNLVGYVPLT
AISPQLSDKI QGLLEKNDDE RDNSDDEEEE DDDISLTNYF HLGQYLRVAV TSTEQERRGQ
TSSARKRIEL SVEPALTNAG LNRPNLAIGA TVQASVSSVE DHGLAVDVSL EDDHVRGFVP
SKQLPSGWSL PDIKTGAVFL CHVLDAGSDN KVVKLSADLS KSAALKTAPS VDTFLPGTKA
EILISTMTEA GLSGKIMGML DVTADVLHSG SFRDREAFLA KFQVGKKIAG RIIYNFPLSD
NKKLGFSVLR NILDLTDVPE STDAGAGRLA LSSTIEAASV IRVEPGLGVY LQLDEQNIGF
AHLSRLSDKK LDAISELSGS FKLGSEHKAR VLEFNPVDNL YIVSLQESVL QQPFLRYEDV
PLGAVVKGTI EKLVIGESGV EGLLVILAEG ITGFVPKLHL SNVKLDHPEK KFRAGQAVTA
RVLTSNPARR RLRLTLKKAL VTNDQKAWLR FEDIEVGDST VGTLAKVDAL GAVVRFFGSV
KGFLPVSEMS EAYIKDARDH FRVGQVVSVS ALSINAAEKR MTLSCRDINR SNASIESSLS
RLLPGIITNG TVFEKSENDI LLRLEENDAI ARLTLDHVAD GSLKKRQSAF SNIRVNQTLE
SILILQVQPK RRIVLISNKR SLVSATRAGM FLKGYEQLQL GVVVTGYVKN ITEDGVFVGF
AAGITGLIPR SQVSEESEDQ ENFGMTEMQP VTAKVLNIEY KGAAPRFWLT MREAGPVTKS
EVPPTEPASF KLVNPVDASL TIVTDVSVGM KTKAKVISVK DTQINVELAK DVQGRIDVSE
VFDEWKDIKD RKKPLKPFYP QLELTVRVLG AHDTRNHRFL PLTHRKGKNT VFELTCKPSS
IATEKLPQLA LQDMTVGSSH VAFVNNITDD CVWVNISPTI RGRIRTIDIA DDLSLAANLP
RNFPLGSAVR VRVLAVDPEK GHLDLTARSD GTAGSLDYGN IALGLVLPGR VTKVTDRNIF
VQLSEHVAGS VELVDMADNY DEANAAKYQK NDIIRVTIVA VDVPNKRVSL STRPSKVLSS
SMGVVDREIA SVNRVAVNDI VRGFVKNVSD QGVFVTLGRG VVAFVRISNL SDSYLKEWKD
HFQRDQLVRG RVIMVDEASG QIQMSLKESA LKPDFKLPIT FNALKVGDLV TGQVVNVQPF
GVFILVDNSE KIRGLCHRSE IAEQRVEDPG NLFAVGDKVK AKVLKLDPAT RKVNFGMKAS
YFNDPTEEEA ESDADSESQG GAILDAEMPD AEADDSGSDD EVDEPELGDD DSHVGGDDEV
SDMLDIQIHP DINDTEDEDE DGATAMSAEA SKSTALSVGG FDWTGLRPAP ATSSKRLAEA
SDSDADTPKK KPKKKKNQPT MDVDRTADLD VNGPQSADDF ERLLLAEPDN SVLWLRYMAF
HLDLGDIDAA RAIGERALRT ITVGLEDEKF NVWVALLNLE VHFGDDENVE EETFRRACEV
NDQQEIHARL ASIYIKEGKF GKATDLFERM IKKFAQDPKV WVNYASFLLD KMGDNTGREK
AHALHARALQ TLPKFTHFET TKSFARLEFT SAHGLPERGR TLFEGLIAAF PKRVDLFDVL
LDLEMNVTKD DEQVRATFER IFTLKLKPKQ ARSFFKRWDR FETDKGDERR VEAVKARAAA
WVKENANAKS SD
//