ID W2NQD7_PHYPR Unreviewed; 1934 AA.
AC W2NQD7;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=L914_05222 {ECO:0000313|EMBL:ETM50821.1};
OS Phytophthora parasitica (Potato buckeye rot agent).
OC Eukaryota; Sar; Stramenopiles; Oomycota; Peronosporales; Peronosporaceae;
OC Phytophthora.
OX NCBI_TaxID=4792 {ECO:0000313|EMBL:ETM50821.1};
RN [1] {ECO:0000313|EMBL:ETM50821.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IAC_01/95 {ECO:0000313|EMBL:ETM50821.1};
RG The Broad Institute Genomics Platform;
RA Russ C., Tyler B., Panabieres F., Shan W., Tripathy S., Grunwald N.,
RA Machado M., Johnson C.S., Arredondo F., Hong C., Coffey M., Young S.K.,
RA Zeng Q., Gargeya S., Fitzgerald M., Abouelleil A., Alvarado L.,
RA Chapman S.B., Gainer-Dewar J., Goldberg J., Griggs A., Gujja S., Hansen M.,
RA Howarth C., Imamovic A., Ireland A., Larimer J., McCowan C., Murphy C.,
RA Pearson M., Poon T.W., Priest M., Roberts A., Saif S., Shea T., Sykes S.,
RA Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Phytophthora parasitica IAC_01/95.";
RL Submitted (NOV-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI691933; ETM50821.1; -; Genomic_DNA.
DR EnsemblProtists; ETM50821; ETM50821; L914_05222.
DR VEuPathDB; FungiDB:PPTG_09583; -.
DR Proteomes; UP000054532; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-KW.
DR CDD; cd05693; S1_Rrp5_repeat_hs1_sc1; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 11.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR045209; Rrp5.
DR InterPro; IPR048059; Rrp5_S1_rpt_hs1_sc1.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR008847; Suf.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR PANTHER; PTHR23270; PROGRAMMED CELL DEATH PROTEIN 11 PRE-RRNA PROCESSING PROTEIN RRP5; 1.
DR PANTHER; PTHR23270:SF10; PROTEIN RRP5 HOMOLOG; 1.
DR Pfam; PF00575; S1; 4.
DR Pfam; PF05843; Suf; 1.
DR SMART; SM00386; HAT; 5.
DR SMART; SM00316; S1; 13.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 10.
DR SUPFAM; SSF48452; TPR-like; 1.
DR PROSITE; PS50126; S1; 12.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW rRNA processing {ECO:0000256|ARBA:ARBA00022552}.
FT DOMAIN 83..174
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 190..252
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 274..343
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 359..454
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 471..542
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 562..631
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 653..729
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 752..823
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1194..1265
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1279..1352
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1375..1442
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1461..1530
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REGION 1..30
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1534..1580
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1585..1604
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1614..1647
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1538..1563
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1564..1580
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1934 AA; 212839 MW; 398A8715D18D0FEB CRC64;
MAVAAAATSF PRGRKPKAPS QKASTSVAAP GEALFGKRGL AVAAEEQPKK KAKKAVKSEK
TGAKDELKAA KATLLSFKTL RKGMLLLGCV RQVTDGQDLM ISLPNKLNGT VALSECSDEF
HEFLQTQKKA KQQDEEEEQL QPLSTIFKVG QFVPCVVLAT GKTDKRKQIQ LSLRTSLIHA
ELSPSSLTKG ASLHATVSSV EDHGAIVNLG IRGVHAFVPR KELATPVHKG QHLLVNVVSM
NTHTNTATVT IDRSQVVKAV TRGDSFTLKQ LVPGMLLNVR VEDVLENGLS VTFLTFFTAT
VEQNHMSLPC ERGWEESYRK GMKARARIIS IDYIAKQITL SMAPHVVHMQ VPESLFSVGD
IIEEATIERI DAGVGMLLSL KNQDEDVEMG DASDKKESTT NAKWKAFAPG YVHISNVSDK
RVDKLEKKFT VGSSIKCRVL GFSPFDAVAS VSCKEHSISQ TVLRHKDLKP GTKVNGKILS
VEPWGILMEI SEGVRGLVTP QHMPAFLLNK KANNGKYKVG KTANARVLHV DLDANKTYLT
MKSGLLSSDL PVLSSFQEAT MGLIAHGFIT KIGEYGVIVT FYNNVYGLVP MAVLQQAGIE
NLEEAYVLGQ VVKARVTRCD PNRKRLMLSF DTTSNSSSNK PTAAPETASK LVGTTITNVK
ITDVETTCFR VQTKDGMEGV LPFVQLTDFP RNTPLVDEIV KRFSAGDVIS EPLLVVSQES
DGVLTLSKKP LLLEFASRKA IVPRTFGDVQ ENAVLIGYVT SVNVSKGVFV KFLNNVVAVA
PKGFHKEQFV AQIDEGMFEI GETVTCSVEK INKEKKQFVV GFQQTNFVLP TNSTNKARPA
FFQAYLREQA SVRNAAEVKK APFALGKTEK AEFVGVRPYG AVFALEKDEE TVTVLVPSVT
ENKEWDEGDS VKLLLTDYDF SKNVYYGAVD ESLVKSGSKK SRKQKQRVKA GGKIAAASVL
AVSPTEKYAV VSFPDPQNAD LLQFGVVQLC DFWCPSQTSS QLGIEVGASI ECRVVQSTLK
SGSNSTPFDD LVLLALEEDE LVTKSKISSH KASFKAPKYT HEDLVLGSIV TGVISGISEN
SMEIRVESHK KAGKVRAVVS IVDVDGIDEK SGHSHPFDRY SVNTTVTGRV IAVTAKGANK
LKPVSEENPA KFHALQLSLR SEDVAGDEKV KDVQRFVRPD WLEGSAGRAL LKEGNSVEGV
VSDQDVDYLT IKLSSNVTGT LSCVEVSEDV GVIREFQDKF PVGKRVKCFL LQVDDEKKTV
DLSVIHSSSA QCKAVVKSGA IVNGVISTKK SAIRPPSIMV QLGAHTFGRV CITELQTKWE
NDMLELPQFA AGKVVRCVVL STSNNHIDLS LREDALANPK EYAKKTSKPA DRSVGDLVPA
IVATTTSSGC FVRVDRHTTA RVMLRDLSDD FVKDPQTQFP SGKLVAGRVT KKSDRGLELS
LKASVVSDDV SVFKWSDLKE GLTVKGTITK VQTYGVFVRI EKTTISGLCH ISEIADEKVT
QPLDQIFSEG DYVKAKVLKV EDRRVAFGLK PSYFEGEESS SEEESDDEEA DSDEDEAEPM
DVDEEEAPMK KHVKKSKPVA MEIDLGDDES SSEEEDASSA APVEFSWDGF SNALSKKTDS
KDDDESSSDE EDEEEATNSS KASKKNRLQS DEWVALREKA LASNEEVPQS ASDYERLLAV
SPQSSYLWIQ FMAFHVSLTE IDLARDVAVR ATSAVSFRDE KEKLNVWVAY LNLEHDFGDD
ASFLRVFKSA LQVNHPKRVY LHLVDLYARA EEHEDVKQTL ATMHKKFRTS KQTWIRSLQY
LVGEKQFAEA AETLQRSLKS LAAHKHLPVI LKYGQLLYEQ GELDKARTIF EGILANYPKR
MDLWNVYLDK EIKFGDVALV RALFERLLAM DFSAKKMKFL FKKYLQFEQD QGDDEHVEHV
KQLAKDFVAS AAAK
//