ID E1ZDW0_CHLVA Unreviewed; 1396 AA.
AC E1ZDW0;
DT 30-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2010, sequence version 1.
DT 24-JAN-2024, entry version 43.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=CHLNCDRAFT_145628 {ECO:0000313|EMBL:EFN56095.1};
OS Chlorella variabilis (Green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Chlorellales; Chlorellaceae; Chlorella clade; Chlorella.
OX NCBI_TaxID=554065 {ECO:0000313|Proteomes:UP000008141};
RN [1] {ECO:0000313|EMBL:EFN56095.1, ECO:0000313|Proteomes:UP000008141}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NC64A {ECO:0000313|EMBL:EFN56095.1,
RC ECO:0000313|Proteomes:UP000008141};
RX PubMed=20852019; DOI=10.1105/tpc.110.076406;
RA Blanc G., Duncan G., Agarkova I., Borodovsky M., Gurnon J., Kuo A.,
RA Lindquist E., Lucas S., Pangilinan J., Polle J., Salamov A., Terry A.,
RA Yamada T., Dunigan D.D., Grigoriev I.V., Claverie J.M., Van Etten J.L.;
RT "The Chlorella variabilis NC64A genome reveals adaptation to
RT photosymbiosis, coevolution with viruses, and cryptic sex.";
RL Plant Cell 22:2943-2955(2010).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL433843; EFN56095.1; -; Genomic_DNA.
DR RefSeq; XP_005848197.1; XM_005848135.1.
DR STRING; 554065.E1ZDW0; -.
DR GeneID; 17355227; -.
DR KEGG; cvr:CHLNCDRAFT_145628; -.
DR eggNOG; KOG1070; Eukaryota.
DR InParanoid; E1ZDW0; -.
DR OMA; GQYLRAY; -.
DR OrthoDB; 167902at2759; -.
DR Proteomes; UP000008141; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006364; P:rRNA processing; IEA:InterPro.
DR CDD; cd05695; S1_Rrp5_repeat_hs3; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 6.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR045209; Rrp5.
DR InterPro; IPR003029; S1_domain.
DR PANTHER; PTHR23270; PROGRAMMED CELL DEATH PROTEIN 11 PRE-RRNA PROCESSING PROTEIN RRP5; 1.
DR PANTHER; PTHR23270:SF10; PROTEIN RRP5 HOMOLOG; 1.
DR Pfam; PF00575; S1; 2.
DR SMART; SM00316; S1; 7.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 5.
DR PROSITE; PS50126; S1; 7.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000008141}.
FT DOMAIN 347..416
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 439..508
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 528..597
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 763..832
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 870..935
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1168..1239
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1258..1349
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REGION 1..24
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1065..1092
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1316..1337
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1078..1092
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1396 AA; 141339 MW; C0244F6ECF055672 CRC64;
MGKKRPPAGA IEEEETFVRG GGSGLAPVVE RKLKEEAYAE AEADLLSGKK GGKRRKGGQA
VAGGEGGDEE DAFFSGLALQ GKLPKFVELL KFKNLSRGCK LWGAVIEVSP RELVVSLPHG
LRGHVAYAEA SDWLAGQSKA AAAAGAEAAG EDGAAAIAAA AAGKKRKAGT AAATEVVLPP
LTDLFTIGQL VRGTVVALRS GSSGDSESAG KAGAKKAAAA EGGAGGAKKK RVDLSLRVSK
MNAGLGPESL REGLALPACV SSVEDHGYLL ALGVKGVSGF LPKKAAAAAG RALAPGMLLD
VAVPPGGAPK PAGGGGSVLG VVCAPEAVAM AVAREWEGLN IGSLLPGQLV AARVRNVLSD
GLLCSFLTYF SGTVDPFHLG ADLAADWRKQ FSPNQRLRAR ILYVDPASKR VALTLHRHLI
SASLPVNFPM LGQVDVRPGM PVSGTVSRVE EYGLLVALTS SIRALVPVLH ASDVGTAKAL
RKFKAGQTVA GKVLTVDPAT KKVTMTLKPS LVGSKLPPIA RTQDAVPGGR SHGVVTGARD
FGVFVSFFGG VTGLAHVSEC GLAADQKPPE AFQAGQVVKC RVLGADPSRK GLKLSLVTKP
KKAAAAAAET EAAPLAAAAA PGAAGEPAPG GSEAGGSSEA AAAAALASYQ AGELVEGTVA
AVHTKEVDGE AVPAYFELSV SSAGGGSDKA AAAGRLEVAH LADHPAAAAA LVAALTVGSR
LGPLLVLQRL ENVKQLRLTR KASLLTAAAA GLLPSAVEAV AEGALLAGYV ASVTSGAVFV
RFLDGLTGRA GLAQLSDTFV SDPHLFFREG QSVRATVVDA QRQRFSVALK QSLCGSRDAA
YLQSLFSDLE AAEALSNDAA DVDWAAFAIG GIAPGEVHEA KGYGLICDLE AHADVVGLVA
PHQMPAGASR EPGTSVRAVV LDANKREGVV DLSLQPRLVA AAQAAAAAAD AETAPKKKKQ
KKAAVGGKAA AAAELKEGQR LECKVELVKE EAGYCVVTLP AADGSPTGSP LLGFLPTTDF
NLQYQQHQQP PRPGDSLTAS VAALPSPATG GRLLLAAPVG GKPVKPAAAA GGTGAAKRPQ
SDKQQAQRMQ QHATGSCVEA TVGAVHALHA DLDLDKLAVA VLGRVTTAEG RRHSVLECSS
RPEAVSAARA GQALPRHPCP ALAALRPGQQ LQGYVQDVQQ GHVWCAFSPS VRGRAFATQA
ASSIEECERL GKRFKPGQPV QATVVHVDKK RKALDVSLLP AAPEAASEAA AAGAPAPGTV
ALGRVSAVGG GGVRVRLSAR SVGRVALTDI HDGAVEECLA GLQAGQYCQA VVLGPDTSAS
PDSGQEASGR KRGGRSSGAA AGQLLLSLRP SAGGRCAAHG AAQRRQGGAG AEVAAGQLQP
AQLKPGQKVR ACIFHR
//