ID V4LED6_EUTSA Unreviewed; 1926 AA.
AC V4LED6;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 24-JAN-2024, entry version 43.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=EUTSA_v10019877mg {ECO:0000313|EMBL:ESQ48840.1};
OS Eutrema salsugineum (Saltwater cress) (Sisymbrium salsugineum).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema.
OX NCBI_TaxID=72664 {ECO:0000313|EMBL:ESQ48840.1, ECO:0000313|Proteomes:UP000030689};
RN [1] {ECO:0000313|EMBL:ESQ48840.1, ECO:0000313|Proteomes:UP000030689}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23518688; DOI=10.3389/fpls.2013.00046;
RA Yang R., Jarvis D.E., Chen H., Beilstein M.A., Grimwood J., Jenkins J.,
RA Shu S., Prochnik S., Xin M., Ma C., Schmutz J., Wing R.A.,
RA Mitchell-Olds T., Schumaker K.S., Wang X.;
RT "The Reference Genome of the Halophytic Plant Eutrema salsugineum.";
RL Front. Plant Sci. 4:46-46(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI517408; ESQ48840.1; -; Genomic_DNA.
DR RefSeq; XP_006407387.1; XM_006407324.1.
DR STRING; 72664.V4LED6; -.
DR EnsemblPlants; ESQ48840; ESQ48840; EUTSA_v10019877mg.
DR GeneID; 18025318; -.
DR Gramene; ESQ48840; ESQ48840; EUTSA_v10019877mg.
DR KEGG; eus:EUTSA_v10019877mg; -.
DR eggNOG; KOG1070; Eukaryota.
DR OMA; GQYLRAY; -.
DR OrthoDB; 167902at2759; -.
DR Proteomes; UP000030689; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0009553; P:embryo sac development; IEA:EnsemblPlants.
DR GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-KW.
DR CDD; cd05702; S1_Rrp5_repeat_hs11_sc8; 1.
DR CDD; cd05703; S1_Rrp5_repeat_hs12_sc9; 1.
DR CDD; cd05693; S1_Rrp5_repeat_hs1_sc1; 1.
DR CDD; cd05694; S1_Rrp5_repeat_hs2_sc2; 1.
DR CDD; cd05695; S1_Rrp5_repeat_hs3; 1.
DR CDD; cd05698; S1_Rrp5_repeat_hs6_sc5; 1.
DR CDD; cd04461; S1_Rrp5_repeat_hs8_sc7; 1.
DR CDD; cd05708; S1_Rrp5_repeat_sc12; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 11.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR045209; Rrp5.
DR InterPro; IPR048058; Rrp5_S1_rpt_hs11_sc8.
DR InterPro; IPR048059; Rrp5_S1_rpt_hs1_sc1.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR008847; Suf.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR PANTHER; PTHR23270; PROGRAMMED CELL DEATH PROTEIN 11 PRE-RRNA PROCESSING PROTEIN RRP5; 1.
DR PANTHER; PTHR23270:SF10; PROTEIN RRP5 HOMOLOG; 1.
DR Pfam; PF00575; S1; 5.
DR Pfam; PF05843; Suf; 1.
DR SMART; SM00386; HAT; 4.
DR SMART; SM00316; S1; 15.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 12.
DR SUPFAM; SSF48452; TPR-like; 2.
DR PROSITE; PS50126; S1; 14.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000030689};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW rRNA processing {ECO:0000256|ARBA:ARBA00022552}.
FT DOMAIN 128..210
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 226..289
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 312..382
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 398..471
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 488..555
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 575..644
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 659..731
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 751..820
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 864..928
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1053..1128
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1153..1224
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1260..1334
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1368..1437
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1458..1528
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REGION 1..33
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 45..65
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1533..1558
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1611..1658
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 11..33
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1623..1658
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1926 AA; 214897 MW; 3F5E9D5ABD95BA95 CRC64;
MVAPSKKSVN GKRNDSTKVF KPAKKPFHKT KDDVAARSKA VAMQLEEVPD FPRGGGTSLS
QKEREKIYEE VDAEFDADER VSKRNKGLKP KKRTPTDVDE LGSLFDGAFT GKRPRYANKI
TIKNISPGMK LLGVVTEVNQ KDIVISLPGG LRGLVRASEA LDFTDFGTED DENELLQDRF
SVGQLVPCIV LQLDDDKKEA GKRKIWLSLR LSLLHKGFSL DSFQPGMVVA ANVKSVEDHG
YILHFGLPSI TGFIKISNDG SQELKTGQLI QGVVTNIDGE RKIVRLSSDP DSVAKCVTKD
LNGMSFDLLI PGMMVNARVQ SVLENGILLG FLMYFTGTVD LFHLQNPMCN KSWKDEYNQT
KMVNARILFI DPSTRAVGLT LNPHLVGNKA PPMHVSSGDI FDEAKVVRVD KSGLLLELPS
KPVSTPAYVS TYDVAEDEVK KLEKKFKEGN RIRVRILGLK QLEGLGIGTL KESAFEGPVF
THSDVKPGLV TKAKLISVDT FGAIVQFPGG LKAMCPLRHM SEFEVTKPRK KFKVGAELIF
RVLGCKSKRI TVTYKKTLVK SKLPILSSYA DATEGLVTHG WITKIEKHGC FVRFYNGVQG
FVPRFELGVE PGSDPNSVFH VGEVVKCRVT SAVHGTRKIN LSFMIKPTSV SEDDSIKLGS
VVSGVIDSIT PQAVIVRVKS KGFLKGTLSA EHLADHHEQA KLLISLLRPG YELDKLLVID
IEGNNLALSS KYSLIKLAEE LPSDFSQLQP NSVVHGYVCN LIENGCFVRF LGRLTGFAPR
SKAIDEPRAD LSESFFVGQS VRANIVDVNP EKSRVTLSLK QSSCASVDAS FVQEYFLMDE
KISDLQSSDI SESECSWVEK FSIGSLIKGT IQEQNDLGLV VNFDNITNVL GFIPQHHLGG
ATLEHGSIVQ ALVLDISRAE RLVDLSLRPE LINNSTREVS NSQSKKKRKR DISKELEVHQ
RVSAVVEIVK EQYLVLSIPE HGYAIGYASV SDYNTQKLPV KQFSTGQSVV ATVEALQNPL
TSGRLLLLLD SVSGISETSR SKRAKKKSSC EVGSVVHAEI TEIKPFEVRV NFAQSFRGRI
HITEVNDATI SEEPFAKFRI GQSISARVVA KPCHTDIKKS QLWELSVKPA TLRVDSSELN
DIQVREQLEF VAGERVSGYV YKVDKEWVWL AISRNVTARI FILDTACEAR ELEEFERRFP
IGKVVSGYVL TYNKEKKTLR LVQRPLLDTH KSIANGGGSK TDELDSTIPG DDATLFIHEG
DILGGRISRI LPCVGGLRVQ IGPYVFGRVH FTELNDSWVC NPLDGLHEGQ FVKCKVLEIS
NSSKGTLQIE LSLRASLDGM GSNHLAEASS NNVNVCKRIE RIEDLSPDMG IQGYVKNTMS
KGCFIMLSRT LDAKVLLSNL SDTFVKDPEK EFPVGKLVTG RVLNVEPLSK RVEVTLKTVN
GGGQQKSESY DLKKFQVGDI ISGRIKRVEP YGLFIEIDQT GMVGLCHKSQ LSDDRIEDVQ
ARYKAGESVT AKILKLDEEK RRISLGMKSS YLMNGDDVEA QPPSEENANE GSMECDPIND
SKSRVLAAVG DFGFQETTGE RHNGTSLVLA QVESRASIPP LEVDLDDIEE SDFDNNQNQE
KLQGANKDEK SKRREKQKDK EEREKQIQAA EGRLLENHAP ESADEFEKLV RSSPNSSFVW
IKYMAFVLSL ADIEKARSIA ERALRTINIR EEEEKLNIWV AYFNLENEHG SPPEEAVKKV
FERARQYCDP KKVYLALLGV YERTEQYKLA DKLLDEMIKK FKQSCKVWLR KVQSYLKQKE
EGIQSVVNRA LLCLPRHKHI KFISQTAILE FKCGVADRGR SLFEGVLREY PKRTDLWSVY
LDQEIRLGEV DVIRSLFERA ISLSLPPKKM KFLFKKFLEY EKCAGDEERV EYVKQRAMEY
ADSTLA
//