ID A0A3D9SZS3_9ACTN Unreviewed; 820 AA.
AC A0A3D9SZS3;
DT 16-JAN-2019, integrated into UniProtKB/TrEMBL.
DT 16-JAN-2019, sequence version 1.
DT 24-JAN-2024, entry version 16.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=DFJ69_5584 {ECO:0000313|EMBL:REF00061.1};
OS Thermomonospora umbrina.
OC Bacteria; Actinomycetota; Actinomycetes; Streptosporangiales;
OC Thermomonosporaceae; Thermomonospora.
OX NCBI_TaxID=111806 {ECO:0000313|EMBL:REF00061.1, ECO:0000313|Proteomes:UP000256661};
RN [1] {ECO:0000313|EMBL:REF00061.1, ECO:0000313|Proteomes:UP000256661}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 43927 {ECO:0000313|EMBL:REF00061.1,
RC ECO:0000313|Proteomes:UP000256661};
RA Klenk H.-P.;
RT "Sequencing the genomes of 1000 actinobacteria strains.";
RL Submitted (AUG-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:REF00061.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QTTT01000001; REF00061.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3D9SZS3; -.
DR OrthoDB; 9804714at2; -.
DR Proteomes; UP000256661; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006139; P:nucleobase-containing compound metabolic process; IEA:InterPro.
DR CDD; cd05685; S1_Tex; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 1.
DR Gene3D; 1.10.10.650; RuvA domain 2-like; 1.
DR Gene3D; 1.10.3500.10; Tex N-terminal region-like; 1.
DR Gene3D; 1.10.150.310; Tex RuvX-like domain-like; 1.
DR Gene3D; 3.30.420.140; YqgF/RNase H-like domain; 1.
DR InterPro; IPR041692; HHH_9.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR010994; RuvA_2-like.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR044146; S1_Tex.
DR InterPro; IPR023323; Tex-like_dom_sf.
DR InterPro; IPR023319; Tex-like_HTH_dom_sf.
DR InterPro; IPR018974; Tex-like_N.
DR InterPro; IPR032639; Tex_YqgF.
DR InterPro; IPR006641; YqgF/RNaseH-like_dom.
DR InterPro; IPR037027; YqgF/RNaseH-like_dom_sf.
DR PANTHER; PTHR10724; 30S RIBOSOMAL PROTEIN S1; 1.
DR PANTHER; PTHR10724:SF10; S1 RNA-BINDING DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF12836; HHH_3; 1.
DR Pfam; PF17674; HHH_9; 1.
DR Pfam; PF00575; S1; 1.
DR Pfam; PF09371; Tex_N; 1.
DR Pfam; PF16921; Tex_YqgF; 1.
DR SMART; SM00316; S1; 1.
DR SMART; SM00732; YqgFc; 1.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR SUPFAM; SSF47781; RuvA domain 2-like; 2.
DR SUPFAM; SSF158832; Tex N-terminal region-like; 1.
DR PROSITE; PS50126; S1; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000256661}.
FT DOMAIN 661..730
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REGION 731..820
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 731..754
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 820 AA; 88253 MW; AEC23F7BFA5DD0D0 CRC64;
MSQTATGKAP DNALNVRIAE ELGVRERQVQ VAVELLDGGA TVPFIARYRK EATGALDDTR
LRRLEERLRY LREMEERRAA ILESIRSQGR LDEALERQIR EADTKARLED IYLPYKPKRR
TKAMIAREAG LEPLADGLLG DPTRDPQAAA QAFVDPDKGV ADAAAALDGA RSILVERFGE
DADLIGELRE RMWTQGRLVA KVREGREEAG AKFADYFDFA EPFTKLPSHR ILAMFRGEKE
EVLDLEPAPE ADDAEVSGPT WYERRIAGAF GIVDRGRPAD RWLSETVRWA WRTRILVHLG
IDLRTRLRQE AEDEAVRVFA ANLRDLLLAA PAGTRATMGL DPGLRTGVKV AVVDATGKVV
DTATIYPHEP RRKWDESLAV LQGLAAKHGV ELVAIGNGTA SRETDKLAGD LIRRHPELNL
TKIVVSEAGA SVYSASEYAS QELPELDVSL RGAVSIARRL QDPLAELVKI DPKSIGVGQY
QHDLAEAKLS RSLDAVVEDC VNAVGVDVNT ASAPLLTRVS GIGEGLAAGI VAHREANGPF
RTRKGLKDVA RLGPKAFEQC AGFLRIPGGD DPLDASSVHP ESYPVVRRII DAAGGDLGSL
IGNGKVLRSL RPADFVDEAF GLPTVTDILS ELEKPGRDPR PAFRTAAFAD GVEKLTDLRP
GMVLEGVVTN VAAFGAFVDV GVHQDGLVHI SAMSKTFVSD PRDVAKPGDI VKVRVLDVDV
PRKRISLTLR LDEDPNERRE RSGPGRRDGG GRGDRGRQGG GQGQGGQSGQ GGQGGGRGGQ
GGARGGSRGS RGQNAPQPGG ALADALRRAG LDKGLPGGDR
//