ID A0A1I5SJC8_9FIRM Unreviewed; 893 AA.
AC A0A1I5SJC8;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 24-JAN-2024, entry version 25.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=SAMN04487928_10698 {ECO:0000313|EMBL:SFP70751.1};
OS Butyrivibrio proteoclasticus.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae;
OC Butyrivibrio.
OX NCBI_TaxID=43305 {ECO:0000313|EMBL:SFP70751.1, ECO:0000313|Proteomes:UP000182624};
RN [1] {ECO:0000313|Proteomes:UP000182624}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=P18 {ECO:0000313|Proteomes:UP000182624};
RA Varghese N., Submissions S.;
RL Submitted (OCT-2016) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FOXO01000006; SFP70751.1; -; Genomic_DNA.
DR RefSeq; WP_074885529.1; NZ_FOXO01000006.1.
DR AlphaFoldDB; A0A1I5SJC8; -.
DR OrthoDB; 9804714at2; -.
DR Proteomes; UP000182624; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006139; P:nucleobase-containing compound metabolic process; IEA:InterPro.
DR CDD; cd05685; S1_Tex; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 1.
DR Gene3D; 1.10.10.650; RuvA domain 2-like; 1.
DR Gene3D; 1.10.3500.10; Tex N-terminal region-like; 1.
DR Gene3D; 1.10.150.310; Tex RuvX-like domain-like; 1.
DR Gene3D; 3.30.420.140; YqgF/RNase H-like domain; 1.
DR InterPro; IPR041692; HHH_9.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR010994; RuvA_2-like.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR044146; S1_Tex.
DR InterPro; IPR023323; Tex-like_dom_sf.
DR InterPro; IPR023319; Tex-like_HTH_dom_sf.
DR InterPro; IPR018974; Tex-like_N.
DR InterPro; IPR032639; Tex_YqgF.
DR InterPro; IPR006641; YqgF/RNaseH-like_dom.
DR InterPro; IPR037027; YqgF/RNaseH-like_dom_sf.
DR PANTHER; PTHR10724; 30S RIBOSOMAL PROTEIN S1; 1.
DR PANTHER; PTHR10724:SF10; S1 RNA-BINDING DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF12836; HHH_3; 1.
DR Pfam; PF17674; HHH_9; 2.
DR Pfam; PF00575; S1; 1.
DR Pfam; PF09371; Tex_N; 1.
DR Pfam; PF16921; Tex_YqgF; 1.
DR SMART; SM00316; S1; 1.
DR SMART; SM00732; YqgFc; 1.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR SUPFAM; SSF47781; RuvA domain 2-like; 3.
DR SUPFAM; SSF158832; Tex N-terminal region-like; 1.
DR PROSITE; PS50126; S1; 1.
PE 4: Predicted;
FT DOMAIN 722..791
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REGION 584..609
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 636..669
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 590..606
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 636..665
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 893 AA; 97776 MW; BA58DF7D31CA3E29 CRC64;
MDIVLKIKEE LNVEKWQVEA AVKLIDEGNT IPFISRYRKE VTGSLNDEQL RNLDERLKYL
RGLEERREQV LASIEEQGKM TDELKSKIIA AETMVALEDL YLPYRPKRKT RASVAREKGL
EGLANILLAQ ETTKPLQEEA AAFVDPEKGV NDPAEALQGA MDIIAEDVSD NADFRTYIRE
TTMEQGMLTS KAKDEKAQSV YEMYYNYEEP VKKVLGHRVL ALNRGEAEKF LVVKIEAPEE
QILQYLDKKM IKNDNPVTSP VIKEINQDAY ERLIAPAIER DIRNELTEKA EDGAISVFGK
NLTQLLMAPP IAGKTVLGWD PAFRTGCKLA IVDATGKVLD TKVIYPTAPQ NKVEEAKAEL
KKLIDKYDVD LISVGNGTAS RESEQVIVEL LKELDKPVQY VIVSEAGASV YSASKLATEE
FPQFDVGQRS AASIARRLQD PLAELVKIDP KSIGVGQYQH DMNQKKLGEA LEGVVETCVN
KVGVDLNTAS APLLQYISGI SKVIAKNIVE YREENGKFNS RAELLNVPKL GPKAYEQCAG
FLRITDGENP LDATSVHPES YDATLKLMDK LGITFDDVRQ AQKNAAKASL EKPAAKKEEK
PKPQKKVKQV VIRNTGTAMG AALAAALAGS NLAVTENEAP SSNSKKEKAT ENTSNTAPAT
GSLSKKVSES DKKKLAEELG IGEITLTDIL SELEKPSRDP RENMPAPILR SDVLDMKDLK
PGMVLKGTVR NVIDFGCFVD IGVHQDGLVH ISHITDKYIK HPLEAVSVGD IVDVQVLDVE
LDKKRISLSM KLQDPAKVAA EAAAKAAERP APKVKAEDKV SKPAPKRTSV VDQVAKAAGV
TRKAVPVKRP VTSAAKAETP VAAPKAVQTA TATEAPKKFK KKGIVIFKGN AND
//