ID R5AXA3_9BACT Unreviewed; 716 AA.
AC R5AXA3;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 29.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=BN456_02057 {ECO:0000313|EMBL:CCX43821.1};
OS Prevotella sp. CAG:1031.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae;
OC Prevotella.
OX NCBI_TaxID=1262917 {ECO:0000313|EMBL:CCX43821.1, ECO:0000313|Proteomes:UP000018183};
RN [1] {ECO:0000313|EMBL:CCX43821.1, ECO:0000313|Proteomes:UP000018183}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:1031 {ECO:0000313|Proteomes:UP000018183};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCX43821.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAWK010000064; CCX43821.1; -; Genomic_DNA.
DR AlphaFoldDB; R5AXA3; -.
DR STRING; 1262917.BN456_02057; -.
DR Proteomes; UP000018183; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006139; P:nucleobase-containing compound metabolic process; IEA:InterPro.
DR CDD; cd05685; S1_Tex; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 1.
DR Gene3D; 1.10.10.650; RuvA domain 2-like; 1.
DR Gene3D; 1.10.3500.10; Tex N-terminal region-like; 1.
DR Gene3D; 1.10.150.310; Tex RuvX-like domain-like; 1.
DR Gene3D; 3.30.420.140; YqgF/RNase H-like domain; 1.
DR InterPro; IPR041692; HHH_9.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR010994; RuvA_2-like.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR044146; S1_Tex.
DR InterPro; IPR023323; Tex-like_dom_sf.
DR InterPro; IPR023319; Tex-like_HTH_dom_sf.
DR InterPro; IPR018974; Tex-like_N.
DR InterPro; IPR032639; Tex_YqgF.
DR InterPro; IPR006641; YqgF/RNaseH-like_dom.
DR InterPro; IPR037027; YqgF/RNaseH-like_dom_sf.
DR PANTHER; PTHR10724; 30S RIBOSOMAL PROTEIN S1; 1.
DR PANTHER; PTHR10724:SF10; S1 RNA-BINDING DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF12836; HHH_3; 1.
DR Pfam; PF17674; HHH_9; 1.
DR Pfam; PF00575; S1; 1.
DR Pfam; PF09371; Tex_N; 1.
DR Pfam; PF16921; Tex_YqgF; 1.
DR SMART; SM00316; S1; 1.
DR SMART; SM00732; YqgFc; 1.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR SUPFAM; SSF47781; RuvA domain 2-like; 2.
DR SUPFAM; SSF158832; Tex N-terminal region-like; 1.
DR PROSITE; PS50126; S1; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000018183}.
FT DOMAIN 643..712
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
SQ SEQUENCE 716 AA; 79539 MW; F163B3AB9C96FC3E CRC64;
MLQEHAAIIA EEINDTKRHV AAALKLFDDG ATIPFVARYR KEATGSMTET TLRTVWLRHQ
ALTELDKRKQ YVIETIKGKG ELTPEFKDRI LAITDATLLE DLFLPYKPHR RTRAQVAREA
GLEPLAKIIM AQQLGGIKMR ATCMARKLEN VFDGNAAIAG ALDIIAEWVS ESEKARAIVR
AKFLRNGMVS SMEIKRQEND KEGKYQNYYN LSEPVRSINS HRYLAMRRGE AEGVLKVNIS
IDDDEMGERL CRMFVKADSQ PEATALVRQA VVDGYRRLLR PSIETEIANM LKDRSDKAAI
SLFADNVEQM LMAPPLMRKR VMAIDPGFRT GCKIVCLDAQ GNLLAHDVIF PTPPANDFYG
SAQTLCYMVD RYQIDVIAIG NGHGSRETER FVSDVELPRK VDVVVVSEQG ASIYSASEVA
VEEFPNEDVT VRGAVSIGRR LIDPMAELVK IDPCSIGVGQ YQHDVNQTKL KEALDFAVSS
CVNSVGVNLN TASRQLLSYV SGIGPTLAGY IVDYRTEHGP FTNRNQLLEV PRMGAKTYQQ
CAGFLRIPRG ENVLDNTGVH PERYELVEQM ASDLGVDVEK LATDRATLHR VELDRYATRA
VGLPTLTDII LELEKPGRDP RAKIEDVVVH DDSVRTIADL HLGMTLSGKV NNITGFGVFV
DIGIKENGLI HISQLSDKFI TSATDVVRIG QIVNVKVLDI DIPRGRIALT MKGVPQ
//