ID A0A2U1CHV7_9BURK Unreviewed; 800 AA.
AC A0A2U1CHV7;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 24-JAN-2024, entry version 16.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=C7440_3550 {ECO:0000313|EMBL:PVY60514.1};
OS Pusillimonas noertemannii.
OC Bacteria; Pseudomonadota; Betaproteobacteria; Burkholderiales;
OC Alcaligenaceae; Pusillimonas.
OX NCBI_TaxID=305977 {ECO:0000313|EMBL:PVY60514.1, ECO:0000313|Proteomes:UP000246145};
RN [1] {ECO:0000313|EMBL:PVY60514.1, ECO:0000313|Proteomes:UP000246145}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 10065 {ECO:0000313|EMBL:PVY60514.1,
RC ECO:0000313|Proteomes:UP000246145};
RA Goeker M.;
RT "Genomic Encyclopedia of Type Strains, Phase IV (KMG-IV): sequencing the
RT most valuable type-strain genomes for metagenomic binning, comparative
RT biology and taxonomic classification.";
RL Submitted (APR-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PVY60514.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QEKO01000008; PVY60514.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1CHV7; -.
DR STRING; 1231391.GCA_000308195_01875; -.
DR Proteomes; UP000246145; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006139; P:nucleobase-containing compound metabolic process; IEA:InterPro.
DR CDD; cd05685; S1_Tex; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 1.
DR Gene3D; 1.10.10.650; RuvA domain 2-like; 1.
DR Gene3D; 1.10.3500.10; Tex N-terminal region-like; 1.
DR Gene3D; 1.10.150.310; Tex RuvX-like domain-like; 1.
DR Gene3D; 3.30.420.140; YqgF/RNase H-like domain; 1.
DR InterPro; IPR041692; HHH_9.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR010994; RuvA_2-like.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR044146; S1_Tex.
DR InterPro; IPR023323; Tex-like_dom_sf.
DR InterPro; IPR023319; Tex-like_HTH_dom_sf.
DR InterPro; IPR018974; Tex-like_N.
DR InterPro; IPR032639; Tex_YqgF.
DR InterPro; IPR006641; YqgF/RNaseH-like_dom.
DR InterPro; IPR037027; YqgF/RNaseH-like_dom_sf.
DR PANTHER; PTHR10724; 30S RIBOSOMAL PROTEIN S1; 1.
DR PANTHER; PTHR10724:SF10; S1 RNA-BINDING DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF12836; HHH_3; 1.
DR Pfam; PF17674; HHH_9; 1.
DR Pfam; PF00575; S1; 1.
DR Pfam; PF09371; Tex_N; 1.
DR Pfam; PF16921; Tex_YqgF; 1.
DR SMART; SM00316; S1; 1.
DR SMART; SM00732; YqgFc; 1.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR SUPFAM; SSF47781; RuvA domain 2-like; 2.
DR SUPFAM; SSF158832; Tex N-terminal region-like; 1.
DR PROSITE; PS50126; S1; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000246145}.
FT DOMAIN 682..751
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REGION 753..787
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 800 AA; 87176 MW; 40630023222C21DE CRC64;
MPRLNLFFRL AMTRTSAALL PDASAQGKPE HIIALLAHEL GAKPPQVAAA VELLDGGATV
PFIARYRKEA TGGLDDTVLR NLEVRLIYVR ELEERRAAVL ESIEQQGKLS EGLAREIRAA
DTKQRLEDLY APYKPKRRTR AQIAREAGLE PLADAIVADP DCDPAALAAQ YLNPEASIND
AKAALDGARD ILAERYAEDA DLLADLREHL WNTGRLYSKV ADGKEAEGAN FRDWFDFSET
LRTLPSHRVL ALLRGRQQGM LELRLGLEAE LEAQTPHPCV ARIAQRLGLD VDFSADAPPR
RRWLAEVCRW TWRVKLLTAF ETELIGRLRE NAEAEAIRVF AANLKDLLLA APAGPKAVMG
LDPGIRTGVK VAVIDHTGKV LDTATVYPFE PRRDREGSIK TLAALATRHR VELVAIGNGT
ASRETEKLVG DMLQAFPQLE LTRVVVSEAG ASVYSASELA AQEFPDLDVS LRGAVSIARR
LQDPLAELVK IEPKAIGVGQ YQHDVNQREL ARSLDAVIED CVNAVGVDVN TASAALLSRV
SGLNSLLARN IVAWRDENGA FANRKMLMKV TRFGDKAFEQ AAGFLRVANG DNPLDASAVH
PEAYPVVERI LKKISADVSQ VMGKQAALKG VSPSEFTDER FGLPTVRDIF AELEKPGRDP
RPEFQTAQFK EGVNTLNDLH EGMILEGVVT NVANFGAFVD IGVHQDGLVH ISALADRFVK
DPRDVVRVGQ TVKVKVQEVD IARKRVGLTM RLDDDAAPAA RRGGNAGSDR PRGAGQPRAG
RAEAGQSMGA MAAAFAKLKR
//