ID A0A0W0WP06_9GAMM Unreviewed; 781 AA.
AC A0A0W0WP06;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE SubName: Full=Transcriptional accessory protein {ECO:0000313|EMBL:KTD34058.1};
GN Name=yhgF {ECO:0000313|EMBL:KTD34058.1};
GN ORFNames=Lnau_2028 {ECO:0000313|EMBL:KTD34058.1};
OS Legionella nautarum.
OC Bacteria; Pseudomonadota; Gammaproteobacteria; Legionellales;
OC Legionellaceae; Legionella.
OX NCBI_TaxID=45070 {ECO:0000313|EMBL:KTD34058.1, ECO:0000313|Proteomes:UP000054725};
RN [1] {ECO:0000313|EMBL:KTD34058.1, ECO:0000313|Proteomes:UP000054725}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 49506 {ECO:0000313|EMBL:KTD34058.1,
RC ECO:0000313|Proteomes:UP000054725};
RA Burstein D., Amaro F., Zusman T., Lifshitz Z., Cohen O., Gilbert J.A.,
RA Pupko T., Shuman H.A., Segal G.;
RT "Genomic analysis of 38 Legionella species identifies large and diverse
RT effector repertoires.";
RL Submitted (NOV-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KTD34058.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LNYO01000019; KTD34058.1; -; Genomic_DNA.
DR RefSeq; WP_058505044.1; NZ_LNYO01000019.1.
DR AlphaFoldDB; A0A0W0WP06; -.
DR STRING; 45070.Lnau_2028; -.
DR PATRIC; fig|45070.6.peg.2139; -.
DR OrthoDB; 9804714at2; -.
DR Proteomes; UP000054725; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006139; P:nucleobase-containing compound metabolic process; IEA:InterPro.
DR CDD; cd05685; S1_Tex; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 1.
DR Gene3D; 1.10.10.650; RuvA domain 2-like; 1.
DR Gene3D; 1.10.3500.10; Tex N-terminal region-like; 1.
DR Gene3D; 1.10.150.310; Tex RuvX-like domain-like; 1.
DR Gene3D; 3.30.420.140; YqgF/RNase H-like domain; 1.
DR InterPro; IPR041692; HHH_9.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR010994; RuvA_2-like.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR044146; S1_Tex.
DR InterPro; IPR023323; Tex-like_dom_sf.
DR InterPro; IPR023319; Tex-like_HTH_dom_sf.
DR InterPro; IPR018974; Tex-like_N.
DR InterPro; IPR032639; Tex_YqgF.
DR InterPro; IPR006641; YqgF/RNaseH-like_dom.
DR InterPro; IPR037027; YqgF/RNaseH-like_dom_sf.
DR PANTHER; PTHR10724; 30S RIBOSOMAL PROTEIN S1; 1.
DR PANTHER; PTHR10724:SF10; S1 RNA-BINDING DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF12836; HHH_3; 1.
DR Pfam; PF17674; HHH_9; 1.
DR Pfam; PF00575; S1; 1.
DR Pfam; PF09371; Tex_N; 1.
DR Pfam; PF16921; Tex_YqgF; 1.
DR SMART; SM00316; S1; 1.
DR SMART; SM00732; YqgFc; 1.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR SUPFAM; SSF47781; RuvA domain 2-like; 2.
DR SUPFAM; SSF158832; Tex N-terminal region-like; 1.
DR PROSITE; PS50126; S1; 1.
PE 4: Predicted;
FT DOMAIN 648..717
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REGION 730..759
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 740..759
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 781 AA; 86877 MW; C8D7EA16FC07BC8C CRC64;
MAQKIQASAA IIAQELKVKV SQVVTAIDLL DEGATVPFIA RYRKEATEGL DDTQLRLLAE
RLHYIRELDE RREVVLQSIR EQEKLTPELE KAILAADSKT RLEDLYLPYK PKRRTKAQIA
REAGLEPLAQ ALWQDPSQDP ETYAQQFINP EATVEDAKAA LDGAQQILME LFAEDAELIN
ELREHLWQNA VLKSTGSADK KGAGNKFADY FEYSEAIKKI PSHRALALFR GRRESVLQLA
LTLDDAEYGE KRVASYFKIS DEKRAADAWL LETVRLTWKV KLFTKLELEL LARLRESADE
EAINVFARNL RDLLLAAPAG PQVTIGLDPG IRTGVKVVVV DITGKLLDYT TVFPLPPQNE
WHQAIAELAK LAAKYQVNLF SVGNGTGSRE TERLVSDLIK MYPDLKLSKV IVSEAGASVY
SASELAAKEF PDLDVTLRGA VSIARRLQDP LAELVKIEPK AIGVGQYQHD VNQNRLARSL
DGVVEDCVNA VGVDVNTASV ALLTRVSGLN ETLAKNLVQY RDEHGVFANR EQLKKVARMG
EKTFQQAAGF LRIMQGDNLL DSSAVHPEAY PLVEKILADQ QMDIRQVIGN RDLLQSINAE
RYVTDDYGLP TIRDVLRELE KPGRDPRPEF KTANFKEGVE DISHLHEGMI LEGVVSNVTN
FGAFVDIGVH QDGLVHISAM TNRFITDPHA VVKAGDIVTV KVVEVDKERR RIGLSMRLTE
EKASVVQKKI AKQSQPMKKP QAVKKKEQGK KVEDAKKPAA AKKTVFNTAM ADALAKLKRG
S
//