GenomeNet

Database: UniProt
Entry: D4CP88_9FIRM
LinkDB: D4CP88_9FIRM
Original site: D4CP88_9FIRM 
ID   D4CP88_9FIRM            Unreviewed;       444 AA.
AC   D4CP88;
DT   18-MAY-2010, integrated into UniProtKB/TrEMBL.
DT   18-MAY-2010, sequence version 1.
DT   27-MAR-2024, entry version 68.
DE   RecName: Full=Transcription termination/antitermination protein NusA {ECO:0000256|HAMAP-Rule:MF_00945};
GN   Name=nusA {ECO:0000256|HAMAP-Rule:MF_00945,
GN   ECO:0000313|EMBL:EFE91072.1};
GN   ORFNames=GCWU000341_02179 {ECO:0000313|EMBL:EFE91072.1};
OS   Oribacterium sp. oral taxon 078 str. F0262.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae;
OC   Oribacterium.
OX   NCBI_TaxID=608534 {ECO:0000313|EMBL:EFE91072.1, ECO:0000313|Proteomes:UP000004602};
RN   [1] {ECO:0000313|EMBL:EFE91072.1, ECO:0000313|Proteomes:UP000004602}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=F0262 {ECO:0000313|EMBL:EFE91072.1,
RC   ECO:0000313|Proteomes:UP000004602};
RA   Weinstock G., Sodergren E., Clifton S., Fulton L., Fulton B., Courtney L.,
RA   Fronick C., Harrison M., Strong C., Farmer C., Delahaunty K., Markovic C.,
RA   Hall O., Minx P., Tomlinson C., Mitreva M., Nelson J., Hou S., Wollam A.,
RA   Pepin K.H., Johnson M., Bhonagiri V., Zhang X., Suruliraj S., Warren W.,
RA   Chinwalla A., Mardis E.R., Wilson R.K.;
RL   Submitted (FEB-2010) to the EMBL/GenBank/DDBJ databases.
CC   -!- FUNCTION: Participates in both transcription termination and
CC       antitermination. {ECO:0000256|HAMAP-Rule:MF_00945}.
CC   -!- SUBUNIT: Monomer. Binds directly to the core enzyme of the DNA-
CC       dependent RNA polymerase and to nascent RNA. {ECO:0000256|HAMAP-
CC       Rule:MF_00945}.
CC   -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|HAMAP-Rule:MF_00945}.
CC   -!- SIMILARITY: Belongs to the NusA family. {ECO:0000256|HAMAP-
CC       Rule:MF_00945}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EFE91072.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ACIQ02000025; EFE91072.1; -; Genomic_DNA.
DR   RefSeq; WP_009215632.1; NZ_GG729935.1.
DR   AlphaFoldDB; D4CP88; -.
DR   STRING; 608534.GCWU000341_02179; -.
DR   eggNOG; COG0195; Bacteria.
DR   HOGENOM; CLU_029242_2_2_9; -.
DR   Proteomes; UP000004602; Unassembled WGS sequence.
DR   GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0006353; P:DNA-templated transcription termination; IEA:UniProtKB-UniRule.
DR   GO; GO:0031564; P:transcription antitermination; IEA:UniProtKB-UniRule.
DR   CDD; cd02134; KH-II_NusA_rpt1; 1.
DR   CDD; cd22529; KH-II_NusA_rpt2; 1.
DR   CDD; cd04455; S1_NusA; 1.
DR   Gene3D; 3.30.300.20; -; 2.
DR   Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 1.
DR   Gene3D; 3.30.1480.10; NusA, N-terminal domain; 1.
DR   HAMAP; MF_00945_B; NusA_B; 1.
DR   InterPro; IPR015946; KH_dom-like_a/b.
DR   InterPro; IPR025249; KH_dom_NusA-like.
DR   InterPro; IPR009019; KH_sf_prok-type.
DR   InterPro; IPR012340; NA-bd_OB-fold.
DR   InterPro; IPR030842; NusA_bac.
DR   InterPro; IPR036555; NusA_N_sf.
DR   InterPro; IPR003029; S1_domain.
DR   InterPro; IPR013735; TF_NusA_N.
DR   InterPro; IPR010213; Tscrpt_termination_fac_NusA.
DR   NCBIfam; TIGR01953; NusA; 1.
DR   PANTHER; PTHR22648; TRANSCRIPTION TERMINATION FACTOR NUSA; 1.
DR   PANTHER; PTHR22648:SF0; TRANSCRIPTION TERMINATION_ANTITERMINATION PROTEIN NUSA; 1.
DR   Pfam; PF13184; KH_5; 1.
DR   Pfam; PF08529; NusA_N; 1.
DR   SMART; SM00316; S1; 1.
DR   SUPFAM; SSF50249; Nucleic acid-binding proteins; 1.
DR   SUPFAM; SSF54814; Prokaryotic type KH domain (KH-domain type II); 2.
DR   SUPFAM; SSF69705; Transcription factor NusA, N-terminal domain; 1.
DR   PROSITE; PS50084; KH_TYPE_1; 1.
DR   PROSITE; PS50126; S1; 1.
PE   3: Inferred from homology;
KW   Cytoplasm {ECO:0000256|ARBA:ARBA00022490, ECO:0000256|HAMAP-Rule:MF_00945};
KW   RNA-binding {ECO:0000256|ARBA:ARBA00022884, ECO:0000256|HAMAP-
KW   Rule:MF_00945};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163, ECO:0000256|HAMAP-
KW   Rule:MF_00945};
KW   Transcription antitermination {ECO:0000256|ARBA:ARBA00022814,
KW   ECO:0000256|HAMAP-Rule:MF_00945};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015, ECO:0000256|HAMAP-
KW   Rule:MF_00945};
KW   Transcription termination {ECO:0000256|ARBA:ARBA00022472,
KW   ECO:0000256|HAMAP-Rule:MF_00945}.
FT   DOMAIN          135..199
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   REGION          358..444
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        358..407
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        408..434
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   444 AA;  49073 MW;  3538A4C0F0E35B85 CRC64;
     MNTELRDALE LLEKEKGIPK QALIEAIELS LQTACRNHFG SADNVRVNVD PESCDFSVIA
     DKTVVETVSN PIAEIGLSEA KLVDPKYELG DIVQVPVDSR SFGRIATQNA KGVIVQKIRE
     EERRVLYREY YSMAREVVSG VVERDTGRSI IVNLGRVDGY LSENEQVKGE VLNPTDRIKV
     YVVEVRDSPK GPRVLLSRTH PELVKKLFEE EAPEIREGIV EIRAIAREAG SRTKMAVLSN
     DPDVDPVGSC VGLDGTRVNA VVEELRGEKI DIINYDENPA YLIENALSPA KVIAVIADPD
     NKDAMVIVPD TQLSLAIGKE GQNARLSAKL TGYKIDIKSE SQAQEQGIFD EMGIDYRGEG
     EEGEEDYDFP EDDGEEFTGE SEQEDFSEDS GEEDLSENSE EGADPAEDSG EGSERESMEE
     SEPSENAALE EREAEEEPKT EDDE
//
DBGET integrated database retrieval system