ID G7CLN1_MYCT3 Unreviewed; 786 AA.
AC G7CLN1;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 27-MAR-2024, entry version 45.
DE SubName: Full=RNA-binding protein {ECO:0000313|EMBL:EHI11115.1};
GN ORFNames=KEK_19616 {ECO:0000313|EMBL:EHI11115.1};
OS Mycolicibacterium thermoresistibile (strain ATCC 19527 / DSM 44167 / CIP
OS 105390 / JCM 6362 / NCTC 10409 / 316) (Mycobacterium thermoresistibile).
OC Bacteria; Actinomycetota; Actinomycetes; Mycobacteriales; Mycobacteriaceae;
OC Mycolicibacterium.
OX NCBI_TaxID=1078020 {ECO:0000313|EMBL:EHI11115.1, ECO:0000313|Proteomes:UP000004915};
RN [1] {ECO:0000313|EMBL:EHI11115.1, ECO:0000313|Proteomes:UP000004915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 19527 / DSM 44167 / CIP 105390 / JCM 6362 / NCTC 10409 /
RC 316 {ECO:0000313|Proteomes:UP000004915};
RG Tuberculosis Structural Genomics Consortium;
RA Ioerger T.R.;
RL Submitted (NOV-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EHI11115.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGVE01000049; EHI11115.1; -; Genomic_DNA.
DR RefSeq; WP_003927392.1; NZ_AGVE01000049.1.
DR AlphaFoldDB; G7CLN1; -.
DR PATRIC; fig|1078020.3.peg.3868; -.
DR eggNOG; COG2183; Bacteria.
DR Proteomes; UP000004915; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006139; P:nucleobase-containing compound metabolic process; IEA:InterPro.
DR CDD; cd05685; S1_Tex; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 1.
DR Gene3D; 1.10.10.650; RuvA domain 2-like; 1.
DR Gene3D; 1.10.3500.10; Tex N-terminal region-like; 1.
DR Gene3D; 1.10.150.310; Tex RuvX-like domain-like; 1.
DR Gene3D; 3.30.420.140; YqgF/RNase H-like domain; 1.
DR InterPro; IPR041692; HHH_9.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR010994; RuvA_2-like.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR044146; S1_Tex.
DR InterPro; IPR023323; Tex-like_dom_sf.
DR InterPro; IPR023319; Tex-like_HTH_dom_sf.
DR InterPro; IPR018974; Tex-like_N.
DR InterPro; IPR032639; Tex_YqgF.
DR InterPro; IPR006641; YqgF/RNaseH-like_dom.
DR InterPro; IPR037027; YqgF/RNaseH-like_dom_sf.
DR PANTHER; PTHR10724; 30S RIBOSOMAL PROTEIN S1; 1.
DR PANTHER; PTHR10724:SF10; S1 RNA-BINDING DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF12836; HHH_3; 1.
DR Pfam; PF17674; HHH_9; 1.
DR Pfam; PF00575; S1; 1.
DR Pfam; PF09371; Tex_N; 1.
DR Pfam; PF16921; Tex_YqgF; 1.
DR SMART; SM00316; S1; 1.
DR SMART; SM00732; YqgFc; 1.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR SUPFAM; SSF47781; RuvA domain 2-like; 2.
DR SUPFAM; SSF158832; Tex N-terminal region-like; 1.
DR PROSITE; PS50126; S1; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000004915}.
FT DOMAIN 655..724
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REGION 723..786
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 723..763
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 786 AA; 84348 MW; 24779D155500EC97 CRC64;
MTGTVTLKSV NARLAEELEV GEAQVAAAVR LLDEGATVPF IARYRKEATG SLDDGQLRML
EERLRYLREL DERRAAVLSA IEEQGKLTDD LRAALLAADT KARVEDIYLP FKPKRRTKAQ
IAREAGLEPL ADRLLADPTL VPDQVAGEYV GESVADPAAA LDGARHILTD RAAEDAELVG
AIREKFWVQG TLRTAPWSDE VATSAAAQKF RDYFEFAEPL EEMPSHRVLA VLRGEKEEVL
ALTFDGGDEA GYHAMIADTL GIDLTAAAAA TPWLTGTVEW AWRTKLSVSA KVDARIRLRQ
RAEEEAVAVF ARNLRDLLLA APAGSRVTLG LDPGYRTGVK VAVVDGTGKV LDTATIYPHQ
PQKQWDAAKA TLAALVARHG VELIAVGNGT ASRETDALAA EVIADIRAAG ATPPVKAMVS
EAGASVYSAS SYAARELPDL DVTLRGAVSI ARRLQDPLAE LVKIEPKSIG VGQYQHDVTP
GYLARSLDAV VEDAVNAVGV DLNTASVPLL SRVSGISESL AESIVAHREK TGPFRNRKAL
LDVPRLGPKA FEQCAGFLRI RDGDDPLDAS GVHPEAYPVV RRILDRAGVT LAELIGNEKT
LRALRPADFA DDQFGIPTVT DILAELEKPG RDPRPAFATA TFTDGVDTIA DLKVGMVLEG
VVTNVAAFGA FVDIGVHQDG LVHVSAMADR FVSDPHDIVR SGQVVRVKVV DVDVERQRIG
LSLRLDDDPH QSGKRSGNRG AKGEAPKGGN GRRPDRGDRR NQNRSRGGNQ PNGAMAQALR
DAGFGR
//