ID G1WGN2_9ACTN Unreviewed; 355 AA.
AC G1WGN2;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 28-JUN-2023, entry version 39.
DE RecName: Full=HTH OST-type domain-containing protein {ECO:0000259|PROSITE:PS51644};
GN ORFNames=HMPREF9452_00495 {ECO:0000313|EMBL:EGX67483.1};
OS Collinsella tanakaei YIT 12063.
OC Bacteria; Actinomycetota; Coriobacteriia; Coriobacteriales;
OC Coriobacteriaceae; Collinsella.
OX NCBI_TaxID=742742 {ECO:0000313|EMBL:EGX67483.1, ECO:0000313|Proteomes:UP000004830};
RN [1] {ECO:0000313|EMBL:EGX67483.1, ECO:0000313|Proteomes:UP000004830}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=YIT 12063 {ECO:0000313|EMBL:EGX67483.1,
RC ECO:0000313|Proteomes:UP000004830};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Morotomi M., Young S.K.,
RA Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C.,
RA Freedman E., Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S.,
RA Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P., Mehta T.,
RA Montmayeur A., Murphy C., Neiman D., Pearson M., Priest M., Roberts A.,
RA Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., Wortman J.,
RA Nusbaum C., Birren B.;
RT "The Genome Sequence of Collinsella tanakaei YIT 12063.";
RL Submitted (JUN-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGX67483.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADLS01000006; EGX67483.1; -; Genomic_DNA.
DR RefSeq; WP_009140531.1; NZ_JH126467.1.
DR AlphaFoldDB; G1WGN2; -.
DR STRING; 742742.HMPREF9452_00495; -.
DR GeneID; 62758263; -.
DR PATRIC; fig|742742.3.peg.473; -.
DR eggNOG; COG1432; Bacteria.
DR HOGENOM; CLU_034061_0_0_11; -.
DR OrthoDB; 2379772at2; -.
DR Proteomes; UP000004830; Unassembled WGS sequence.
DR GO; GO:0004540; F:RNA nuclease activity; IEA:InterPro.
DR CDD; cd10146; LabA_like_C; 1.
DR CDD; cd11297; PIN_LabA-like_N_1; 1.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 1.
DR Gene3D; 3.30.420.610; LOTUS domain-like; 1.
DR InterPro; IPR041966; LOTUS-like.
DR InterPro; IPR021139; NYN.
DR InterPro; IPR025605; OST-HTH/LOTUS_dom.
DR PANTHER; PTHR35811:SF1; HTH OST-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR35811; SLR1870 PROTEIN; 1.
DR Pfam; PF01936; NYN; 1.
DR Pfam; PF12872; OST-HTH; 1.
DR PROSITE; PS51644; HTH_OST; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000004830}.
FT DOMAIN 276..352
FT /note="HTH OST-type"
FT /evidence="ECO:0000259|PROSITE:PS51644"
FT REGION 153..188
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 216..236
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 355 AA; 39585 MW; 0EBE58DDD3D91061 CRC64;
MNREPDLRFA ILIDADNVSE KYVQIVLDEV ANAGVATYKR IYGDWTSPRL SSWKKCLLDN
SIIPMQQYSY TFGKNATDSA MIIDAMDILY SGTVDGFAIV SSDSDFTRLV ARLRESGMYV
IGMGEQKTPR PFISACNQFK YLDLLLAARA SEDEADEDEE LEVPVKKRRS RSRGRKQSRA
AERSQDADAE LAIDRNEGDA LIEVAADIED AADAGAAQRR ARRRPGDRGV LAVPAGDMPG
DADALAEVSA GFSGEAEADA VEVEFGQMSK ADRRRHMRMI RESINAIIDK FSDDDGWVSL
GQLGDQLARR LPDFDVRNYG FKKLRPFLKS LGVYEFDEPS DDSGHRQIYL RVKAE
//