GenomeNet

Database: UniProt
Entry: G1WGN2_9ACTN
LinkDB: G1WGN2_9ACTN
Original site: G1WGN2_9ACTN 
ID   G1WGN2_9ACTN            Unreviewed;       355 AA.
AC   G1WGN2;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   16-NOV-2011, sequence version 1.
DT   28-JUN-2023, entry version 39.
DE   RecName: Full=HTH OST-type domain-containing protein {ECO:0000259|PROSITE:PS51644};
GN   ORFNames=HMPREF9452_00495 {ECO:0000313|EMBL:EGX67483.1};
OS   Collinsella tanakaei YIT 12063.
OC   Bacteria; Actinomycetota; Coriobacteriia; Coriobacteriales;
OC   Coriobacteriaceae; Collinsella.
OX   NCBI_TaxID=742742 {ECO:0000313|EMBL:EGX67483.1, ECO:0000313|Proteomes:UP000004830};
RN   [1] {ECO:0000313|EMBL:EGX67483.1, ECO:0000313|Proteomes:UP000004830}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=YIT 12063 {ECO:0000313|EMBL:EGX67483.1,
RC   ECO:0000313|Proteomes:UP000004830};
RG   The Broad Institute Genome Sequencing Platform;
RA   Earl A., Ward D., Feldgarden M., Gevers D., Morotomi M., Young S.K.,
RA   Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA   Arachchi H.M., Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C.,
RA   Freedman E., Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S.,
RA   Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P., Mehta T.,
RA   Montmayeur A., Murphy C., Neiman D., Pearson M., Priest M., Roberts A.,
RA   Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., Wortman J.,
RA   Nusbaum C., Birren B.;
RT   "The Genome Sequence of Collinsella tanakaei YIT 12063.";
RL   Submitted (JUN-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EGX67483.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ADLS01000006; EGX67483.1; -; Genomic_DNA.
DR   RefSeq; WP_009140531.1; NZ_JH126467.1.
DR   AlphaFoldDB; G1WGN2; -.
DR   STRING; 742742.HMPREF9452_00495; -.
DR   GeneID; 62758263; -.
DR   PATRIC; fig|742742.3.peg.473; -.
DR   eggNOG; COG1432; Bacteria.
DR   HOGENOM; CLU_034061_0_0_11; -.
DR   OrthoDB; 2379772at2; -.
DR   Proteomes; UP000004830; Unassembled WGS sequence.
DR   GO; GO:0004540; F:RNA nuclease activity; IEA:InterPro.
DR   CDD; cd10146; LabA_like_C; 1.
DR   CDD; cd11297; PIN_LabA-like_N_1; 1.
DR   Gene3D; 3.40.50.1010; 5'-nuclease; 1.
DR   Gene3D; 3.30.420.610; LOTUS domain-like; 1.
DR   InterPro; IPR041966; LOTUS-like.
DR   InterPro; IPR021139; NYN.
DR   InterPro; IPR025605; OST-HTH/LOTUS_dom.
DR   PANTHER; PTHR35811:SF1; HTH OST-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR35811; SLR1870 PROTEIN; 1.
DR   Pfam; PF01936; NYN; 1.
DR   Pfam; PF12872; OST-HTH; 1.
DR   PROSITE; PS51644; HTH_OST; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000004830}.
FT   DOMAIN          276..352
FT                   /note="HTH OST-type"
FT                   /evidence="ECO:0000259|PROSITE:PS51644"
FT   REGION          153..188
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          216..236
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   355 AA;  39585 MW;  0EBE58DDD3D91061 CRC64;
     MNREPDLRFA ILIDADNVSE KYVQIVLDEV ANAGVATYKR IYGDWTSPRL SSWKKCLLDN
     SIIPMQQYSY TFGKNATDSA MIIDAMDILY SGTVDGFAIV SSDSDFTRLV ARLRESGMYV
     IGMGEQKTPR PFISACNQFK YLDLLLAARA SEDEADEDEE LEVPVKKRRS RSRGRKQSRA
     AERSQDADAE LAIDRNEGDA LIEVAADIED AADAGAAQRR ARRRPGDRGV LAVPAGDMPG
     DADALAEVSA GFSGEAEADA VEVEFGQMSK ADRRRHMRMI RESINAIIDK FSDDDGWVSL
     GQLGDQLARR LPDFDVRNYG FKKLRPFLKS LGVYEFDEPS DDSGHRQIYL RVKAE
//
DBGET integrated database retrieval system