GenomeNet

Database: UniProt
Entry: W4M8H4_9BACT
LinkDB: W4M8H4_9BACT
Original site: W4M8H4_9BACT 
ID   W4M8H4_9BACT            Unreviewed;       730 AA.
AC   W4M8H4;
DT   19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT   19-MAR-2014, sequence version 1.
DT   27-MAR-2024, entry version 29.
DE   RecName: Full=CHAT domain-containing protein {ECO:0000259|Pfam:PF12770};
GN   ORFNames=ETSY2_20170 {ECO:0000313|EMBL:ETX05927.1};
OS   Candidatus Entotheonella gemina.
OC   Bacteria; Nitrospinae/Tectomicrobia group; Candidatus Tectomicrobia;
OC   Entotheonella.
OX   NCBI_TaxID=1429439 {ECO:0000313|EMBL:ETX05927.1, ECO:0000313|Proteomes:UP000019140};
RN   [1] {ECO:0000313|EMBL:ETX05927.1, ECO:0000313|Proteomes:UP000019140}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=TSY2 {ECO:0000313|Proteomes:UP000019140};
RX   PubMed=24476823; DOI=10.1038/nature12959;
RA   Wilson M.C., Mori T., Ruckert C., Uria A.R., Helf M.J., Takada K.,
RA   Gernert C., Steffens U.A., Heycke N., Schmitt S., Rinke C., Helfrich E.J.,
RA   Brachmann A.O., Gurgui C., Wakimoto T., Kracht M., Crusemann M.,
RA   Hentschel U., Abe I., Matsunaga S., Kalinowski J., Takeyama H., Piel J.;
RT   "An environmental bacterial taxon with a large and distinct metabolic
RT   repertoire.";
RL   Nature 506:58-62(2014).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:ETX05927.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AZHX01000833; ETX05927.1; -; Genomic_DNA.
DR   AlphaFoldDB; W4M8H4; -.
DR   PATRIC; fig|1429439.4.peg.3429; -.
DR   HOGENOM; CLU_002404_0_0_7; -.
DR   Proteomes; UP000019140; Unassembled WGS sequence.
DR   Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 2.
DR   InterPro; IPR024983; CHAT_dom.
DR   InterPro; IPR011990; TPR-like_helical_dom_sf.
DR   InterPro; IPR019734; TPR_repeat.
DR   PANTHER; PTHR10098; RAPSYN-RELATED; 1.
DR   PANTHER; PTHR10098:SF106; TETRATRICOPEPTIDE REPEAT PROTEIN 28; 1.
DR   Pfam; PF12770; CHAT; 1.
DR   Pfam; PF13424; TPR_12; 1.
DR   SMART; SM00028; TPR; 5.
DR   SUPFAM; SSF48452; TPR-like; 2.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000019140}.
FT   DOMAIN          460..726
FT                   /note="CHAT"
FT                   /evidence="ECO:0000259|Pfam:PF12770"
SQ   SEQUENCE   730 AA;  81479 MW;  2AF106F6253B234B CRC64;
     MEQGWNAFQS GAFSTAISFW QQAAHVYAKS GEVSAQGIAL TQLAQAYQAL GRDQKALTSL
     KQALKLTKRA HDPSQEASIL GHLGNVYLAM GQLEDASQWL QQSRQIAETL EDASLLAQIF
     HHMGSLSSAR DQRVEAMQHY GKSLQLATQI GNRALGTRAA INAGRTAILL QQYPLAKDHL
     NMAWLQVKNL PSSQDTAYSL ISIGQASKTL GEHLPEFRDA LLQRSVEALI AAAKMAEGQN
     DDLAASYAWG YLGQLYELEQ RYEDALQLSR RATFAAQRIR APESLYRWQW QIGRLLRATG
     DIQGAIAAHQ RAVDTLQSFR PEFGPVYGRP RASFRKVAEP VYTELIDLLL QQAATISDRA
     EQEARLLEAQ KAVELFKTAE LEDYFHDDCV VRVSDADLDK VSQTAVVIYP IILPDRLELL
     VRLPGELKRF TVSVGGKELT DTVRHFRRFL EKRTTREYLP YAHQLYAWLI KPLEAELKSV
     DIDTLVFVPD GPLRTIPMSA LHDGKGFLIQ TYAVATTPGL KLTDPRPLPR DDIRILSAGL
     TEAVQGFAPL PFVEEELDAI KHIYGNEPLL NETYLKTRIQ KKLREEPFSI VHIASHGQFE
     SAADETFILT YDSKLTLDQL DEMIGLLRLR EEPLELLTLS ACQTAAGDDR AALGLAGIAV
     KAGARSAVAT LWFISDQASS QLVSAFYRNL RNPALSRAVA LQRAQLSLLQ DLRYDHPAYW
     APFLLINNWL
//
DBGET integrated database retrieval system