GenomeNet

Database: UniProt
Entry: W4MGX5_9BACT
LinkDB: W4MGX5_9BACT
Original site: W4MGX5_9BACT 
ID   W4MGX5_9BACT            Unreviewed;       717 AA.
AC   W4MGX5;
DT   19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT   19-MAR-2014, sequence version 1.
DT   27-MAR-2024, entry version 23.
DE   RecName: Full=Transglutaminase-like domain-containing protein {ECO:0000259|SMART:SM00460};
DE   Flags: Fragment;
GN   ORFNames=ETSY2_02320 {ECO:0000313|EMBL:ETX08952.1};
OS   Candidatus Entotheonella gemina.
OC   Bacteria; Nitrospinae/Tectomicrobia group; Candidatus Tectomicrobia;
OC   Entotheonella.
OX   NCBI_TaxID=1429439 {ECO:0000313|EMBL:ETX08952.1, ECO:0000313|Proteomes:UP000019140};
RN   [1] {ECO:0000313|EMBL:ETX08952.1, ECO:0000313|Proteomes:UP000019140}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=TSY2 {ECO:0000313|Proteomes:UP000019140};
RX   PubMed=24476823; DOI=10.1038/nature12959;
RA   Wilson M.C., Mori T., Ruckert C., Uria A.R., Helf M.J., Takada K.,
RA   Gernert C., Steffens U.A., Heycke N., Schmitt S., Rinke C., Helfrich E.J.,
RA   Brachmann A.O., Gurgui C., Wakimoto T., Kracht M., Crusemann M.,
RA   Hentschel U., Abe I., Matsunaga S., Kalinowski J., Takeyama H., Piel J.;
RT   "An environmental bacterial taxon with a large and distinct metabolic
RT   repertoire.";
RL   Nature 506:58-62(2014).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:ETX08952.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AZHX01000090; ETX08952.1; -; Genomic_DNA.
DR   AlphaFoldDB; W4MGX5; -.
DR   HOGENOM; CLU_008973_4_2_7; -.
DR   Proteomes; UP000019140; Unassembled WGS sequence.
DR   Gene3D; 3.10.620.30; -; 1.
DR   InterPro; IPR013589; Bac_transglu_N.
DR   InterPro; IPR018667; DUF2126.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR002931; Transglutaminase-like.
DR   PANTHER; PTHR33490; BLR5614 PROTEIN-RELATED; 1.
DR   PANTHER; PTHR33490:SF1; SLL1233 PROTEIN; 1.
DR   Pfam; PF08379; Bact_transglu_N; 1.
DR   Pfam; PF09899; DUF2126; 1.
DR   Pfam; PF01841; Transglut_core; 1.
DR   SMART; SM00460; TGc; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000019140}.
FT   DOMAIN          172..248
FT                   /note="Transglutaminase-like"
FT                   /evidence="ECO:0000259|SMART:SM00460"
FT   NON_TER         717
FT                   /evidence="ECO:0000313|EMBL:ETX08952.1"
SQ   SEQUENCE   717 AA;  80037 MW;  3D51EF42AA7023E2 CRC64;
     MAIRVALHHK TTYRYDRLVD LSPHVVRLRP APHCRTPILS YSLTVYPETH FLNWQQDPQS
     NYLARLVFPE PTRMLQVEVD LVAEMVVINP FDFFLEPSVH QSPFAYEPGL AHELQPYLET
     EPAGPRLQAL LAGIDRTPQT TVDFLVAINQ QLHCEVDYIV RMEPGIQTCE ETLTLGRGSC
     RDSAWLMVQL LRQLGLAARF VSGYLIQLVA DEKPLDGPAG PEKDFTDLHA WAEVYLPGAG
     WVGLDPTSGL FAGEGHLPLA CSPHASSAAP ITGMMGPAEV DFSFTMSVTR VHEEPRVTKP
     YTEAQWQAID ALGEQVEANL EAGDVRLTMG GEPTFVSIDD MEGAEWTTEA VGETKRQLSG
     ALLKRLKARF APGGMLHYGQ GKWYPGESLP RWALACYWRT DGVPIWRDER WMADENADGD
     RTVADAYTFL EALGKRLQVD TRYIMPAYED VWTLLSQERH LPPNVTPETS ELDDPEARER
     LARAFERGLG AIKGYVLPLD RRRFSPPRWV TGPWPFRSGK LFLQPGDSPI GLRLPLDALP
     WIEPEHYPHV TAPDPFAPRF PLPGPTQSYR RPDAMTELSG QSVTEQRLGV LATSISAETS
     EPAAAVMEPD HDQNGIDAPT EGRLPGRVAD DVRTALCIEP RQGHLYIFLT PTRELEDYLA
     LVADIEATAE ALGMPVILEG YSPPADHRIR QLKVTPDPGV IEVNIHPSHS WYELVDT
//
DBGET integrated database retrieval system