GenomeNet

Database: UniProt
Entry: A0A1G0XSS0_9BACT
LinkDB: A0A1G0XSS0_9BACT
Original site: A0A1G0XSS0_9BACT 
ID   A0A1G0XSS0_9BACT        Unreviewed;       884 AA.
AC   A0A1G0XSS0;
DT   15-FEB-2017, integrated into UniProtKB/TrEMBL.
DT   15-FEB-2017, sequence version 1.
DT   24-JAN-2024, entry version 16.
DE   RecName: Full=Transglutaminase-like domain-containing protein {ECO:0000259|Pfam:PF01841};
GN   ORFNames=A2X48_16705 {ECO:0000313|EMBL:OGV36638.1};
OS   Lentisphaerae bacterium GWF2_49_21.
OC   Bacteria; Lentisphaerota.
OX   NCBI_TaxID=1798573 {ECO:0000313|EMBL:OGV36638.1, ECO:0000313|Proteomes:UP000178513};
RN   [1] {ECO:0000313|EMBL:OGV36638.1, ECO:0000313|Proteomes:UP000178513}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=27774985; DOI=10.1038/ncomms13219;
RA   Anantharaman K., Brown C.T., Hug L.A., Sharon I., Castelle C.J.,
RA   Probst A.J., Thomas B.C., Singh A., Wilkins M.J., Karaoz U., Brodie E.L.,
RA   Williams K.H., Hubbard S.S., Banfield J.F.;
RT   "Thousands of microbial genomes shed light on interconnected biogeochemical
RT   processes in an aquifer system.";
RL   Nat. Commun. 7:13219-13219(2016).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OGV36638.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; MHBH01000077; OGV36638.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A1G0XSS0; -.
DR   STRING; 1798573.A2X48_16705; -.
DR   Proteomes; UP000178513; Unassembled WGS sequence.
DR   Gene3D; 2.60.40.3140; -; 1.
DR   Gene3D; 3.10.620.30; -; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR002931; Transglutaminase-like.
DR   Pfam; PF01841; Transglut_core; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
PE   4: Predicted;
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..25
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           26..884
FT                   /note="Transglutaminase-like domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5009569635"
FT   DOMAIN          532..609
FT                   /note="Transglutaminase-like"
FT                   /evidence="ECO:0000259|Pfam:PF01841"
FT   REGION          259..290
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   884 AA;  96633 MW;  A89CB327D5ECC90E CRC64;
     MTKFRRIIFC LLLLGIAILP SFSQAQSPAR HAVVIGAKSG EQELFDENLS YCKRTSELLL
     KNGFAKDRVL VFFESPESLP GSTEASSDAI LSSLAKLAGS VREGDELWIF IYGDANINQR
     GLSLATKGKR LYDKPLAEAL DKIPGRQFVF GLCRQSAGLM DSMKNHPRVL FTATSDPNEL
     NPPRMAKHLI DTFAANPSSS LLEILKSASE KTEEEYKSKG LAISETPQLF DGKDILSFPY
     AGSDAKMLAV AIAAPKPEKS TLTESTLSGT APVPAVKTER PTASAKVAEP ATEETKALLA
     KGQALAAKHP GFKAVFLWEK KSMTVSQDNA VQTTSDTAIF LADGSASEIF GSLILEDSPP
     FVEKELLSAR IILPDASFLN VSPEPSITDT KMRRRLVRLK FPGACAGALL ELKSKTSEKP
     ENQMPMFEEE FQIQHEFPVG EAELVIQYPK DKPCRIKVYG SEFKPVQTQT PYSAVSTFKF
     GEVPAFEPLA GDPPIADCVV RMRISSLPSW DEFLKWALRI TEKSMELDDP VKALAVKLTS
     EAKTDTEKVK SIYEFLCELR YETTPVGVRS FRPRLPSEVC SSKYGDCKDK ANALAAMSRS
     LGIDAYLVLL NRGGFSDVSF PCWQFNHAIA FFPKLEGYPN GLWCDATDGS TPFGTLPPGD
     IGRAALIVKP GNFEFKTVTL PSGAENILKQ IVSLEEQTDG TVKGSMVVSA LGLADYELRQ
     QFKRLSPKQA ESLAQHITND SFSGLSASKL QLSPLHELSK PFELRAVLAG KSARLSFRSI
     NFPVPLWAMV AAETRDRPLL INDGQPLKVA QELSCKSTLP EPALPAPFKT EAPGFKASVV
     FSNKGGVRER IATIELSQPM LSPSDYPAFR QAVLEVLKAL DTDF
//
DBGET integrated database retrieval system