GenomeNet

Database: UniProt
Entry: A0A1G1FCU4_9BACT
LinkDB: A0A1G1FCU4_9BACT
Original site: A0A1G1FCU4_9BACT 
ID   A0A1G1FCU4_9BACT        Unreviewed;      1215 AA.
AC   A0A1G1FCU4;
DT   15-FEB-2017, integrated into UniProtKB/TrEMBL.
DT   15-FEB-2017, sequence version 1.
DT   24-JAN-2024, entry version 22.
DE   RecName: Full=Transglutaminase-like domain-containing protein {ECO:0000259|Pfam:PF01841};
GN   ORFNames=A2X56_13145 {ECO:0000313|EMBL:OGW29570.1};
OS   Nitrospirae bacterium GWC2_57_13.
OC   Bacteria; Nitrospirota.
OX   NCBI_TaxID=1801697 {ECO:0000313|EMBL:OGW29570.1, ECO:0000313|Proteomes:UP000177780};
RN   [1] {ECO:0000313|EMBL:OGW29570.1, ECO:0000313|Proteomes:UP000177780}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=27774985; DOI=10.1038/ncomms13219;
RA   Anantharaman K., Brown C.T., Hug L.A., Sharon I., Castelle C.J.,
RA   Probst A.J., Thomas B.C., Singh A., Wilkins M.J., Karaoz U., Brodie E.L.,
RA   Williams K.H., Hubbard S.S., Banfield J.F.;
RT   "Thousands of microbial genomes shed light on interconnected biogeochemical
RT   processes in an aquifer system.";
RL   Nat. Commun. 7:13219-13219(2016).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OGW29570.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; MHDZ01000018; OGW29570.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A1G1FCU4; -.
DR   STRING; 1801697.A2X56_13145; -.
DR   Proteomes; UP000177780; Unassembled WGS sequence.
DR   Gene3D; 3.10.620.30; -; 1.
DR   Gene3D; 3.90.1720.10; endopeptidase domain like (from Nostoc punctiforme); 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR002931; Transglutaminase-like.
DR   Pfam; PF01841; Transglut_core; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
PE   4: Predicted;
FT   DOMAIN          375..520
FT                   /note="Transglutaminase-like"
FT                   /evidence="ECO:0000259|Pfam:PF01841"
FT   REGION          161..246
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        175..196
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        208..229
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1215 AA;  133441 MW;  B10B321FD362273F CRC64;
     MRAFLRIARF VSAITLFFFC WTYMPLYSIA AYAAEKKQVR NADPSRSPLG KGGGRGVAET
     AGDRFEKAIE TIREKVGKAE EKAGRGEDAA GEIEAVTKQR LEIETLDSEL RKEFSETEKK
     LKDTKLPQEI LDRHYKFVKH YEDNLAELKK NLSEVEQYTA DSKQGISKQQ RVYSKQGKDA
     LKRAKAHLDK TKAPSKHVPL DPNNLPFRSV KGKEREPRLK KEEFEKDFPP QRPQSSPRTA
     KLSDTDPHGY ARILATSEFN SALRTHNSEL LSAASPLTPD PSRILLAYND IASDVPFQLP
     RPSEERAEVR GGSSLAPSPI LPVTAFSVLS ESSVAHDLTP NFELSTLNLA AATAADLPTA
     ADLSQTPEVQ FTPEIQAKAL ELGNNPVKIY EWVRNNIEFV PTYGSIQGAD MCLQAQQCNA
     FDTASLLIAL LRASNIPARY VYGTIEVPIE KVLNWAGGFT DPMAAASLMA SGGIPVKPYI
     VGSKIAKVQM EHTWVETYIP YGNYRGAIMD QSIKTWIPMD GSFKEYVYAA GFDITPTVSF
     SQNDYLSQVQ SQNPVHYYQS QIQEYLDANM PETSIIDVKG YREIKQERYP FLSSTLPYKT
     IVRGGVFASV PANTQAKAVF SLPGASLAFT MPEIAGKRIT LSYIPATSSD EALISNYGGY
     IYGVPAYLLN LKPVLKIDGI IKLSGDAAMM GAEQALMLQL SQPKGLSETV QKKLIAGAYY
     AIGLDLQGIN ENVLGKRNYT LTTNVLSETA GTLGNDDLIG ELLYLRAVTY FLANDKIYRS
     GAKLFNTVVT RTLSEGITSF TLSVSHLFLI PRTATPSGIN MDVAMDRVIA VAKDGNVAKE
     KAYMDIAGLV SSYHEHDIFE RIDGFSSVSA VRAIQVAMAN GTPIHYINSS NIGQTLPLLQ
     VASEIKTEIQ NAVNAGKEVT IPQANVQIND WNGVGYFIKD VTGSGAYMIS GGLAGSDSTS
     QNDGMQIVQL HKEPLGWVKD SIDPQTRRTI ITAAELEIGE LIVKEAEDFK GYKYETIGQC
     VGLVRKAYKA AGICLDEWAG CGENLMKKNG IPYADGKNGV YYFYEIAKKL KINESIRTTD
     DKLIIGDMVF WDNTLDYNCN CEKDDDLTHV GIVVKVNIDG KGTLNFIHAG GKGVVSDNPM
     NVTNDYKSSK SPYNTPLRTL DKPSCCTCRG VEKCSGTSGC KEGKKPYTCT EEQWDSVPKS
     SGQLFIGFGT IRNPK
//
DBGET integrated database retrieval system