GenomeNet

Database: UniProt
Entry: A0A1Q3TQF2_9SPHI
LinkDB: A0A1Q3TQF2_9SPHI
Original site: A0A1Q3TQF2_9SPHI 
ID   A0A1Q3TQF2_9SPHI        Unreviewed;       670 AA.
AC   A0A1Q3TQF2;
DT   12-APR-2017, integrated into UniProtKB/TrEMBL.
DT   12-APR-2017, sequence version 1.
DT   24-JAN-2024, entry version 15.
DE   RecName: Full=DUF3857 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=BGO52_18950 {ECO:0000313|EMBL:OJW04597.1};
OS   Sphingobacteriales bacterium 44-61.
OC   Bacteria; Bacteroidota; Sphingobacteriia; Sphingobacteriales.
OX   NCBI_TaxID=1895838 {ECO:0000313|EMBL:OJW04597.1, ECO:0000313|Proteomes:UP000186208};
RN   [1] {ECO:0000313|EMBL:OJW04597.1, ECO:0000313|Proteomes:UP000186208}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=44-61 {ECO:0000313|EMBL:OJW04597.1};
RA   Kantor R.S., Huddy R.J., Iyer R., Thomas B.C., Brown C.T., Anantharaman K.,
RA   Tringe S., Hettich R.L., Harrison S.T., Banfield J.F.;
RT   "Genome-resolved meta-omics ties microbial dynamics to process performance
RT   in biotechnology for thiocyanate degradation.";
RL   Submitted (SEP-2016) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OJW04597.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; MKTW01000001; OJW04597.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A1Q3TQF2; -.
DR   STRING; 1895838.BGO52_18950; -.
DR   Proteomes; UP000186208; Unassembled WGS sequence.
DR   Gene3D; 2.60.120.1130; -; 1.
DR   Gene3D; 2.60.40.3140; -; 1.
DR   Gene3D; 3.10.620.30; -; 1.
DR   InterPro; IPR024618; DUF3857.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR002931; Transglutaminase-like.
DR   Pfam; PF12969; DUF3857; 1.
DR   Pfam; PF01841; Transglut_core; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
PE   4: Predicted;
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..20
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           21..670
FT                   /note="DUF3857 domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5012388553"
FT   DOMAIN          71..222
FT                   /note="DUF3857"
FT                   /evidence="ECO:0000259|Pfam:PF12969"
FT   DOMAIN          321..388
FT                   /note="Transglutaminase-like"
FT                   /evidence="ECO:0000259|Pfam:PF01841"
SQ   SEQUENCE   670 AA;  76883 MW;  908BF80533A21BA5 CRC64;
     MKVQSILVIL LGLFSFEMSA QEKAKIKFGK VSPGDFKAVY PIDSNASAVI IGDIGSTDFV
     GNDKGSFALE FKRYCRVHIL NKNGYDISKV EIPIYRVDEA EEELTSIKAV TYNLENGKVV
     ETKLDPKSIF KEKISKKKVV KKFTFPAVRE GSIIEFEYKL KSDFLFNLRP WEFQGSYPTL
     WSEYIVGMPE FLNYVSLTQG YQPYYIKDQK DKRESYSVEI KKENAMLSGS DRINFSAGVT
     FFRWVMKDVP ALKEESYTST LSNHIARIEF QLAEYREPFI PKKVMNTWAE TARRLLDEED
     FGKSLQEENE WLNDAISTAG KNAKTSLEQA KNIYAWLRDN MTCTNYNSFY TTKSLKATLN
     SKSGSEAEIN LLLTAMLKKL GMEADPVILS TRSNGYAYAL YPLIDRFNYI VTRVVIDGKT
     YYLDASRPML GFNKLGYDCY NGHARVIEPN AAGVEFKPES LMEKELTSVF ITNDANGNLV
     GSLQHSPGYY SSYSLRSRLK EKGQDAYFND LKKEAGGEIE ITEPAIDSVN NYEAPLAIRY
     SFKLPTDKED IWYFNPMFSE GWKENPFKST ERLYPVEMPY GIDQTFLLRM DVPPGYAVDE
     LPKPIIVKLN PEDEGNFEYR AVETAGVVSI RSRIRLSRTY FSPEEYVMLR EFFNLVVKKH
     NEQIVFKKKK
//
DBGET integrated database retrieval system