ID C3QX07_9BACE Unreviewed; 639 AA.
AC C3QX07;
DT 16-JUN-2009, integrated into UniProtKB/TrEMBL.
DT 16-JUN-2009, sequence version 1.
DT 24-JAN-2024, entry version 33.
DE RecName: Full=DUF3857 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BSCG_03419 {ECO:0000313|EMBL:EEO56491.1};
OS Bacteroides sp. 2_2_4.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Bacteroidaceae;
OC Bacteroides.
OX NCBI_TaxID=469590 {ECO:0000313|EMBL:EEO56491.1};
RN [1] {ECO:0000313|EMBL:EEO56491.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2_2_4 {ECO:0000313|EMBL:EEO56491.1};
RG The Broad Institute Genome Sequencing Platform;
RA Ward D., Young S.K., Kodira C.D., Zeng Q., Koehrsen M., Alvarado L.,
RA Berlin A., Borenstein D., Chen Z., Engels R., Freedman E., Gellesch M.,
RA Goldberg J., Griggs A., Gujja S., Heiman D., Hepburn T., Howarth C.,
RA Jen D., Larson L., Lewis B., Mehta T., Park D., Pearson M., Roberts A.,
RA Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., Walk T.,
RA White J., Yandava C., Allen-Vercoe E., Strauss J., Ambrose C., Lander E.,
RA Nusbaum C., Ilzarbe M., Galagan J., Birren B.;
RT "The Genome Sequence of Bacteroides sp. strain 2_2_4.";
RL Submitted (DEC-2008) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; EQ973361; EEO56491.1; -; Genomic_DNA.
DR RefSeq; WP_004310052.1; NZ_EQ973361.1.
DR AlphaFoldDB; C3QX07; -.
DR HOGENOM; CLU_027424_1_0_10; -.
DR Proteomes; UP000003969; Unassembled WGS sequence.
DR Gene3D; 2.60.120.1130; -; 1.
DR Gene3D; 2.60.40.3140; -; 1.
DR Gene3D; 3.10.620.30; -; 1.
DR InterPro; IPR024618; DUF3857.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR002931; Transglutaminase-like.
DR Pfam; PF12969; DUF3857; 1.
DR Pfam; PF01841; Transglut_core; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
PE 4: Predicted;
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..35
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 36..639
FT /note="DUF3857 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5002929684"
FT DOMAIN 66..223
FT /note="DUF3857"
FT /evidence="ECO:0000259|Pfam:PF12969"
FT DOMAIN 284..392
FT /note="Transglutaminase-like"
FT /evidence="ECO:0000259|Pfam:PF01841"
SQ SEQUENCE 639 AA; 72365 MW; 50CEB8B91D4E6FCB CRC64;
MIRSSYLILK QMTHRAFSLS CVLICLSLHI QPACAQDILK DANSVIVEAR TEVLCKSMTQ
SIEKESLTIT ILNRKGLEAA HFFCGCDMFR SLQKFSGEII NADGQSVRKI KKSELQKSEY
SSSLSTDDYF YFYECNYPSL PFTVKYEWEV KCNNGLIGYP PFIPLADFNQ GVEKATYRIE
LPAGQGCRYR ELNTQGKGIQ VKESTGANGQ QVIEATASKL SPIIKEPFGP DFTELFPRVY
FAPSAFKYDK SEGDMSNWQK YGEWQYRLLD GRDLLTEPFR AKLHELTAHC TTDRDKVKAI
YDYLAKTTRY VSIQLGIGGL QPIAAADVCR TGFGDCKGLS NYTRAMLKEL GIASTYTVIS
TTNERLLPDF SSANQMNHVI LQVPLPQDTL WLECTNPSFP FGYVHQDIAG HDALLIEPTG
GQMYRLPTYP DSLNTQHIVA NITLSPTAEA RIEVNEISRI FQYENEAGIV YLEPNKQKDR
IRSSINLSQA DIQNLQISEC KEPNPSITFD YTATSNQYGH KTGNRLFIPT NVFRKEFSVP
PVTKRTYPIY INYGYTDTDS IRIQLPEGYV IEGLPKPLDV KSKFGSFHSG IQVKDKEIYI
THRLFMRKGV YSPDEYAAFI DFRKQVAGQY GGKIILKKE
//