ID R6DM20_9BACE Unreviewed; 651 AA.
AC R6DM20;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 19.
DE RecName: Full=DUF3857 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BN772_03574 {ECO:0000313|EMBL:CDA85356.1};
OS Bacteroides sp. CAG:754.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Bacteroidaceae;
OC Bacteroides.
OX NCBI_TaxID=1262750 {ECO:0000313|EMBL:CDA85356.1, ECO:0000313|Proteomes:UP000017906};
RN [1] {ECO:0000313|EMBL:CDA85356.1, ECO:0000313|Proteomes:UP000017906}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:754 {ECO:0000313|Proteomes:UP000017906};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDA85356.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBCP010000206; CDA85356.1; -; Genomic_DNA.
DR AlphaFoldDB; R6DM20; -.
DR Proteomes; UP000017906; Unassembled WGS sequence.
DR Gene3D; 2.60.120.1130; -; 1.
DR Gene3D; 2.60.40.3140; -; 1.
DR Gene3D; 3.10.620.30; -; 1.
DR InterPro; IPR024618; DUF3857.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR002931; Transglutaminase-like.
DR Pfam; PF12969; DUF3857; 1.
DR Pfam; PF01841; Transglut_core; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000017906};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..651
FT /note="DUF3857 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004404131"
FT DOMAIN 55..233
FT /note="DUF3857"
FT /evidence="ECO:0000259|Pfam:PF12969"
FT DOMAIN 285..395
FT /note="Transglutaminase-like"
FT /evidence="ECO:0000259|Pfam:PF01841"
SQ SEQUENCE 651 AA; 73542 MW; D28C83CCBB361346 CRC64;
MRKQLSITLA CFLLTSATAF AQSWKPYENT AKGKTYETSD CVTLLDSTLV SVQPTGQGSF
AVCKVIKVQT PKGAVANRVI KYDYDPLTAY AEFKRATVYR ANGQVDELDV KKTCDYAAPA
RAIYWGARQI MLEIGALHPG DIVDYEIAKK GFTYALLGAG DEDDSRFIPP MRGQFYDIVP
FWSSVPTVRK VYKVAVPMEK ELQFQFYQGE CASSMRYEDN SKVYTFTMDN VMPFVREPNM
VDLFDAAPKL MMSSTPQWKD KSLWFNKVNE DYGSFAPLPE AQKKVDELIK GEKTEMEKIA
VLTHWVADNI RYSGISMGKG EGFTLHNTQM NYTDRCGVCK DIAGTLISFL RMAGFEAYPA
MTMAGSRVES IPADHFNHCV AVVKLSNGTY MPLDPTWVPF CRELWSSAEQ QQNYLPGVPE
GSDLCITPVS APENHYMRIK ADNRLDANGT LRGTFTLTAE GQSDSNIRRI FTTGFQSEWK
NTMERQLLAI SPKAKLLSVD YGKNPKDYQA APIKITFRYE IPDYAFKGEK EMFFRPLVMN
NLYNQVRSYL RIDTSLKERK YGFKDGCSRL VELDETIQLP AGYKLANANK NETMQGTGAN
FEGSLAQQGN KVLLHNKLAL KKRVYEASDW DSFRNSVNAH KAYGEYLVIK K
//