ID A0A242CCK2_9ENTE Unreviewed; 820 AA.
AC A0A242CCK2;
DT 25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT 25-OCT-2017, sequence version 1.
DT 24-JAN-2024, entry version 16.
DE RecName: Full=BIG2 domain-containing protein {ECO:0000259|SMART:SM00635};
GN ORFNames=A5880_002118 {ECO:0000313|EMBL:OTO07848.1};
OS Enterococcus sp. 4G2_DIV0659.
OC Bacteria; Bacillota; Bacilli; Lactobacillales; Enterococcaceae;
OC Enterococcus.
OX NCBI_TaxID=1834181 {ECO:0000313|EMBL:OTO07848.1, ECO:0000313|Proteomes:UP000195139};
RN [1] {ECO:0000313|EMBL:OTO07848.1, ECO:0000313|Proteomes:UP000195139}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=4G2_DIV0659 {ECO:0000313|EMBL:OTO07848.1,
RC ECO:0000313|Proteomes:UP000195139};
RG The Broad Institute Genomics Platform;
RG The Broad Institute Genomic Center for Infectious Diseases;
RA Earl A., Manson A., Schwartman J., Gilmore M., Abouelleil A., Cao P.,
RA Chapman S., Cusick C., Shea T., Young S., Neafsey D., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Enterococcus sp. 4G2_DIV0659.";
RL Submitted (MAY-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OTO07848.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NGLE01000003; OTO07848.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A242CCK2; -.
DR STRING; 1834181.A5880_002118; -.
DR Proteomes; UP000195139; Unassembled WGS sequence.
DR CDD; cd02619; Peptidase_C1; 1.
DR Gene3D; 2.60.40.1080; -; 2.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR InterPro; IPR040528; Lectin-like.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR Pfam; PF02368; Big_2; 2.
DR Pfam; PF18560; Lectin_like; 1.
DR SMART; SM00635; BID_2; 2.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 2.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 4: Predicted;
FT DOMAIN 522..599
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 608..685
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
SQ SEQUENCE 820 AA; 91635 MW; 1104B373493B79DE CRC64;
MRKTIKKWTT LFGGIITLVI LPTNVIAVDL PKEIEYFPSD PTSALQIEIP NDIPETRLFN
EQRKQFPEKF DYTVGNFETV MKNQGELGLC WAYSGTDTIG ISAKKEFGEE YYISPNYFNY
YFSKNAFSDI LNPHNVGGTL NDGGSASRIF IQGALNNLGV SENTLPTPMW LDLNKPMIST
DFYNKTKDKL PIDIEKTIII PGVSYLAEEV DHKKKIMAIK ELVYTYGAST FYYDTEYSHD
STYYNYKTNA TYVPIEDAKA GLVPTYDGWL SANHGITIVG WDDTYAKENF VKKPKNNGAF
KMKNSWGVFP HDRGYFYMSY EDAYLLAAEN IAADTSKEKF DHVNSYITGE MNSYMDLKSD
SKDIYAGNVY TTSKNKEVLE AVSISTDQPH LSYEIYYLDK AVQKNKTFSG FEGLEKIASG
IKDASGIERI QTKKISLKPE SEYSIIVKYT YPRDVSLFRI NLQKVKDASK GQTPHLEAGR
SFFSNMNVSG SRHWLSLSDG SLWGENERFN TWINVYTRNV IENLDIAISP KSAELIVGDS
KKLDALITPE NATNKQVNWS SSNTAIATVS ENGDVTGISA GEVIITAQTA VGYKKATAKI
RIIPKKVAVT DITVPPSTIQ LKLKEVKQLT AVVKPENATN KKITWSSSDP SIATVSGNGE
ITGQALGKVT IIAQTEDGHK ESRCIVEVTQ DETDDHGDTG ETATKIIEGA IVKGKINSET
DIDVFEMPLP EGPDKTVVLE SPNGQAEKFS IQNGAGTWFH QFKGTKEPKI PNQPFRTMYN
SDKPKDKLIR FYIDKSFEKV GGTYEFKITV LHKGQKLSDY
//