GenomeNet

Database: UniProt
Entry: A0A242CCK2_9ENTE
LinkDB: A0A242CCK2_9ENTE
Original site: A0A242CCK2_9ENTE 
ID   A0A242CCK2_9ENTE        Unreviewed;       820 AA.
AC   A0A242CCK2;
DT   25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT   25-OCT-2017, sequence version 1.
DT   24-JAN-2024, entry version 16.
DE   RecName: Full=BIG2 domain-containing protein {ECO:0000259|SMART:SM00635};
GN   ORFNames=A5880_002118 {ECO:0000313|EMBL:OTO07848.1};
OS   Enterococcus sp. 4G2_DIV0659.
OC   Bacteria; Bacillota; Bacilli; Lactobacillales; Enterococcaceae;
OC   Enterococcus.
OX   NCBI_TaxID=1834181 {ECO:0000313|EMBL:OTO07848.1, ECO:0000313|Proteomes:UP000195139};
RN   [1] {ECO:0000313|EMBL:OTO07848.1, ECO:0000313|Proteomes:UP000195139}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=4G2_DIV0659 {ECO:0000313|EMBL:OTO07848.1,
RC   ECO:0000313|Proteomes:UP000195139};
RG   The Broad Institute Genomics Platform;
RG   The Broad Institute Genomic Center for Infectious Diseases;
RA   Earl A., Manson A., Schwartman J., Gilmore M., Abouelleil A., Cao P.,
RA   Chapman S., Cusick C., Shea T., Young S., Neafsey D., Nusbaum C.,
RA   Birren B.;
RT   "The Genome Sequence of Enterococcus sp. 4G2_DIV0659.";
RL   Submitted (MAY-2017) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OTO07848.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; NGLE01000003; OTO07848.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A242CCK2; -.
DR   STRING; 1834181.A5880_002118; -.
DR   Proteomes; UP000195139; Unassembled WGS sequence.
DR   CDD; cd02619; Peptidase_C1; 1.
DR   Gene3D; 2.60.40.1080; -; 2.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR003343; Big_2.
DR   InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR   InterPro; IPR040528; Lectin-like.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   Pfam; PF02368; Big_2; 2.
DR   Pfam; PF18560; Lectin_like; 1.
DR   SMART; SM00635; BID_2; 2.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 2.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   4: Predicted;
FT   DOMAIN          522..599
FT                   /note="BIG2"
FT                   /evidence="ECO:0000259|SMART:SM00635"
FT   DOMAIN          608..685
FT                   /note="BIG2"
FT                   /evidence="ECO:0000259|SMART:SM00635"
SQ   SEQUENCE   820 AA;  91635 MW;  1104B373493B79DE CRC64;
     MRKTIKKWTT LFGGIITLVI LPTNVIAVDL PKEIEYFPSD PTSALQIEIP NDIPETRLFN
     EQRKQFPEKF DYTVGNFETV MKNQGELGLC WAYSGTDTIG ISAKKEFGEE YYISPNYFNY
     YFSKNAFSDI LNPHNVGGTL NDGGSASRIF IQGALNNLGV SENTLPTPMW LDLNKPMIST
     DFYNKTKDKL PIDIEKTIII PGVSYLAEEV DHKKKIMAIK ELVYTYGAST FYYDTEYSHD
     STYYNYKTNA TYVPIEDAKA GLVPTYDGWL SANHGITIVG WDDTYAKENF VKKPKNNGAF
     KMKNSWGVFP HDRGYFYMSY EDAYLLAAEN IAADTSKEKF DHVNSYITGE MNSYMDLKSD
     SKDIYAGNVY TTSKNKEVLE AVSISTDQPH LSYEIYYLDK AVQKNKTFSG FEGLEKIASG
     IKDASGIERI QTKKISLKPE SEYSIIVKYT YPRDVSLFRI NLQKVKDASK GQTPHLEAGR
     SFFSNMNVSG SRHWLSLSDG SLWGENERFN TWINVYTRNV IENLDIAISP KSAELIVGDS
     KKLDALITPE NATNKQVNWS SSNTAIATVS ENGDVTGISA GEVIITAQTA VGYKKATAKI
     RIIPKKVAVT DITVPPSTIQ LKLKEVKQLT AVVKPENATN KKITWSSSDP SIATVSGNGE
     ITGQALGKVT IIAQTEDGHK ESRCIVEVTQ DETDDHGDTG ETATKIIEGA IVKGKINSET
     DIDVFEMPLP EGPDKTVVLE SPNGQAEKFS IQNGAGTWFH QFKGTKEPKI PNQPFRTMYN
     SDKPKDKLIR FYIDKSFEKV GGTYEFKITV LHKGQKLSDY
//
DBGET integrated database retrieval system