ID R5GR78_9BACT Unreviewed; 809 AA.
AC R5GR78;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE RecName: Full=BIG2 domain-containing protein {ECO:0000259|SMART:SM00635};
GN ORFNames=BN773_01014 {ECO:0000313|EMBL:CCY15041.1};
OS Prevotella sp. CAG:755.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae;
OC Prevotella.
OX NCBI_TaxID=1262935 {ECO:0000313|EMBL:CCY15041.1, ECO:0000313|Proteomes:UP000018353};
RN [1] {ECO:0000313|EMBL:CCY15041.1, ECO:0000313|Proteomes:UP000018353}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:755 {ECO:0000313|Proteomes:UP000018353};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCY15041.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAXR010000025; CCY15041.1; -; Genomic_DNA.
DR AlphaFoldDB; R5GR78; -.
DR Proteomes; UP000018353; Unassembled WGS sequence.
DR Gene3D; 2.60.40.1080; -; 2.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR Pfam; PF02368; Big_2; 1.
DR SMART; SM00635; BID_2; 2.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000018353}.
FT DOMAIN 135..209
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 380..456
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT REGION 1..29
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 9..29
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 809 AA; 85077 MW; E4BB554BA313F93B CRC64;
MGEGELGWTS SHAAQQFESN GSSSYEPRGV QFGTSKAPIN VFTLTANESL NGVTSVTIVA
SASGAGNTIA VKVGDTNFNG EVTEIASGTA NANKEYTFTG EATNGVITIT IDDKTKAVWV
KSISVKNQDV VPGKEAAGLS FGEETKFTVD LSGTFTAPTL TKATDAAATY ASSETSVATV
DASTGDVTLV GAGTTTITAT TPETDTYMAG SASYTLTITD VASMPKFQKV TTVTSGKRYL
IVVENNGEIS VAQPVPATGN PYGYLNVTAP AATDADGNIY MEDITNAFTI TTVEGGYYIQ
QADGRYLYQT GTFDSFNVGS AGSNAVWTIE SQADGTFLIT NVAMNKYVQF STRYNSFGSY
EEEQQGAIRP MLYEEVGGKE APEFSFAAQT ATLNLLDAGA FTAPALNSTS DGAVTYTSSN
TEVATIDADG RIEALAQGTT TITATVAETD EFTAATAEFT LHVTTLSDYM KTTAVESGKT
YILAADNEGT AVIAQTVAAN SRYDYLPVNE AAPADLMGAT DLLNGFTLTE TEGGYTIQDA
YGRYLYMDET HTSFNVSADR QDGDTWTVEP QADGTVKITN VLREKFIQYS TEHESYGAYT
DEQGIKPSLY VKNSIEVNED GYATAYSDNA VILPEGLQAA VITGVADGKL TIDYRYNNGD
VIPAGTAVLV KGEAGAHDYN LATSDEQAVT GNLLKGAASD VTTEGDGCLF YMLSYDANHQ
NLGFYWAAKD GAAFTSKAGK AYLALPQAAS AGVTGYALDG TPVGIDQATT DAQAPAVIYT
VDGRLVQQTD VKDLPKGLYI VNGKKVIIK
//