ID R5GYK3_9FIRM Unreviewed; 806 AA.
AC R5GYK3;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 18.
DE SubName: Full=Ig domain protein group 2 domain protein {ECO:0000313|EMBL:CCY17827.1};
GN ORFNames=BN782_01562 {ECO:0000313|EMBL:CCY17827.1};
OS Eubacterium sp. CAG:786.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Eubacteriaceae;
OC Eubacterium.
OX NCBI_TaxID=1262893 {ECO:0000313|EMBL:CCY17827.1, ECO:0000313|Proteomes:UP000018127};
RN [1] {ECO:0000313|EMBL:CCY17827.1, ECO:0000313|Proteomes:UP000018127}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:786 {ECO:0000313|Proteomes:UP000018127};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCY17827.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAXS010000164; CCY17827.1; -; Genomic_DNA.
DR AlphaFoldDB; R5GYK3; -.
DR STRING; 1262893.BN782_01562; -.
DR Proteomes; UP000018127; Unassembled WGS sequence.
DR Gene3D; 2.60.40.1080; -; 4.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR Pfam; PF02368; Big_2; 4.
DR SMART; SM00635; BID_2; 4.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 4.
PE 4: Predicted;
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..806
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004391896"
FT DOMAIN 258..333
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 425..501
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 526..602
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 607..683
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
SQ SEQUENCE 806 AA; 85299 MW; 4BC31D38E545A86D CRC64;
MKLKRLMTAS MAAVMAVSSA IVCQISVSAA ETVLYEYKAG QITEVLFTEE AMTAVKAATK
SIEVTIDAEG DGSFFLLETS TWKGSGGSIT SSDENWAAFT AANGIKVCPS NYTKVNKITA
IIDGTASYTV YDVTAETKGI PFPKTGDIYK ISADDLANAG VTAENIADSK LVVTFDSVGE
NKELSLLATR EGWGDMFYWG NSMTAGEMEL PLTGFCDSVD LGTKPLTTGI SLFCKDSVVS
KIAIRTPAKH VESLKITQKD GSNVDPEYTY GETIALTAEV TPADADDADK VTWASETPDV
ATVNENGEVT LLKAGEAKIT ATADGKSESV TFTVKKKKLT VAPSALTEIY VKDNKIEAAA
KAVTAKYALE NDENVSLVKG TDYTVSEPQI SADKKYYSIT ITLSDDAAGK YELSTTTLKG
YIAYELTSVA LNSDTLNLVA GADGRQLVAT TTPDNALLDN LTFTYKSDNE TVATVDKNGL
VTPLKAGTAT ITVTAKAVVT TTNGMPILTK TATAKCTVTV TDNSIPATNI ELDTYSKTMT
VGEKAKLTAT VKPDDSTDKV TWKSNNDKIV SVDENGNITA LATGTTEITA TAGSVSAVCK
VTVEGVKVSK VELNKTSVSL KVGGTEQLTA TVSPDNATDK TVTWTSSNEK IATVADGKIT
AVAPGTATIT ATADGKSATC TVTVSKEAQI IKDPKKYPGV PKNYAKVDPV VTTTNSDGTK
DMLVMFSISD SDVKNFRGAR VTFKRADGKT FSRSLILGKY YTDVTYVKDG DNYNGNSQNY
IVIRLKNVDE SWGDISAKFE LINGLG
//