ID R9MH45_9FIRM Unreviewed; 1227 AA.
AC R9MH45;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 32.
DE RecName: Full=BIG2 domain-containing protein {ECO:0000259|SMART:SM00635};
GN ORFNames=C818_01804 {ECO:0000313|EMBL:EOS70389.1};
OS Lachnospiraceae bacterium MD308.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX NCBI_TaxID=1235799 {ECO:0000313|EMBL:EOS70389.1, ECO:0000313|Proteomes:UP000014117};
RN [1] {ECO:0000313|EMBL:EOS70389.1, ECO:0000313|Proteomes:UP000014117}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=3-2 {ECO:0000313|EMBL:EOS70389.1,
RC ECO:0000313|Proteomes:UP000014117};
RG The Broad Institute Genomics Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Earl A., Xavier R., Elson C., Duck W., Walker B., Young S., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium 3-2.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EOS70389.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASTE01000019; EOS70389.1; -; Genomic_DNA.
DR AlphaFoldDB; R9MH45; -.
DR STRING; 1235799.C818_01804; -.
DR PATRIC; fig|1235799.3.peg.1922; -.
DR eggNOG; COG5492; Bacteria.
DR HOGENOM; CLU_268041_0_0_9; -.
DR OrthoDB; 2042388at2; -.
DR Proteomes; UP000014117; Unassembled WGS sequence.
DR Gene3D; 2.60.40.1080; -; 2.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR Pfam; PF02368; Big_2; 2.
DR SMART; SM00635; BID_2; 2.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000014117};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..1227
FT /note="BIG2 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5038373901"
FT DOMAIN 1059..1134
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 1146..1224
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT REGION 35..105
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 42..58
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 59..85
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 86..105
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1227 AA; 137042 MW; E112A83847125CD8 CRC64;
MKKRYLAMLL AIAIVFAQAV GTGQVVLAAQ IAEEEQQGED AAQEKEAEIE KTEEEKTEGE
EIVTDVPENG DEKEDADEEQ NEGDVQPEEE GKKEEVVEKK RAAALPEDKQ GVGEITILPA
EGTELRGSSD WTWLFENETQ TFHVDVDGSN TVQWEIGRYT EDSTEAFNEG VPGNPVTWKI
SEDGKSITVT AKSESSGLFI SASCGDAQHQ ILVDVKKRKM IPNIWVGREE KIEDFPGEYF
ISKTNGGDCY VENEDYPNGE ELPLTILSIK SEDENIASIG EEGTEWVLQL KKLGQTNIII
KCCLQNDETQ TFEKKIPVHV KNEVYSMDLM TSTGSTEMLF DDSLELTANV QGSLWNDSEE
EITPIDTKGL RVEWTYEVSS YGNDGEDDSA EVTKKVKLEP NADNSRACHI TAGKADHHYE
VVVRVKAYKD SEEVASDVIE LSVNDFYNLL EPVSINTDIL PEEVMEISGV GVYRYQVGKE
KELIKDIKIS FEYDPEILEI TKGDQAISSG DDPKLEAGGN FIVKRLSPKD TLLRVNSWVQ
EEDGQWDYAE TREWWFESCD YNDICFEKDS YEIFASDDPE ENETIDMKLN TESIEAEHTV
EWTVGLMSED GVFEQTIDPS NYTASGNVLT LDGNQIKNAL KALGQEREYL WVSVLAVVKI
SGTEVGSTYS VVDIKTPEYE LESVYDRTEV LSTWLYYDKE ISCHVENKNY PEGADIAVTL
LDIVIEQEER VWKKEEDASG IILNAVGCGE AKVTFKTQSK ELGEKDFTVN MNVTDDMYHI
YANDGRDYIQ LLPGKSHKLK VEVYHSYYDS KIEGIKEEKV NAPFSYLTYD SFDEEVISVK
DGTITAKTPL EIPMYSNLRI HLSVPREGRE NFECDELLHV EVTNCYYQAV AEKMYVEPGE
VVSSVPVKLL RFDTEHENGI EEKGITYSLD DEWNVTFNKD KTGFTVGNDL EDGETFEISV
IVEKKVGDEE IREFGQLELE ICKHSFVVKS TKQDNCTVEG VKTLECSKCH TAKTETIPKT
SHSYGAWQTT REATVFAQGA QVRTCRRCRA SETRTLAKLK PFIKLNAKTI PLKVKQSTTA
VKVTMEKGDS IASWKSSNKK IATVNSKGKI TGKKAGTAKI TVTLKSGVKA TIKVKVQKKA
VTTSKISVTG KTSKIGKKLT LKKGKSATLA VTVTPITSKE KVTYKSSNKK VAAVTSKGKI
TAKKPGKAKI TVKSGKKKVV ITVTVKK
//