ID A0A1Y4MGV4_9FIRM Unreviewed; 1149 AA.
AC A0A1Y4MGV4;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 24-JAN-2024, entry version 18.
DE RecName: Full=BIG2 domain-containing protein {ECO:0000259|SMART:SM00635};
GN ORFNames=B5F12_00150 {ECO:0000313|EMBL:OUP65972.1};
OS Pseudoflavonifractor sp. An176.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Pseudoflavonifractor.
OX NCBI_TaxID=1965572 {ECO:0000313|EMBL:OUP65972.1, ECO:0000313|Proteomes:UP000196146};
RN [1] {ECO:0000313|Proteomes:UP000196146}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=An176 {ECO:0000313|Proteomes:UP000196146};
RA Medvecky M., Cejkova D., Polansky O., Karasova D., Kubasova T., Cizek A.,
RA Rychlik I.;
RT "Function of individual gut microbiota members based on whole genome
RT sequencing of pure cultures obtained from chicken caecum.";
RL Submitted (APR-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OUP65972.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NFKO01000001; OUP65972.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Y4MGV4; -.
DR Proteomes; UP000196146; Unassembled WGS sequence.
DR CDD; cd00688; ISOPREN_C2_like; 1.
DR Gene3D; 1.50.10.20; -; 1.
DR Gene3D; 2.60.40.1080; -; 4.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR Pfam; PF02368; Big_2; 3.
DR SMART; SM00635; BID_2; 4.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 2.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000196146}.
FT DOMAIN 221..297
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 303..379
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 615..685
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 694..783
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT REGION 1084..1117
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1149 AA; 124959 MW; 64757EC85D6BE83F CRC64;
MRAEARRRGR RWLSLLLTLA MVWTLVSGAL ALNTSRSGVQ TVTVTMEYAD ESGHTMYYLP
PTQVELEEGD ILVDVLQRGY QDRGSVTYSA LYGFTVTPND GQAVGEVNWG KKAWWPFVDS
VKVGNKYQVQ AGEVIRLIYV QDSAQGTPDY IPPAAEGGQG ALGINKDKLV SSLAQLTQAQ
IDANAQAYEQ ALQVAVDGAS TQEQVNEQTQ IVSQLLAQKV PATDITVTPG EVTLMVGETQ
TLAATVTPEE SNDTVVWYSE DTHVAQVTDQ GVVRAVGEGT AVITAQANET VKATVMVTVT
GIAAQSVTLS ETKLDLEEGQ GMRLTATVTP ADSTDAVVWQ SSAPDIVSVD DSGLVVARQA
GSAQITVTAG SCSAVCTVTV TPREVPTTPT VVFYHTDGRI TQWDDTHTCT LSALDEGRFV
LEGVAEGQST YWSCSQQNGG SESSTIHISS SGKFYPSVGE YQAYVYDKNP DWYVAQELAT
FTLKVVPTQV TDLKLYQDGW PVTGEEPLHL SGTQPQQLTV KGCLDGVYIT IPNQALEMTS
TEGSYIYTLD GENGLEFFAE DQSMHTFTVS LADNPQVTVS FQATAQRIDV TGLTVTYPEV
FYIEGWNGLG NQYVGITSHD TDPERRYEIN IEPHNATVKD VVWVSHDSDV AEFQATYGNG
IVPKKAGTAR FTVTSVDNPA ATQDLIIRFE YKYPLEKVEM EETALTLAQY ESMNLDLLVT
PANATNQRFT WTYSQDGIVK VNDYVTSTPG TMTTTHTLSA LEQGTVTVTG TPLDDTQGAK
PIQFTVTVTQ PDAVEDLDFD RYVTDNIRHA LSYLDSQLEG NYTYGAEWSL FTLLRAGASL
SQSDLDHYYA SITQQLESGG RMLPTDYFRV VVALLAMGKD PTDVAGMDLI ETLYNYPNLD
RMTSNMMSYT LLALDTKDYE VPQDARWTRE TLIEKILTFQ NANGGFGLSS ADTVSVDVTA
MTLQALAPYR DMEQVGLAFD RALEYLRSQM TSDCGYINEG DDNGCTAAQV LTALAVAGID
PLDPDNGFTH GNYNLVTKLD QFKRESGFTT FMSSDQPDGM GTVQIGYALE AYRRFVAGEN
TLFDLTQWTP APPTEDGDSG ENPGSGEEDS QVPQTGDASP LVESMVILGM SAAGLCAAVW
TDHKRKSRG
//