GenomeNet

Database: UniProt
Entry: G5SQC5_9BACT
LinkDB: G5SQC5_9BACT
Original site: G5SQC5_9BACT 
ID   G5SQC5_9BACT            Unreviewed;       900 AA.
AC   G5SQC5;
DT   25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT   25-JAN-2012, sequence version 1.
DT   24-JAN-2024, entry version 30.
DE   SubName: Full=Bacterial group 2 Ig-like protein {ECO:0000313|EMBL:EHH00328.1};
GN   ORFNames=HMPREF9441_01562 {ECO:0000313|EMBL:EHH00328.1};
OS   Paraprevotella clara YIT 11840.
OC   Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae;
OC   Paraprevotella.
OX   NCBI_TaxID=762968 {ECO:0000313|EMBL:EHH00328.1, ECO:0000313|Proteomes:UP000003598};
RN   [1] {ECO:0000313|EMBL:EHH00328.1, ECO:0000313|Proteomes:UP000003598}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=YIT 11840 {ECO:0000313|EMBL:EHH00328.1,
RC   ECO:0000313|Proteomes:UP000003598};
RA   Weinstock G., Sodergren E., Clifton S., Fulton L., Fulton B., Courtney L.,
RA   Fronick C., Harrison M., Strong C., Farmer C., Delahaunty K., Markovic C.,
RA   Hall O., Minx P., Tomlinson C., Mitreva M., Hou S., Chen J., Wollam A.,
RA   Pepin K.H., Johnson M., Bhonagiri V., Zhang X., Suruliraj S., Warren W.,
RA   Chinwalla A., Mardis E.R., Wilson R.K.;
RL   Submitted (MAR-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EHH00328.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AFFY01000022; EHH00328.1; -; Genomic_DNA.
DR   RefSeq; WP_008619479.1; NZ_JH376597.1.
DR   AlphaFoldDB; G5SQC5; -.
DR   STRING; 762968.HMPREF9441_01562; -.
DR   GeneID; 78582618; -.
DR   PATRIC; fig|762968.3.peg.1400; -.
DR   eggNOG; COG2273; Bacteria.
DR   HOGENOM; CLU_321807_0_0_10; -.
DR   Proteomes; UP000003598; Unassembled WGS sequence.
DR   Gene3D; 2.60.40.1080; -; 1.
DR   InterPro; IPR003343; Big_2.
DR   InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR   Pfam; PF02368; Big_2; 1.
DR   Pfam; PF13290; CHB_HEX_C_1; 1.
DR   SMART; SM00635; BID_2; 1.
DR   SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 1.
PE   4: Predicted;
FT   DOMAIN          470..545
FT                   /note="BIG2"
FT                   /evidence="ECO:0000259|SMART:SM00635"
SQ   SEQUENCE   900 AA;  95955 MW;  E0B071BD915ED401 CRC64;
     MTLLLLALVS GSVWGQEVLK SGDFRKLSAY SYTADKEVVV GNDTWLVSTS QYNASVFYLG
     CNSKNAAKGL LSDKWADVIA AIKNQDASFK ENTDHAYAMR LQLNGTSLSD VGHIQFDWAG
     SNDAMRMYLL VDEGDGLVLK NTLTSGSGPT VAGSISYEFA QPQSIKDIVL VAVPSSNSKT
     IRMSTYEITK AVATNKVATP TFEPIDGTTF DESLIVKATC ATEGASIHYT TNGDEPTVDS
     DVLTEAGITI TETTAVKAIA VKEGLDNSSV VTATYTKVEP FASLAELKEK GGATAAGVPC
     IMKLTDAVVT YADSRKAYIQ DETAGLYVFG SNKLKAGTKL NGVVAAQLAL YFGLYELKVD
     GGEFDNVAVT NDVEIPVQEV TVAELNQNFA QYESMRVKVV DATVTSSFND KNGEIEQNGE
     KIALRAADES ITADVQATVD VTGYPGLYNS TKQLNVILQE DIAVKTAGKT QATLTFDSDA
     YSVNVGESLT VKATTNSSAS VVYSSSDKTI ATVDENTGEV QAGNKAGTVT ITATVTENDK
     YTGATATCTV KVVDPGIVPD VVALVSEKDG IYYAMLNTTG NSKNKLNASE ATILNGKVIT
     DRMDLCGWVV DQSAGYIKDF NDNMFVAHGS GNTDLVLQSN GFKWEYSDEM WTCVDEKNKQ
     RAIGLNSSTN DDVTKYFFGT YLVNEIAAKY PAPVVMPITE GYHRNVTSGD YGTICLPYAV
     AAEDMAGAEF FSIAGKIMKD GEPQSIVLNQ VTTLEAGVPY IFSATSDKLI AAYSGKAVAV
     AEEANGLIGS FEGQDVAEGM YLISAQNKVQ LCGKSCKISG NRAYIDMNEV PEYSGEVGVN
     QRLISFEDAT GISETMVEGG LADVYTLSGV EVRHQVDKSE ATQGLPQGIY IVNGKKVVVK
//
DBGET integrated database retrieval system