GenomeNet

Database: UniProt
Entry: M9LKH4_PAEPP
LinkDB: M9LKH4_PAEPP
Original site: M9LKH4_PAEPP 
ID   M9LKH4_PAEPP            Unreviewed;       793 AA.
AC   M9LKH4;
DT   29-MAY-2013, integrated into UniProtKB/TrEMBL.
DT   29-MAY-2013, sequence version 1.
DT   24-JAN-2024, entry version 25.
DE   SubName: Full=Bacterial surface protein {ECO:0000313|EMBL:GAC43815.1};
GN   ORFNames=PPOP_3215 {ECO:0000313|EMBL:GAC43815.1};
OS   Paenibacillus popilliae ATCC 14706.
OC   Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Paenibacillus.
OX   NCBI_TaxID=1212764 {ECO:0000313|EMBL:GAC43815.1, ECO:0000313|Proteomes:UP000029453};
RN   [1] {ECO:0000313|EMBL:GAC43815.1, ECO:0000313|Proteomes:UP000029453}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 14706 {ECO:0000313|EMBL:GAC43815.1,
RC   ECO:0000313|Proteomes:UP000029453};
RA   Iiyama K., Mori K., Mon H., Chieda Y., Lee J.M., Kusakabe T., Tashiro K.,
RA   Asano S., Yasunaga-Aoki C., Shimizu S.;
RT   "Draft Genome Sequence of Paenibacillus popilliae ATCC 14706T.";
RL   Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:GAC43815.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; BALG01000256; GAC43815.1; -; Genomic_DNA.
DR   AlphaFoldDB; M9LKH4; -.
DR   OrthoDB; 503324at2; -.
DR   Proteomes; UP000029453; Unassembled WGS sequence.
DR   Gene3D; 2.60.40.1080; -; 8.
DR   InterPro; IPR003343; Big_2.
DR   InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR   Pfam; PF02368; Big_2; 1.
DR   SMART; SM00635; BID_2; 7.
DR   SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 3.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000029453};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..793
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004100082"
FT   DOMAIN          74..154
FT                   /note="BIG2"
FT                   /evidence="ECO:0000259|SMART:SM00635"
FT   DOMAIN          158..240
FT                   /note="BIG2"
FT                   /evidence="ECO:0000259|SMART:SM00635"
FT   DOMAIN          244..329
FT                   /note="BIG2"
FT                   /evidence="ECO:0000259|SMART:SM00635"
FT   DOMAIN          430..510
FT                   /note="BIG2"
FT                   /evidence="ECO:0000259|SMART:SM00635"
FT   DOMAIN          518..598
FT                   /note="BIG2"
FT                   /evidence="ECO:0000259|SMART:SM00635"
FT   DOMAIN          628..708
FT                   /note="BIG2"
FT                   /evidence="ECO:0000259|SMART:SM00635"
FT   DOMAIN          712..791
FT                   /note="BIG2"
FT                   /evidence="ECO:0000259|SMART:SM00635"
FT   REGION          28..54
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        34..51
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   793 AA;  86380 MW;  6254F2FCDEBA5967 CRC64;
     MLVCALVIGL LPLHGVANAE WGGTDVQFES DESAGWGSGD SKTTPEITEK YQGGTSEKKA
     IPNYMYEKLG IRSADTGVNI DKEMNLYPGM KLDWKAFAFD KAGNTENDVT TDASWRSSNP
     SVVEVDKGKL SLKDKGEAKI TVKHKGLTSN IKIKVEPPCK ELTLSPGKLI EFKFGDKAIN
     VTAKAKMKDG NLENVTDKAD WSTSDSDVVE VKDGAVKPIG VGTATITATY LGVTKSVTAV
     VRPSYKAMRI SPDKQQTMFL DSEPLNVQTF VLNSEGEQEE VTHLTEWHLA EWTSNHAVTV
     HQGQFYAKRA GTATVTAKYK GLSKSISVKV ISADNVRRLD WPEEDNTDNG GIRKMDIYME
     DSQSLPKVEA ILRLGIENVD VSDLAEWQSS NTGVISIEDG KMKAESRGTA ILTATVRNHT
     ISMEVTVKRK APILQTYTGK MNIVAGREQP VPDVTAIYMN GDEENITSEM KWESSSPNLL
     VVDGKLKGLV PGTAMLIGTY DNVKISVKVT IKVTIEEEVV RFEIEPGKLA LDLKKSQRIK
     VTGYYKNGKK ISLGSKVDWK SANEKIATVK GTSVKGVAIG STRLVGEFQG QKLEVPVIVE
     PKLTKQIAES NSVKLTIGQE AVEDVEEKLV RFEIEPGQLT LNLKKSQSIK VTGYYKNGKK
     VSLESKVKWY SGNEKVAMVK GASVKGVAIG STMLVGVFQG QKLEVPVTVV PKVMKLIAEP
     NSEKLTVGEE AYWKVKAIYD TGEAVNVTFS VTFVPSNANV KVERGRVKGV SKGSTSVKLT
     FGGKSTSMRI SVK
//
DBGET integrated database retrieval system