ID A0A1R1CJ39_9BACL Unreviewed; 1760 AA.
AC A0A1R1CJ39;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 24-JAN-2024, entry version 19.
DE RecName: Full=Attaching and effacing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BK133_27115 {ECO:0000313|EMBL:OMF22085.1};
OS Paenibacillus sp. FSL H8-0548.
OC Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Paenibacillus.
OX NCBI_TaxID=1920422 {ECO:0000313|EMBL:OMF22085.1, ECO:0000313|Proteomes:UP000187405};
RN [1] {ECO:0000313|EMBL:OMF22085.1, ECO:0000313|Proteomes:UP000187405}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=FSL H8-0548 {ECO:0000313|EMBL:OMF22085.1,
RC ECO:0000313|Proteomes:UP000187405};
RA Beno S.M.;
RT "Paenibacillus species isolates.";
RL Submitted (NOV-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OMF22085.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MRTK01000059; OMF22085.1; -; Genomic_DNA.
DR STRING; 1920422.BK133_27115; -.
DR Proteomes; UP000187405; Unassembled WGS sequence.
DR Gene3D; 2.60.40.1080; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 1.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR InterPro; IPR001322; Lamin_tail_dom.
DR InterPro; IPR036415; Lamin_tail_dom_sf.
DR InterPro; IPR011044; Quino_amine_DH_bsu.
DR InterPro; IPR001119; SLH_dom.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR NCBIfam; NF038117; choice_anch_I; 1.
DR PANTHER; PTHR46928; MESENCHYME-SPECIFIC CELL SURFACE GLYCOPROTEIN; 1.
DR PANTHER; PTHR46928:SF1; MESENCHYME-SPECIFIC CELL SURFACE GLYCOPROTEIN; 1.
DR Pfam; PF02368; Big_2; 1.
DR Pfam; PF00932; LTD; 2.
DR Pfam; PF00395; SLH; 3.
DR SMART; SM00635; BID_2; 1.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 1.
DR SUPFAM; SSF74853; Lamin A/C globular tail domain; 2.
DR SUPFAM; SSF50969; YVTN repeat-like/Quinoprotein amine dehydrogenase; 1.
DR PROSITE; PS51841; LTD; 2.
DR PROSITE; PS51272; SLH; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000187405};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..1760
FT /note="Attaching and effacing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012255198"
FT DOMAIN 29..227
FT /note="LTD"
FT /evidence="ECO:0000259|PROSITE:PS51841"
FT DOMAIN 343..485
FT /note="LTD"
FT /evidence="ECO:0000259|PROSITE:PS51841"
FT DOMAIN 1571..1630
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 1632..1695
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 1698..1760
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
SQ SEQUENCE 1760 AA; 183927 MW; 8B9BFD8F26D5682E CRC64;
MLAAEITASM ALAGGPVMAA AGLEQAGTPY TANQVYSHAV PHVIINQVYG GGLAVDTDVP
ISHGFIELYN PTDSDIDLSG WSLQYADRGN KDTTGPTKAW EKLNLTGTIK AYSSFLIKGA
ATGKQAAKVD LTNKGDLSWE RFINNKGLKV ALMSNQTLLT EANPFLTLPD GYVDMVGSGS
NDAGSNIDGY ENEFPTGDNE GTSKQKAIRR TDFIDTDNNK KDFKQIDYKG AAVESLPSVA
PRSGADGPWG AGVVLPLAIS TTVLSDGYVG SAYSAALQAT GGTAPYTFTA TGLPEGLSLQ
ANGELAGTPV TAVSQAAVNI TVTDSTSGTP LTVDKTFTLT IKSEIAAANH LLINEVYGGG
GKINKETPPV AAPFLYDFIE LYNPTAQPIS LAGYHLRYSN KGATTVQGFD FSEDAVIGPN
DYYLVRLEAT WGNTGDKNYG SAYYADAYAS TKEQSIGMSD TDGTVELFQG AYSSSAVIDA
VGFGAVKTLL HEGTPIGDGA SLPAAVKGVR RINYQDSNNN AADFTIVDPS PTSSGQAEGD
VEVITDFVKL IGSLQKLTAD TSVTVEGLVT TPPVKSDNSQ AEAVRYIQSF TGAIAVEGMD
PSIPVGAEVR VSGVAGLHEG EVRVKGNPVV ARLNNNVYSL TVDDTFDGSM LVVDKLSSVT
TERYGRRVTT TGKVEVVDAA AKTIKLDNDM ILYVNGAFPS TAIGDTLEAT GVIGVYSSNV
RLAIANASAD LVIKPASAEF NDTLNISKIG EYSVGLSNKD GGVAEIVKFN KDNGKFYLVN
GSGNPPSLDI VSLGSGNGNL NKEKTIAVQQ LAETEGFLYG DLTSVDINTT TKRVSVTVQE
ADALKSGKIL VLDYDGNLLA SYMAGVQPDM IKSTSDGKYI LTADEAEPRS GTTDPKGSVT
IVNTETNTVT QVLFDDPSVI EDGVHIRGLA DPADGKIKTS GTKADAIFDF EPEYITLSED
NKTAYVSLQE NNAIAIIDIA TGKVTAVKAL GFKDYNDVRN SLDLVKDGAI KLENVPFKGM
YMPDGIASHT INGQTYLFTA NEGDVTEWPN RTNGSTIGAL KGSLDPASAA AIFLSGKTAY
DGVEVASDMG NDSIYMYGGR SFSVWNADTM EQVYDSGNDF ETITAARLPA YFNTSNSKTA
LDDRSGKKGP EPEDIKTGKV GNKVLAFVGL ERIGGFMTYD VTDPANATFA NYTNTREFKD
GQGKDNLDTD TGPEGLEFIP ADISPTGMPL LLVAYEVGGK VGVYQLNVTK VTVDKKALSM
KVGDASSKLN AVARPAAGSA SSVIWSSSNA AVAKVDGNGN VTAVSAGSAV ITAVSADGYG
LAETIVTVLP ASTGTGPDTG TVVPTPTPTP EQAGTSIVSG DTVKAVTVIK AETDNDSNTT
AEVTASQMAE TLAALEKAAN GKPGAVEFTV DRNANAESAT IRFEEQAIAL IASKGLVSLS
INGGLGAVSF DSKAIDTLTA AAAAGAGELS FAIKKEDVSK LSDSERAAIG SHPAYELTIT
AGNTAVTDLL GGKVRISIPY TLQAGEDSQA IVIYYFAEDG QPVVVSNSVY DALTGQLYFT
VNHFSTYGIG YNHVTFTDTA ASFAKDSIMY LSSRNIIGGM GNDRFVPKAN ISRADFTLIL
ARIAGAELDS YTSSSFTDVS PSDYYAGAVA WASDKGITGG TGNGVFEPQA NISREQLVAM
IVRFADVMSF TLPSNTNAQS FVDDSSISAF AKEAAAAVQQ AGIINGKPAG NNEGNSFAPK
DAATREEAAK MLAKLIQLMA
//