ID V2YJ11_9FIRM Unreviewed; 1308 AA.
AC V2YJ11;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 24-JAN-2024, entry version 32.
DE RecName: Full=Attaching and effacing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=N510_002743 {ECO:0000313|EMBL:USF27786.1}, N510_00460
GN {ECO:0000313|EMBL:ESL15191.1};
OS Firmicutes bacterium ASF500.
OC Bacteria; Bacillota.
OX NCBI_TaxID=1378168 {ECO:0000313|EMBL:ESL15191.1};
RN [1] {ECO:0000313|EMBL:ESL15191.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ASF500 {ECO:0000313|EMBL:ESL15191.1};
RX PubMed=24723722;
RA Wannemuehler M.J., Overstreet A.M., Ward D.V., Phillips G.J.;
RT "Draft genome sequences of the altered schaedler flora, a defined bacterial
RT community from gnotobiotic mice.";
RL Genome Announc. 2:e00287-e00214(2014).
RN [2] {ECO:0000313|EMBL:USF27786.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=ASF500 {ECO:0000313|EMBL:USF27786.1};
RA Proctor A., Parvinroo S., Richie T., Jia X., Lee S.T.M., Karp P.D.,
RA Paley S., Kostic A.D., Pierre J.F., Wannemuehler M.J., Phillips G.J.;
RT "Resources to Facilitate Use of the Altered Schaedler Flora (ASF) Mouse
RT Model to Study Microbiome Function.";
RL Submitted (MAY-2022) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYJP01000004; ESL15191.1; -; Genomic_DNA.
DR EMBL; CP097573; USF27786.1; -; Genomic_DNA.
DR STRING; 1378168.N510_00460; -.
DR PATRIC; fig|1378168.3.peg.480; -.
DR eggNOG; COG5492; Bacteria.
DR HOGENOM; CLU_272460_0_0_9; -.
DR OrthoDB; 43070at2; -.
DR Proteomes; UP000017395; Chromosome.
DR Gene3D; 2.60.40.1080; -; 4.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR InterPro; IPR001119; SLH_dom.
DR Pfam; PF02368; Big_2; 2.
DR Pfam; PF00395; SLH; 3.
DR SMART; SM00635; BID_2; 4.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 3.
DR PROSITE; PS50835; IG_LIKE; 1.
DR PROSITE; PS51272; SLH; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000017395};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..1308
FT /note="Attaching and effacing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5038419102"
FT DOMAIN 38..149
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 1186..1249
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 1254..1308
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT REGION 106..130
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1308 AA; 139198 MW; 973ED1E264D89214 CRC64;
MKTTLGRRWL SLLLTLVLCL GLSQAVWAED PPEGAQVPEI TLDPDTLALT VGETGTLAAT
VKENGTAVPN PAVTWKSDNE AIAAVANGTV TAVAAGTANI TATYTYTVPD TTPDSGDSSG
DSSSVTRAGE EKTVSAACAV TVSAAPTLTG LSVTLEPTSL SMFTGESQKL TATVKPAWSD
NGTHDLGDVT YTWTSKNTSV ATVDGTGGTV TVKPVSAGST EIEVTASCGG TTAAAKCNVK
VEERIEGLRL DKSGPFTMDV GKYEYIKATA DPESAVVSWE SKDEAVVEAS SSDSKGREGI
LYARAPGKTE VTVSVGSPGN LKTRTIQVEV SGLVLDERRL EVKENETAGL PKLTVYGAAK
NGKVVWQSAD PNVAQISGNS VAGRGPGTTT ITASVSGSYQ VSVTVTVSAD KETTIGPLDM
KVSDRLNFGD SLVEEIRDQA SERKLSHVTG MFVSPSQGTL YYKYTSPDEP NAGVAQRENY
YYAPSSGQKA LGDITFVPNP QFSGTQAVIS YTVVSTTNQT SSGRILINLE KDSKAVINLG
TSNSTPVFFS GNLFNRQCQQ KTGSTLDYVI FSLPPANKGT LYFGYVDANN YGGTVTAGAM
YRLAQLDSIV FVPAEGVPRD GKSETVTVYY TARSMAGGSV SSYAGQVDIN VTRENTGHGT
DVYYSISKGA TKTLDDTDFT DFYGDEVLSY IRFDSLPASR EGVLYHGYRS ASNVGTAVKT
DTNYYSGTRN PRLDRITFVP AEDFTGSVYI PFTGWDQNGN RFPGTLEINV KGGGDGYGDI
LYSCAPGRTV NFAEKDFREL CDDLTDKTLS YIILQDLPDR SLEGSIYHKS TRVSSAGTRY
NNGSGTYRIS NLSFRAVSDF SGTVEIPFVG YTTGTSSTST TFNGVIIIEA TGGSYGDTTI
SYYTDYDSAA VFDRDDFDEI SLNETGEKVK SVKFSIPTAS RGDLYQNYRS SSSKGSKITS
KNTSISASSL DRVAFIPASA YTGTVYIEYT ATAQSDGGTF EGTIEIEVER PSAAVTVRYG
TKADPVDFDA EDFRRSGYTL RSVKFTTLPT SSEGKLYYQY TSPSQYTRLA STSTSYNVSG
SNKIADLTFV PRAGYTGTVV LPYTGTNSSG STFSGEVIIT VSPVIAHSFT DLGGYSDQQR
AAVSYIYDRG ITAGMSSTEY GPELPIRRGD FARMIYIAFG FSPSGGSWVF NDVPPNEYYA
QAVNTLYARG VVSGVGNGDF SPSANVSRQD AICMIQRALR AVGQSAPDGA YSALSSYSDA
GYVSDYAKGA MALAVQRGYL PTSGNYLKPG DPLTRIDMAD ILHRVLTY
//