ID A0A4U8S448_9HELI Unreviewed; 514 AA.
AC A0A4U8S448;
DT 31-JUL-2019, integrated into UniProtKB/TrEMBL.
DT 31-JUL-2019, sequence version 1.
DT 13-SEP-2023, entry version 16.
DE RecName: Full=Flagellin {ECO:0000256|RuleBase:RU362073};
GN ORFNames=LS68_007395 {ECO:0000313|EMBL:TLD80555.1};
OS Helicobacter sp. MIT 05-5293.
OC Bacteria; Campylobacterota; Epsilonproteobacteria; Campylobacterales;
OC Helicobacteraceae; Helicobacter.
OX NCBI_TaxID=1548149 {ECO:0000313|EMBL:TLD80555.1, ECO:0000313|Proteomes:UP000029872};
RN [1] {ECO:0000313|EMBL:TLD80555.1, ECO:0000313|Proteomes:UP000029872}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MIT 05-5293 {ECO:0000313|EMBL:TLD80555.1,
RC ECO:0000313|Proteomes:UP000029872};
RX PubMed=25428971;
RA Sheh A., Shen Z., Fox J.G.;
RT "Draft genome sequences of eight enterohepatic helicobacter species
RT isolated from both laboratory and wild rodents.";
RL Genome Announc. 2:e01218-e01214(2014).
CC -!- FUNCTION: Flagellin is the subunit protein which polymerizes to form
CC the filaments of bacterial flagella. Important for motility and
CC virulence. {ECO:0000256|ARBA:ARBA00025143}.
CC -!- SUBUNIT: Heteromer of FlaA and FlaB. FlaB is located proximal to the
CC hook while the remainder of the filament is composed of the predominant
CC FlaA. {ECO:0000256|ARBA:ARBA00025928}.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|RuleBase:RU362073}.
CC Bacterial flagellum {ECO:0000256|RuleBase:RU362073}.
CC -!- SIMILARITY: Belongs to the bacterial flagellin family.
CC {ECO:0000256|ARBA:ARBA00005709, ECO:0000256|RuleBase:RU362073}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:TLD80555.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JROZ02000003; TLD80555.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A4U8S448; -.
DR STRING; 1548149.LS68_04290; -.
DR OrthoDB; 9796789at2; -.
DR Proteomes; UP000029872; Unassembled WGS sequence.
DR GO; GO:0009288; C:bacterial-type flagellum; IEA:UniProtKB-SubCell.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0005198; F:structural molecule activity; IEA:UniProtKB-UniRule.
DR Gene3D; 3.30.70.2120; -; 1.
DR Gene3D; 1.20.1330.10; f41 fragment of flagellin, N-terminal domain; 1.
DR Gene3D; 6.10.10.10; Flagellar export chaperone, C-terminal domain; 1.
DR InterPro; IPR001492; Flagellin.
DR InterPro; IPR046358; Flagellin_C.
DR InterPro; IPR042187; Flagellin_C_sub2.
DR InterPro; IPR010810; Flagellin_hook_IN_motif.
DR InterPro; IPR001029; Flagellin_N.
DR PANTHER; PTHR42792; FLAGELLIN; 1.
DR PANTHER; PTHR42792:SF2; FLAGELLIN; 1.
DR Pfam; PF00700; Flagellin_C; 1.
DR Pfam; PF07196; Flagellin_IN; 2.
DR Pfam; PF00669; Flagellin_N; 1.
DR PRINTS; PR00207; FLAGELLIN.
DR SUPFAM; SSF64518; Phase 1 flagellin; 1.
PE 3: Inferred from homology;
KW Bacterial flagellum {ECO:0000256|ARBA:ARBA00023143,
KW ECO:0000256|RuleBase:RU362073};
KW Cell projection {ECO:0000313|EMBL:TLD80555.1};
KW Cilium {ECO:0000313|EMBL:TLD80555.1};
KW Flagellum {ECO:0000313|EMBL:TLD80555.1};
KW Secreted {ECO:0000256|RuleBase:RU362073};
KW Virulence {ECO:0000256|ARBA:ARBA00023026}.
FT DOMAIN 5..141
FT /note="Flagellin N-terminal"
FT /evidence="ECO:0000259|Pfam:PF00669"
FT DOMAIN 428..513
FT /note="Flagellin C-terminal"
FT /evidence="ECO:0000259|Pfam:PF00700"
SQ SEQUENCE 514 AA; 54409 MW; AEEB7C12FF36BC71 CRC64;
MSFRINTNIS ALSAHTIGVQ NNRSLHSSLE KLSSGLRLNK AADDASGMAI ADSLRSQSES
LGQAVRNAND AIGMIQIADK AMDEQIKILD TIKNKAIQAA QDGQSNETRK ALQSDIIRLL
EELDNIANTT SYNGQQMLSG AFSNKEFQIG AYSNTTIKAS IGPTSSDKIG HVRMESSSFG
GIGMLASGAA SNLSEVMLKF RQVDGKHDFE IETVKISTSA GTGLGVLTEV INKYADTLGV
RASWNCMATG DVPVLSGTVK GLVINGVTIG NVNDVRKNDS DGRLINAINS IKERTGCEAY
TDITGRINVR SLDGRAISIQ TEGDSGKVFG GGNFAGISGT EHAIIGRLTL IRTDARDIIV
SGVNFSHIGF HSAQGIAEYT ANLRSVRGEM DANIASACGA NPNVAQANLH FDGIGAGVTS
LRGAMVVMDM AESARIQLDK LRADIGSAQI QLVTTINNVS VTQVNVKSAE SQIRDVDFAE
ESATFSKHNI LAQSGNFAMA QANAVQQNVL RLLQ
//