ID A0A329ZH42_9HELI Unreviewed; 514 AA.
AC A0A329ZH42;
DT 10-OCT-2018, integrated into UniProtKB/TrEMBL.
DT 10-OCT-2018, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE RecName: Full=Flagellin {ECO:0000256|RuleBase:RU362073};
GN ORFNames=CCY97_05535 {ECO:0000313|EMBL:RAX54579.1};
OS Helicobacter sp. 10-6591.
OC Bacteria; Campylobacterota; Epsilonproteobacteria; Campylobacterales;
OC Helicobacteraceae; Helicobacter.
OX NCBI_TaxID=2004998 {ECO:0000313|EMBL:RAX54579.1, ECO:0000313|Proteomes:UP000251062};
RN [1] {ECO:0000313|EMBL:RAX54579.1, ECO:0000313|Proteomes:UP000251062}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=10-6591 {ECO:0000313|EMBL:RAX54579.1,
RC ECO:0000313|Proteomes:UP000251062};
RX PubMed=29225701; DOI=10.1186/s13099-017-0220-y;
RA Feng Y., Mannion A., Madden C.M., Swennes A.G., Townes C., Byrd C.,
RA Marini R.P., Fox J.G.;
RT "Cytotoxic Escherichia coli strains encoding colibactin and cytotoxic
RT necrotizing factor (CNF) colonize laboratory macaques.";
RL Gut Pathog. 9:71-71(2017).
CC -!- FUNCTION: Flagellin is the subunit protein which polymerizes to form
CC the filaments of bacterial flagella. Important for motility and
CC virulence. {ECO:0000256|ARBA:ARBA00025143}.
CC -!- SUBUNIT: Heteromer of FlaA and FlaB. FlaB is located proximal to the
CC hook while the remainder of the filament is composed of the predominant
CC FlaA. {ECO:0000256|ARBA:ARBA00025928}.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|RuleBase:RU362073}.
CC Bacterial flagellum {ECO:0000256|RuleBase:RU362073}.
CC -!- SIMILARITY: Belongs to the bacterial flagellin family.
CC {ECO:0000256|ARBA:ARBA00005709, ECO:0000256|RuleBase:RU362073}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RAX54579.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NHYK01000014; RAX54579.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A329ZH42; -.
DR OrthoDB; 9796789at2; -.
DR Proteomes; UP000251062; Unassembled WGS sequence.
DR GO; GO:0009288; C:bacterial-type flagellum; IEA:UniProtKB-SubCell.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0005198; F:structural molecule activity; IEA:UniProtKB-UniRule.
DR Gene3D; 3.30.70.2120; -; 1.
DR Gene3D; 1.20.1330.10; f41 fragment of flagellin, N-terminal domain; 1.
DR Gene3D; 6.10.10.10; Flagellar export chaperone, C-terminal domain; 1.
DR InterPro; IPR001492; Flagellin.
DR InterPro; IPR046358; Flagellin_C.
DR InterPro; IPR042187; Flagellin_C_sub2.
DR InterPro; IPR010810; Flagellin_hook_IN_motif.
DR InterPro; IPR001029; Flagellin_N.
DR PANTHER; PTHR42792; FLAGELLIN; 1.
DR PANTHER; PTHR42792:SF2; FLAGELLIN; 1.
DR Pfam; PF00700; Flagellin_C; 1.
DR Pfam; PF07196; Flagellin_IN; 2.
DR Pfam; PF00669; Flagellin_N; 1.
DR PRINTS; PR00207; FLAGELLIN.
DR SUPFAM; SSF64518; Phase 1 flagellin; 1.
PE 3: Inferred from homology;
KW Bacterial flagellum {ECO:0000256|ARBA:ARBA00023143,
KW ECO:0000256|RuleBase:RU362073};
KW Cell projection {ECO:0000313|EMBL:RAX54579.1};
KW Cilium {ECO:0000313|EMBL:RAX54579.1};
KW Flagellum {ECO:0000313|EMBL:RAX54579.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000251062};
KW Secreted {ECO:0000256|RuleBase:RU362073};
KW Virulence {ECO:0000256|ARBA:ARBA00023026}.
FT DOMAIN 5..141
FT /note="Flagellin N-terminal"
FT /evidence="ECO:0000259|Pfam:PF00669"
FT DOMAIN 428..513
FT /note="Flagellin C-terminal"
FT /evidence="ECO:0000259|Pfam:PF00700"
SQ SEQUENCE 514 AA; 53942 MW; E98FC76CAE0701EC CRC64;
MSFRINTNIS ALNAHSIGVM NNRGLHNSLE KLSSGLRLNK AADDASGMAI ADGLRSQSEG
LGQAIRNAND AIGMIQVADK AMDEQLRIID TIKTKAIQAA QDGQTSESRK ALQSDILRLL
EELDNIANTT SFNGQQMLSG AFANKEFQIG AYSNTTIKAS VGPTSSDKIG HIRMESASFS
ASGMLASGAA SNLTEVMFHV KEVDGKNSFT LETVKISNSA GTGIGVLSEV INKYSDKLGV
RASWSVVGTG SLPVQSGTVH GLVINGVTIG TINDVRKNDA DGRLINAINS VKERTGAEAY
IDITGRVNLR STDGRAISLQ TYSSSGAVFG GGNFAGISGS THAIIGRLTL VRTDARDIII
SGTNFSHVGF HTAQGIAEYT VNLRSVRGEM DANIASAAGA NANVAQAELN AGGIGTGVTS
LRGAMVVMDM AESARIQLDK LRADMGSVQI QLIATINNIS VTQVNVKSAE SQIRDVDFAM
ESSTFSKHSI LAQSGSFAMA QANAVQQNVL RLLQ
//