ID A0A2G3E0U2_9FIRM Unreviewed; 462 AA.
AC A0A2G3E0U2;
DT 31-JAN-2018, integrated into UniProtKB/TrEMBL.
DT 31-JAN-2018, sequence version 1.
DT 24-JAN-2024, entry version 18.
DE RecName: Full=Flagellin {ECO:0000256|ARBA:ARBA00020110, ECO:0000256|RuleBase:RU362073};
GN ORFNames=CSX02_11345 {ECO:0000313|EMBL:PHU36760.1};
OS Agathobacter ruminis.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae;
OC Agathobacter.
OX NCBI_TaxID=1712665 {ECO:0000313|EMBL:PHU36760.1, ECO:0000313|Proteomes:UP000224563};
RN [1] {ECO:0000313|EMBL:PHU36760.1, ECO:0000313|Proteomes:UP000224563}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JK623 {ECO:0000313|EMBL:PHU36760.1,
RC ECO:0000313|Proteomes:UP000224563};
RA Sheridan P.O., Walker A.W., Duncan S.H., Scott K.P., Toole P.W.O., Luis P.,
RA Flint H.J.;
RT "Resolving the taxonomy of Roseburia spp., Eubacterium rectale and
RT Agathobacter spp. through phylogenomic analysis.";
RL Submitted (OCT-2017) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:PHU36760.1, ECO:0000313|Proteomes:UP000224563}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JK623 {ECO:0000313|EMBL:PHU36760.1,
RC ECO:0000313|Proteomes:UP000224563};
RA Banno H., Chua N.-H.;
RL Submitted (OCT-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Flagellin is the subunit protein which polymerizes to form
CC the filaments of bacterial flagella. {ECO:0000256|RuleBase:RU362073}.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|RuleBase:RU362073}.
CC Bacterial flagellum {ECO:0000256|RuleBase:RU362073}.
CC -!- SIMILARITY: Belongs to the bacterial flagellin family.
CC {ECO:0000256|ARBA:ARBA00005709, ECO:0000256|RuleBase:RU362073}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PHU36760.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PDYG01000123; PHU36760.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2G3E0U2; -.
DR Proteomes; UP000224563; Unassembled WGS sequence.
DR GO; GO:0009288; C:bacterial-type flagellum; IEA:UniProtKB-SubCell.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0005198; F:structural molecule activity; IEA:UniProtKB-UniRule.
DR Gene3D; 1.20.1330.10; f41 fragment of flagellin, N-terminal domain; 2.
DR Gene3D; 6.10.10.10; Flagellar export chaperone, C-terminal domain; 1.
DR InterPro; IPR001492; Flagellin.
DR InterPro; IPR046358; Flagellin_C.
DR InterPro; IPR042187; Flagellin_C_sub2.
DR InterPro; IPR001029; Flagellin_N.
DR PANTHER; PTHR42792; FLAGELLIN; 1.
DR PANTHER; PTHR42792:SF2; FLAGELLIN; 1.
DR Pfam; PF00700; Flagellin_C; 1.
DR Pfam; PF00669; Flagellin_N; 1.
DR PRINTS; PR00207; FLAGELLIN.
DR SUPFAM; SSF64518; Phase 1 flagellin; 1.
PE 3: Inferred from homology;
KW Bacterial flagellum {ECO:0000256|ARBA:ARBA00023143,
KW ECO:0000256|RuleBase:RU362073};
KW Cell projection {ECO:0000313|EMBL:PHU36760.1};
KW Cilium {ECO:0000313|EMBL:PHU36760.1}; Coiled coil {ECO:0000256|SAM:Coils};
KW Flagellum {ECO:0000313|EMBL:PHU36760.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000224563};
KW Secreted {ECO:0000256|RuleBase:RU362073}.
FT DOMAIN 3..140
FT /note="Flagellin N-terminal"
FT /evidence="ECO:0000259|Pfam:PF00669"
FT DOMAIN 376..461
FT /note="Flagellin C-terminal"
FT /evidence="ECO:0000259|Pfam:PF00700"
FT COILED 73..127
FT /evidence="ECO:0000256|SAM:Coils"
SQ SEQUENCE 462 AA; 48396 MW; 99CBA8CF0E8F2DB4 CRC64;
MVVQHNMQAM NANRMLNITT GAQSKSTEKL SSGYKINRAA DDAAGLSISE KMRKQIRGLD
QASTNAEDGV SAVQTAEGAL NEVQSMLQRM NELATQAANG TNSESDREAI QNEISQLTTE
IDRVAETTKF NETYLLKGTN GTQTNYLKGH DAGVVGTDAN NVTFVDGSDK ATLTVELKAG
KSVTIAGKEY NITEGSANTA DDSIDTLKTA LDSASSVVIN GTTYTKGADN KFDAGTGTLY
SADEIKEKVV DGAKIKVGNG TEHVARDYSL TSKNIDAATA KSKIETALKQ ANNIGVDTGT
VSATGAQDGS TNNYKFTINK GKATVAKDLS FALHVGADAD MSNKITVGIS SMSSASLGVQ
GLNVKDDSGM AATYAIDAIA DAVAKVSAQR SALGAIQNRL EHTIDNVDNV VENTTAAESR
IRDTDMAQEM VNYSKNNILA QAGQSMLAQA NSSNQGVLSL LG
//