ID G6AY97_9BACT Unreviewed; 2378 AA.
AC G6AY97;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 24-JAN-2024, entry version 29.
DE SubName: Full=FG-GAP repeat protein {ECO:0000313|EMBL:EHJ39473.1};
DE Flags: Fragment;
GN ORFNames=HMPREF0673_01606 {ECO:0000313|EMBL:EHJ39473.1};
OS Leyella stercorea DSM 18206.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae;
OC Leyella.
OX NCBI_TaxID=1002367 {ECO:0000313|EMBL:EHJ39473.1, ECO:0000313|Proteomes:UP000004407};
RN [1] {ECO:0000313|EMBL:EHJ39473.1, ECO:0000313|Proteomes:UP000004407}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 18206 {ECO:0000313|EMBL:EHJ39473.1,
RC ECO:0000313|Proteomes:UP000004407};
RA Weinstock G., Sodergren E., Clifton S., Fulton L., Fulton B., Courtney L.,
RA Fronick C., Harrison M., Strong C., Farmer C., Delahaunty K., Markovic C.,
RA Hall O., Minx P., Tomlinson C., Mitreva M., Hou S., Chen J., Wollam A.,
RA Pepin K.H., Johnson M., Bhonagiri V., Zhang X., Suruliraj S., Warren W.,
RA Chinwalla A., Mardis E.R., Wilson R.K.;
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EHJ39473.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AFZZ01000145; EHJ39473.1; -; Genomic_DNA.
DR eggNOG; COG3209; Bacteria.
DR HOGENOM; CLU_000302_0_0_10; -.
DR Proteomes; UP000004407; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:InterPro.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR Gene3D; 2.180.10.10; RHS repeat-associated core; 1.
DR InterPro; IPR028994; Integrin_alpha_N.
DR InterPro; IPR031325; RHS_repeat.
DR InterPro; IPR003284; Sal_SpvB.
DR InterPro; IPR022045; TcdB_toxin_mid/N.
DR InterPro; IPR006530; YD.
DR NCBIfam; TIGR01643; YD_repeat_2x; 1.
DR Pfam; PF05593; RHS_repeat; 1.
DR Pfam; PF03534; SpvB; 1.
DR Pfam; PF12256; TcdB_toxin_midN; 1.
DR SUPFAM; SSF69318; Integrin alpha N-terminal domain; 2.
PE 4: Predicted;
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP};
KW Virulence {ECO:0000256|ARBA:ARBA00023026}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..2378
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003485624"
FT DOMAIN 1532..1655
FT /note="Insecticide toxin TcdB middle/N-terminal"
FT /evidence="ECO:0000259|Pfam:PF12256"
FT NON_TER 2378
FT /evidence="ECO:0000313|EMBL:EHJ39473.1"
SQ SEQUENCE 2378 AA; 260082 MW; 4E8136905795894B CRC64;
MKNYIQKVAW ALVLLLAATL SLSAKDGTAV RKLFKKGVSD SISVGGSKLV VLQKDLIRNR
SLSVNSIGEE NVPELDFAMT NVTAGGHGYR FLPHGTHFTG EGATVKIKYD RTRIPSGYTE
DDIRTYYYDP AEKHWVALER VRVDKKEECV VSKTTHFTDM INGVIQAPES PQTEGFAPTM
MNDIKAADPT AKLNVIAPPS ANNRGSANLQ YPFEMPPARN GMQPSLGLQY SSEGGSGWLG
EGWNVSVPSI TLDTRWGVPR YDLSKETETY LLSGSMLSTM DDNGQMGVAH RGEKMNRKAD
RQFYTRQGGD FNRIIRKGDS PANYYWEVTD KQGVKYIYGG DGAVVKGNVT DASGNTREVI
AEWKLKRVEE LHGDYIEYVY DIVDEDVRGG LKAKAAYLKE VHAGNAGQEP HTVVLFDGNK
VKQVKTNNAR YGFLASSNRL LEKVTVNFQG ETLRSYAFDY KEGAFHKEML TGVRQYDNTG
KEVAFQNFDY YDDVQADKGY VPFKDDSEKW NTHDDGLDAG FVNPLKTVSK RFSDKPTALG
GTTSSSVSGS FYAGVGPWDG SKWKSNTIGG SYSYSSDTSK GLSAFVDLNG DGLPDKVYKS
GGSVYYRPQV KTDNGEVVYG EPVKVKGISN FSTTKSSTNS FGAKAVVGWN VLTAVVGTDK
STTKTKTTQY FSDINGDGLV DLVSNGKVYF NHLEFDQSGN AVPTFTLSSA DTPSPIIYGG
KVDTSVMEVS KDEQAEAIKN SPMEDIVRVW QAPKDGTVSV TGQVSLIAPT DDYDADEYQK
ADGVRIAIQK GGNELWNKTI AKGDNSPYSA AVSSVAVKRG DRIYFRVQSG SEETSNGSFD
KVSWSPTITY AGESNILPNG LSSTEYKPED GAIYDVNTSA NVENGSSVEV RSAFHKPVTT
DDVVLCIIGS NEKKDSDGND NPNYIEKTVF ARTLKANEAF GGDSLNISLE NTEKLTNFSF
EISSTSNVDW KNIGWQASVT YKDSANVEQT MAVPAHYKIF ANALKEGKPY LTTAADTALV
VSPVLALSDN ALNGEVVLTA KTVDALVGKK IFKIQNGILQ SDTLRLSNFG GKKIWFEYSY
PSTISDGALT SASVSVQKDA AGLVVESVPA GFYAESDNNG FGMLYRGWGG FVYNAAEGRY
AKPIDESLLK LPENEDDKVD PLTMAFTPLG TDQTSMDKWV GQRQDIYLTA TEASTARLTE
QDVLLSNPLE NDTEVAGLAG EYLQGTGAAA VSQVMSSKST VVQGGALGIT HNDASGNAKT
EVTMMDMNGD GFPDIIAGGT IQYTNSLGGL SGEIYKGIGS SNSDNASQAW GYGGNPVASA
SSITNLISKG KATILNQQTA WLAQFSISGS APKNTDEAVE SFIDINGDGL PDKILSDKKV
KLNLGYAFTE PIDWELDRVQ DGKSLSYNIG ASGNTPEAGW GKFQDEKYKD INKASGSFSA
GFGIVTSESE EEYNLIDINS DGLPDKVWKD GDGITVALNT GNGFDEPISW KGASALSESA
STSESANAAF TLTINIPVIS IKISTNPGAS TSHSINRPTY SLQDVDGDGY LDIVESEKES
ELKVTRSAIG RTNMLKSVTN SLGGTFTLDY AHTTPTYGLP GGKWVMAALT VDDGIHDDGP
VMTTAFEYKD GKRDRHEREF LGFGEVITKN LDTEKGNSVY RQAVENYDVA NYYTQGNVTA
TSVEDANGNK YTETKNRYDS YYLTADGDKY TFAKRDVNLW SDRASAFVPL RYTANLQYEG
AANGVVTSEA WNEYYLNGYH GELKSYKYSD KGSLGEDGNG KFDYATAIKY TDNASKHIFG
LPVDVTVTGG DGSLYHHVTA KYNTNYANHI TQITQQLNDG EAVTDYKYDS YGNIIQKTLP
ANGKGQRMWY KYRYEPEMNM YVERIDDAWG YRSEQGNFDY RYGIAKEHRD LNNFYYETDV
DDLGRITGVR GPNELATGVP YAIAFEYQPL ATFGESGITA PAYAVTKHYD IQHPNDDLET
ITFVDGFGRA VQVKKDGVVT SASKGNSAKD ENVMIVSGRN VYDAFGRVAK AFYPTTEGTG
SKSTFSKSFD NVSPTVTVYD VLDRAASVTL PDNSTTTTAY TVDNGSHALV TTVTDALHNV
QATHTNGSGK TLKTIQKSGP DGEITTSFEY DGIQRLVRVT DTEGNVTTST YDMGDRRTEV
NHPASGITSF TYDALGNVLT KQTANLAKEG KFITYDYDYQ RLTGINYPDH PENNVKYYYG
GRNASQNRIG RLMLREDGTG AIEYFYGKMG EVTKTRRTMI VPNQAIATYV TQWTYDSHNR
LLEMIYPDEE KITYSYNLGG QLEKVHGYKS YGYDYVSKIG YDKFEQRTYL KYCNGAETFY
TYDPQRRRLQ NLTVNSGGNT IMDNAYTYDA VSNVLSVV
//