ID L1NGU3_9BACT Unreviewed; 1700 AA.
AC L1NGU3;
DT 06-MAR-2013, integrated into UniProtKB/TrEMBL.
DT 06-MAR-2013, sequence version 1.
DT 24-JAN-2024, entry version 30.
DE SubName: Full=FG-GAP repeat protein {ECO:0000313|EMBL:EKY02490.1};
DE Flags: Fragment;
GN ORFNames=HMPREF9151_00760 {ECO:0000313|EMBL:EKY02490.1};
OS Hoylesella saccharolytica F0055.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae;
OC Hoylesella.
OX NCBI_TaxID=1127699 {ECO:0000313|EMBL:EKY02490.1, ECO:0000313|Proteomes:UP000010433};
RN [1] {ECO:0000313|EMBL:EKY02490.1, ECO:0000313|Proteomes:UP000010433}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=F0055 {ECO:0000313|EMBL:EKY02490.1,
RC ECO:0000313|Proteomes:UP000010433};
RA Weinstock G., Sodergren E., Lobos E.A., Fulton L., Fulton R., Courtney L.,
RA Fronick C., O'Laughlin M., Godfrey J., Wilson R.M., Miner T., Farmer C.,
RA Delehaunty K., Cordes M., Minx P., Tomlinson C., Chen J., Wollam A.,
RA Pepin K.H., Bhonagiri V., Zhang X., Suruliraj S., Warren W., Mitreva M.,
RA Mardis E.R., Wilson R.K.;
RL Submitted (MAY-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EKY02490.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMEP01000049; EKY02490.1; -; Genomic_DNA.
DR STRING; 1127699.HMPREF9151_00760; -.
DR HOGENOM; CLU_240895_0_0_10; -.
DR Proteomes; UP000010433; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:InterPro.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR InterPro; IPR028994; Integrin_alpha_N.
DR InterPro; IPR003284; Sal_SpvB.
DR InterPro; IPR022045; TcdB_toxin_mid/N.
DR Pfam; PF03534; SpvB; 1.
DR Pfam; PF12256; TcdB_toxin_midN; 1.
DR SUPFAM; SSF69318; Integrin alpha N-terminal domain; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000010433};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Virulence {ECO:0000256|ARBA:ARBA00023026}.
FT DOMAIN 1544..1673
FT /note="Insecticide toxin TcdB middle/N-terminal"
FT /evidence="ECO:0000259|Pfam:PF12256"
FT REGION 1322..1400
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1323..1395
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EKY02490.1"
FT NON_TER 1700
FT /evidence="ECO:0000313|EMBL:EKY02490.1"
SQ SEQUENCE 1700 AA; 185160 MW; E7F2552D23E0CBE2 CRC64;
ANLQDFRKRT VSKYFTGTLA DSLSVGRAEL KVPRGSMEHA KILSITPLRK GELPHLPAGM
VNVTADRSNP TVTAHSKDSI AGYRFLPHGE HFVHSPASIT VPYDSTLIPQ GYTAEDIHTY
YYDELKAQWV MLRHKALDKG RELVMAETSH FTDVINGIIK VPESPETQNY VPTGISELKA
ADPSAGITAV SAPTANQSGT ASLSYPFELP KGRAGMQPSV GLQYSSDGSS SYVGYGWSLP
LQSIDIETRW GVPRFDADKE SESYLLMGSK LNDRTYRTTN APARTKDKRF YPLVEGGFAK
IIRKGDNPQN YTWEVTSKDG SVSYFGGVDG MVDEGAVLKD GNGNILRWAL CKTQDTHGNF
VSYKYLKKGN NLYPDTYHYT GSKEGEGIHS VNFTYTATER KDITSGARMG VLQYDSLLLK
KVSVLYKDEL LRAYDLNFEE GEFGKTLLKS INQKDSKDHL VATQSFDYYN DIKNGMFGKG
EQWTAEQDGR DVYIRQIGHK IDKCSDELTM LGGGYSKGKT YGGGLMVGFG VSIGTMNVGA
SYTQSKNNSV GKNVLIDIDG DGLPDKVFQS GGGLRYRKNL FGTTGKNVFG KSIAIKNIGE
FSRSTSWSKS VNADVALDLV VFTPGVSYSK TWDNTETPVY FSDFNNDGLV DIAKNGTVWF
NKIDADGVPN FTPSTTGTGN PIIGQNAEID KNFIPDYKAI RDSLEKEYPL HDVVRVWCAP
FKGTVSISSK IEKTTTYGDG IIYSIQKEKN VLKKDSILGE GIKQDNLTAS VNAGDRIFFR
LQSRYSGVSD SVGWAPQITY TQIIGNASSY LGQDFAHYDA KEDFLEGMTT GLPLQKDGRV
EIKAPYKKDK TNDDVTLIIR RKDVHGENVV EKRTPPANAP ATGVLTRSLD ILEKDSVQLS
FEIQTVGALN WKKVEWAPTI KYASDPDTVR VTPFKQMYNK PLVIKASKPL SGNLGTSTDY
DPGITLVSKL KVIRKDAGED KDTASVWMHI NREDGTLLHK RHYTLSKGDT LVVDTAKITD
AALLAEFSTG KLQTTFNIPN ELQSVDTAVV QILRDSLVYT TDSAGVKHFD HKEKVLLDTL
VASVFSGYNS LKFGHLYKGW GQFGYNGNGE YANEAIDPDV LQIKTDDYKD MADKFKNSHD
PKDLDGLMET NKQRFFIMAY DVARKVYISA TDSAYIGNAF QCSSRMGESE IKIDSIQYVA
GEGLSAPVLK TKATGEGFAV SAGADVGVVS FGVSGSKSWQ TSYTKVAAMD INGDGYPDWI
DENNDHICTQ YTAQTGTLSN LRLDTDVPLP KFNSDASTVG ANIGGAAKGK GKGAIAVSIC
SKGKPAPSST SGNTTSGDAG GGNEGGNSGG NNNSGSNSSG QNEDQNAGDG NSISSFSVSA
SGDFTSGTST TRRDWLDWNG DGLPDMLIGD KVRYNLGYGF TGEIQRSTGN MESSSNSTWG
AGLGTSINIL GPANISFGFN GTKTTTLTEF SYADLNGDGL PDMVSRDGDK VKVSINTGTG
FINDVYHGAG SPGRSLATSV SGYGNTAVKF SIHLLFLKFS LTPSIKAAAS EGVSRTENAL
VDIDGDGYPD FVESDGVDKL IVHRNLTGRT NLLKGVTLPF GGHVSVEYKQ TEPSYNTPGR
RWVMASVETT GGYAENGATK MRNEFEYEGG YRDRRERDFY GFEKVITKQI DTQNGNAVYR
KQVAEYGHNR HFYMHDLVTA
//