ID A0A1B6B987_9FIRM Unreviewed; 1482 AA.
AC A0A1B6B987;
DT 02-NOV-2016, integrated into UniProtKB/TrEMBL.
DT 02-NOV-2016, sequence version 1.
DT 24-JAN-2024, entry version 23.
DE SubName: Full=Large exoproteins involved in heme utilization or adhesion {ECO:0000313|EMBL:GAU76296.1};
GN ORFNames=F3D3_0893 {ECO:0000313|EMBL:GAU76296.1};
OS Fusibacter sp. 3D3.
OC Bacteria; Bacillota; Clostridia; Eubacteriales;
OC Eubacteriales Family XII. Incertae Sedis; Fusibacter.
OX NCBI_TaxID=1048380 {ECO:0000313|EMBL:GAU76296.1, ECO:0000313|Proteomes:UP000095197};
RN [1] {ECO:0000313|EMBL:GAU76296.1, ECO:0000313|Proteomes:UP000095197}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=3D3 {ECO:0000313|EMBL:GAU76296.1,
RC ECO:0000313|Proteomes:UP000095197};
RA Serrano A.E., Escudero L.V., Encalada O., Tebes C.J., Fernandez S.,
RA Demergasso C.S.;
RT "Draft genome sequence of Fusibacter ascotence 3D3 isolated from the high
RT arsenic level Ascotan salt flat in northern Chile.";
RL Submitted (SEP-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GAU76296.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BDHH01000003; GAU76296.1; -; Genomic_DNA.
DR RefSeq; WP_069871549.1; NZ_BDHH01000003.1.
DR STRING; 1048380.F3D3_0893; -.
DR OrthoDB; 6372180at2; -.
DR Proteomes; UP000095197; Unassembled WGS sequence.
DR Gene3D; 2.160.20.110; -; 2.
DR Gene3D; 2.60.40.3440; -; 2.
DR InterPro; IPR011493; GLUG.
DR InterPro; IPR001119; SLH_dom.
DR NCBIfam; NF012211; tand_rpt_95; 2.
DR Pfam; PF17963; Big_9; 3.
DR Pfam; PF07581; Glug; 2.
DR Pfam; PF00395; SLH; 2.
DR PROSITE; PS51272; SLH; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000095197};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1291..1349
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 1350..1413
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 1418..1481
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT REGION 1027..1062
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1046..1062
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1482 AA; 159087 MW; 92E93A35A635E73B CRC64;
MNRKLIQLIF VLILLVGVGS ASYAMEIYNL NSEQINYIYT VEDFDKIMND SAGFFVLMND
LDLSTSKSIG QIAFAGILHG NGHVLSNWSA PNGLFSVINN AEIYDLGLES GSATNYVLGS
QITNSKLENV YVDHIVTTSS SLTGILLNST LTNMYVTLPG GFATLKSGNT ETSVYYDADK
FTGTTTVGAG LTTAAMMNQV NYTGWDFNHI WTVEESVTNP NFGQYNSLRL MGSSVHYNKA
EGKVAIKLVF NQTPSIGTGS IHLLRKVDDS EVVTFASSSS VLEGKVATLS SEILLEENVE
YYITVDASAF DNGSGGTMII LGRVSKAFEI LHFELGDGTS EHPYEIRNFE ALNKIREGLE
YHYKLMDNMD LEGKSMAPIG NAATPFIGGF DGSDHVISNY TYEALDGSYA GLFGYSKGSI
KNLGVNNLEI KIQGSASSSV SLSGSYVGGL VGYNTGRIEN CYTTGVVLGK SNVGGLVGNN
QGVITLCYSE ANVTSSDYLS SGYNNEGFYT GGLSGRNALG GTISLSYATG DVAGCEQVGG
LVGNAFKGGL ISDSYASGDV IGYYDVGGFM GRNYSGSTAE NVYSTGAVSL YTGGTHVGGL
IGTQASAGTL TNGYYNQETS GQNDDLGNGS PRMTSEMFLE STYVGFDFNT IWTIIENRSM
PYLNLNEAAA LPGLINQAPT LTTTAIYTLE DTVVSGNLLG VDPDGDAINY TLVQVTSSGS
INVTTSGAYT YMPNPNFNGE DQFLVILTDG FVTTTSAAIT VHVESVNDAP SVSNGTISLD
EDSSFVGQLV STDADENDVL EYTLITEPVH GTVTLNHLTG YFVYSSVANY YGSDQFTWQA
SDGVTSQQAT ISVAIASVND SPVTTHKTFD MRYNEVLSTD LALLSTDVEG DALTFSLNSS
PIFGTAHILM GSSLLSYTPV HIGIETLVYS VSDLNTTTVG SITIHIVNAE QSAPTGLTAE
RASQNKSDGA IIGTDSTMAY KKVDATEWTE VTSDKITGLT AGTYAVRFKA KLGYNAGENA
YVTVEIDEDS TGAGGTEDAG GSGDRDEGSS YNSPETSGEI LSGSIQPVSA SIVIKTTDTT
EVIDGKTVTQ VGVETNEIRA YLKSETPNGE NQNLVSFKTA QTSDSVQFNL TGEAVDLFSK
NQIDYAFETP LVAYQFTSNA IDIKRIAMAL GLSESQYANI KIGVNMKRTS QEIRPDGVDG
KVLMSPIQFE ITASYTSESG EIKTISVHAF NHYVERTFTL SDEIDQALVT TGVVINSDGR
YAHIPTIVIK ENGKWKVKMN SMTNSEYTLI YNDVKVSSVS GHWSETVVNR MASDLVLVDY
ETFKPNEAVT RGEFIDYVVR GLGLYRDTVD FETKLNDIEN SRYAQSIQIA NAWGLVSGYE
NNTFRPNHQI SREEAMMIFA NALTLIQFEH TGTVNHDISQ YIDKTEISNW AVDAVNRSLK
VGVFSGRSNQ QLAPKENLTC AEAVVAVHNL LIKTELIAGK TN
//