ID A0A0Q9TWJ8_9BACL Unreviewed; 3216 AA.
AC A0A0Q9TWJ8;
DT 20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 1.
DT 24-JAN-2024, entry version 38.
DE RecName: Full=Fibronectin type-III domain-containing protein {ECO:0000259|PROSITE:PS50853};
GN ORFNames=ASG93_03015 {ECO:0000313|EMBL:KRF43901.1};
OS Paenibacillus sp. Soil787.
OC Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Paenibacillus.
OX NCBI_TaxID=1736411 {ECO:0000313|EMBL:KRF43901.1, ECO:0000313|Proteomes:UP000051948};
RN [1] {ECO:0000313|EMBL:KRF43901.1, ECO:0000313|Proteomes:UP000051948}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Soil787 {ECO:0000313|EMBL:KRF43901.1,
RC ECO:0000313|Proteomes:UP000051948};
RA Gilbert D.G.;
RL Submitted (OCT-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:KRF43901.1, ECO:0000313|Proteomes:UP000051948}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Soil787 {ECO:0000313|EMBL:KRF43901.1,
RC ECO:0000313|Proteomes:UP000051948};
RA Schulze-Lefert P.;
RT "Functional overlap of the Arabidopsis leaf and root microbiotas.";
RL Submitted (NOV-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRF43901.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LMSP01000001; KRF43901.1; -; Genomic_DNA.
DR RefSeq; WP_056828670.1; NZ_LMSP01000001.1.
DR STRING; 1736411.ASG93_03015; -.
DR OrthoDB; 273314at2; -.
DR Proteomes; UP000051948; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0008152; P:metabolic process; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 1.
DR CDD; cd08983; GH43_Bt3655-like; 1.
DR Gene3D; 2.60.120.200; -; 2.
DR Gene3D; 2.60.40.1080; -; 3.
DR Gene3D; 3.30.1920.20; -; 2.
DR Gene3D; 2.60.120.430; Galactose-binding lectin; 3.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 2.60.40.1180; Golgi alpha-mannosidase II; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 3.
DR InterPro; IPR039514; 6GAL-like.
DR InterPro; IPR039743; 6GAL/EXGAL.
DR InterPro; IPR046780; aBig_2.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR033452; GH30_C.
DR InterPro; IPR013780; Glyco_hydro_b.
DR InterPro; IPR023296; Glyco_hydro_beta-prop_sf.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR PANTHER; PTHR42767; ENDO-BETA-1,6-GALACTANASE; 1.
DR PANTHER; PTHR42767:SF1; GLYCO_HYDR_30_2 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF20578; aBig_2; 3.
DR Pfam; PF02368; Big_2; 2.
DR Pfam; PF14587; Glyco_hydr_30_2; 1.
DR Pfam; PF17189; Glyco_hydro_30C; 1.
DR Pfam; PF13385; Laminin_G_3; 2.
DR SMART; SM00635; BID_2; 2.
DR SMART; SM00060; FN3; 5.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF75005; Arabinanase/levansucrase/invertase; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR SUPFAM; SSF49265; Fibronectin type III; 3.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 3.
DR SUPFAM; SSF51011; Glycosyl hydrolase domain; 1.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 2.
DR PROSITE; PS50853; FN3; 3.
PE 4: Predicted;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..3216
FT /note="Fibronectin type-III domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5006384777"
FT DOMAIN 551..642
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 728..815
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1395..1484
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 626..646
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1667..1686
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3163..3182
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3216 AA; 345941 MW; 3F18A900B1819C1D CRC64;
MLKRTLAMIT SVLVAVSPIA STATHAADVN RTLVNINASS TFQTIDNFGA SDAWSMDPIG
KEWSESNKEK IADLLFSQDK GIGLSGWRFN IGAGSSDTDE GIITIPWRRA ESFKKDENSA
YDWSKQAGQQ WFLKAANDRG VKDLIGFVNS PPVWMTKNGH AQPDASVGST NLKSGYEGKF
AGFLADVMDH FDKSGIHFNY VSPINEPTWD WNGAGQEANR YNNTDIINVI NSLYAELQNK
NLMTKISAPD GVEILSLLDD AKYAQFMATS QNPNDNKLQY QGGSNKLNVG KYREYIKDLL
GNPEIASKIS NKIASHSYWS DTVSDRNGDR LGELRKLLWE NIQSTLPGSS YWMSEYCILG
DVGEMQGNGR DLGIDPALLI ARTIHYDLSV ANASAWQWWT AVSKGDYKDG LIYTDYGMPG
DEQSIYTSKM LWALGNYSKF IRPGAKRIAM SGLAENDPKG LMGSAYLHES NHKLTSVYVN
YTNEDKPVTV QLNSLPDGKK VYHLTPYVTN ANESLAQHDM VTADADGTFQ YTIPARSIVT
LDGSYVSTDQ APEKPGIDSV KSLNKAAEIK IHEVAGAESY KIVYGTSPDN MNMTAGPVTG
TNYTLLGLEN QSAYYLQAIA VNRNGDSSPS DTVSVTPKLS PPQQVSANPV DGGVEIGFSG
DPHVPHYLAK WGTASGTYIG TVEIAGQDGL YKGSLTGLTN GSSYYLVIQA QDGAESSTLS
PEMTVMPTVQ PPSKLITIPG DGKVQLEFPE VQGVTRYSLE MTTENGEPTI LELPTNSATV
DHLTNGQSYS FRVSSVGVGG VGQPTANIQA TPQAEAITWE DDFQNSSMSS YNPDVSVWNI
ENGLLKHQSG GDNQGELSIK NVNIIDGTIS VTAKHSELGA DWGIVFRGTD YNHAYSFVFE
NDALFLRKNG TNLTKPKAFS AKLNQLYQLK IVLDGPHIQA YSDGELIFDV LDSTYSSGMV
GLHSWSNAQF SYLKVTRADG LMKAPEIYTV HIGNHRIDLK YAEVDGATAY TVQYGTTSDN
LQNTLAVSGG NAAVTGLSNN QVYYFKVVAT NGSIQTESSV KSGVPRDIQE PQLLYYVDAG
DGSPGTLEDG ESFGALNGVE DQAYGVDPVT GNMWGYAADG DATWARTDAS GNFETIRQYD
GNDLNKGLAY KFNLPNGTYR VTVGFFDPWH DSNRKMDLTI NGTTKLTDYV IGNSQEAKQF
DSIPVSNGQL EVKVIKKAGP KPMLSWIKVE KDLLYYVDAG DSSPVKLEAG EEFGVRNGVE
DQAYQEDPVT GYKWGYDADD SQTWANQNED RWNSIRQYDG STNGKGLSYR FEVPNGMYNL
DLGFDDPWDS SDRVMDLEVE GQKVLTNYLT GSGQDVKRVV GISVTDGELD VKITKQGNSK
PLISWLTVQH DNGIPVQPVI KRVVEGDSKV TLYFDETLNA TYDLAYGTDQ GSLTHHVSVS
GNVNHYTAQN LTNGTPYYFA LATVRGDLSS PLSDVKTSTP VGPADPNLYY FVDAGATAVQ
NGVTLGVKQS VPDQIFGVDP ISGKSWGYVA DDGKTWAKSD ETDPYLSIRQ YDGNDNGKGI
AYKFEVPHGT YTVQMGFDDP WDSSSRLMDI SIQNDLKLQN YVVGDKREIK EFTVDVDTDQ
LEVKLVKAGS DKPAVGWISV RYVKPLPPPV NTGPKLWYKF NEGSGTTVGD STGNGHTGTL
SEGASWGTAN NGSGAVTLNG VGGFVQLPNG ILSDITDVTV ATNVFIDPTV SNPYWIFTFG
SADDPASAPG TKYFGMLTDG SGSSRVSITN DRWSAEQNVS KGSSIAKGVW KNVAVTLSGT
TMAFYEDGNK IAEKSNVTLS PKDMEATIAN YIGKPAYPAD HYLKGQISDF RLYNRALSAD
EIKHVWMESL TDSEAVQAVK DGLNLGDTTA VVSNLTLPIT YAAGVNITWS SDRTDIITVD
GKVTRPLDAD TTVHLTASIS KGSATDTKTI TVTVLKAGDV VGLDSRSYVL FFNQAHETKV
TVTHPNGSVE DVTGLATYES SDSNVAKVDG SGRVMGLHKG TAVITVTYKG ETYPVTVTVQ
NEMLLWYKLD ETSGTSAIDS SGNGNDGVLK NGATWGQGIG GSSLQLSGGY NGAYVQMPNN
LLQGVDDLTI SAFMKMDATA TAPQWLATFS NTTNGYIYFA PTLSGKFRYA ITPTNWSGES
GVASSSIPVN SWRQVAVTYS SVTGTATLYV DGVAVQTNEI SKKPSSLEPT NANFIGKPWP
NYGDPYFKGN VSDFRLYSRA LNASEIQVIY GSKLGDMFIT DQMDLTLGDT SAVQDNLTLP
SRGLYGSAIT WTSSNEAVVN SQTGAVTRPA AVSGNATVTL TATLSNGQAT ATKTFTVTVM
KQLADAEKLK HDAEKLTVHN IDDVRGNLTL PVQGDYGSLI SWHSEDAAVV TPTGEVKRPV
SGSGDVEIKL TATLRLNDEV LTKAFLAHIK ELPAKPDYAG YLFAYFIGEG TKDGEQLYMA
SSNGNNPLNW NNLNNGKPVF TSHLGEKGVR DPSIVRSPDG DKFYMIATDL NIYSNGDWTR
AQTTGSTSLL IWESSDLIHW SNQRLVKVSS DLAGNTWAPE AFYDKTTGEY VVFWASKIYS
DASKSGSPNE RMMYAKTRDF YTFTEAQEYY NPGYSVIDTT MIENNGKIYR FTKDERDFNA
TTAPNGKMVF EESGNAIFGN FTMIKEGIGK GSIPAGEGPL VFKANGENKW YLFIDYFSGG
GYKPFYTTDL ASGIWTPASS GYSLPTPAPR HGTVLPITAE ELARITGTLP VEVAPAVSEV
TNVTLDQQAL ALKPGQTAGL KATVVPDLAG NKTVLWSSSN ESVAIVGNTG NVTAIGSGTA
AITATTVDGG RIATAQVTVL PVIPPTTSDD APAGWVNKDV TVTLTASDSG SGVAGTYYTV
DGGAEQQGTS VTITAEGNHM ISYWSVDKIG NAESPHTAVV QLDKTSPTIT GAPTTSANAS
GWYPSDVTVH FTCTDALSGT ASCAQDSTIS TEGANQRVKG DAADSAGNTA SVTVNGISID
KTAPTTSDNA PAGWSNKDVT VTFTASDSGS GVAGTYYTVD GGAEQQGTSV AITVEGNHTI
SYWSVDKVGN TESPHTAVVQ IDKTAPSLNL VLDKTTLWPP NHQLITVTAS VYTNDSLSGI
SSIVLTSITS NEPDNGLGDG DKTNDIQGAQ FGTLDTEFML RAEQSDNDNN KDNHNDKNKG
KDKGRIYTVT YTAIDFAGNK TTNTATVKVS NNQSSK
//