ID F3AD84_9FIRM Unreviewed; 1760 AA.
AC F3AD84;
DT 28-JUN-2011, integrated into UniProtKB/TrEMBL.
DT 28-JUN-2011, sequence version 1.
DT 27-MAR-2024, entry version 58.
DE RecName: Full=Fibronectin type-III domain-containing protein {ECO:0000259|PROSITE:PS50853};
GN ORFNames=HMPREF0992_01029 {ECO:0000313|EMBL:EGG79384.1};
OS Lachnospiraceae bacterium 6_1_63FAA.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX NCBI_TaxID=658083 {ECO:0000313|EMBL:EGG79384.1, ECO:0000313|Proteomes:UP000004368};
RN [1] {ECO:0000313|EMBL:EGG79384.1, ECO:0000313|Proteomes:UP000004368}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=6_1_63FAA {ECO:0000313|EMBL:EGG79384.1,
RC ECO:0000313|Proteomes:UP000004368};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Daigneault M., Strauss J.,
RA Ambrose C., Allen-Vercoe E., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Alvarado L., Arachchi H.M.,
RA Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C., Freedman E.,
RA Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S., Heilman E.R.,
RA Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P., Mehta T.,
RA Montmayeur A., Murphy C., Neiman D., Pearson M., Priest M., Roberts A.,
RA Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., White J.,
RA Yandava C., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium 6_1_63FAA.";
RL Submitted (MAR-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGG79384.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACTV01000009; EGG79384.1; -; Genomic_DNA.
DR HOGENOM; CLU_001365_0_0_9; -.
DR OrthoDB; 9802600at2; -.
DR Proteomes; UP000004368; Unassembled WGS sequence.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd00063; FN3; 2.
DR Gene3D; 1.50.10.10; -; 1.
DR Gene3D; 2.10.270.20; -; 1.
DR Gene3D; 2.60.40.1080; -; 1.
DR Gene3D; 1.20.1270.90; AF1782-like; 1.
DR Gene3D; 2.10.270.10; Cholin Binding; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 2.60.40.10; Immunoglobulins; 3.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR049053; AFCA-like_C.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR018337; Cell_wall/Cho-bd_repeat.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR027414; GH95_N_dom.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR PANTHER; PTHR31084; ALPHA-L-FUCOSIDASE 2; 1.
DR PANTHER; PTHR31084:SF19; GLYCOSYL HYDROLASE FAMILY 95 N-TERMINAL DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF02368; Big_2; 1.
DR Pfam; PF01473; Choline_bind_1; 1.
DR Pfam; PF19127; Choline_bind_3; 3.
DR Pfam; PF07554; FIVAR; 1.
DR Pfam; PF00041; fn3; 1.
DR Pfam; PF14498; Glyco_hyd_65N_2; 1.
DR Pfam; PF21307; Glyco_hydro_95_C; 1.
DR SMART; SM00635; BID_2; 1.
DR SMART; SM00060; FN3; 3.
DR SUPFAM; SSF69360; Cell wall binding repeat; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 1.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
DR PROSITE; PS51170; CW; 8.
DR PROSITE; PS50853; FN3; 2.
PE 4: Predicted;
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..1760
FT /note="Fibronectin type-III domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003290960"
FT DOMAIN 1384..1471
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1478..1564
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REPEAT 1583..1602
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 1603..1622
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 1623..1642
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 1643..1662
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 1663..1682
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 1683..1702
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 1703..1722
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 1723..1742
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REGION 100..126
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1760 AA; 196784 MW; 984AD3FCED2A7B75 CRC64;
MKSKKLKKAF AGLCTVALTV SSVPVFAQTE QTKAPQKESG LRLWYNEPAS KGKNILNGGS
FGTTEEDNTW QQHTLPIGNS FMGANVYGEI GQERLTFNQK TLWNGGPSEN RPDYDGGNKE
TADNGQKMSD VYKEIIELYK EGNDAQANEL AKKLTGEVNG YGAYQSWGDI YVDFGLKEEQ
AENYVRDLNL ENAVASVDFD YQDTKMHREY FISYPDNVLA MKFTAEGSEK LDFDISFPID
NAEGVADKKL GKSVETTVED DTITVSGEMQ DNQLQLNGKL KVETEGGKVQ EKDGDKLHVS
GASEAVVYVS ADTDYLNKYP DYRTGETAQE LDASVERAVD KASKKGYEKV KKEHIKDYSE
IFSRVQLDLG QNVPDKTTDI LLKDYNAGKN TEAENRALEV ILFQYGRYLT IASSRAGDLP
SNLQGVWQNR VGDHNRIPWA SDYHMNVNLQ MNYWPTYSTN MAECATPLID YINSLVEPGK
VTAKTYFGVE NGGFTAHTQN TPFGWTCPGW DFSWGWSPAA LPWILQNCWE YYEYTGDVKY
MEEHIYPMLK EAALLYDQIL IEDEKTGRLV SAPAYSPEHG PVTAGNTYEQ SLIWQLYEDA
ATAAEILSKD EEKAKEWRQR QQKLKPIEIG ESGQIKEWYT ETTLGSMGEK GHRHMSHLLG
LFPGDLISVD NAEYMDAAIV SLKERGEKST GWGMGQRINA WARTGDGNQA HKLIQNLFHD
GIYPNLWDTH TPFQIDGNFG MTSGVSEMLM QSNMGYINML PSLPDVWANG SVKGLVARGN
FEVSMKWADK NLTEATLLSR NGGTATVQTK NASLATVFDE KGNPVEMKAL TADRISFETE
KGKTYIIKNI PQSVKVPTGL TAERLDEQRV ELTWDEAEAD GICFNVYRKV GNGDVQLIAS
NVKTNSYKDV TAYEKLGTMK YQITAVTSEL ESEKTEFVTI KDKGVTAGMI DDTDSHIIYE
GAWGDWKEEV NYNGTIKYLN TPQGNESVSL NFVGTGIEVI TCTNHDRGMM EVLIDDKSYG
EIDTYSAQTK RQQKIFEKDD LPYGNHTITL KVLNKSSQGA GKSTKVELDA FRVLDNTMSI
PASVQVSAVS GITIIGKANS TVQMKAEILP EDVQDKSVTW SSSDTSIAEV NENGLVTVKE
KNGEVKITAV SKADTTKSGE VTLKVALKQG NANTVTEIED AVQENGKWKP NPSITWSEGW
STWDGEADRH HGGTKTEATG AGKYFEYTFT GNQIEVYVQK HQNCGNFEIY LDDEKVDTYS
FDTEIGTGED QALLFKSEEL SNEEHKIKCV ITDRGGKQQA NLDYLKVYAP ADSSVLDKSN
LQDVITEASV LSEEAYPQDK WAELQKVYKE AVAVMNKDDA KQDEIDKAAE ALEAAISALG
KAQPPVVKQE KGKAILVESK QIVLEWDKVQ GAKSYEIIDN EHQVKAETTD TFARVTGLEP
GTTYNFKIYA VNEGGKSEKA IEINEVTTTN PNGENTIPPV TDIAKKVTGK ESVKLTWKAP
ADTKTAGYIV YVDGAKKGVT EKEAFTLQGL VKNQIYVVKI IAFDEEGHKS VPAQFAFSFE
EEENEGWVHT GNGWEYYENG NKAIGWKDIS GTWYYFNGNG IMETGWEEVN GHWYYLNDSG
AMCTGWVYVD GHWYYMDQWG AMQTGWVSVS GHWYYMDQWG AMCTGWEEVN SHWYYMDQQG
AMQTGWVSVS GHWYYMDQWG AMQTGWMFVN GHWYYMDQWG AMCTGWVFVG GYWYYLNTDG
AMAANQWIDG YYVGNSGQMA
//