ID F3AV52_9FIRM Unreviewed; 1966 AA.
AC F3AV52;
DT 28-JUN-2011, integrated into UniProtKB/TrEMBL.
DT 28-JUN-2011, sequence version 1.
DT 27-MAR-2024, entry version 58.
DE RecName: Full=Fibronectin type-III domain-containing protein {ECO:0000259|PROSITE:PS50853};
GN ORFNames=HMPREF1025_01606 {ECO:0000313|EMBL:EGG85423.1};
OS Lachnospiraceae bacterium 3_1_46FAA.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX NCBI_TaxID=665950 {ECO:0000313|EMBL:EGG85423.1, ECO:0000313|Proteomes:UP000005376};
RN [1] {ECO:0000313|EMBL:EGG85423.1, ECO:0000313|Proteomes:UP000005376}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=3_1_46FAA {ECO:0000313|EMBL:EGG85423.1,
RC ECO:0000313|Proteomes:UP000005376};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Daigneault M., Strauss J.,
RA Ambrose C., Allen-Vercoe E., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Alvarado L., Arachchi H.M.,
RA Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C., Freedman E.,
RA Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S., Heilman E.R.,
RA Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P., Mehta T.,
RA Montmayeur A., Murphy C., Neiman D., Pearson M., Priest M., Roberts A.,
RA Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., White J.,
RA Yandava C., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium 3_1_46FAA.";
RL Submitted (MAR-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGG85423.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACWP01000033; EGG85423.1; -; Genomic_DNA.
DR eggNOG; COG1554; Bacteria.
DR eggNOG; COG3250; Bacteria.
DR eggNOG; COG5492; Bacteria.
DR HOGENOM; CLU_001365_0_0_9; -.
DR OrthoDB; 9802600at2; -.
DR Proteomes; UP000005376; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd00063; FN3; 3.
DR Gene3D; 1.50.10.10; -; 1.
DR Gene3D; 2.60.40.1080; -; 1.
DR Gene3D; 1.20.1270.90; AF1782-like; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 2.60.40.10; Immunoglobulins; 3.
DR Gene3D; 2.70.98.50; putative glycoside hydrolase family protein from bacillus halodurans; 1.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR049053; AFCA-like_C.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR011081; Big_4.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR027414; GH95_N_dom.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR NCBIfam; TIGR01167; LPXTG_anchor; 1.
DR PANTHER; PTHR31084; ALPHA-L-FUCOSIDASE 2; 1.
DR PANTHER; PTHR31084:SF19; GLYCOSYL HYDROLASE FAMILY 95 N-TERMINAL DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF02368; Big_2; 1.
DR Pfam; PF07532; Big_4; 2.
DR Pfam; PF00041; fn3; 1.
DR Pfam; PF14498; Glyco_hyd_65N_2; 1.
DR Pfam; PF21307; Glyco_hydro_95_C; 1.
DR SMART; SM00635; BID_2; 1.
DR SMART; SM00060; FN3; 2.
DR SUPFAM; SSF49265; Fibronectin type III; 2.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 1.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
DR PROSITE; PS50853; FN3; 2.
PE 4: Predicted;
KW Membrane {ECO:0000256|SAM:Phobius}; Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..1966
FT /note="Fibronectin type-III domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003295776"
FT TRANSMEM 1940..1960
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 1429..1533
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1538..1626
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 94..117
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1222..1258
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1913..1934
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1966 AA; 214234 MW; 064D3C7D20730321 CRC64;
MKRKLSKAGG KALSVALAMA TVVSAAPFSV AAEDNVALAQ TQETEGNALR LWYDEEAPNS
YTGWEQWALP LGNSAIGASV FGGVQTERIQ LNEKSLWSGG PSDSRPEYNG GNIESKGQNG
KVMAQLKEKL KSGQGFDSNL AGQLIGVSDD AGVQGYGYYL SYGNMYLDFK NVTKNNVSGY
SRDLDLRTAV AGVNYDLNGA HYTRENFVSY PDNVLVTRLT ATDGGTLDFD VRVEPDEEKG
GSQNKPEADS YARTFDKKVS DNAIAIDGQL TDNQLKFSSY TKVIKDDGTA GQIKDDSKNG
KITVSGAKAI TIITSIGTDY KNDYPKYRTG ETKEQLAALV KGYVSGAEAK VKAGGYETLK
EDHVNDYDHI FGRLDLNIGQ AVSDKTTDKL LEAYKKGTAS ETEKRYLELM LFQYGRYLTM
GSSRETPVNE DGTKNERRAT LPSNLQGIWV GANNSAWHSD YHMNVNLQMN YWPTYTTNMA
ECAEPLIDYV DSLREPGRIT AKIYAGVEST EANPENGFMA HTQNNPYGWT NPGWVFDWGW
SPAGVPWILQ NCWEYYEFTG DTEYMQTHIY PMMKEEATLY DQMLMRDSEG KLVSVPSYSP
EHGPRTAGNT YEHSLIWQLY EDTITAAETL GVDEAKVAQW KQNQADLKGP IEIGDSGQIK
EWYNETTLNT DENGQKMGEG YGHRHISHML GLYPGDLIAQ NDEWLAAAKV SMQNRTDVTT
GWAMAQRVAT WARLAEGDKA YDVLSKMITN NKIMTNLWDT HAPFQIDGNF GYTAAVAEML
VQSNMGHIDL MPAVPKAWGT GNVKGLLARG NFAVDMAWAD NKLTEASIHS NNGGEAVVQY
ANLSLATVKD SDGNLVEITP VTSDRISFNT EAGKTYTITA IPDNTLAAAP TGLKVTKIKD
GETVLTWDAV KARTEVSYNV YRQIEGGEWV QVQTELKATE WTDTDAWDVL GTLKYKVTAV
IDGKESEFSS EATVDDCRNM IGMIDDQDER IKYTGSWGNW ISQEGNYGGS VKYLEDPTGT
ETAELTFCGT GIEVFSCTQN DRGKVEISID GEVWATADTY SATQKRQAKI FSTEEHKDKT
LEYGIHTVKV RAMGEKNASA SRDKVELDAF NVLNTEAARV EKVEVTSASG MTTVSKAGST
LQMKAAVTPA EAINKGVTWS VTTKSGSAAA TIDENGLLSA GTANGVVTVK ATSKENAEIS
GTCEITVAIP DVTAQSEIIE DCSDDKNSKN PKITWSETGW SNPYTGEADK HHGGTKTETS
TEGAWFSYTF TGTGIKFYAQ KNNTQGKFTV ELDGKAESDA VLWEEKSGGS PQQCVFEKTD
LENTEHTIKF TAATDHGNIN VNIDYLEVFT PAAASVDKSA LQAAIEKYSG LNKDDYAENL
WNEFKTAYDN AVKGMNDDKT TKEQADKLAA DLNAAGAKLE AAEIPAPTIP EDAKATVMGV
TSTEATVSWT KLPRAESYKV YAVEHEGTAK ALKAITDTVP DNAGEPVTTT ETTVRFTNLK
PGTSYVFMIV GVNHNKVESE KALCSEPFTT VAAADEKAPA QVSGISVVKE GEKDVKITWA
PSEDPEGSEV TYIVYVDGVI VSESDKTEYT LKNADESKGY SIRIVAVDAS GNKAVPAVIN
LTVDDWNAKT VVGVEKPADL KVKKGTAADK LDLPKTVNVT LEGGRVADVL EVKWTTDNYK
ADQIGTQTLS GELQKKKGVK IPDKFKTVTI NVIVEESEVN PPQPEDKVVT KVEELTDLTV
EFGTSVEKLA LPTEVKVTLD NETEDKLSVE WNTDAYKADE AGTYELTGTL QAKEGIKLPD
NFKTVTINVT VEEKGGEVDP PQPEDKVVIA VEKLADLTVK FGTPAEKLAL PKEVKVTLDN
KAKDKLSIVW NTDVYKADKA GTYKLTGTLQ TKEGITIPEE FRKVTVNVTV EEKGKVPPTS
GKEEGKNNSG AVQTGDDKNV WLPIAGLVAA IAVVAAVLII KNKKKK
//