ID A0A1S2VPP8_9BACT Unreviewed; 1633 AA.
AC A0A1S2VPP8;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=Fibronectin type-III domain-containing protein {ECO:0000259|PROSITE:PS50853};
GN ORFNames=BLX24_01190 {ECO:0000313|EMBL:OIN60747.1};
OS Arsenicibacter rosenii.
OC Bacteria; Bacteroidota; Cytophagia; Cytophagales; Spirosomataceae;
OC Arsenicibacter.
OX NCBI_TaxID=1750698 {ECO:0000313|EMBL:OIN60747.1, ECO:0000313|Proteomes:UP000181790};
RN [1] {ECO:0000313|EMBL:OIN60747.1, ECO:0000313|Proteomes:UP000181790}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SM-1 {ECO:0000313|EMBL:OIN60747.1,
RC ECO:0000313|Proteomes:UP000181790};
RA Huang K.;
RT "Arsenicibacter rosenii gen. nov., sp. nov., an efficient arsenic-
RT methylating bacterium isolated from an arsenic-contaminated paddy soil.";
RL Submitted (OCT-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OIN60747.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MORL01000001; OIN60747.1; -; Genomic_DNA.
DR RefSeq; WP_071501250.1; NZ_MORL01000001.1.
DR Proteomes; UP000181790; Unassembled WGS sequence.
DR GO; GO:0008810; F:cellulase activity; IEA:InterPro.
DR GO; GO:0000272; P:polysaccharide catabolic process; IEA:UniProtKB-KW.
DR CDD; cd02850; E_set_Cellulase_N; 1.
DR CDD; cd00063; FN3; 3.
DR Gene3D; 1.50.10.10; -; 1.
DR Gene3D; 2.60.120.430; Galactose-binding lectin; 3.
DR Gene3D; 2.60.40.10; Immunoglobulins; 6.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR004197; Cellulase_Ig-like.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR001701; Glyco_hydro_9.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR014756; Ig_E-set.
DR PANTHER; PTHR47135; FIBRONECTIN TYPE III DOMAIN-CONTAINING PROTEIN 7; 1.
DR PANTHER; PTHR47135:SF2; FIBRONECTIN-LIKE; 1.
DR Pfam; PF02927; CelD_N; 1.
DR Pfam; PF00041; fn3; 2.
DR Pfam; PF00759; Glyco_hydro_9; 1.
DR SMART; SM00060; FN3; 5.
DR SUPFAM; SSF81296; E set domains; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 4.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
DR PROSITE; PS50853; FN3; 5.
PE 4: Predicted;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326};
KW Reference proteome {ECO:0000313|Proteomes:UP000181790};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..1633
FT /note="Fibronectin type-III domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5010369936"
FT DOMAIN 640..725
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 881..967
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 975..1061
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1071..1157
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1305..1388
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
SQ SEQUENCE 1633 AA; 177287 MW; B84B3A1A43FAE033 CRC64;
MKQQILWVLV WLTTLYNAWA APPVTNANIR VDQFGYLQNA RKVAVIINPQ VGANAGQTFT
ASTGTNQYQV RRWSDDAVVF SGTLQAWNNG ATQASSGDKG WWFDFSMVNT PGAYYIFDTG
NNVGSYRFDI GDQVYRDVLK AAVRTYYYQR INFAKVAPYT DPKWADAATH EWPGQDREAR
SAKDKTNART AKDLHGGWMD AGDMNKYVTF AEEPVALLLE AYRLNPAVFG DDFNIPESGN
GVPDILDEVK YELEFIKRMQ DATGTDGLLL KVGVDTWDEF DRIGGLPPSA DKRPRYYLPE
CTSSTLSGAA MFAVGAAVYR QIPSQVAYAQ DLLARAERAF ARARATTNYF STFDTNCDDL
DIRAGDADKP IAIQKQSAVV AAVYLFEVTG KAEYKSFIDN NYNQIQPMAN EWWGPYLMHV
HAALLRYAAN PQATPAVANA IRAMKSQQNG VLSINDYTAK ADLYRAYMKD DQYHWGSNQV
RGNVGMANLD FVNFGINPRQ KALYREVAGE YLHWFHGVNA QGKVMLSNMG AYGAEDSMDE
IYHTWFQDGT DWDNAKTSPK GPAPGYVPGG PNKLDEYNGT EGYLRTEPLQ KRYKDWNGGF
PENSWFLTEV AIYNQAPYVS LLSRMMIPTS DPNDTEPPTT PTNLVASGLS PYSVKLTWNG
STDNRGVTAY EVYQNGVRID ETPDLFLNLN SLSPGNSYTF VVKAVDFSAN RSEASNVVIV
NTPSPSPNDF VVYGDAYKST IAPWSWNTTD NPNNTSPVKS GQKSIRADVT QAWGALSLRN
SEIINTAYYP GGLQFWFYGS EKGVRVSIHT TETAPASDTY RIPAEPEMWT LIRIPWTEFG
NPTQIQRISI GDASGGGQTF YLDDIRLVAG PDTPDTQAPT APANLTATNI TQTSMRLAWD
AATDNTGAVT YTVLRSGAVI LSGLTSTSVD LTGLTCNSLH ELAIQAKDPS GNLSPKSNTI
WIMSASCYDG QAPTVPTSLA VSNITTTGLT LTWTASTDNV AVTGYEVYRN GTLIADNVSG
TTLGVNGLTC NTAYAFTVRA KDGAGNRSAL SNPVSATTTA CAQADTQAPT VPTNLVATNI
TSTGLTLTWN ASADNIGVVA YEVYQNGVLL NGNVTGTSLA VNNLTCGTAY PFTVLAKDAA
GNKSAQSAAV SATTPVCPPL PPTGSEVIYD EVLNSSWQEW GWSVTTNYAN ASPVKVAQKS
LAIQYAEGWG GWAITRATPY VPTPNTTIRF WVYATTNKTL GVCTASENES GFSPYVSFKP
TPNTWQEVVV TMPQLVNPAR IKRLVIQTQE SGANLIYVDN VRFEGSFNAS VSNVTPVGLR
LNWGAVEQAA TYAVYQNEVP VNANVSGTQL DVTGLTCGTG YTYSVVAKNA AGTVLATSVP
VSATTAFCAG SSSVEMIYDE AFNTAGWDDY SWGLTKNYAS TTTVKAGQKS AALTFEGWGA
WSIFRKTALP VTSQTTIRFW IYATTNKTLA LTTYAENDSQ ASRTLNLTPT PNIWQEVVVT
MPQLGNPTRI KRLVINLPTS GPNPVFLDNV RIETPNGSAR EAASPETGPA GRLMVSPNPA
DGPVRVQVTT TAVEAGGFTL VSQNGTPVMT ENRPVAAAEH EWMPDVSKLA PGLYIVQWQS
ATQRLTGRLM VIR
//