ID R5GUZ6_9BACT Unreviewed; 1603 AA.
AC R5GUZ6;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 21.
DE SubName: Full=FG-GAP repeat protein {ECO:0000313|EMBL:CCY16557.1};
GN ORFNames=BN773_00034 {ECO:0000313|EMBL:CCY16557.1};
OS Prevotella sp. CAG:755.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae;
OC Prevotella.
OX NCBI_TaxID=1262935 {ECO:0000313|EMBL:CCY16557.1, ECO:0000313|Proteomes:UP000018353};
RN [1] {ECO:0000313|EMBL:CCY16557.1, ECO:0000313|Proteomes:UP000018353}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:755 {ECO:0000313|Proteomes:UP000018353};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCY16557.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAXR010000176; CCY16557.1; -; Genomic_DNA.
DR STRING; 1262935.BN773_00034; -.
DR Proteomes; UP000018353; Unassembled WGS sequence.
DR CDD; cd00063; FN3; 1.
DR Gene3D; 2.60.40.1080; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR InterPro; IPR013517; FG-GAP.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR028994; Integrin_alpha_N.
DR PANTHER; PTHR44103; PROPROTEIN CONVERTASE P; 1.
DR PANTHER; PTHR44103:SF1; PROPROTEIN CONVERTASE P; 1.
DR Pfam; PF13517; FG-GAP_3; 2.
DR SMART; SM00060; FN3; 2.
DR SUPFAM; SSF49265; Fibronectin type III; 2.
DR SUPFAM; SSF69318; Integrin alpha N-terminal domain; 4.
DR PROSITE; PS50853; FN3; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000018353};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..1603
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004380746"
FT DOMAIN 435..544
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 961..1079
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
SQ SEQUENCE 1603 AA; 171570 MW; 6F0C2209AE1A5DA9 CRC64;
MKKLFLSLSA VLAALFAPSV AAGQTLTDLH APRDARHGCT VDGVRHYVEY KSDARSFILY
DQKGNAVRTL TGDDFTDASL YAATDLNGDG RFDLVYRDGD HQLAVSLQQP DGTFRRVAQT
ADPSGDAPVI ADLDADGLTD VYHREGSRHS LLRQQSGDVW TNVSLPVTTD TLVTRSSGVG
VSSAFGSPYQ GGTIWGGATQ PDAGYGVNDA FDALSVALDV NDDGYPDLID TDAGGVMLSV
GDGRVYPADF GGRVTVHDFN GDAAPDYLVY DPDADRVSVR LSSPAGYTER ELLTNGQITG
VYCRDLNLDG APDVLLTLDY TGSYSYLVFL VGDGRGGFTT VERFFEDEYY FINLHCLGDR
WGVVANPERY GSYTNGVLLT WDDAWNVSST PLPAPFLDGS YFSDLDGDGV LEGVNYSNRK
VWRFDDIPLP AAPQAPGAPR LIADEANSRL RLEWTPAEAD GTRTADFTYA LRIERPDGTV
ISGGADADGR RYAFGDGNQG HNRTAWLHTG GWPAGTYRIA VQAVDPLGRG SQWSEPATYN
LRSAALGLTL DRVTLSTGDT LHATLSDGAA EERAYAFDID GGEVIACDGH TADIVFTTPG
RHALTVTATG ASGTSAARRD VEVTTYSPGT TDRNAMALTD FDADGRPEAY NQGIYVSDGR
GTFTRHPSLF NADVYNMSGL PVDLDMDGLP DLWGQVSKNG TYYTRLLNAG GLNFSVASPD
VFVDNGDGQF VRDEGLSGPY STPDILFDLD NDGRADFCQR DNAGTYVYLN LGDNRFRRHA
LPGKYFDLMA ASDYDRDGYV DLYGRDGDGV CVLRHRGDLT FERIALPAEL DDTEVYFADL
DGDGRLDACV RGYSERVMRF YYAAEDFRTA HSWPVEFLRP IFLDLDNDGR TELVDRGRNL
YYPQADGTLR TEPYDATTLP HPFSSSQMGN TARPPYQADI DGDGRPDFEG TVLRSGVLNT
PPTVPTAVSA VTTDEGVRLN WTPSTDAESR SEALRYNVSL RRAGASGAGA YVLSPLTGGS
AEAVPSEPYA RYFREAPTLL VPLSRFEAGA TYELSVQAVD PWGARSAFSD VFTFTVEASA
AVRMPAEAGE GRTVAVTYDA TLGTTPTWDW AGATATATAT GWDVVWNEPG VKTVACTVGG
KTYRRSIRIV EEPDIDLDLP WKVAAGSTFT FALPRVFQEK PECISLSVPD GLTLTMDERR
ATAAVTVAAN ATATGYSIGV GYADELFHLG QKSGFQVGGY GVRPDIGLVT ADAATGHNVV
TWQIAIPESD KDLFDSVHIY KETSVAGQYA LIGRALIGDG RFVDTQSDPT VRRSRYRLTL
GTFVASETAP GTPHSSVHMM INRAAGGGYN LLWTPYEGRA IGSYVLLRGS SPDNLQPFAT
LSGNDMSFTD RTAGDGATYY ALSYVPAEQA EARRAPTQEG GTSNVVCTTD APEVTSVEAI
AIQSVTGGSE LSEALPALQL VAVVSPLHAT FKQVKWSIAE GQSLASVNTE GLLTLLPNET
GGTVVVRAEA VDGSGVAAVA TFKAGRATAI ATPSASGLRL LPDASGGVRV EGVTAPVALR
VLTPDGRTVR LVRFEADGLL PAAAMPRGVV IITADGASLK LIR
//