GenomeNet

Database: UniProt
Entry: R5GUZ6_9BACT
LinkDB: R5GUZ6_9BACT
Original site: R5GUZ6_9BACT 
ID   R5GUZ6_9BACT            Unreviewed;      1603 AA.
AC   R5GUZ6;
DT   24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT   24-JUL-2013, sequence version 1.
DT   24-JAN-2024, entry version 21.
DE   SubName: Full=FG-GAP repeat protein {ECO:0000313|EMBL:CCY16557.1};
GN   ORFNames=BN773_00034 {ECO:0000313|EMBL:CCY16557.1};
OS   Prevotella sp. CAG:755.
OC   Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae;
OC   Prevotella.
OX   NCBI_TaxID=1262935 {ECO:0000313|EMBL:CCY16557.1, ECO:0000313|Proteomes:UP000018353};
RN   [1] {ECO:0000313|EMBL:CCY16557.1, ECO:0000313|Proteomes:UP000018353}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=MGS:755 {ECO:0000313|Proteomes:UP000018353};
RA   Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA   Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA   Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA   Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA   Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA   Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA   Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA   Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA   Wang J., Brunak S., Ehrlich S.D.;
RT   "Dependencies among metagenomic species, viruses, plasmids and units of
RT   genetic variation.";
RL   Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:CCY16557.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CAXR010000176; CCY16557.1; -; Genomic_DNA.
DR   STRING; 1262935.BN773_00034; -.
DR   Proteomes; UP000018353; Unassembled WGS sequence.
DR   CDD; cd00063; FN3; 1.
DR   Gene3D; 2.60.40.1080; -; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR   InterPro; IPR013517; FG-GAP.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR028994; Integrin_alpha_N.
DR   PANTHER; PTHR44103; PROPROTEIN CONVERTASE P; 1.
DR   PANTHER; PTHR44103:SF1; PROPROTEIN CONVERTASE P; 1.
DR   Pfam; PF13517; FG-GAP_3; 2.
DR   SMART; SM00060; FN3; 2.
DR   SUPFAM; SSF49265; Fibronectin type III; 2.
DR   SUPFAM; SSF69318; Integrin alpha N-terminal domain; 4.
DR   PROSITE; PS50853; FN3; 2.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000018353};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..1603
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004380746"
FT   DOMAIN          435..544
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          961..1079
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
SQ   SEQUENCE   1603 AA;  171570 MW;  6F0C2209AE1A5DA9 CRC64;
     MKKLFLSLSA VLAALFAPSV AAGQTLTDLH APRDARHGCT VDGVRHYVEY KSDARSFILY
     DQKGNAVRTL TGDDFTDASL YAATDLNGDG RFDLVYRDGD HQLAVSLQQP DGTFRRVAQT
     ADPSGDAPVI ADLDADGLTD VYHREGSRHS LLRQQSGDVW TNVSLPVTTD TLVTRSSGVG
     VSSAFGSPYQ GGTIWGGATQ PDAGYGVNDA FDALSVALDV NDDGYPDLID TDAGGVMLSV
     GDGRVYPADF GGRVTVHDFN GDAAPDYLVY DPDADRVSVR LSSPAGYTER ELLTNGQITG
     VYCRDLNLDG APDVLLTLDY TGSYSYLVFL VGDGRGGFTT VERFFEDEYY FINLHCLGDR
     WGVVANPERY GSYTNGVLLT WDDAWNVSST PLPAPFLDGS YFSDLDGDGV LEGVNYSNRK
     VWRFDDIPLP AAPQAPGAPR LIADEANSRL RLEWTPAEAD GTRTADFTYA LRIERPDGTV
     ISGGADADGR RYAFGDGNQG HNRTAWLHTG GWPAGTYRIA VQAVDPLGRG SQWSEPATYN
     LRSAALGLTL DRVTLSTGDT LHATLSDGAA EERAYAFDID GGEVIACDGH TADIVFTTPG
     RHALTVTATG ASGTSAARRD VEVTTYSPGT TDRNAMALTD FDADGRPEAY NQGIYVSDGR
     GTFTRHPSLF NADVYNMSGL PVDLDMDGLP DLWGQVSKNG TYYTRLLNAG GLNFSVASPD
     VFVDNGDGQF VRDEGLSGPY STPDILFDLD NDGRADFCQR DNAGTYVYLN LGDNRFRRHA
     LPGKYFDLMA ASDYDRDGYV DLYGRDGDGV CVLRHRGDLT FERIALPAEL DDTEVYFADL
     DGDGRLDACV RGYSERVMRF YYAAEDFRTA HSWPVEFLRP IFLDLDNDGR TELVDRGRNL
     YYPQADGTLR TEPYDATTLP HPFSSSQMGN TARPPYQADI DGDGRPDFEG TVLRSGVLNT
     PPTVPTAVSA VTTDEGVRLN WTPSTDAESR SEALRYNVSL RRAGASGAGA YVLSPLTGGS
     AEAVPSEPYA RYFREAPTLL VPLSRFEAGA TYELSVQAVD PWGARSAFSD VFTFTVEASA
     AVRMPAEAGE GRTVAVTYDA TLGTTPTWDW AGATATATAT GWDVVWNEPG VKTVACTVGG
     KTYRRSIRIV EEPDIDLDLP WKVAAGSTFT FALPRVFQEK PECISLSVPD GLTLTMDERR
     ATAAVTVAAN ATATGYSIGV GYADELFHLG QKSGFQVGGY GVRPDIGLVT ADAATGHNVV
     TWQIAIPESD KDLFDSVHIY KETSVAGQYA LIGRALIGDG RFVDTQSDPT VRRSRYRLTL
     GTFVASETAP GTPHSSVHMM INRAAGGGYN LLWTPYEGRA IGSYVLLRGS SPDNLQPFAT
     LSGNDMSFTD RTAGDGATYY ALSYVPAEQA EARRAPTQEG GTSNVVCTTD APEVTSVEAI
     AIQSVTGGSE LSEALPALQL VAVVSPLHAT FKQVKWSIAE GQSLASVNTE GLLTLLPNET
     GGTVVVRAEA VDGSGVAAVA TFKAGRATAI ATPSASGLRL LPDASGGVRV EGVTAPVALR
     VLTPDGRTVR LVRFEADGLL PAAAMPRGVV IITADGASLK LIR
//
DBGET integrated database retrieval system