ID R5ZHG4_9FIRM Unreviewed; 591 AA.
AC R5ZHG4;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 27.
DE SubName: Full=Bacterial group 2 Ig-like protein {ECO:0000313|EMBL:CDA28352.1};
GN ORFNames=BN504_01891 {ECO:0000313|EMBL:CDA28352.1};
OS Eubacterium sp. CAG:156.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Eubacteriaceae;
OC Eubacterium.
OX NCBI_TaxID=1262880 {ECO:0000313|EMBL:CDA28352.1, ECO:0000313|Proteomes:UP000018124};
RN [1] {ECO:0000313|EMBL:CDA28352.1, ECO:0000313|Proteomes:UP000018124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:156 {ECO:0000313|Proteomes:UP000018124};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDA28352.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBBN010000027; CDA28352.1; -; Genomic_DNA.
DR AlphaFoldDB; R5ZHG4; -.
DR STRING; 1262880.BN504_01891; -.
DR Proteomes; UP000018124; Unassembled WGS sequence.
DR Gene3D; 2.60.40.1080; -; 2.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 3.40.50.1110; SGNH hydrolase; 1.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR032616; DUF4886.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR InterPro; IPR036514; SGNH_hydro_sf.
DR Pfam; PF02368; Big_2; 1.
DR Pfam; PF16227; DUF4886; 1.
DR SMART; SM00635; BID_2; 2.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 1.
DR SUPFAM; SSF52266; SGNH hydrolase; 1.
DR PROSITE; PS50853; FN3; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000018124};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..591
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5039264433"
FT DOMAIN 499..591
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 39..97
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 50..97
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 591 AA; 65144 MW; 6F1B6CCB85316519 CRC64;
MGNQVKKMGK ALMLILAMVV CFNIGVMAQE IEATTAPDNV KEVETTKPSG ETETTKPAGE
VETTTKANET ETTKPSGETE TTTPSGNNNG TNTTQPNVRI LFVGNSSTYY NDMPQMVKGL
AVADGMNVTI QALTAANYKL YQFANEKDTY GSQLISALKN YKWDYVILQD HREMIITDIA
KTQTAIETLK PLIQSSGAKM LLYDTQADYI GRSFTATGYS YYLSHNEIQH YMIKGYYMMG
NKYGAQVISS GVNFERCLKQ FPDIKLYNAD NIHPTPTGSY LAACTIYGTI FNTTALENKY
LPESEYDSNG LLKKVTKEAA LKMQAIADPR LTIDTSYVEV NKGFNSKVTA TLIANEKNEI
LKDYKNEIQY SSSNDTAISV NKYTGSYNAI GVGDSMIMAT TDSGLITMCT ISVKQPSTSF
AINETGIAKL YKGDTIQYTT KIAPSDTTDT VKWTSNNTSV ATVSDTGLVT AKKVGTAKIT
ATTTSGIKAV RYIRVKLIKP KNVKVKKLST KAKGSRYANI RITWTKNTNA VKYYVYRSSG
SGYKKIATTK KPKYTDKNKK KGVKYYYKVR AVYSNTKCNS TLSETKSISL K
//