ID R7I810_9FIRM Unreviewed; 1392 AA.
AC R7I810;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE RecName: Full=Ig-like domain-containing surface protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BN770_01321 {ECO:0000313|EMBL:CDE47641.1};
OS Faecalibacterium sp. CAG:74.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Faecalibacterium.
OX NCBI_TaxID=1262897 {ECO:0000313|EMBL:CDE47641.1, ECO:0000313|Proteomes:UP000018328};
RN [1] {ECO:0000313|EMBL:CDE47641.1, ECO:0000313|Proteomes:UP000018328}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:74 {ECO:0000313|Proteomes:UP000018328};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDE47641.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBJB010000055; CDE47641.1; -; Genomic_DNA.
DR Proteomes; UP000018328; Unassembled WGS sequence.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 4.
DR InterPro; IPR026906; LRR_5.
DR InterPro; IPR032675; LRR_dom_sf.
DR PANTHER; PTHR45661:SF3; RICH REPEAT DOMAIN PROTEIN, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR45661; SURFACE ANTIGEN; 1.
DR Pfam; PF13306; LRR_5; 7.
DR SUPFAM; SSF52058; L domain-like; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000018328};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..1392
FT /note="Ig-like domain-containing surface protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5039526713"
SQ SEQUENCE 1392 AA; 149778 MW; F37BB4F903749B90 CRC64;
MKRRLCLVLF LLMLVTCVTA SGEAADWNYD ANYGILRGYN GAGGDVVVPG ELDGFTVDVI
GVSVFRGETI TSLTLPETVL ELRSNAISTC DNLTRVSLPQ SLVVINRMNF FSCTALTEVT
IPAGVRYIGD TSFRYCDSLR KITFEGVCPA IDIDCFSLLP EGATAYVPDD QLDAYIAAFE
NAGSEVSVQP SGKNAVIVEN NGYVESEFDF DASTGTITSY NGYATYLAIP ETIGGAPVKA
IGPEAFAQHT YLALLELPEG LETIGDRAFY NCETLARVHF PSTLKFIGDS AFYNAYKSSV
LELPEGLEHI GAYAFYFAGI KGFLTLPEGL KTIGESAFES CSNMGGNLYL PSTLESIGSR
AFKGDYNIQY IVLESLTAPT LGEDVFAGCD YLYDIDLNAH GTRQEMQQWQ AYVDALGLPC
RVWRAQDPTA QSPEKGAYRY ENRVLTEYTG TRTRIHPHLT VSKEAVVGLG DGVFKGSQTI
EYFSVAHNDE FTTIGAEAFM NSSLRNVDLF DSVTTIGARA FAGCTQLEAL TLPDSLTTIG
EGALDGLTGL KKLVIQCDPA IIPAGVFANL PALSDVTVEF GAIPAHMFEG SGVTVLTLGA
GVTEIGDSAF ANTALTTAEM QNVTAIGAGA FANTALTSAE IQNVTAIGAG AFEGSALERV
RLNAAASVGE RAFANTKLTK MVIPTAGSFP LSAVEGTSVE LRLPADATDE QLAAWNETLQ
RPWYDPMLRE GEVSKFVKMP FEPTPAENFE FDPDTGLIAA YIGTDVDVVV PREINGVTVV
GFKNYNAFDA CHDYTDSSVT SDRTEWVRLR TLVLPETIKE LPDMMLAYCQ QLETFVCYAP
LESTGGNQFM LCRSLNNVIF VNGVREIDNY AFDSAGPLGN LYFGEHLIRI GQQAFNFAGL
SSFVADAERV EYGAFTECKN LTSLHFTSKM KDFGENCIIN CPNLAEICFD GCDLTASPMG
LMMNVAPKLT VRVPEDMSEE NIKHAQNCQS WSENRSEVTV ITEPCAHAQP TLPDLPALLP
TLKLDASVET AAPAQPGTFV AAEPEVAPEP TPIPEPVAQN AGIPDEYLGV WYGVRMEIGG
TPYPLADMGL DVTFTLGADG AAEMNTNGDV DSIHCAMQDG MLMADGISFT LQDALLTVSM
DDVTMAFSRE KPQASSAAVP SINESAIIDD FRGFWTLARV TADGITLPAE AAEMAGDTLV
IYGTVCDLTL HGTTLDGLSC RMDNFTLLIS ILNGEAAVTL REDGTLSLEM SDLTLWYERT
GDAPEAAAEP TAEFNPAVTP EPAAGAEIML EKKYVMTDAD VSGYNMTAAQ LGNYEYSLLF
HEDGTVKLVI AGADIPGLTW VFGKVPTEAG EVDGIVINYY TQALYIVPTE KGCDMDYFGS
MLIHFAPEES AK
//