ID R6X2R6_9BACT Unreviewed; 2634 AA.
AC R6X2R6;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 28.
DE RecName: Full=CARDB domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BN673_01062 {ECO:0000313|EMBL:CDD00787.1};
OS Prevotella sp. CAG:474.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae;
OC Prevotella.
OX NCBI_TaxID=1262926 {ECO:0000313|EMBL:CDD00787.1, ECO:0000313|Proteomes:UP000018116};
RN [1] {ECO:0000313|EMBL:CDD00787.1, ECO:0000313|Proteomes:UP000018116}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:474 {ECO:0000313|Proteomes:UP000018116};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDD00787.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBGF010000126; CDD00787.1; -; Genomic_DNA.
DR STRING; 1262926.BN673_01062; -.
DR Proteomes; UP000018116; Unassembled WGS sequence.
DR Gene3D; 2.60.40.2030; -; 1.
DR Gene3D; 2.60.40.680; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 3.
DR InterPro; IPR044060; Bacterial_rp_domain.
DR InterPro; IPR038081; CalX-like_sf.
DR InterPro; IPR011635; CARDB.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR032812; SbsA_Ig.
DR Pfam; PF13205; Big_5; 1.
DR Pfam; PF07705; CARDB; 1.
DR Pfam; PF18998; Flg_new_2; 1.
DR SUPFAM; SSF141072; CalX-like; 3.
DR SUPFAM; SSF49478; Cna protein B-type domain; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000018116};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..2634
FT /note="CARDB domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004424046"
FT DOMAIN 708..802
FT /note="CARDB"
FT /evidence="ECO:0000259|Pfam:PF07705"
FT DOMAIN 2124..2212
FT /note="SbsA Ig-like"
FT /evidence="ECO:0000259|Pfam:PF13205"
FT DOMAIN 2265..2312
FT /note="Bacterial repeat"
FT /evidence="ECO:0000259|Pfam:PF18998"
SQ SEQUENCE 2634 AA; 286675 MW; A621F22B5CB74F62 CRC64;
MKMKEKAIYI LLLLVAMTLG AKAQNTISVA HLTGGQGKDV MIPISMDNNE DVVALQFDLQ
LPFAKTSGKQ PTLTNRNTNG HTVSVRGMGS NRYRVVIVNM SNKPIAGSGG VLLNFPMTVP
TGLNPESVHA ITLSDVVITN RNGDNIQTGS TNGSYTIQRA PSPDLEVSDV AIRQTTLTPG
ERVSVSWKVS NVGNADTRSG WTEKVYLVNT ETEEAIYIGN VYFNNTMLQG GHTARSAEFV
LSQTVGVDGE VAAKVVVEPN SNMGEYATDR LNNTAIGGKA TVGKLLFLTA PATRLKEGQS
MRLQLKRSGN RATDETYSVA TSLPDHVSMT TQVTIKAGQS TATFDVTVPD NDYVNSYKAA
SVTVTKAHGY PADVAVSFDI EDNELLPLSL QLDKSEYNEG ESIKLRVSVP YRIEGEELTV
SLSIEKPRRF RLPLTFTFPD GATEAVIDIP IVQDNLPAND ETIKIIVSAD HHLTASTLFI
LHDDDVPAIS MTLQPTTVSE AAGYNAVYAT ITRNSAVNSK ITLKLSDDSN NELYYTQTIT
MNEGVEEVTF PIGVKDNQKV DGTRTVKFTA AVYITDCGCS AIGNKQTTVT ETITITDNDG
PTLNITSDKT TILEGDATGA QLTISRNDDT SQPLTVALSA KGNDLDFAQT VTIPAGKESV
TTPFVALSNS TTEGNRTISV MASSEGYSPG TVWLLISDQT LPDAEMQKPE VAAGVEAGSQ
VKVMLHIKNV GAIAMPQSTP ITVTLADATT TLQTDKPIPP GQTYDTWAEV KAPAVPGTYR
VTAHINPDGK VTELQTMNNS STIDVKVVAA YTFTIAAAKD KYLFGDKVVL TGSVKRTDGT
AAAGIEVEPY IVYARSRTRL AAVSDAAGNF SIDYTIPAGM GGEFGFGVCT PGENLATEQG
NFKVYGFSRT TTDYITNKLF VGEPFVGKIM LKNMSDLDLH NVRVACADAG AYTVSFNTVA
VLPGNGTAEI TYTLLSTALS KTKDWDKLVF AITTDEGARL DVTTYNYTNE HTPTLVVETN
SINTTVTKNK TRLYPLVITN TGLAPTGKIT VDLPKALSKF ISLATPATMP SMATGDSATV
MLRFNAADFY VNIIQKGSIA INCEKGDGKQ VFFNVKVVSE DKGSLKVRVQ DENTIYGNKN
GERPYVSGAT VRLTDYNTGA LVMSDVTGDD GYVLFEHLNE GYYHLNVTAE KHDSYSQNVL
VSPGDVTTHL ATISYQAITV NWDVEETEVE DEYEVVTKVT FETYVPVPVV DITAPDMLIL
KDIQPGTTTL MNVVLRNRGL IAAQDVNYYP PTAHGFTFMP MVEHTGLTLA PEQSYVIPVL
VMHTEDFDNP QFAPHVKRLM PRRASSSEKN CKGKMGTDYK WPCGAGSKYS YLEKPIQWAY
GNDSDCKNTT NEGVEWTPPT QMGRPGGPGG PGYNATVTIN PGGEAVAKAV ITLMCEVCNC
VCVDPVAWLP CATSGIMVAQ GSWDGVQGAK GCGNELVQWA VGMALGPAGK LSCLVKPFDP
DKAPMAKAPA DDDSKPTVLP ALLESSGRKQ VMYWRYYESF FNYNAELTGA REALATEGFY
DELVSALADI DFALKKMQND GDLWDFDLST IPATTTVDDK SQGLGAYLTS LMPNKRANVA
DFSLRSYVER IRNKWRKLEG MDYDSDNHPD EDKLARIVRE RDDIVAQMVD MGFATLDDLM
NSARKDWLLY QETASQNTCA EVKLEIKQKL VLTRQAFRGT LTIDNGSNSD MSHIMVNVNA
TNMETGAIAT RHEMEIQVEK IEGFGGEKDG EWTLPAGKKG VATFLFIPTK YAAPENVTTY
SFGGTLAFND GATDQTRSLF PVSLQVKPSP DLDLTYFMQR DIYGDNPLTE DVIEPVVPAE
FSVLIHNKGN GDANNVRMIT QQPQIVDNQK GLMIDFAILS SSLNGGEKTM ALSSDVATQF
GTIKAGEASY ATWELTSSLL GHFTQYDVSV THVTDYGNPD LSLLDRVSIH ELIHSINARI
GEKTYRAWIT NDYPDAFDDP DHIYFANGTD EDLVALRDAT RIEALGDSKY RITVDVAQRQ
WFYTSVANPA GKYAKILSIK NETTGETLDA DNFWTTDYTL QDGIDPLLDY RLHIADLSSG
PGAVKYIVEF EPMPELRLDV VSIETVPEDD QIAEKPITEL TVKFNKDIDT QTFTRDDIVL
RLEGKPQTTA LPITPATADN KREFKINTAA LTDNGFYALQ VKSENIRDAE GFMGAEGKQV
RWMLFKDGLV HFNVKVLPLP ECGNVDCVIE GEEGRKVTTR KAPLKAGTAA IASTAPYGRT
MTFTATPNTG YKFVHWKSNA DDQIVSTDEV FTTEARSTKD FAAVFQAESY KVDVNCNSAE
GDIDASTGYY DYNTKLVLDA RAKDNFRLVG YRINGTLKEM TPYTLTVEGP TEVEVVYRDM
TPVNVLLNEG ADYTPEDVEA AKVSLYRSFH KGTWNTICLP CAVDNPEAVF GQGTLVAQMT
GVSGTTLMFA PVNTMEANTP YLIKPTKINS PAYAHDENPT MLYDLGVTTT QKPERGVPAD
SKDGYSFIGA YSVYPIATDD GNYYISSDKF YYVDAAASVN TTRFRGYFHA DNGSASPISL
GIGSATGITP PVSITDSRNA VYDLNGVMVR QPGESLSGLK PGVYITRDKK IIIK
//