ID R6FSD8_9CLOT Unreviewed; 648 AA.
AC R6FSD8;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 37.
DE SubName: Full=Cell wall anchor domain protein {ECO:0000313|EMBL:CDB14970.1};
GN ORFNames=BN542_00268 {ECO:0000313|EMBL:CDB14970.1};
OS Clostridium sp. CAG:221.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1262780 {ECO:0000313|EMBL:CDB14970.1, ECO:0000313|Proteomes:UP000018176};
RN [1] {ECO:0000313|EMBL:CDB14970.1, ECO:0000313|Proteomes:UP000018176}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:221 {ECO:0000313|Proteomes:UP000018176};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDB14970.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBDC010000037; CDB14970.1; -; Genomic_DNA.
DR AlphaFoldDB; R6FSD8; -.
DR Proteomes; UP000018176; Unassembled WGS sequence.
DR CDD; cd06920; NEAT; 3.
DR Gene3D; 2.60.40.1850; -; 4.
DR InterPro; IPR019931; LPXTG_anchor.
DR InterPro; IPR006635; NEAT_dom.
DR InterPro; IPR037250; NEAT_dom_sf.
DR NCBIfam; TIGR01167; LPXTG_anchor; 1.
DR Pfam; PF00746; Gram_pos_anchor; 1.
DR Pfam; PF05031; NEAT; 4.
DR SMART; SM00725; NEAT; 4.
DR SUPFAM; SSF158911; NEAT domain-like; 4.
DR PROSITE; PS50847; GRAM_POS_ANCHORING; 1.
DR PROSITE; PS50978; NEAT; 4.
PE 4: Predicted;
KW Cell wall {ECO:0000256|ARBA:ARBA00022512};
KW Peptidoglycan-anchor {ECO:0000256|ARBA:ARBA00023088};
KW Secreted {ECO:0000256|ARBA:ARBA00022512}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..648
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004406669"
FT DOMAIN 43..170
FT /note="NEAT"
FT /evidence="ECO:0000259|PROSITE:PS50978"
FT DOMAIN 172..297
FT /note="NEAT"
FT /evidence="ECO:0000259|PROSITE:PS50978"
FT DOMAIN 331..456
FT /note="NEAT"
FT /evidence="ECO:0000259|PROSITE:PS50978"
FT DOMAIN 492..617
FT /note="NEAT"
FT /evidence="ECO:0000259|PROSITE:PS50978"
FT DOMAIN 617..648
FT /note="Gram-positive cocci surface proteins LPxTG"
FT /evidence="ECO:0000259|PROSITE:PS50847"
FT REGION 290..333
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 449..487
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 290..308
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 315..333
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 460..481
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 648 AA; 71060 MW; 2CE164607353EFCD CRC64;
MLRKKICRIL SAIMIATSVF GAITPVAKAD VINASVLINK EQLQSGIYTV KNTTKYVEEG
NATGEGMARK VLKEDSKIVV QNDKVLLTMY FDEGMFKMLS DIKVSMDDTE LIAEVNNDEK
SVTMEVPSVD SKVKVDVTVS MMGRQVSFYV ENNMDTLTLE EAFEEETGAV VLGDGKYKAS
NITRYSTDSE HGNSSARRAI ESETLISVEN GRTYLTITFS SMYSMLENIS VSLDGKKLDI
VEDSNARTVT FEVPSVDTEV LLGMNIKGMN HPVDFYVIND MDTLAKISEE TDNPAEDDKN
DTAPEVPETP EVKPETTPDS GNNGEGSETI PGTKVYTIEN QVYHENATGM TMARQYLNST
SKVEEVNGKY YVTMTFTGVE YMNNHQIYVN GSKVDAEIVE STSSKIALRF AVESLSDSIK
VGTYVIPMGR TIDFDVKMLE DTLTLVDGNV EEDKEEDSTN NSTGNNSSST NNSTSNSADN
KVEEETNVKE TVTVKVYNVQ NNVTHHNATG VAMARKYLNS NTEVKEINGK YYVTLTFTGT
EFMQNHEIYV NGSKVSHSIV SSTGDSVSIK FTIASLNDAI SVKTYVVPMA RDVEFGVELL
LDTMTLVNEY TIEADSLPET GATTSSALAL SMGLMAIGSG ALLRKKVK
//