ID C5NV12_9BACL Unreviewed; 1087 AA.
AC C5NV12;
DT 01-SEP-2009, integrated into UniProtKB/TrEMBL.
DT 01-SEP-2009, sequence version 1.
DT 27-MAR-2024, entry version 75.
DE SubName: Full=LPXTG-motif cell wall anchor domain protein {ECO:0000313|EMBL:EER69038.1};
GN ORFNames=GEMHA0001_1389 {ECO:0000313|EMBL:EER69038.1};
OS Gemella haemolysans ATCC 10379.
OC Bacteria; Bacillota; Bacilli; Bacillales; Gemellaceae; Gemella.
OX NCBI_TaxID=546270 {ECO:0000313|EMBL:EER69038.1, ECO:0000313|Proteomes:UP000006004};
RN [1] {ECO:0000313|EMBL:EER69038.1, ECO:0000313|Proteomes:UP000006004}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 10379 {ECO:0000313|EMBL:EER69038.1,
RC ECO:0000313|Proteomes:UP000006004};
RA Fulton L., Clifton S., Chinwalla A.T., Mitreva M., Sodergren E.,
RA Weinstock G., Clifton S., Dooling D.J., Fulton B., Minx P., Pepin K.H.,
RA Johnson M., Bhonagiri V., Nash W.E., Mardis E.R., Wilson R.K.;
RL Submitted (JAN-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EER69038.1, ECO:0000313|Proteomes:UP000006004}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 10379 {ECO:0000313|EMBL:EER69038.1,
RC ECO:0000313|Proteomes:UP000006004};
RA Sebastian Y., Madupu R., Durkin A.S., Torralba M., Methe B., Sutton G.G.,
RA Strausberg R.L., Nelson K.E.;
RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EER69038.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACDZ02000005; EER69038.1; -; Genomic_DNA.
DR RefSeq; WP_004263326.1; NZ_ACDZ02000005.1.
DR AlphaFoldDB; C5NV12; -.
DR eggNOG; COG0154; Bacteria.
DR OrthoDB; 9811471at2; -.
DR Proteomes; UP000006004; Unassembled WGS sequence.
DR GO; GO:0003824; F:catalytic activity; IEA:InterPro.
DR Gene3D; 3.90.1300.10; Amidase signature (AS) domain; 1.
DR InterPro; IPR000120; Amidase.
DR InterPro; IPR020556; Amidase_CS.
DR InterPro; IPR023631; Amidase_dom.
DR InterPro; IPR036928; AS_sf.
DR InterPro; IPR019931; LPXTG_anchor.
DR NCBIfam; TIGR01167; LPXTG_anchor; 1.
DR PANTHER; PTHR11895:SF180; AMIDASE AMIC-RELATED; 1.
DR PANTHER; PTHR11895; TRANSAMIDASE; 1.
DR Pfam; PF01425; Amidase; 1.
DR Pfam; PF00746; Gram_pos_anchor; 1.
DR SUPFAM; SSF75304; Amidase signature (AS) enzymes; 1.
DR SUPFAM; SSF101898; NHL repeat; 1.
DR PROSITE; PS00571; AMIDASES; 1.
DR PROSITE; PS50847; GRAM_POS_ANCHORING; 1.
PE 4: Predicted;
KW Cell wall {ECO:0000256|ARBA:ARBA00022512};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Peptidoglycan-anchor {ECO:0000256|ARBA:ARBA00023088};
KW Secreted {ECO:0000256|ARBA:ARBA00022512}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..1087
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5039286096"
FT DOMAIN 1056..1087
FT /note="Gram-positive cocci surface proteins LPxTG"
FT /evidence="ECO:0000259|PROSITE:PS50847"
FT REGION 30..99
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1087 AA; 117952 MW; 7B63ECC8D7660252 CRC64;
MKKNKIKYVL IPAFAAATLF PVIANDNQAK ASDDKITTSS NVTTTNEKPT SIDTTANNVV
PTPTNTTTAN DVTASTTNSE PEINKNNETT NRTNVENKES TKPTIPFTVA EYKQKSALEL
AKLIREKKVT STELVDLAYK VIAEENPKLN AVLTTENGKI PNALVNEAYK TAKEIDDRIK
AGNLAANPIN WEEQPFLGVP TLIKGLDLVK DGDSSNGVYF NKGKVSKFSG AVAKEFAKLG
FVILGQTNYP ELGTRNITDS KLFGPAGNPW DPSRNTGGSS GGSAGAVASG MVSIASGSDA
GGSIRIPASW TGLIGLKPTG HVVKFPLVKT IEDAKTYFDK TQISKPKTLE EVPTDLKKLK
IAYSLKTPLK DVELSEDGKK AVLKAVDFLR KQGFTVEEVT EFPIDGYEGI RTYTIGAIGG
GYTAPAKAAT EDNKRDLDPA TYALGTSSYR GKTANTDVTS AKPAAEYINQ MNEFYKKYDL
FLMATNAVTA PSNDKKTDPY VDPTVEEKLY NINKIKDPKE RFDLLVKQWE PMMRRTPFTW
LFNLTGNPSI SLPVYKSKNN LPLGVMFAGR NNSEKILLEM GQLFQNNNQF ILHPDVRNTV
TPENGYKVRE NEYGTKYEYS IPSDALVAEA LKPFTGKVDG LSENGNKVVV DKDGNVSEYS
LPSDALVAEA LKPFTGKVDG LSENGNKVVV DKDGNVSEYS LPTDSLVAKA LKPFTGKVDG
LSENGNKVVV DEDENVSEFN TPTVAPISEA LKPFTGKIDG LSENGNKVVV DKDGNVSEFN
TPTDSTISEA LKPFTDKVDG LSENGNKVIV DKDGNVSEFN TPTVAPTSET PKAFTGKVDG
LSENGNKVVV DKDGNISEAN APSGAPIVSE LSELNINNNK ITVTIKSKNK DISVDINTAD
AKDITLEAKD ITKDVNLEDL KDKIIADANN NFKDKKDITS IRVVDLELQN DNKTVKLDIP
RTVKVALLQN EQNKDILVYH IKEDNSIELI PSSVTNNSLQ FTVDHFSKFA IIAKIKTAFA
GSEKDINSFA VQEEKKFTII DTRGANSTPT NAPKTLPKTG ETTNKSLVVI GLSLLALIGL
LKRRKNN
//