ID A0A109RF72_9LACT Unreviewed; 440 AA.
AC A0A109RF72;
DT 13-APR-2016, integrated into UniProtKB/TrEMBL.
DT 13-APR-2016, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE SubName: Full=CBS domain-containing protein {ECO:0000313|EMBL:PKZ23404.1};
DE SubName: Full=Thioesterase {ECO:0000313|EMBL:AMB94600.1};
GN ORFNames=AWM72_07455 {ECO:0000313|EMBL:AMB94600.1}, CYJ28_02300
GN {ECO:0000313|EMBL:PKZ23404.1};
OS Aerococcus sanguinicola.
OC Bacteria; Bacillota; Bacilli; Lactobacillales; Aerococcaceae; Aerococcus.
OX NCBI_TaxID=119206 {ECO:0000313|EMBL:AMB94600.1, ECO:0000313|Proteomes:UP000069912};
RN [1] {ECO:0000313|EMBL:AMB94600.1, ECO:0000313|Proteomes:UP000069912}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCUG43001 {ECO:0000313|EMBL:AMB94600.1,
RC ECO:0000313|Proteomes:UP000069912};
RX PubMed=27103727;
RA Carkaci D., Dargis R., Nielsen X.C., Skovgaard O., Fuursted K.,
RA Christensen J.J.;
RT "Complete Genome Sequences of Aerococcus christensenii CCUG 28831T,
RT Aerococcus sanguinicola CCUG 43001T, Aerococcus urinae CCUG 36881T,
RT Aerococcus urinaeequi CCUG 28094T, Aerococcus urinaehominis CCUG 42038 BT,
RT and Aerococcus viridans CCUG 4311T.";
RL Genome Announc. 4:0-0(2016).
RN [2] {ECO:0000313|Proteomes:UP000069912}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCUG43001 {ECO:0000313|Proteomes:UP000069912};
RA Carkaci D., Dargis R., Nielsen X.C., Skovgaard O., Fuursted K.,
RA Christensen J.J.;
RT "Six Aerococcus type strain genome sequencing and assembly using PacBio and
RT Illumina Hiseq.";
RL Submitted (JAN-2016) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:PKZ23404.1, ECO:0000313|Proteomes:UP000234239}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=UMB0139 {ECO:0000313|EMBL:PKZ23404.1,
RC ECO:0000313|Proteomes:UP000234239};
RA Thomas-White K., Wolfe A.J.;
RT "Phylogenetic diversity of female urinary microbiome.";
RL Submitted (DEC-2017) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP014160; AMB94600.1; -; Genomic_DNA.
DR EMBL; PKGY01000001; PKZ23404.1; -; Genomic_DNA.
DR RefSeq; WP_067975640.1; NZ_PKGY01000001.1.
DR AlphaFoldDB; A0A109RF72; -.
DR GeneID; 69592608; -.
DR KEGG; asan:AWM72_07455; -.
DR OrthoDB; 1790451at2; -.
DR Proteomes; UP000069912; Chromosome.
DR Proteomes; UP000234239; Unassembled WGS sequence.
DR CDD; cd04596; CBS_pair_DRTGG_assoc; 1.
DR Gene3D; 3.10.580.10; CBS-domain; 1.
DR Gene3D; 3.40.1390.20; HprK N-terminal domain-like; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR000644; CBS_dom.
DR InterPro; IPR046342; CBS_dom_sf.
DR InterPro; IPR010766; DRTGG.
DR InterPro; IPR029069; HotDog_dom_sf.
DR InterPro; IPR028979; Ser_kin/Pase_Hpr-like_N_sf.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR PANTHER; PTHR43080:SF31; CBS DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR43080; CBS DOMAIN-CONTAINING PROTEIN CBSX3, MITOCHONDRIAL; 1.
DR Pfam; PF00571; CBS; 2.
DR Pfam; PF07085; DRTGG; 1.
DR SMART; SM00116; CBS; 2.
DR SUPFAM; SSF54631; CBS-domain pair; 1.
DR SUPFAM; SSF75138; HprK N-terminal domain-like; 1.
DR SUPFAM; SSF54637; Thioesterase/thiol ester dehydrase-isomerase; 1.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
DR PROSITE; PS51371; CBS; 2.
PE 4: Predicted;
KW CBS domain {ECO:0000256|PROSITE-ProRule:PRU00703};
KW Reference proteome {ECO:0000313|Proteomes:UP000069912}.
FT DOMAIN 179..254
FT /note="CBS"
FT /evidence="ECO:0000259|PROSITE:PS51371"
FT DOMAIN 256..313
FT /note="CBS"
FT /evidence="ECO:0000259|PROSITE:PS51371"
SQ SEQUENCE 440 AA; 50187 MW; F6DFB389B9BA8795 CRC64;
MTTKHEQILN YIRDLPIDTK ISVRRIARDL KVSDGTAYRA IKEAENQNLV RTVERVGTVR
IEPYDYDQTK RLTIREIVRL TDCTVHGGER GLDGEITKFI IGAMQEEAVM TYLRPEALMI
VGDREDIQRV ALEHGMAVLI TGGFQPSQAN IDLANQKQIP IMSVAFDTYS TATVINKAMI
ERAIQQDIVL VEDIFIPFEK TFYLFTDSHV VDYRKLNEKT RHSRFPVVNK NYELQGIVTA
KDLLGKEDRE VIESCMTADP LVAKLSMSVV SVTHMMVWDG LELLPVVDDN NHLLGIVSRQ
DVLRTLEYKQ HHVNRNSRLE SMLEEKLECL DDSLDSPRYR MLSDATMSNQ LGTLSTGLLL
GVIQLVVEKY FADVVQKTAI IESVHFLNLR LVQLNSTLEI HPRVLTLGRR HAYVEVHVYH
GQQLMAKATL TMQIPLDMKE
//