ID R9MSI0_9FIRM Unreviewed; 746 AA.
AC R9MSI0;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 48.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EOS73723.1};
GN ORFNames=C819_03523 {ECO:0000313|EMBL:EOS73723.1};
OS Lachnospiraceae bacterium 10-1.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX NCBI_TaxID=1235800 {ECO:0000313|EMBL:EOS73723.1, ECO:0000313|Proteomes:UP000014134};
RN [1] {ECO:0000313|EMBL:EOS73723.1, ECO:0000313|Proteomes:UP000014134}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=10-1 {ECO:0000313|EMBL:EOS73723.1,
RC ECO:0000313|Proteomes:UP000014134};
RG The Broad Institute Genomics Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Earl A., Xavier R., Elson C., Duck W., Walker B., Young S., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium 10-01.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EOS73723.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASTF01000035; EOS73723.1; -; Genomic_DNA.
DR AlphaFoldDB; R9MSI0; -.
DR STRING; 1235800.C819_03523; -.
DR eggNOG; COG1609; Bacteria.
DR eggNOG; COG1653; Bacteria.
DR HOGENOM; CLU_023027_0_0_9; -.
DR OrthoDB; 9770625at2; -.
DR Proteomes; UP000014134; Unassembled WGS sequence.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd01392; HTH_LacI; 1.
DR CDD; cd06267; PBP1_LacI_sugar_binding-like; 1.
DR Gene3D; 3.40.50.2300; -; 2.
DR Gene3D; 1.10.260.40; lambda repressor-like DNA-binding domains; 1.
DR Gene3D; 3.40.190.10; Periplasmic binding protein-like II; 2.
DR InterPro; IPR001387; Cro/C1-type_HTH.
DR InterPro; IPR000843; HTH_LacI.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR InterPro; IPR028082; Peripla_BP_I.
DR InterPro; IPR006059; SBP.
DR PANTHER; PTHR30146:SF152; HTH-TYPE TRANSCRIPTIONAL REGULATOR EBGR-RELATED; 1.
DR PANTHER; PTHR30146; LACI-RELATED TRANSCRIPTIONAL REPRESSOR; 1.
DR Pfam; PF00356; LacI; 1.
DR Pfam; PF13416; SBP_bac_8; 1.
DR PRINTS; PR00036; HTHLACI.
DR SMART; SM00354; HTH_LACI; 1.
DR SUPFAM; SSF47413; lambda repressor-like DNA-binding domains; 1.
DR SUPFAM; SSF53822; Periplasmic binding protein-like I; 1.
DR SUPFAM; SSF53850; Periplasmic binding protein-like II; 1.
DR PROSITE; PS50943; HTH_CROC1; 1.
DR PROSITE; PS00356; HTH_LACI_1; 1.
DR PROSITE; PS50932; HTH_LACI_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Reference proteome {ECO:0000313|Proteomes:UP000014134};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 2..56
FT /note="HTH lacI-type"
FT /evidence="ECO:0000259|PROSITE:PS50932"
FT DOMAIN 3..46
FT /note="HTH cro/C1-type"
FT /evidence="ECO:0000259|PROSITE:PS50943"
SQ SEQUENCE 746 AA; 85565 MW; C705ED01555B5199 CRC64;
MPTIKDVAKK SGVSIATVSN ILNNKASVSE EIYQRVYSAM EELDYKPNML ARNLKSNGVK
FVGIIVPAFL GIYQDIIEGI QRELSEYDFY LITRTTNDMI NKEKKLIDEF IGLGVCGIFV
VTSFRDMEYY KKAADAGISL VFIERTVDEL NYSSVVFDNR SMVRNLLRSL LTKTCKVEEL
WLITGDLSYS SERDFLEGAY LAVEEAGMQA KELKRSQVSF SELQAFADMM NLIGRADEIP
PYVILSSERI LESFMEMLSI YDRKDTHIFV PVGERWSSAG RSESICEIPR RAVYCGKRCA
QLMLEFVKKP ATNENTQIVI PTQEVKEEQL ISYIPALHRS LKILLTSNAM TTALLRLLPA
FMRESGIQVE TKIFTYQQEL YEEIMKQYES GSSEYDVYMM DSPWIEYFRQ IQCVLTLNPY
LEKEKEYVKS FVPEIWNKLN GGNGRIIGIP LVSMTQLLLY RRDLFKSPAL QRAYYRKNGL
ELGLPSTWTE YNFLAKFFTR EFTKDSPTEY GTCLLGKKPN GLIQEFFPRQ WSFHGAVMNK
NQVVIDSLQN VRAVRNLMEA YECSLPQIWD LMENEQIEAF AKGNIAFIST YSGHIKSILD
SQFGSIGDRL GYAVLPGKYS LVGSWLLAVN GNCQNPDTAF SFIKWITSGT RAMQCSVLGG
FMPKQTVNLS EQIKTIYPWN EHLEDYVKME RQRENIRNAR GTYINNYSIE GVIADGLQDV
LCKKCGIEEM LEDCKEQIKA MVREWK
//