ID A0A1H3X542_9FIRM Unreviewed; 277 AA.
AC A0A1H3X542;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 24-JAN-2024, entry version 16.
DE RecName: Full=SIMPL domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=SAMN02745687_01179 {ECO:0000313|EMBL:SDZ94506.1};
OS Lachnospiraceae bacterium NK3A20.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX NCBI_TaxID=877406 {ECO:0000313|EMBL:SDZ94506.1, ECO:0000313|Proteomes:UP000199449};
RN [1] {ECO:0000313|EMBL:SDZ94506.1, ECO:0000313|Proteomes:UP000199449}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NK3A20 {ECO:0000313|EMBL:SDZ94506.1,
RC ECO:0000313|Proteomes:UP000199449};
RA de Groot N.N.;
RL Submitted (OCT-2016) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FNQX01000014; SDZ94506.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1H3X542; -.
DR OrthoDB; 9785192at2; -.
DR Proteomes; UP000199449; Unassembled WGS sequence.
DR Gene3D; 3.30.110.170; Protein of unknown function (DUF541), domain 1; 1.
DR Gene3D; 3.30.70.2970; Protein of unknown function (DUF541), domain 2; 1.
DR InterPro; IPR007497; SIMPL/DUF541.
DR PANTHER; PTHR34387; SLR1258 PROTEIN; 1.
DR PANTHER; PTHR34387:SF2; SLR1258 PROTEIN; 1.
DR Pfam; PF04402; SIMPL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000199449};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..277
FT /note="SIMPL domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5011644888"
FT REGION 46..71
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 277 AA; 29201 MW; CE8EA26F740FF5B9 CRC64;
MKNQRLLASI VLAGMLVTIN GCASTVQSTG ADLSANSAAA TATVSAEATA EEPTAAAQMT
EEEASEKVGD NDTITVTAVS TVQETPDIAK ISFGVTTRAD TAAAAQKRNT GDVDKVIDKL
KDLGIDEKSI QTSGYSMYPD YDYNNNNAII GYNVETSLTI SDQKISDVGS IITECVNAGI
NNINDIEYTC STYDEKYLTA LQQAVAAADT KAAALAEACH RQLGLVTSMT EGYQDTSARY
KNANTSLDMA AAEESAVNVQ PGQMEINAQV TVTYQLD
//