ID W4M1P4_9BACT Unreviewed; 103 AA.
AC W4M1P4;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ETX03851.1};
GN ORFNames=ETSY2_32260 {ECO:0000313|EMBL:ETX03851.1};
OS Candidatus Entotheonella gemina.
OC Bacteria; Nitrospinae/Tectomicrobia group; Candidatus Tectomicrobia;
OC Entotheonella.
OX NCBI_TaxID=1429439 {ECO:0000313|EMBL:ETX03851.1, ECO:0000313|Proteomes:UP000019140};
RN [1] {ECO:0000313|EMBL:ETX03851.1, ECO:0000313|Proteomes:UP000019140}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=TSY2 {ECO:0000313|Proteomes:UP000019140};
RX PubMed=24476823; DOI=10.1038/nature12959;
RA Wilson M.C., Mori T., Ruckert C., Uria A.R., Helf M.J., Takada K.,
RA Gernert C., Steffens U.A., Heycke N., Schmitt S., Rinke C., Helfrich E.J.,
RA Brachmann A.O., Gurgui C., Wakimoto T., Kracht M., Crusemann M.,
RA Hentschel U., Abe I., Matsunaga S., Kalinowski J., Takeyama H., Piel J.;
RT "An environmental bacterial taxon with a large and distinct metabolic
RT repertoire.";
RL Nature 506:58-62(2014).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ETX03851.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AZHX01001375; ETX03851.1; -; Genomic_DNA.
DR AlphaFoldDB; W4M1P4; -.
DR PATRIC; fig|1429439.4.peg.5463; -.
DR HOGENOM; CLU_2258590_0_0_7; -.
DR Proteomes; UP000019140; Unassembled WGS sequence.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000019140}.
SQ SEQUENCE 103 AA; 10806 MW; 6E205774496B64B7 CRC64;
MADFDQEVTK LGCTMGLTHG RITAIELEGL VVDFEGLGVV RFDGQLEIRG DHGPFSSGGD
SGSLIVNPAR QAVGLLFAGT DAGVTYANPI DAVLNTLNME LIQ
//