ID U2UM03_9FIRM Unreviewed; 675 AA.
AC U2UM03;
DT 13-NOV-2013, integrated into UniProtKB/TrEMBL.
DT 13-NOV-2013, sequence version 1.
DT 24-JAN-2024, entry version 34.
DE RecName: Full=Peptidase S55 domain-containing protein {ECO:0000259|PROSITE:PS51494};
GN ORFNames=HMPREF1985_01646 {ECO:0000313|EMBL:ERL04137.1};
OS Mitsuokella sp. oral taxon 131 str. W9106.
OC Bacteria; Bacillota; Negativicutes; Selenomonadales; Selenomonadaceae;
OC Mitsuokella.
OX NCBI_TaxID=1321781 {ECO:0000313|EMBL:ERL04137.1, ECO:0000313|Proteomes:UP000016614};
RN [1] {ECO:0000313|EMBL:ERL04137.1, ECO:0000313|Proteomes:UP000016614}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=W9106 {ECO:0000313|EMBL:ERL04137.1,
RC ECO:0000313|Proteomes:UP000016614};
RA Weinstock G., Sodergren E., Wylie T., Fulton L., Fulton R., Fronick C.,
RA O'Laughlin M., Godfrey J., Miner T., Herter B., Appelbaum E., Cordes M.,
RA Lek S., Wollam A., Pepin K.H., Palsikar V.B., Mitreva M., Wilson R.K.;
RL Submitted (AUG-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ERL04137.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AWVT01000014; ERL04137.1; -; Genomic_DNA.
DR AlphaFoldDB; U2UM03; -.
DR STRING; 1321781.HMPREF1985_01646; -.
DR PATRIC; fig|1321781.3.peg.1555; -.
DR eggNOG; COG3064; Bacteria.
DR HOGENOM; CLU_023510_1_0_9; -.
DR OrthoDB; 9765242at2; -.
DR Proteomes; UP000016614; Unassembled WGS sequence.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR008763; Peptidase_S55.
DR Pfam; PF05580; Peptidase_S55; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS51494; SPOIVB; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000016614}.
FT DOMAIN 1..119
FT /note="Peptidase S55"
FT /evidence="ECO:0000259|PROSITE:PS51494"
FT REGION 127..237
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 127..161
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 207..221
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 675 AA; 71336 MW; 1B56317292234CD4 CRC64;
MDPILPFGSV EEGMMGKAYT VVDSSGEIRN FDVDIVGPID DGKGGSRMIM ARATGPVIEQ
TGGILQGMSG SPVYVDGNLV GAVAAGIKDM TPYTFFITPI EDMMTLWQMP DTKDKTRLPS
LSLKHYQKER EDARKAAEKA AEKGTGETDK PQAEEPAKAE GAAKESAQTE APAKAGETAK
GRESETETET GTGTGTGTGT AEAPAKAKEP VKAKEPAKAE GSATAEEPAK AGDKPQKGEA
KSVLYLAGFG RAGMDFLQKR LPMAEIRCVP MGTPSAVLQS QAIYNANLQP GSPVGVAVAY
GDFAVGATGT VTAVDGKRVL AFGHPFLHRG NVSYFMTDAN VVGTISGVSN GMKVASVGHI
IGRINQDRET GVAGILGEFP SVVPVRVHVA DKTLGASESY GTRIAYDEDL LSKLVGSVAY
AAMNKTSNTL GSATAKLHFA IRTNAVKSGM FERSNMYYST ADVGQIAVGE LIQAMDIISA
NTEKESDIVD VKVDVEMDGD RKTASLVSAA PDKLSVKPGE TVNLTTTIKP YRKEKETIVI
PYKVPETQKP GAMHLDLRGG AFSPAAPQLL LLSPTDAESL AEETSNRSTQ ERLAALSESS
PNNVITVMPS AQRKDLTARQ KRAALRAAEA NAQAHAKKIS LLGSGKKKEK PGETKFETNY
IIDNVIHATL QIEKK
//