GenomeNet

Database: UniProt
Entry: U2UM03_9FIRM
LinkDB: U2UM03_9FIRM
Original site: U2UM03_9FIRM 
ID   U2UM03_9FIRM            Unreviewed;       675 AA.
AC   U2UM03;
DT   13-NOV-2013, integrated into UniProtKB/TrEMBL.
DT   13-NOV-2013, sequence version 1.
DT   24-JAN-2024, entry version 34.
DE   RecName: Full=Peptidase S55 domain-containing protein {ECO:0000259|PROSITE:PS51494};
GN   ORFNames=HMPREF1985_01646 {ECO:0000313|EMBL:ERL04137.1};
OS   Mitsuokella sp. oral taxon 131 str. W9106.
OC   Bacteria; Bacillota; Negativicutes; Selenomonadales; Selenomonadaceae;
OC   Mitsuokella.
OX   NCBI_TaxID=1321781 {ECO:0000313|EMBL:ERL04137.1, ECO:0000313|Proteomes:UP000016614};
RN   [1] {ECO:0000313|EMBL:ERL04137.1, ECO:0000313|Proteomes:UP000016614}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=W9106 {ECO:0000313|EMBL:ERL04137.1,
RC   ECO:0000313|Proteomes:UP000016614};
RA   Weinstock G., Sodergren E., Wylie T., Fulton L., Fulton R., Fronick C.,
RA   O'Laughlin M., Godfrey J., Miner T., Herter B., Appelbaum E., Cordes M.,
RA   Lek S., Wollam A., Pepin K.H., Palsikar V.B., Mitreva M., Wilson R.K.;
RL   Submitted (AUG-2013) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:ERL04137.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AWVT01000014; ERL04137.1; -; Genomic_DNA.
DR   AlphaFoldDB; U2UM03; -.
DR   STRING; 1321781.HMPREF1985_01646; -.
DR   PATRIC; fig|1321781.3.peg.1555; -.
DR   eggNOG; COG3064; Bacteria.
DR   HOGENOM; CLU_023510_1_0_9; -.
DR   OrthoDB; 9765242at2; -.
DR   Proteomes; UP000016614; Unassembled WGS sequence.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR008763; Peptidase_S55.
DR   Pfam; PF05580; Peptidase_S55; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS51494; SPOIVB; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000016614}.
FT   DOMAIN          1..119
FT                   /note="Peptidase S55"
FT                   /evidence="ECO:0000259|PROSITE:PS51494"
FT   REGION          127..237
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        127..161
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        207..221
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   675 AA;  71336 MW;  1B56317292234CD4 CRC64;
     MDPILPFGSV EEGMMGKAYT VVDSSGEIRN FDVDIVGPID DGKGGSRMIM ARATGPVIEQ
     TGGILQGMSG SPVYVDGNLV GAVAAGIKDM TPYTFFITPI EDMMTLWQMP DTKDKTRLPS
     LSLKHYQKER EDARKAAEKA AEKGTGETDK PQAEEPAKAE GAAKESAQTE APAKAGETAK
     GRESETETET GTGTGTGTGT AEAPAKAKEP VKAKEPAKAE GSATAEEPAK AGDKPQKGEA
     KSVLYLAGFG RAGMDFLQKR LPMAEIRCVP MGTPSAVLQS QAIYNANLQP GSPVGVAVAY
     GDFAVGATGT VTAVDGKRVL AFGHPFLHRG NVSYFMTDAN VVGTISGVSN GMKVASVGHI
     IGRINQDRET GVAGILGEFP SVVPVRVHVA DKTLGASESY GTRIAYDEDL LSKLVGSVAY
     AAMNKTSNTL GSATAKLHFA IRTNAVKSGM FERSNMYYST ADVGQIAVGE LIQAMDIISA
     NTEKESDIVD VKVDVEMDGD RKTASLVSAA PDKLSVKPGE TVNLTTTIKP YRKEKETIVI
     PYKVPETQKP GAMHLDLRGG AFSPAAPQLL LLSPTDAESL AEETSNRSTQ ERLAALSESS
     PNNVITVMPS AQRKDLTARQ KRAALRAAEA NAQAHAKKIS LLGSGKKKEK PGETKFETNY
     IIDNVIHATL QIEKK
//
DBGET integrated database retrieval system