GenomeNet

Database: UniProt
Entry: A0A318XM20_9FIRM
LinkDB: A0A318XM20_9FIRM
Original site: A0A318XM20_9FIRM 
ID   A0A318XM20_9FIRM        Unreviewed;      1047 AA.
AC   A0A318XM20;
DT   10-OCT-2018, integrated into UniProtKB/TrEMBL.
DT   10-OCT-2018, sequence version 1.
DT   24-JAN-2024, entry version 14.
DE   SubName: Full=Trypsin {ECO:0000313|EMBL:PYG87752.1};
GN   ORFNames=LY28_01772 {ECO:0000313|EMBL:PYG87752.1};
OS   Ruminiclostridium sufflavum DSM 19573.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC   Ruminiclostridium.
OX   NCBI_TaxID=1121337 {ECO:0000313|EMBL:PYG87752.1, ECO:0000313|Proteomes:UP000248132};
RN   [1] {ECO:0000313|EMBL:PYG87752.1, ECO:0000313|Proteomes:UP000248132}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=DSM 19573 {ECO:0000313|EMBL:PYG87752.1,
RC   ECO:0000313|Proteomes:UP000248132};
RA   Kyrpides N.;
RT   "Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial
RT   genomes (KMG-I) project.";
RL   Submitted (JUN-2018) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:PYG87752.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; QKMR01000009; PYG87752.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A318XM20; -.
DR   OrthoDB; 2087794at2; -.
DR   Proteomes; UP000248132; Unassembled WGS sequence.
DR   GO; GO:0003729; F:mRNA binding; IEA:InterPro.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 2.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 2.
DR   InterPro; IPR003107; HAT.
DR   InterPro; IPR044624; Mbb1-like.
DR   InterPro; IPR027417; P-loop_NTPase.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR011990; TPR-like_helical_dom_sf.
DR   InterPro; IPR001254; Trypsin_dom.
DR   PANTHER; PTHR44917; PROTEIN HIGH CHLOROPHYLL FLUORESCENT 107; 1.
DR   PANTHER; PTHR44917:SF1; PROTEIN HIGH CHLOROPHYLL FLUORESCENT 107; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   SMART; SM00386; HAT; 7.
DR   SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR   SUPFAM; SSF48452; TPR-like; 2.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
PE   4: Predicted;
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022670};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000248132}.
FT   DOMAIN          43..207
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|Pfam:PF00089"
SQ   SEQUENCE   1047 AA;  120031 MW;  DE44E609ED0D6274 CRC64;
     MDIEEKLRLL AVRIEAPAGE DQRNSILGSG ILWAPQEQGE YMYVFTAAHV VYGHGRLIIR
     YWDEAGDTKT LNIDECDIQP HEQNNCHEHK NNRNTALKND VAVLRCKRKQ IKYIDYKLKA
     ARNIKQDEKL ILRGFPEKVS NDEFSLTLGR EYKARFVMEE KGKSCFLYKV EEVLKCEERN
     EELIGLSGSG IFLNNGQPVL AGIHSYAAGD VYLNQVTGMN IELIRDICQA KRWDMPQLVN
     REDTHTKPSV YKWEEINDDF FRNHGRYDDE KLQGFLKGES CTWGLIANNC TVKREVTERV
     VSIIGGKRLI GILGAGGEGK STILMQICKE LNNKGYTVYW NTEQITRTFN KLELLPTVDS
     VVLAIDDASG DAEFEKFALQ AVGKGYRIIF AARENEWNVE RVKAETAKLD RELEIIELSD
     VTEKEADSFS QLIAGKMNTG KGKSEIRKIF TENNNGFLLA AMLMAVYGRP LEEIVRDVLL
     KIRKQSENIL KVLAIICYVE KLEARINKNL GFTSELYRVL YSSYGIKKKE ISALLHKEVQ
     KTHLNIMRTR HPVISDIISG FLIRGDSSEF ELDDLLYDFI RCPLKGEKTV PAELIKNMYP
     MMIEILNDIY TDDTISPQQL AENIAEIYKR GDIWRLWALK ETNAGNVGQS AEGKYSARWI
     LKEGSRKCPF DGNIYIKWAE LESAAGNAGR SMEEENSARW ILKEGCEKCP SDGNVYIKWA
     ELEINEGNIG RDIREKNSAR WILKEGSEEC PSDGNIYIKW AELEINSGNI GRNINEEYSA
     RWILKEGSRK CPSNGNVHIK WAELEINEGN IGRTANEKNT ARWLLEEGIK REPDNCNTYI
     KWAELEIGEG NTGRDINERN SARWIYNEGS KRCPSQGNIY IKWAELEIKE NNLGKDTSEE
     NSARWIMNEG LKNCPYDGNI YMKKAELEIN FGNDEKVIEL LAESLKIDCL HNLSSLALIQ
     AKNKNFSPDD PYSAKCCIDK MLLQVRNANA FYTAYLCYKL YASEEAAIEY KKKLTQRDIE
     TLSQENFYKF RRWEQGWIER SRSQTIG
//
DBGET integrated database retrieval system