ID S9WQD4_CAMFR Unreviewed; 493 AA.
AC S9WQD4;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 24-JAN-2024, entry version 32.
DE RecName: Full=trypsin {ECO:0000256|ARBA:ARBA00038868};
DE EC=3.4.21.4 {ECO:0000256|ARBA:ARBA00038868};
GN ORFNames=CB1_000804057 {ECO:0000313|EMBL:EPY80732.1};
OS Camelus ferus (Wild bactrian camel) (Camelus bactrianus ferus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Tylopoda; Camelidae; Camelus.
OX NCBI_TaxID=419612 {ECO:0000313|EMBL:EPY80732.1, ECO:0000313|Proteomes:UP000030684};
RN [1] {ECO:0000313|EMBL:EPY80732.1, ECO:0000313|Proteomes:UP000030684}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=bactrian camel {ECO:0000313|Proteomes:UP000030684};
RX PubMed=23149746;
RG Bactrian Camels Genome Sequencing and Analysis Consortium;
RA Jirimutu, Wang Z., Ding G., Chen G., Sun Y., Sun Z., Zhang H., Wang L.,
RA Hasi S., Zhang Y., Li J., Shi Y., Xu Z., He C., Yu S., Li S., Zhang W.,
RA Batmunkh M., Ts B., Narenbatu, Unierhu, Bat-Ireedui S., Gao H.,
RA Baysgalan B., Li Q., Jia Z., Turigenbayila, Subudenggerile, Narenmanduhu,
RA Wang Z., Wang J., Pan L., Chen Y., Ganerdene Y., Dabxilt, Erdemt, Altansha,
RA Altansukh, Liu T., Cao M., Aruuntsever, Bayart, Hosblig, He F., Zha-ti A.,
RA Zheng G., Qiu F., Sun Z., Zhao L., Zhao W., Liu B., Li C., Chen Y.,
RA Tang X., Guo C., Liu W., Ming L., Temuulen, Cui A., Li Y., Gao J., Li J.,
RA Wurentaodi, Niu S., Sun T., Zhai Z., Zhang M., Chen C., Baldan T.,
RA Bayaer T., Li Y., Meng H.;
RT "Genome sequences of wild and domestic bactrian camels.";
RL Nat. Commun. 3:1202-1202(2012).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Preferential cleavage: Arg-|-Xaa, Lys-|-Xaa.; EC=3.4.21.4;
CC Evidence={ECO:0000256|ARBA:ARBA00036320};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB017062; EPY80732.1; -; Genomic_DNA.
DR AlphaFoldDB; S9WQD4; -.
DR Proteomes; UP000030684; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 3.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR013106; Ig_V-set.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24264:SF57; TRYPSIN-1; 1.
DR PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR Pfam; PF00089; Trypsin; 2.
DR Pfam; PF07686; V-set; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00406; IGv; 1.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF48726; Immunoglobulin; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS50835; IG_LIKE; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|RuleBase:RU363034};
KW Protease {ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000030684};
KW Serine protease {ECO:0000256|RuleBase:RU363034}.
FT DOMAIN 66..159
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 222..490
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
SQ SEQUENCE 493 AA; 52833 MW; A2084B5238516676 CRC64;
MVGRKYTVVR KDKPESGFLV PFLKEEERLP PKDADSSAAS PGARDKLVSA LVGFFVSLEL
GSAFGALVSQ KPSRAICQHG VSMMIQCEAD SQVISMYWYR QRPGQSLSLI ATANQGSQAT
YESGFAKEKF PISRPSLNFS VLTVSSVSPE DSSSYFCSAD TVLGPDHRSE QEPWLPPAGS
HPAGPDIPGR LVEAGNRSSQ ADAAACVLVG GFAVPTDDDD KIVGGYTCAE NSVPYQVSLN
SGYHFCGGSL ISDQWVVSAA HCYKSRIEVR LGENNIDVVE GTEQFISAAK VIRHPSYNSW
TLDNDILLIK LSSPAVLSTR VSTLALPTAC AAAGTQCLIS GWGNTLSSGV CKFTSSAYYT
QAPALGTRDM QGSALEYHLQ SWPDPRFPFL DLFLILKVNY PELLQCLDAP LLSQAECEAS
YPGQITDNMV CAGFLEGGKD SCQGDSGGPV ACNGELQGIV SWGYGCAQKN KPGVYTKVCN
YVDWIEETIV ANS
//