ID B0WIS3_CULQU Unreviewed; 264 AA.
AC B0WIS3;
DT 08-APR-2008, integrated into UniProtKB/TrEMBL.
DT 08-APR-2008, sequence version 1.
DT 27-MAR-2024, entry version 93.
DE RecName: Full=trypsin {ECO:0000256|ARBA:ARBA00038868};
DE EC=3.4.21.4 {ECO:0000256|ARBA:ARBA00038868};
GN Name=6038908 {ECO:0000313|EnsemblMetazoa:CPIJ007078-PA};
GN ORFNames=CpipJ_CPIJ007078 {ECO:0000313|EMBL:EDS28650.1};
OS Culex quinquefasciatus (Southern house mosquito) (Culex pungens).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Culicinae; Culicini; Culex; Culex.
OX NCBI_TaxID=7176;
RN [1] {ECO:0000313|EMBL:EDS28650.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JHB {ECO:0000313|EMBL:EDS28650.1};
RG The Broad Institute Genome Sequencing Platform;
RA Atkinson P.W., Hemingway J., Christensen B.M., Higgs S., Kodira C.,
RA Hannick L., Megy K., O'Leary S., Pearson M., Haas B.J., Mauceli E.,
RA Wortman J.R., Lee N.H., Guigo R., Stanke M., Alvarado L., Amedeo P.,
RA Antoine C.H., Arensburger P., Bidwell S.L., Crawford M., Camaro F.,
RA Devon K., Engels R., Hammond M., Howarth C., Koehrsen M., Lawson D.,
RA Montgomery P., Nene V., Nusbaum C., Puiu D., Romero-Severson J.,
RA Severson D.W., Shumway M., Sisk P., Stolte C., Zeng Q., Eisenstadt E.,
RA Fraser-Liggett C., Strausberg R., Galagan J., Birren B., Collins F.H.;
RT "Annotation of Culex pipiens quinquefasciatus.";
RL Submitted (MAR-2007) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:CPIJ007078-PA}
RP IDENTIFICATION.
RC STRAIN=JHB {ECO:0000313|EnsemblMetazoa:CPIJ007078-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Preferential cleavage: Arg-|-Xaa, Lys-|-Xaa.; EC=3.4.21.4;
CC Evidence={ECO:0000256|ARBA:ARBA00036320};
CC -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC {ECO:0000256|ARBA:ARBA00024195}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DS231951; EDS28650.1; -; Genomic_DNA.
DR RefSeq; XP_001848607.1; XM_001848555.1.
DR AlphaFoldDB; B0WIS3; -.
DR STRING; 7176.B0WIS3; -.
DR MEROPS; S01.130; -.
DR EnsemblMetazoa; CPIJ007078-RA; CPIJ007078-PA; CPIJ007078.
DR GeneID; 6038908; -.
DR KEGG; cqu:CpipJ_CPIJ007078; -.
DR VEuPathDB; VectorBase:CPIJ007078; -.
DR VEuPathDB; VectorBase:CQUJHB016745; -.
DR eggNOG; KOG3627; Eukaryota.
DR HOGENOM; CLU_006842_7_0_1; -.
DR InParanoid; B0WIS3; -.
DR OMA; FSKSHIC; -.
DR OrthoDB; 2910936at2759; -.
DR Proteomes; UP000002320; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0007586; P:digestion; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR Pfam; PF00089; Trypsin; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU363034};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000002320};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW ECO:0000256|RuleBase:RU363034};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..264
FT /note="trypsin"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5011408265"
FT DOMAIN 38..263
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
SQ SEQUENCE 264 AA; 27891 MW; 403F35213F90C50C CRC64;
MFNLLLLFSI ASGLSNGETP LQKPPTRPLH GSATANRIVG GVPISIADAP YQVSLQFSKS
HICGGSIISR RWILTAAHCI LTSSASSFHV RARSSKHASG GSLIRVRRVV VHPLYRDSGV
DYDYALVELR RGILLGKYAK AVALPEQGES VADGTICTVS GWGNTQSASE SGEILRAAKV
PIFNQMQCND AYAIFGGVTD RMICAGYPEG GKDSCQGDSG GPLVANGKLV GVISWGVECA
KPEMPGVYAR VAAVRDWIRS SSGV
//