GenomeNet

Database: UniProt
Entry: Q5TQD6_ANOGA
LinkDB: Q5TQD6_ANOGA
Original site: Q5TQD6_ANOGA 
ID   Q5TQD6_ANOGA            Unreviewed;       272 AA.
AC   Q5TQD6;
DT   07-DEC-2004, integrated into UniProtKB/TrEMBL.
DT   07-DEC-2004, sequence version 1.
DT   27-MAR-2024, entry version 121.
DE   RecName: Full=trypsin {ECO:0000256|ARBA:ARBA00038868};
DE            EC=3.4.21.4 {ECO:0000256|ARBA:ARBA00038868};
GN   Name=3291691 {ECO:0000313|EnsemblMetazoa:AGAP008276-PA};
GN   ORFNames=AgaP_AGAP008276 {ECO:0000313|EMBL:EAL39603.1};
OS   Anopheles gambiae (African malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7165 {ECO:0000313|EMBL:EAL39603.1};
RN   [1] {ECO:0000313|EMBL:EAL39603.1, ECO:0000313|Proteomes:UP000007062}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EMBL:EAL39603.1,
RC   ECO:0000313|Proteomes:UP000007062};
RX   PubMed=12364791; DOI=10.1126/science.1076181;
RA   Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA   Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA   Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA   Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA   Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA   Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA   Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA   Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA   Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA   Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA   Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA   Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA   McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA   O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA   Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA   Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA   Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA   Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA   Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA   Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA   Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA   Collins F.H., Hoffman S.L.;
RT   "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL   Science 298:129-149(2002).
RN   [2] {ECO:0000313|EMBL:EAL39603.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAL39603.1};
RG   The Anopheles Genome Sequencing Consortium;
RL   Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:EAL39603.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAL39603.1};
RX   PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA   Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT   "The Anopheles gambiae genome: an update.";
RL   Trends Parasitol. 20:49-52(2004).
RN   [4] {ECO:0000313|EMBL:EAL39603.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAL39603.1};
RX   PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA   Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA   Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT   "Update of the Anopheles gambiae PEST genome assembly.";
RL   Genome Biol. 8:R5.1-R5.13(2007).
RN   [5] {ECO:0000313|EMBL:EAL39603.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAL39603.1};
RG   VectorBase;
RL   Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN   [6] {ECO:0000313|EnsemblMetazoa:AGAP008276-PA}
RP   IDENTIFICATION.
RC   STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP008276-PA};
RG   EnsemblMetazoa;
RL   Submitted (JAN-2021) to UniProtKB.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=Preferential cleavage: Arg-|-Xaa, Lys-|-Xaa.; EC=3.4.21.4;
CC         Evidence={ECO:0000256|ARBA:ARBA00036320};
CC   -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC       {ECO:0000256|ARBA:ARBA00024195}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAAB01008964; EAL39603.1; -; Genomic_DNA.
DR   RefSeq; XP_555189.1; XM_555189.1.
DR   AlphaFoldDB; Q5TQD6; -.
DR   STRING; 7165.Q5TQD6; -.
DR   MEROPS; S01.117; -.
DR   PaxDb; 7165-AGAP008276-PA; -.
DR   EnsemblMetazoa; AGAP008276-RA; AGAP008276-PA; AGAP008276.
DR   GeneID; 3291691; -.
DR   KEGG; aga:AgaP_AGAP008276; -.
DR   VEuPathDB; VectorBase:AGAP008276; -.
DR   eggNOG; KOG3627; Eukaryota.
DR   HOGENOM; CLU_006842_7_0_1; -.
DR   InParanoid; Q5TQD6; -.
DR   OrthoDB; 3436169at2759; -.
DR   Proteomes; UP000007062; Chromosome 3R.
DR   GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IBA:GO_Central.
DR   GO; GO:0007586; P:digestion; IEA:UniProtKB-KW.
DR   GO; GO:0006508; P:proteolysis; IBA:GO_Central.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24264:SF54; HEPATOCYTE GROWTH FACTOR ACTIVATOR-LIKE; 1.
DR   PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007062};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..24
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           25..272
FT                   /note="trypsin"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5014586924"
FT   DOMAIN          39..259
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
SQ   SEQUENCE   272 AA;  29730 MW;  EFC60182059EBF91 CRC64;
     MCVLRQVGLL AVVLAAISLP ISSAQQQQEE RDDSATNMIV GGMKVDIEQV PYQAAILTLG
     QVHCGGSIIG PRWVLTAYHC VDWLLPNFYE VAVGSTNPYE GQRILVQELF VPLETLSDPN
     FDIALAKLAH TLQYSSTVQC IPLLTSDSSL IPDTPAYISG FGYTKERASD NILKAAQIKV
     LPWDYCQQAY PYLMREFMLC AGFKEGKVDS CQGDSGGPLI VNAKLAGVVF YGEGCARPHF
     PGVYISVPWF SDWIIEVVDQ QSTALEGELC AV
//
DBGET integrated database retrieval system