GenomeNet

Database: UniProt
Entry: Q7PN20_ANOGA
LinkDB: Q7PN20_ANOGA
Original site: Q7PN20_ANOGA 
ID   Q7PN20_ANOGA            Unreviewed;       319 AA.
AC   Q7PN20;
DT   15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT   23-OCT-2007, sequence version 4.
DT   27-MAR-2024, entry version 125.
DE   SubName: Full=AGAP007795-PA {ECO:0000313|EMBL:EAA12901.5};
DE   Flags: Fragment;
GN   ORFNames=AgaP_AGAP007795 {ECO:0000313|EMBL:EAA12901.5};
OS   Anopheles gambiae (African malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA12901.5};
RN   [1] {ECO:0000313|EMBL:EAA12901.5}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA12901.5};
RX   PubMed=12364791; DOI=10.1126/science.1076181;
RA   Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA   Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA   Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA   Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA   Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA   Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA   Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA   Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA   Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA   Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA   Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA   Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA   McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA   O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA   Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA   Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA   Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA   Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA   Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA   Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA   Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA   Collins F.H., Hoffman S.L.;
RT   "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL   Science 298:129-149(2002).
RN   [2] {ECO:0000313|EMBL:EAA12901.5}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA12901.5};
RG   The Anopheles Genome Sequencing Consortium;
RL   Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:EAA12901.5}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA12901.5};
RX   PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA   Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT   "The Anopheles gambiae genome: an update.";
RL   Trends Parasitol. 20:49-52(2004).
RN   [4] {ECO:0000313|EMBL:EAA12901.5}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA12901.5};
RX   PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA   Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA   Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT   "Update of the Anopheles gambiae PEST genome assembly.";
RL   Genome Biol. 8:R5.1-R5.13(2007).
RN   [5] {ECO:0000313|EMBL:EAA12901.5}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA12901.5};
RG   VectorBase;
RL   Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC       {ECO:0000256|ARBA:ARBA00024195}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EAA12901.5}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAAB01008964; EAA12901.5; -; Genomic_DNA.
DR   RefSeq; XP_317712.4; XM_317712.4.
DR   AlphaFoldDB; Q7PN20; -.
DR   STRING; 7165.Q7PN20; -.
DR   PaxDb; 7165-AGAP007795-PA; -.
DR   GeneID; 1278167; -.
DR   KEGG; aga:AgaP_AGAP007795; -.
DR   VEuPathDB; VectorBase:AGAP007795; -.
DR   eggNOG; KOG3627; Eukaryota.
DR   HOGENOM; CLU_004497_6_1_1; -.
DR   InParanoid; Q7PN20; -.
DR   PhylomeDB; Q7PN20; -.
DR   GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IBA:GO_Central.
DR   GO; GO:0006508; P:proteolysis; IBA:GO_Central.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24252; ACROSIN-RELATED; 1.
DR   PANTHER; PTHR24252:SF7; HYALIN; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|RuleBase:RU363034};
KW   Protease {ECO:0000256|RuleBase:RU363034};
KW   Serine protease {ECO:0000256|RuleBase:RU363034};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..22
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           23..319
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004289472"
FT   DOMAIN          41..293
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   NON_TER         319
FT                   /evidence="ECO:0000313|EMBL:EAA12901.5"
SQ   SEQUENCE   319 AA;  34682 MW;  80C0BC2C66A9E532 CRC64;
     MWKASALQLV LLALAIHWQI DSITTTNVLC GQRPIAAPGT ITYGRSSWPG QFPWHVALYR
     TEQPLTISYA CGGFIVGERV VITAAHCVTA PSGYQLAADE LTVRVGLYDL LTLARHSQEH
     RVGRIHRHGN FTTGSLRHDL ALLMLRTIVE FGDFVQPICL PREPDALKGV RTGTVSGWGL
     VEDDSPARTL RSATMPVVSY LSCLQSDSTL FGPVLYDGMF CAGWENGTNV CNGDSGGAFA
     ANVNGSWTAF GIVSFTGVRE HTDGQTPFRC DTKSLAGFIS IPMYLNWIES VAAVEAVQLD
     THREMPNSPL PKISDTSKI
//
DBGET integrated database retrieval system