GenomeNet

Database: UniProt
Entry: Q7PZH5_ANOGA
LinkDB: Q7PZH5_ANOGA
Original site: Q7PZH5_ANOGA 
ID   Q7PZH5_ANOGA            Unreviewed;       408 AA.
AC   Q7PZH5;
DT   15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT   23-OCT-2007, sequence version 4.
DT   27-MAR-2024, entry version 126.
DE   SubName: Full=AGAP011912-PA {ECO:0000313|EMBL:EAA00100.4};
DE   Flags: Fragment;
GN   ORFNames=AgaP_AGAP011912 {ECO:0000313|EMBL:EAA00100.4};
OS   Anopheles gambiae (African malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA00100.4};
RN   [1] {ECO:0000313|EMBL:EAA00100.4}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA00100.4};
RX   PubMed=12364791; DOI=10.1126/science.1076181;
RA   Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA   Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA   Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA   Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA   Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA   Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA   Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA   Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA   Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA   Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA   Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA   Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA   McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA   O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA   Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA   Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA   Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA   Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA   Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA   Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA   Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA   Collins F.H., Hoffman S.L.;
RT   "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL   Science 298:129-149(2002).
RN   [2] {ECO:0000313|EMBL:EAA00100.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA00100.4};
RG   The Anopheles Genome Sequencing Consortium;
RL   Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:EAA00100.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA00100.4};
RX   PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA   Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT   "The Anopheles gambiae genome: an update.";
RL   Trends Parasitol. 20:49-52(2004).
RN   [4] {ECO:0000313|EMBL:EAA00100.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA00100.4};
RX   PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA   Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA   Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT   "Update of the Anopheles gambiae PEST genome assembly.";
RL   Genome Biol. 8:R5.1-R5.13(2007).
RN   [5] {ECO:0000313|EMBL:EAA00100.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA00100.4};
RG   VectorBase;
RL   Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC       {ECO:0000256|ARBA:ARBA00024195}.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00059}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EAA00100.4}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAAB01008986; EAA00100.4; -; Genomic_DNA.
DR   RefSeq; XP_320615.4; XM_320615.4.
DR   AlphaFoldDB; Q7PZH5; -.
DR   STRING; 7165.Q7PZH5; -.
DR   MEROPS; S01.492; -.
DR   PaxDb; 7165-AGAP011912-PA; -.
DR   GeneID; 1280750; -.
DR   KEGG; aga:AgaP_AGAP011912; -.
DR   VEuPathDB; VectorBase:AGAP011912; -.
DR   eggNOG; KOG3627; Eukaryota.
DR   InParanoid; Q7PZH5; -.
DR   PhylomeDB; Q7PZH5; -.
DR   GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IBA:GO_Central.
DR   GO; GO:0006508; P:proteolysis; IBA:GO_Central.
DR   CDD; cd00041; CUB; 1.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR000859; CUB_dom.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR035914; Sperma_CUB_dom_sf.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   PANTHER; PTHR24252:SF8; ACROSIN; 1.
DR   PANTHER; PTHR24252; ACROSIN-RELATED; 1.
DR   Pfam; PF00431; CUB; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00042; CUB; 1.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF49854; Spermadhesin, CUB domain; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS01180; CUB; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..37
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           38..408
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004289857"
FT   DOMAIN          42..156
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          170..401
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:EAA00100.4"
SQ   SEQUENCE   408 AA;  43697 MW;  CB23E0CB0B76B769 CRC64;
     QRWGRHSTRY HLAMCGSNVL IAWLVALVAL AVSPAVGQFT GCDRQKTLAA GEVFYVESPS
     FPNYYARGTN CRWQLAAPAG NTIYVNCYDM YLAASTGCTA DRVEISLQND PTLAYATKYC
     GQRTFTLQST GNRAVFALRT TTTTSGGRFR CQVVAQAPKC SCGLRRTSKI VNGVPTLVNE
     FPMMAGLVDS SSRSVFCGAT IISDYHSITA AHCMRGRSLS ASGLLVGDHN LSVGTDTSYS
     VLMRLASITN HPQYVVSPSR NDIALVRTAD RIAFNAAVGP ACLPFRYSTS NFAGSIVEAT
     GWGTMDFGAP TSNVLRKVSL NVISEQSCQS SMPNILASHI CTYTPGKDTC QYDSGGPLLF
     TTGGRVYLVG VVNYGVSCAS SKPSVSSRIT SYLSWIQSVT PGVTYCTP
//
DBGET integrated database retrieval system