GenomeNet

Database: UniProt
Entry: Q7PQR9_ANOGA
LinkDB: Q7PQR9_ANOGA
Original site: Q7PQR9_ANOGA 
ID   Q7PQR9_ANOGA            Unreviewed;       355 AA.
AC   Q7PQR9;
DT   15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT   09-JAN-2007, sequence version 3.
DT   27-MAR-2024, entry version 139.
DE   RecName: Full=CLIP domain-containing serine protease {ECO:0000256|RuleBase:RU366078};
DE            EC=3.4.21.- {ECO:0000256|RuleBase:RU363034};
GN   Name=CLIPB2 {ECO:0000313|EMBL:EAA08404.4};
GN   Synonyms=1273920 {ECO:0000313|EnsemblMetazoa:AGAP003246-PA};
GN   ORFNames=AgaP_AGAP003246 {ECO:0000313|EMBL:EAA08404.4};
OS   Anopheles gambiae (African malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA08404.4};
RN   [1] {ECO:0000313|EMBL:EAA08404.4, ECO:0000313|Proteomes:UP000007062}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA08404.4,
RC   ECO:0000313|Proteomes:UP000007062};
RX   PubMed=12364791; DOI=10.1126/science.1076181;
RA   Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA   Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA   Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA   Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA   Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA   Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA   Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA   Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA   Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA   Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA   Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA   Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA   McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA   O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA   Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA   Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA   Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA   Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA   Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA   Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA   Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA   Collins F.H., Hoffman S.L.;
RT   "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL   Science 298:129-149(2002).
RN   [2] {ECO:0000313|EMBL:EAA08404.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA08404.4};
RG   The Anopheles Genome Sequencing Consortium;
RL   Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:EAA08404.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA08404.4};
RX   PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA   Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT   "The Anopheles gambiae genome: an update.";
RL   Trends Parasitol. 20:49-52(2004).
RN   [4] {ECO:0000313|EMBL:EAA08404.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA08404.4};
RX   PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA   Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA   Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT   "Update of the Anopheles gambiae PEST genome assembly.";
RL   Genome Biol. 8:R5.1-R5.13(2007).
RN   [5] {ECO:0000313|EMBL:EAA08404.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA08404.4};
RG   VectorBase;
RL   Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN   [6] {ECO:0000313|EnsemblMetazoa:AGAP003246-PA}
RP   IDENTIFICATION.
RC   STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP003246-PA};
RG   EnsemblMetazoa;
RL   Submitted (MAY-2020) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613,
CC       ECO:0000256|RuleBase:RU366078}.
CC   -!- DOMAIN: The clip domain consists of 35-55 residues which are 'knitted'
CC       together usually by 3 conserved disulfide bonds forming a clip-like
CC       compact structure. {ECO:0000256|RuleBase:RU366078}.
CC   -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC       {ECO:0000256|ARBA:ARBA00024195, ECO:0000256|RuleBase:RU366078}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAAB01008879; EAA08404.4; -; Genomic_DNA.
DR   RefSeq; XP_312956.3; XM_312956.4.
DR   AlphaFoldDB; Q7PQR9; -.
DR   MEROPS; S01.507; -.
DR   PaxDb; 7165-AGAP003246-PA; -.
DR   EnsemblMetazoa; AGAP003246-RA; AGAP003246-PA; AGAP003246.
DR   GeneID; 1273920; -.
DR   KEGG; aga:AgaP_AGAP003246; -.
DR   CTD; 1273920; -.
DR   VEuPathDB; VectorBase:AGAP003246; -.
DR   eggNOG; KOG3627; Eukaryota.
DR   HOGENOM; CLU_006842_0_3_1; -.
DR   InParanoid; Q7PQR9; -.
DR   OMA; PSIGTCG; -.
DR   OrthoDB; 3680196at2759; -.
DR   Proteomes; UP000007062; Chromosome 2R.
DR   ExpressionAtlas; Q7PQR9; differential.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:UniProtKB-UniRule.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 3.30.1640.30; -; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 2.
DR   InterPro; IPR022700; CLIP.
DR   InterPro; IPR038565; CLIP_sf.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24260; -; 1.
DR   PANTHER; PTHR24260:SF131; CLIP DOMAIN-CONTAINING SERINE PROTEASE-RELATED; 1.
DR   Pfam; PF12032; CLIP; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00680; CLIP; 1.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS51888; CLIP; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydrolase {ECO:0000256|RuleBase:RU363034};
KW   Protease {ECO:0000256|RuleBase:RU363034};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007062};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525, ECO:0000256|RuleBase:RU366078};
KW   Serine protease {ECO:0000256|RuleBase:RU363034};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|RuleBase:RU366078}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|RuleBase:RU366078"
FT   CHAIN           20..355
FT                   /note="CLIP domain-containing serine protease"
FT                   /evidence="ECO:0000256|RuleBase:RU366078"
FT                   /id="PRO_5014587847"
FT   DOMAIN          25..78
FT                   /note="Clip"
FT                   /evidence="ECO:0000259|PROSITE:PS51888"
FT   DOMAIN          103..355
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
SQ   SEQUENCE   355 AA;  38380 MW;  9B25C366EE353015 CRC64;
     MGKAKVFPLV LALFGVSVAL EQGQRCVNPA RQTGKCVLVR ECASLLAIYS KRFTTPEETQ
     FLASSRCGEI GRKTLVCCAS EQQTRTSSFP TSPECGIQVT DRIIGGQTTE LEEFPWTALI
     EYRKPGNQYD FHCGGALINA RYILTAAHCV QSLPRGWQLN GVRLGEWDLS TANDCSDGIC
     SAGPIDLEIE SFVAHAGYDA ADTAHTNDIA LIRLRQDVAS SEMIRPICLP LTEPQRSRNR
     VGTVSFAAGW GKTESASASE RKLKVELTVQ DPSRCRQIYR GINIALKASQ MCAGGLQGKD
     TCTGDSGGPL MAKSAGAWYL IGVVSFGLSK CGTAGYPGVY TNVVEYLDWI ESNVQ
//
DBGET integrated database retrieval system