GenomeNet

Database: UniProt
Entry: Q7QKL2_ANOGA
LinkDB: Q7QKL2_ANOGA
Original site: Q7QKL2_ANOGA 
ID   Q7QKL2_ANOGA            Unreviewed;       372 AA.
AC   Q7QKL2;
DT   15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT   07-DEC-2004, sequence version 2.
DT   27-MAR-2024, entry version 136.
DE   RecName: Full=CLIP domain-containing serine protease {ECO:0000256|RuleBase:RU366078};
DE            EC=3.4.21.- {ECO:0000256|RuleBase:RU363034};
GN   Name=CLIPB1 {ECO:0000313|EMBL:EAA03566.2};
GN   Synonyms=1269160 {ECO:0000313|EnsemblMetazoa:AGAP003251-PA};
GN   ORFNames=AgaP_AGAP003251 {ECO:0000313|EMBL:EAA03566.2};
OS   Anopheles gambiae (African malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA03566.2};
RN   [1] {ECO:0000313|EMBL:EAA03566.2, ECO:0000313|Proteomes:UP000007062}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA03566.2,
RC   ECO:0000313|Proteomes:UP000007062};
RX   PubMed=12364791; DOI=10.1126/science.1076181;
RA   Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA   Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA   Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA   Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA   Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA   Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA   Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA   Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA   Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA   Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA   Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA   Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA   McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA   O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA   Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA   Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA   Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA   Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA   Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA   Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA   Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA   Collins F.H., Hoffman S.L.;
RT   "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL   Science 298:129-149(2002).
RN   [2] {ECO:0000313|EMBL:EAA03566.2}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA03566.2};
RG   The Anopheles Genome Sequencing Consortium;
RL   Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:EAA03566.2}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA03566.2};
RX   PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA   Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT   "The Anopheles gambiae genome: an update.";
RL   Trends Parasitol. 20:49-52(2004).
RN   [4] {ECO:0000313|EMBL:EAA03566.2}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA03566.2};
RX   PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA   Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA   Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT   "Update of the Anopheles gambiae PEST genome assembly.";
RL   Genome Biol. 8:R5.1-R5.13(2007).
RN   [5] {ECO:0000313|EMBL:EAA03566.2}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA03566.2};
RG   VectorBase;
RL   Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN   [6] {ECO:0000313|EnsemblMetazoa:AGAP003251-PA}
RP   IDENTIFICATION.
RC   STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP003251-PA};
RG   EnsemblMetazoa;
RL   Submitted (JAN-2021) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613,
CC       ECO:0000256|RuleBase:RU366078}.
CC   -!- DOMAIN: The clip domain consists of 35-55 residues which are 'knitted'
CC       together usually by 3 conserved disulfide bonds forming a clip-like
CC       compact structure. {ECO:0000256|RuleBase:RU366078}.
CC   -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC       {ECO:0000256|ARBA:ARBA00024195, ECO:0000256|RuleBase:RU366078}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAAB01008794; EAA03566.2; -; Genomic_DNA.
DR   RefSeq; XP_307756.2; XM_307756.4.
DR   AlphaFoldDB; Q7QKL2; -.
DR   STRING; 7165.Q7QKL2; -.
DR   MEROPS; S01.316; -.
DR   PaxDb; 7165-AGAP003251-PA; -.
DR   EnsemblMetazoa; AGAP003251-RA; AGAP003251-PA; AGAP003251.
DR   GeneID; 1269160; -.
DR   KEGG; aga:AgaP_AGAP003251; -.
DR   CTD; 1269160; -.
DR   VEuPathDB; VectorBase:AGAP003251; -.
DR   eggNOG; KOG3627; Eukaryota.
DR   HOGENOM; CLU_006842_0_3_1; -.
DR   InParanoid; Q7QKL2; -.
DR   OMA; WGQTENK; -.
DR   OrthoDB; 3680196at2759; -.
DR   Proteomes; UP000007062; Chromosome 2R.
DR   GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IBA:GO_Central.
DR   GO; GO:0006508; P:proteolysis; IBA:GO_Central.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 3.30.1640.30; -; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 2.
DR   InterPro; IPR022700; CLIP.
DR   InterPro; IPR038565; CLIP_sf.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24256:SF537; RE37738P-RELATED; 1.
DR   PANTHER; PTHR24256; TRYPTASE-RELATED; 1.
DR   Pfam; PF12032; CLIP; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00680; CLIP; 1.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS51888; CLIP; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydrolase {ECO:0000256|RuleBase:RU363034};
KW   Protease {ECO:0000256|RuleBase:RU363034};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007062};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525, ECO:0000256|RuleBase:RU366078};
KW   Serine protease {ECO:0000256|RuleBase:RU363034};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|RuleBase:RU366078}.
FT   SIGNAL          1..25
FT                   /evidence="ECO:0000256|RuleBase:RU366078"
FT   CHAIN           26..372
FT                   /note="CLIP domain-containing serine protease"
FT                   /evidence="ECO:0000256|RuleBase:RU366078"
FT                   /id="PRO_5014588602"
FT   DOMAIN          31..85
FT                   /note="Clip"
FT                   /evidence="ECO:0000259|PROSITE:PS51888"
FT   DOMAIN          115..372
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
SQ   SEQUENCE   372 AA;  40888 MW;  11906ACC44906E35 CRC64;
     MSPQAGRFLL LLVCVFCGVI GLSDALNLQD ACETPDGKVG TCVYLRSCLS IRNVLLKKEN
     MTPEDRSLVM KSKCGQEGRS VLVCCPLVRK LTGRFDAPVE LPPPGECGKM QMDRIVGGGV
     SPIDGYPWLT RIQYYKGSNR YGFHCGGVLI HNQYVLTAAH CIEGVPSTWI VYQVRLGEFD
     TTTTIDCVED DCADPVRDVL INAYVVHPDY YKQNGADYND IALLQLSETV EFTDFIRPIC
     LPTSEESRTV NLTGKYATVA GWGQTENSTS STKKLHLRVP VVDNEVCADA FSSIRLEIIP
     TQLCAGGEKG KDSCRGDSGG PLMRYGDGRS STKSWYLIGL VSFGLEQCGT DGVPGVYTRM
     SEYMDWVLDT ME
//
DBGET integrated database retrieval system