GenomeNet

Database: UniProt
Entry: F5HIQ8_ANOGA
LinkDB: F5HIQ8_ANOGA
Original site: F5HIQ8_ANOGA 
ID   F5HIQ8_ANOGA            Unreviewed;       394 AA.
AC   F5HIQ8;
DT   27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT   27-JUL-2011, sequence version 1.
DT   27-MAR-2024, entry version 73.
DE   RecName: Full=CLIP domain-containing serine protease {ECO:0000256|RuleBase:RU366078};
DE            EC=3.4.21.- {ECO:0000256|RuleBase:RU366078};
GN   Name=CLIPB36 {ECO:0000313|EMBL:EGK96169.1};
GN   Synonyms=11175953 {ECO:0000313|EnsemblMetazoa:AGAP013184-PA};
GN   ORFNames=AgaP_AGAP013184 {ECO:0000313|EMBL:EGK96169.1};
OS   Anopheles gambiae (African malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7165 {ECO:0000313|EMBL:EGK96169.1};
RN   [1] {ECO:0000313|EMBL:EGK96169.1, ECO:0000313|Proteomes:UP000007062}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EMBL:EGK96169.1,
RC   ECO:0000313|Proteomes:UP000007062};
RX   PubMed=12364791; DOI=10.1126/science.1076181;
RA   Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA   Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA   Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA   Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA   Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA   Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA   Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA   Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA   Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA   Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA   Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA   Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA   McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA   O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA   Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA   Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA   Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA   Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA   Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA   Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA   Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA   Collins F.H., Hoffman S.L.;
RT   "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL   Science 298:129-149(2002).
RN   [2] {ECO:0000313|EMBL:EGK96169.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EGK96169.1};
RG   The Anopheles Genome Sequencing Consortium;
RL   Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:EGK96169.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EGK96169.1};
RX   PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA   Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT   "The Anopheles gambiae genome: an update.";
RL   Trends Parasitol. 20:49-52(2004).
RN   [4] {ECO:0000313|EMBL:EGK96169.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EGK96169.1};
RX   PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA   Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA   Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT   "Update of the Anopheles gambiae PEST genome assembly.";
RL   Genome Biol. 8:R5.1-R5.13(2007).
RN   [5] {ECO:0000313|EMBL:EGK96169.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EGK96169.1};
RG   VectorBase;
RL   Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN   [6] {ECO:0000313|EnsemblMetazoa:AGAP013184-PA}
RP   IDENTIFICATION.
RC   STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP013184-PA};
RG   EnsemblMetazoa;
RL   Submitted (JAN-2021) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613,
CC       ECO:0000256|RuleBase:RU366078}.
CC   -!- DOMAIN: The clip domain consists of 35-55 residues which are 'knitted'
CC       together usually by 3 conserved disulfide bonds forming a clip-like
CC       compact structure. {ECO:0000256|RuleBase:RU366078}.
CC   -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC       {ECO:0000256|ARBA:ARBA00024195, ECO:0000256|RuleBase:RU366078}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAAB01008794; EGK96169.1; -; Genomic_DNA.
DR   RefSeq; XP_003436427.1; XM_003436379.1.
DR   AlphaFoldDB; F5HIQ8; -.
DR   PaxDb; 7165-AGAP013184-PA; -.
DR   EnsemblMetazoa; AGAP013184-RA; AGAP013184-PA; AGAP013184.
DR   GeneID; 11175953; -.
DR   KEGG; aga:AgaP_AGAP013184; -.
DR   CTD; 11175953; -.
DR   VEuPathDB; VectorBase:AGAP013184; -.
DR   eggNOG; KOG3627; Eukaryota.
DR   HOGENOM; CLU_006842_0_3_1; -.
DR   InParanoid; F5HIQ8; -.
DR   OMA; WDLESTV; -.
DR   OrthoDB; 3452443at2759; -.
DR   Proteomes; UP000007062; Chromosome 2R.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:UniProtKB-UniRule.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 3.30.1640.30; -; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 2.
DR   InterPro; IPR022700; CLIP.
DR   InterPro; IPR038565; CLIP_sf.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   PANTHER; PTHR24256:SF477; LD47230P-RELATED; 1.
DR   PANTHER; PTHR24256; TRYPTASE-RELATED; 1.
DR   Pfam; PF12032; CLIP; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00680; CLIP; 1.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS51888; CLIP; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydrolase {ECO:0000256|RuleBase:RU366078};
KW   Membrane {ECO:0000256|SAM:Phobius};
KW   Protease {ECO:0000256|RuleBase:RU366078};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007062};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525, ECO:0000256|RuleBase:RU366078};
KW   Serine protease {ECO:0000256|RuleBase:RU366078};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   TRANSMEM        12..29
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          36..90
FT                   /note="Clip"
FT                   /evidence="ECO:0000259|PROSITE:PS51888"
FT   DOMAIN          97..388
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
SQ   SEQUENCE   394 AA;  42468 MW;  6569F0E0A2D6F3FA CRC64;
     MVNCLFEVTR YQWLGAIVLV LLVVPGYGGR HSFRQLCITE EQQRGRCVPV KQCDSVMTTL
     RKDTLTPEDI QYLYGTECGR TAEGKALVCC PQSSLVPVGG PMESGINTTI RVNAEEKQPT
     EPTLEVAPNA CGLQSAVKLT NLTIGHHPWT VLLHYGGQAH STQFNCSGTL IAPSYVLTSA
     SCVDDEAAWN NLTVRLGEWD LESTVDCILD PDSDDLVCAD PSYDVPVGQV ILHEAYTGRR
     NDIALLKLAQ PAQLNDWVSP ICLPESPVLN ETAKYGAAGW NQNTCADPSS RYKQLSSYDA
     LNQKACERYV PSVAGTSYGF VCVAVGEEQP LGDAGGGLTA VRTIDSAGRS VHELVGVLSS
     LSSCANFQGV SVYTRVAQYV DWIESKLVVP SDGA
//
DBGET integrated database retrieval system