GenomeNet

Database: UniProt
Entry: F5HMI1_ANOGA
LinkDB: F5HMI1_ANOGA
Original site: F5HMI1_ANOGA 
ID   F5HMI1_ANOGA            Unreviewed;       454 AA.
AC   F5HMI1;
DT   27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT   27-JUL-2011, sequence version 1.
DT   27-MAR-2024, entry version 72.
DE   SubName: Full=AGAP013020-PA {ECO:0000313|EMBL:EGK97503.1};
GN   ORFNames=AgaP_AGAP013020 {ECO:0000313|EMBL:EGK97503.1};
OS   Anopheles gambiae (African malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7165 {ECO:0000313|EMBL:EGK97503.1};
RN   [1] {ECO:0000313|EMBL:EGK97503.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EMBL:EGK97503.1};
RX   PubMed=12364791; DOI=10.1126/science.1076181;
RA   Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA   Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA   Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA   Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA   Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA   Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA   Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA   Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA   Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA   Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA   Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA   Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA   McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA   O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA   Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA   Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA   Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA   Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA   Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA   Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA   Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA   Collins F.H., Hoffman S.L.;
RT   "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL   Science 298:129-149(2002).
RN   [2] {ECO:0000313|EMBL:EGK97503.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EGK97503.1};
RG   The Anopheles Genome Sequencing Consortium;
RL   Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:EGK97503.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EGK97503.1};
RX   PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA   Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT   "The Anopheles gambiae genome: an update.";
RL   Trends Parasitol. 20:49-52(2004).
RN   [4] {ECO:0000313|EMBL:EGK97503.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EGK97503.1};
RX   PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA   Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA   Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT   "Update of the Anopheles gambiae PEST genome assembly.";
RL   Genome Biol. 8:R5.1-R5.13(2007).
RN   [5] {ECO:0000313|EMBL:EGK97503.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EGK97503.1};
RG   VectorBase;
RL   Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC       {ECO:0000256|ARBA:ARBA00024195}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EGK97503.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAAB01008987; EGK97503.1; -; Genomic_DNA.
DR   RefSeq; XP_003435910.1; XM_003435862.1.
DR   AlphaFoldDB; F5HMI1; -.
DR   STRING; 7165.F5HMI1; -.
DR   PaxDb; 7165-AGAP013020-PA; -.
DR   GeneID; 11175979; -.
DR   KEGG; aga:AgaP_AGAP013020; -.
DR   VEuPathDB; VectorBase:AGAP013020; -.
DR   eggNOG; KOG3627; Eukaryota.
DR   HOGENOM; CLU_006842_16_1_1; -.
DR   InParanoid; F5HMI1; -.
DR   OMA; HTICAGQ; -.
DR   OrthoDB; 3149366at2759; -.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR031986; GD_N.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   PANTHER; PTHR24260; -; 1.
DR   PANTHER; PTHR24260:SF153; GH19262P-RELATED; 1.
DR   Pfam; PF16030; GD_N; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
SQ   SEQUENCE   454 AA;  50449 MW;  CDD3CEC3FB152735 CRC64;
     MLAVHINVSR RCTVRSSPPS SLVLRTYAFL LLLGSLVYTP VEGQRSPCPD VFSYWVEEGT
     NQPFGYVKLE GLRANQAITL QVDLTIAATV SQNNIGSITL YKSSTETVRD IQNNRPAWYR
     VNFPFRNIKP SVLAIRVNGH TICAGQKVTG QIVTTINLQH TLYPSTQSLV STNDGTNVNV
     IQYQPPQTPV DCGLPDKGFS HYSINGVHAH KGMFPWAAPI FHTGSSSKPR YICGSTILTE
     RHLVTAAHCV YNSDGIKQNV SDLTVVPGMH NIDNFFEADL QERGVKKIFV HNDYFFEHGM
     LVDADIAVLL LDDPITYNKL VRPICMWSDS DNLEKIVGDE GFVSGWGVTE DGKAKIPSYV
     MATVVDRQTC NRNLDRLFAA KARIFCADGH GSVPCTGDSG SGFVIKRGPR YYIRGIVSFG
     QFDPKTLTCA TDKYVVYTDI APFRYWLTRV MKSQ
//
DBGET integrated database retrieval system