GenomeNet

Database: UniProt
Entry: Q7PG50_ANOGA
LinkDB: Q7PG50_ANOGA
Original site: Q7PG50_ANOGA 
ID   Q7PG50_ANOGA            Unreviewed;       376 AA.
AC   Q7PG50;
DT   15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT   27-JUL-2011, sequence version 4.
DT   27-MAR-2024, entry version 128.
DE   SubName: Full=AGAP000573-PA {ECO:0000313|EMBL:EAA45068.4};
GN   Name=CLIPC4 {ECO:0000313|EMBL:EAA45068.4};
GN   Synonyms=1271649 {ECO:0000313|EnsemblMetazoa:AGAP000573-PA};
GN   ORFNames=AgaP_AGAP000573 {ECO:0000313|EMBL:EAA45068.4};
OS   Anopheles gambiae (African malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA45068.4};
RN   [1] {ECO:0000313|EMBL:EAA45068.4, ECO:0000313|Proteomes:UP000007062}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA45068.4,
RC   ECO:0000313|Proteomes:UP000007062};
RX   PubMed=12364791; DOI=10.1126/science.1076181;
RA   Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA   Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA   Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA   Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA   Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA   Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA   Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA   Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA   Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA   Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA   Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA   Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA   McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA   O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA   Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA   Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA   Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA   Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA   Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA   Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA   Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA   Collins F.H., Hoffman S.L.;
RT   "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL   Science 298:129-149(2002).
RN   [2] {ECO:0000313|EMBL:EAA45068.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA45068.4};
RG   The Anopheles Genome Sequencing Consortium;
RL   Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:EAA45068.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA45068.4};
RX   PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA   Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT   "The Anopheles gambiae genome: an update.";
RL   Trends Parasitol. 20:49-52(2004).
RN   [4] {ECO:0000313|EMBL:EAA45068.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA45068.4};
RX   PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA   Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA   Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT   "Update of the Anopheles gambiae PEST genome assembly.";
RL   Genome Biol. 8:R5.1-R5.13(2007).
RN   [5] {ECO:0000313|EMBL:EAA45068.4}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EAA45068.4};
RG   VectorBase;
RL   Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN   [6] {ECO:0000313|EnsemblMetazoa:AGAP000573-PA}
RP   IDENTIFICATION.
RC   STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP000573-PA};
RG   EnsemblMetazoa;
RL   Submitted (MAY-2020) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC       {ECO:0000256|ARBA:ARBA00024195}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAAB01008846; EAA45068.4; -; Genomic_DNA.
DR   RefSeq; XP_310504.4; XM_310504.4.
DR   AlphaFoldDB; Q7PG50; -.
DR   SMR; Q7PG50; -.
DR   STRING; 7165.Q7PG50; -.
DR   MEROPS; S01.122; -.
DR   PaxDb; 7165-AGAP000573-PA; -.
DR   EnsemblMetazoa; AGAP000573-RA; AGAP000573-PA; AGAP000573.
DR   GeneID; 1271649; -.
DR   CTD; 1271649; -.
DR   VEuPathDB; VectorBase:AGAP000573; -.
DR   eggNOG; KOG3627; Eukaryota.
DR   InParanoid; Q7PG50; -.
DR   OrthoDB; 3676963at2759; -.
DR   Proteomes; UP000007062; Chromosome X.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24258:SF148; SERINE PROTEASE P66; 1.
DR   PANTHER; PTHR24258; SERINE PROTEASE-RELATED; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|RuleBase:RU363034};
KW   Membrane {ECO:0000256|SAM:Phobius};
KW   Protease {ECO:0000256|RuleBase:RU363034};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007062};
KW   Serine protease {ECO:0000256|RuleBase:RU363034};
KW   Signal {ECO:0000256|ARBA:ARBA00022729};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   TRANSMEM        12..36
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          134..372
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
SQ   SEQUENCE   376 AA;  40734 MW;  6B78716E28D08BC8 CRC64;
     MGRRRCATRW SVAVGVVPPV GAKAGLLVLV ILLGAVTRSA GNFSFPTDPD ILYEGDKCGL
     PGMRTGTCQK AADCPQGIRA NGARCEFSGN DPVVCCPTEA PGTGDGTSNR FTARIAKQEC
     ERFTSASANI IDHISGRAIE ALRGEFPFVA LVNFRGEEGE EVKLTRCGAS LIAPRFLLTA
     AHCLKDLNPV TVEIGFIQLS DTEKDEYEIK QVHLHEGHKS RRNDIALIEL KNNVTYKQDV
     GPICLNTDRP EIGPSINLTV MGWGADGDGQ RADKLMKGTV YEIPLDECVQ RFRDAKQRIS
     LGEDQLCALG EKVNDETTDA CQGDSGGPLV MTVRQKFYLV GVVSTGAVCG GSLPGIYTRV
     SRYLEWIEQR VWGSPN
//
DBGET integrated database retrieval system