ID J9HJ91_AEDAE Unreviewed; 1804 AA.
AC J9HJ91;
DT 31-OCT-2012, integrated into UniProtKB/TrEMBL.
DT 31-OCT-2012, sequence version 1.
DT 27-MAR-2024, entry version 57.
DE SubName: Full=AAEL016971-PA {ECO:0000313|EMBL:EJY57924.1};
GN ORFNames=AaeL_AAEL016971 {ECO:0000313|EMBL:EJY57924.1};
OS Aedes aegypti (Yellowfever mosquito) (Culex aegypti).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Culicinae; Aedini; Aedes; Stegomyia.
OX NCBI_TaxID=7159 {ECO:0000313|EMBL:EJY57924.1, ECO:0000313|Proteomes:UP000682892};
RN [1] {ECO:0000313|EMBL:EJY57924.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Liverpool {ECO:0000313|EMBL:EJY57924.1};
RA Loftus B.J., Nene V.M., Hannick L.I., Bidwell S., Haas B., Amedeo P.,
RA Orvis J., Wortman J.R., White O.R., Salzberg S., Shumway M., Koo H.,
RA Zhao Y., Holmes M., Miller J., Schatz M., Pop M., Pai G., Utterback T.,
RA Rogers Y.-H., Kravitz S., Fraser C.M.;
RL Submitted (OCT-2005) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EJY57924.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Liverpool {ECO:0000313|EMBL:EJY57924.1};
RX PubMed=17510324; DOI=10.1126/science.1138878;
RA Nene V., Wortman J.R., Lawson D., Haas B.J., Kodira C.D., Tu Z.J.,
RA Loftus B.J., Xi Z., Megy K., Grabherr M., Ren Q., Zdobnov E.M., Lobo N.F.,
RA Campbell K.S., Brown S.E., Bonaldo M.F., Zhu J., Sinkins S.P.,
RA Hogenkamp D.G., Amedeo P., Arensburger P., Atkinson P.W., Bidwell S.L.,
RA Biedler J., Birney E., Bruggner R.V., Costas J., Coy M.R., Crabtree J.,
RA Crawford M., DeBruyn B., DeCaprio D., Eiglmeier K., Eisenstadt E.,
RA El-Dorry H., Gelbart W.M., Gomes S.L., Hammond M., Hannick L.I.,
RA Hogan J.R., Holmes M.H., Jaffe D., Johnston S.J., Kennedy R.C., Koo H.,
RA Kravitz S., Kriventseva E.V., Kulp D., Labutti K., Lee E., Li S.,
RA Lovin D.D., Mao C., Mauceli E., Menck C.F., Miller J.R., Montgomery P.,
RA Mori A., Nascimento A.L., Naveira H.F., Nusbaum C., O'Leary S.B., Orvis J.,
RA Pertea M., Quesneville H., Reidenbach K.R., Rogers Y.-H.C., Roth C.W.,
RA Schneider J.R., Schatz M., Shumway M., Stanke M., Stinson E.O.,
RA Tubio J.M.C., Vanzee J.P., Verjovski-Almeida S., Werner D., White O.R.,
RA Wyder S., Zeng Q., Zhao Q., Zhao Y., Hill C.A., Raikhel A.S., Soares M.B.,
RA Knudson D.L., Lee N.H., Galagan J., Salzberg S.L., Paulsen I.T.,
RA Dimopoulos G., Collins F.H., Bruce B., Fraser-Liggett C.M., Severson D.W.;
RT "Genome sequence of Aedes aegypti, a major arbovirus vector.";
RL Science 316:1718-1723(2007).
RN [3] {ECO:0000313|EMBL:EJY57924.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Liverpool {ECO:0000313|EMBL:EJY57924.1};
RG VectorBase;
RL Submitted (SEP-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC {ECO:0000256|ARBA:ARBA00024195}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00124}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH477769; EJY57924.1; -; Genomic_DNA.
DR RefSeq; XP_011493608.1; XM_011495306.1.
DR MEROPS; S01.013; -.
DR PaxDb; 7159-AAEL016971-PA; -.
DR GeneID; 23687391; -.
DR KEGG; aag:23687391; -.
DR VEuPathDB; VectorBase:AAEL027317; -.
DR eggNOG; KOG3627; Eukaryota.
DR HOGENOM; CLU_000346_0_0_1; -.
DR OrthoDB; 3035825at2759; -.
DR PhylomeDB; J9HJ91; -.
DR Proteomes; UP000682892; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00112; LDLa; 6.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 7.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 3.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR023415; LDLR_class-A_CS.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR015420; Peptidase_S1A_nudel.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24252; ACROSIN-RELATED; 1.
DR PANTHER; PTHR24252:SF7; HYALIN; 1.
DR Pfam; PF09342; DUF1986; 1.
DR Pfam; PF00057; Ldl_recept_a; 5.
DR Pfam; PF00089; Trypsin; 1.
DR PRINTS; PR00261; LDLRECEPTOR.
DR SMART; SM00192; LDLa; 8.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF57424; LDL receptor-like module; 7.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 2.
DR PROSITE; PS01209; LDLRA_1; 3.
DR PROSITE; PS50068; LDLRA_2; 7.
DR PROSITE; PS50240; TRYPSIN_DOM; 2.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00124}; Hydrolase {ECO:0000256|RuleBase:RU363034};
KW Protease {ECO:0000256|RuleBase:RU363034};
KW Serine protease {ECO:0000256|RuleBase:RU363034};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 308..546
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 1301..1510
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT REGION 698..819
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 855..889
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 920..973
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 717..819
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 923..963
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 88..103
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 164..179
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 558..570
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 577..592
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1120..1135
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1528..1546
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1567..1579
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1574..1592
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1656..1671
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ SEQUENCE 1804 AA; 201056 MW; E6F395CEDB67D0DB CRC64;
MNVPLNIQNL SGKRTSDDPD LEGAQLVCSF VGPSSTEKPL AGNDSMMLQS SQELFVSSTR
ARGHHRHCPR GKVPCADGIQ CVLSSHLCDS RVDCFDGSDE SHCSCLSRLA DKRRCDGYVD
CPLAEDEMGC FGCDKFSFSC FNTFFEYQAS HHSETRCFTL IEKCDGFNNC MNRKDEQDCT
MLVRDLRSPL AFAVGHSVGV LHRNYKGKWY PVCHNPLNLA REACEAELGP ADRDPVILQH
HGDLPGPFIQ PSPRSHHVFQ PEFTDTCNGL INYVKCPAPK CGSSKQNEME NLRIKIRGKR
NATELVQIVG GTKAEPAAYP FIVGIFRDGK FHCGGSIFNE RWIVTAAHCC DNFPRHHYEL
RAGLLRRRSF SPQVQVSTVT HVFIHRGYSA QKMINDISLM HSDRPFQYNR WVRPICLPDR
HMTTNDRDWI WGPKPGTMCT AVGWGALREH GGSPDHLMQV TVPILPFCKH KNDRDGLAIC
AAEMSGGHDA CQGDSGGPFA CISVSSPHEW YLAGVVSHGE GCARPNEPGV YTRVALFNDW
IQRKTREVLT SSSTRQDCPG FQCSVGVSFC IPRQKRCNGK VDCLGGEDEL SCSLDQLLAE
SIQETTIATT PKNSTATSST TAAATTEAVT SKIDFLAENS EASDPATTVT EASSTETANI
ETILTTIETV SAKDVSDGIF QVSTEATTVE INTTLDISSS SQNVSASPEK VEESTVDETT
SSTTETSFTH PTTIESETTA DSVTALMSET TSPSLPPNEN NTTVDYTVTR STDQTTNTPF
TISPTTDSLD DSSAAYSSSA NDSEVNHSTT IEPKNTESSV HATTVIELIS VESTTNIEAT
SLADPSNASI TTLSSDLVEE TSSSSTTPST PDSTSSTQPD SSHSSTSSAP WMQINEAIAN
LKPRPPSTDR DMEMRQLELG DSVAATRTDN TTSTTISTAP SAATDSTSST PLSTTTVESS
STTHGDLEHS ETQIEPVEAE ATNASVEEHP FLREIHNLVE EKTRRLNQFR LSMHYLHTSL
RNQTQDANET SYQYRRFKCS KIRQSINIAH HCDRIIDCED GTDELRCTCR DYLKDKYDFL
ICDGKTDCLD LTDEADCFSC TAGQFPCRMS KVCIDEKKLC DAIPDCPLHE DELDCLALTD
GHKVYFDANN LSEFKYEGLV TKNTNGTWDL ICGAEINNKS VESIGKICSF LGFAGYESYY
QTVLTPLVNE TVDLDHQPLL IMSYRNISSE PNCKALHITC APFINATEHE ISHFENQHKE
QPVQVNIRPT HPIQNITSLT HITFQENAHI EFIENFGDDY DWPWNADIYL EGVFLCSAII
IEVNWIVVDS SCMRMINLKN DYLSVVAGGA KSYLKISGPY EQVVRVDCYH FLPEARVVML
HLAKNLTFTR HVLPTFIPEK NYNITDNQCL AVGQDKYGRT RTLRVHMNMT NCEPEDHICY
QLNPDNGIYH ADHCYTENAS RTGVVVCKSK VSGWYPVGFY QNKHGLCGFN EVVKMISLKE
FYTDIQHVLS HKKCDYEFPE PLCDGVRCWH GKCIGHSLVC DNKMDCDDNF DERPEACNAI
NDTSTACLPT QFRCGSHQCV DKSKFCDGRN DCGDLSDEPH ECSCYTYLKV TDPSKICDGV
RNCWDKSDEN PRLCKCAKTS FRCGDSEVCI PYDHVCDDEI DCPGEEDERY CYALQQNPAE
TNYGEVMQQS YGIWHSKCFP KDDKYDEQTI KEICHRVGYQ QVRKVYGRKV LPESRLRTSN
RTHDPVDRLR GAATKAVAFN KFFKVNINEK QAIFMKPSRP LYTLVNWDAE DEQKCDRLEI
NCGD
//