GenomeNet

Database: UniProt
Entry: A0A182XYH8_ANOST
LinkDB: A0A182XYH8_ANOST
Original site: A0A182XYH8_ANOST 
ID   A0A182XYH8_ANOST        Unreviewed;      2293 AA.
AC   A0A182XYH8;
DT   07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT   07-SEP-2016, sequence version 1.
DT   27-MAR-2024, entry version 30.
DE   RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
OS   Anopheles stephensi (Indo-Pakistan malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=30069 {ECO:0000313|EnsemblMetazoa:ASTEI01264-PA, ECO:0000313|Proteomes:UP000076408};
RN   [1] {ECO:0000313|Proteomes:UP000076408}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Indian {ECO:0000313|Proteomes:UP000076408};
RX   PubMed=25244985; DOI=10.1186/preaccept-1262842421127991;
RA   Jiang X., Peery A., Hall A.B., Sharma A., Chen X.G., Waterhouse R.M.,
RA   Komissarov A., Riehle M.M., Shouche Y., Sharakhova M.V., Lawson D.,
RA   Pakpour N., Arensburger P., Davidson V.L., Eiglmeier K., Emrich S.,
RA   George P., Kennedy R.C., Mane S.P., Maslen G., Oringanje C., Qi Y.,
RA   Settlage R., Tojo M., Tubio J.M., Unger M.F., Wang B., Vernick K.D.,
RA   Ribeiro J.M., James A.A., Michel K., Riehle M.A., Luckhart S.,
RA   Sharakhov I.V., Tu Z.;
RT   "Genome analysis of a major urban malaria vector mosquito, Anopheles
RT   stephensi.";
RL   Genome Biol. 15:459-459(2014).
RN   [2] {ECO:0000313|EnsemblMetazoa:ASTEI01264-PA}
RP   IDENTIFICATION.
RC   STRAIN=Indian {ECO:0000313|EnsemblMetazoa:ASTEI01264-PA};
RG   EnsemblMetazoa;
RL   Submitted (MAY-2020) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC       {ECO:0000256|ARBA:ARBA00024195}.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00124}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 30069.A0A182XYH8; -.
DR   EnsemblMetazoa; ASTEI01264-RA; ASTEI01264-PA; ASTEI01264.
DR   VEuPathDB; VectorBase:ASTE004431; -.
DR   VEuPathDB; VectorBase:ASTEI01264; -.
DR   VEuPathDB; VectorBase:ASTEI20_039130; -.
DR   OMA; DCMSAFL; -.
DR   Proteomes; UP000076408; Unassembled WGS sequence.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd00112; LDLa; 8.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 6.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 3.
DR   InterPro; IPR036055; LDL_receptor-like_sf.
DR   InterPro; IPR023415; LDLR_class-A_CS.
DR   InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR015420; Peptidase_S1A_nudel.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24258:SF151; SERINE PROTEASE NUDEL; 1.
DR   PANTHER; PTHR24258; SERINE PROTEASE-RELATED; 1.
DR   Pfam; PF09342; DUF1986; 1.
DR   Pfam; PF00057; Ldl_recept_a; 5.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00261; LDLRECEPTOR.
DR   SMART; SM00192; LDLa; 9.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF57424; LDL receptor-like module; 6.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 2.
DR   PROSITE; PS01209; LDLRA_1; 1.
DR   PROSITE; PS50068; LDLRA_2; 8.
DR   PROSITE; PS50240; TRYPSIN_DOM; 2.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00124}; Reference proteome {ECO:0000313|Proteomes:UP000076408};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          972..1210
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   DOMAIN          1782..2119
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   DISULFID        748..763
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        824..839
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        1222..1234
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        1241..1256
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        1544..1559
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        1612..1627
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2056..2068
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2063..2081
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2145..2160
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ   SEQUENCE   2293 AA;  256623 MW;  7EB680DC6E226E65 CRC64;
     MAEHDQTVLD EELTITSPRI QKLTMGASQR TNLVCLMLIA VGGVIVMAAI AFGFYMTVPS
     SVLSEQTSNT IVLEPATVEV PDELEDIRKM YFKTEVQDIM MGSLELLELM QEFEPLLERQ
     RRRRDVGEAA PNRISRADDR SEEITYGVRK RVQREFDLGM VPIVYKGVVV GPEPARGSRT
     KRRAVSLDEL QRAKRMAQKD YQRLETEYHR CRKEAPNGKL CDQIYEKLQR LSEEMNARFM
     EMANLLQGFH EMGLTTEETT SYPEEFFSST EVGSKSVTST SSRPEKHVPK HDPRITTVEQ
     LVNDMNITQT VEGAWMSTTT EAMGASTSTR LPFDPSLHNP VEDSSIRSLD DLFDHMRLSD
     SFQSSTPSDK WTTTERSRDL FEAEDGYSSS SPPTTVKSSS KLTTTHRWTT TKSPYQSARN
     LHDVVDQLQL AQMLQPHPFH EDLLLNPPDN INEFVMSRSR NRDHMHHDPG HSEVNHDPSI
     DPLRSVHHQH QQQPQQPLVV GPSAPFLSLC DQLRSAAGNG GHMKPAHQGN FQHSPGIPIT
     GEATKASSQI IVNSAYGAGY VPNTVCFYQN APPVGAQTAY AFRPATATGP YVYPNGGYPP
     YQQQPPPPQH HHHHPGAGGK LPLNDPAIDA LIQPRIPEAS RIFIIGGARV PDAPVDPNGP
     YLLCSMMQND EAPSHADGHA TQPPPPPPGR HHHMGNESHP QSDDPEDLAS IFAAGRSLAA
     RAKGRHHCRR GWVPCFSSHQ CVKRSAWCDS KTDCMDGSDE SACSCVSRLP KRKLCDGYAD
     CPLGMDEMGC FGCEKFSFSC FHTQDEYRAA HRSGSMCYTL LEKCDGFDNC LNRKDEQDCT
     MLVRDLGHYL AYSVPHSSGV LHRNYRGKWY PVCNNPSQWA REACDAEVGP QEREPLLSHG
     HGSLPGPYIS QRANSAVSQP EFSEGCNGVY VQVKCPPVRC GTSKLQEQHF SRISVRSRRN
     TNESELVESV RIVGGSHAEP EAYPFIVGIF RDGKYHCGGS IYNEHWIISA AHCCDNFDQH
     YFEVRSGMLR KRSFAPQVQI TRVTNMIIHH AYSSTLMAND IALMRVETPF HYNRWVRPIC
     LPDRHRTTND REWMWGPKPG TVCTAIGWGA LRERGGAPDH LMQVSVPILG YCKHKSDRDS
     LQICAGEEDG GHDACQGDSG GPFVCQSHSN PAEWYLAGVV SHGEGCARAH EPGVYTRVAL
     FIDWIAEKVK SPPTGRTARA DCPGMRCIWG GGICLPPGKK CNGFVNCLGG EDESGCAMDQ
     MLRSMAHREG DEDSDEERQE QEIVTQRASN VHAGRQMDEP SSSTESITPG MFTSEESTTL
     TAHEASVEAQ ESMIVTHTAQ TDPVTTTTPL EQVPTTEPTA SEQPSTESSS TEPSTTELPT
     SELSPSSSTE QTTPTTTSAT TDQSAIIFPI HDTEQDPKGF EFVAPITSTT TEAIWKALEK
     TVKEVRGQAR AENGSSTVNS TTTEQVPLLI SEDSLDEEQP DHPFFNELEV MQEQKHKRVA
     EFRLTVHSLH TKTNVATVPP NDTAQYRQFS CRNITQRINV AHRCDRIVDC EDGSDELNCT
     CRDFLKDKFD FLICDGKTDC LDRTDELECM NCQVGQYACR ISQVCIPNAQ VCNGQPDCPL
     HEDELDCLAL TDGHRVYFDA NNLTLFRNTG IVTKNTNGTW EVLCGAVLTA KTEHAVEKIC
     SFLGFAGFQN YSLLSITPAE LPAGLLVNTA EHHYVNITVD ASCQALRVSC VQHINATEHD
     IAHFEHKHKQ EPVQVNIRPL NPIHRPHHMP QIVFQENAHI ELVENFGDDY DWPWNTNIYL
     DGVLICSGLI IDASWIIVAG SCTRLVNLKH QYLAVVAGGA KSYLHIEGPY EQVVRVDCYH
     YIPEAETVML HLATKLSFSR HVLPTFVPEN ENVTDSECLA VGQDKYGRTK TLRVHLNGTN
     CQGDRVRCYH KDLKQPYYHH ESCYTPEATR SGVIVCKTNR SGWYPVGFYQ HKRGLCGFNE
     VVRVTSLVES YQKIQNVLHN EQCGDQLYEE PKCAGKRCRY GKCVGEKLLC DRKPDCGDGS
     DESPALCATR NQTSSCLPHQ LRCANGRCID KSSFCDRKND CGDSTDEPHD CSCYTYLKIT
     DPGKICDGVR NCWDKSDENP RVCRCHSTSF RCGESDICVP YDFVCDKERD CPEGQDEQYC
     YALQQNSYEA GYGELMEQSY GIWHSKCFPK TAQFDDEYMR RICEQLGYSQ VRKIYGRAIV
     EGARLRTANE TESSVDKLRR AATKTIVQNK FSKVVINQNH TFYMKPSRPM FKVINWNYED
     EQNCHRLELL CAA
//
DBGET integrated database retrieval system