ID A0A182XYH8_ANOST Unreviewed; 2293 AA.
AC A0A182XYH8;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
OS Anopheles stephensi (Indo-Pakistan malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=30069 {ECO:0000313|EnsemblMetazoa:ASTEI01264-PA, ECO:0000313|Proteomes:UP000076408};
RN [1] {ECO:0000313|Proteomes:UP000076408}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Indian {ECO:0000313|Proteomes:UP000076408};
RX PubMed=25244985; DOI=10.1186/preaccept-1262842421127991;
RA Jiang X., Peery A., Hall A.B., Sharma A., Chen X.G., Waterhouse R.M.,
RA Komissarov A., Riehle M.M., Shouche Y., Sharakhova M.V., Lawson D.,
RA Pakpour N., Arensburger P., Davidson V.L., Eiglmeier K., Emrich S.,
RA George P., Kennedy R.C., Mane S.P., Maslen G., Oringanje C., Qi Y.,
RA Settlage R., Tojo M., Tubio J.M., Unger M.F., Wang B., Vernick K.D.,
RA Ribeiro J.M., James A.A., Michel K., Riehle M.A., Luckhart S.,
RA Sharakhov I.V., Tu Z.;
RT "Genome analysis of a major urban malaria vector mosquito, Anopheles
RT stephensi.";
RL Genome Biol. 15:459-459(2014).
RN [2] {ECO:0000313|EnsemblMetazoa:ASTEI01264-PA}
RP IDENTIFICATION.
RC STRAIN=Indian {ECO:0000313|EnsemblMetazoa:ASTEI01264-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC {ECO:0000256|ARBA:ARBA00024195}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00124}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 30069.A0A182XYH8; -.
DR EnsemblMetazoa; ASTEI01264-RA; ASTEI01264-PA; ASTEI01264.
DR VEuPathDB; VectorBase:ASTE004431; -.
DR VEuPathDB; VectorBase:ASTEI01264; -.
DR VEuPathDB; VectorBase:ASTEI20_039130; -.
DR OMA; DCMSAFL; -.
DR Proteomes; UP000076408; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00112; LDLa; 8.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 6.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 3.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR023415; LDLR_class-A_CS.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR015420; Peptidase_S1A_nudel.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24258:SF151; SERINE PROTEASE NUDEL; 1.
DR PANTHER; PTHR24258; SERINE PROTEASE-RELATED; 1.
DR Pfam; PF09342; DUF1986; 1.
DR Pfam; PF00057; Ldl_recept_a; 5.
DR Pfam; PF00089; Trypsin; 1.
DR PRINTS; PR00261; LDLRECEPTOR.
DR SMART; SM00192; LDLa; 9.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF57424; LDL receptor-like module; 6.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 2.
DR PROSITE; PS01209; LDLRA_1; 1.
DR PROSITE; PS50068; LDLRA_2; 8.
DR PROSITE; PS50240; TRYPSIN_DOM; 2.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00124}; Reference proteome {ECO:0000313|Proteomes:UP000076408};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 972..1210
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 1782..2119
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DISULFID 748..763
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 824..839
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1222..1234
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1241..1256
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1544..1559
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1612..1627
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2056..2068
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2063..2081
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2145..2160
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ SEQUENCE 2293 AA; 256623 MW; 7EB680DC6E226E65 CRC64;
MAEHDQTVLD EELTITSPRI QKLTMGASQR TNLVCLMLIA VGGVIVMAAI AFGFYMTVPS
SVLSEQTSNT IVLEPATVEV PDELEDIRKM YFKTEVQDIM MGSLELLELM QEFEPLLERQ
RRRRDVGEAA PNRISRADDR SEEITYGVRK RVQREFDLGM VPIVYKGVVV GPEPARGSRT
KRRAVSLDEL QRAKRMAQKD YQRLETEYHR CRKEAPNGKL CDQIYEKLQR LSEEMNARFM
EMANLLQGFH EMGLTTEETT SYPEEFFSST EVGSKSVTST SSRPEKHVPK HDPRITTVEQ
LVNDMNITQT VEGAWMSTTT EAMGASTSTR LPFDPSLHNP VEDSSIRSLD DLFDHMRLSD
SFQSSTPSDK WTTTERSRDL FEAEDGYSSS SPPTTVKSSS KLTTTHRWTT TKSPYQSARN
LHDVVDQLQL AQMLQPHPFH EDLLLNPPDN INEFVMSRSR NRDHMHHDPG HSEVNHDPSI
DPLRSVHHQH QQQPQQPLVV GPSAPFLSLC DQLRSAAGNG GHMKPAHQGN FQHSPGIPIT
GEATKASSQI IVNSAYGAGY VPNTVCFYQN APPVGAQTAY AFRPATATGP YVYPNGGYPP
YQQQPPPPQH HHHHPGAGGK LPLNDPAIDA LIQPRIPEAS RIFIIGGARV PDAPVDPNGP
YLLCSMMQND EAPSHADGHA TQPPPPPPGR HHHMGNESHP QSDDPEDLAS IFAAGRSLAA
RAKGRHHCRR GWVPCFSSHQ CVKRSAWCDS KTDCMDGSDE SACSCVSRLP KRKLCDGYAD
CPLGMDEMGC FGCEKFSFSC FHTQDEYRAA HRSGSMCYTL LEKCDGFDNC LNRKDEQDCT
MLVRDLGHYL AYSVPHSSGV LHRNYRGKWY PVCNNPSQWA REACDAEVGP QEREPLLSHG
HGSLPGPYIS QRANSAVSQP EFSEGCNGVY VQVKCPPVRC GTSKLQEQHF SRISVRSRRN
TNESELVESV RIVGGSHAEP EAYPFIVGIF RDGKYHCGGS IYNEHWIISA AHCCDNFDQH
YFEVRSGMLR KRSFAPQVQI TRVTNMIIHH AYSSTLMAND IALMRVETPF HYNRWVRPIC
LPDRHRTTND REWMWGPKPG TVCTAIGWGA LRERGGAPDH LMQVSVPILG YCKHKSDRDS
LQICAGEEDG GHDACQGDSG GPFVCQSHSN PAEWYLAGVV SHGEGCARAH EPGVYTRVAL
FIDWIAEKVK SPPTGRTARA DCPGMRCIWG GGICLPPGKK CNGFVNCLGG EDESGCAMDQ
MLRSMAHREG DEDSDEERQE QEIVTQRASN VHAGRQMDEP SSSTESITPG MFTSEESTTL
TAHEASVEAQ ESMIVTHTAQ TDPVTTTTPL EQVPTTEPTA SEQPSTESSS TEPSTTELPT
SELSPSSSTE QTTPTTTSAT TDQSAIIFPI HDTEQDPKGF EFVAPITSTT TEAIWKALEK
TVKEVRGQAR AENGSSTVNS TTTEQVPLLI SEDSLDEEQP DHPFFNELEV MQEQKHKRVA
EFRLTVHSLH TKTNVATVPP NDTAQYRQFS CRNITQRINV AHRCDRIVDC EDGSDELNCT
CRDFLKDKFD FLICDGKTDC LDRTDELECM NCQVGQYACR ISQVCIPNAQ VCNGQPDCPL
HEDELDCLAL TDGHRVYFDA NNLTLFRNTG IVTKNTNGTW EVLCGAVLTA KTEHAVEKIC
SFLGFAGFQN YSLLSITPAE LPAGLLVNTA EHHYVNITVD ASCQALRVSC VQHINATEHD
IAHFEHKHKQ EPVQVNIRPL NPIHRPHHMP QIVFQENAHI ELVENFGDDY DWPWNTNIYL
DGVLICSGLI IDASWIIVAG SCTRLVNLKH QYLAVVAGGA KSYLHIEGPY EQVVRVDCYH
YIPEAETVML HLATKLSFSR HVLPTFVPEN ENVTDSECLA VGQDKYGRTK TLRVHLNGTN
CQGDRVRCYH KDLKQPYYHH ESCYTPEATR SGVIVCKTNR SGWYPVGFYQ HKRGLCGFNE
VVRVTSLVES YQKIQNVLHN EQCGDQLYEE PKCAGKRCRY GKCVGEKLLC DRKPDCGDGS
DESPALCATR NQTSSCLPHQ LRCANGRCID KSSFCDRKND CGDSTDEPHD CSCYTYLKIT
DPGKICDGVR NCWDKSDENP RVCRCHSTSF RCGESDICVP YDFVCDKERD CPEGQDEQYC
YALQQNSYEA GYGELMEQSY GIWHSKCFPK TAQFDDEYMR RICEQLGYSQ VRKIYGRAIV
EGARLRTANE TESSVDKLRR AATKTIVQNK FSKVVINQNH TFYMKPSRPM FKVINWNYED
EQNCHRLELL CAA
//