ID A0A084WAT8_ANOSI Unreviewed; 1500 AA.
AC A0A084WAT8;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
GN ORFNames=ZHAS_00015360 {ECO:0000313|EMBL:KFB47332.1};
OS Anopheles sinensis (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=74873 {ECO:0000313|EMBL:KFB47332.1};
RN [1] {ECO:0000313|EMBL:KFB47332.1, ECO:0000313|Proteomes:UP000030765}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24438588; DOI=10.1186/1471-2164-15-42;
RA Zhou D., Zhang D., Ding G., Shi L., Hou Q., Ye Y., Xu Y., Zhou H.,
RA Xiong C., Li S., Yu J., Hong S., Yu X., Zou P., Chen C., Chang X., Wang W.,
RA Lv Y., Sun Y., Ma L., Shen B., Zhu C.;
RT "Genome sequence of Anopheles sinensis provides insight into genetics basis
RT of mosquito competence for malaria parasites.";
RL BMC Genomics 15:42-42(2014).
RN [2] {ECO:0000313|EnsemblMetazoa:ASIC015360-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC {ECO:0000256|ARBA:ARBA00024195}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ATLV01022266; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KE525331; KFB47332.1; -; Genomic_DNA.
DR STRING; 74873.A0A084WAT8; -.
DR EnsemblMetazoa; ASIC015360-RA; ASIC015360-PA; ASIC015360.
DR VEuPathDB; VectorBase:ASIC015360; -.
DR VEuPathDB; VectorBase:ASIS009700; -.
DR VEuPathDB; VectorBase:ASIS010287; -.
DR OMA; LAECASH; -.
DR Proteomes; UP000030765; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00190; Tryp_SPc; 2.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 5.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24260; -; 1.
DR PANTHER; PTHR24260:SF131; CLIP DOMAIN-CONTAINING SERINE PROTEASE-RELATED; 1.
DR Pfam; PF00089; Trypsin; 5.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00020; Tryp_SPc; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 5.
DR PROSITE; PS50240; TRYPSIN_DOM; 5.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 2.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|RuleBase:RU363034};
KW Protease {ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000030765};
KW Serine protease {ECO:0000256|RuleBase:RU363034};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..1500
FT /note="Peptidase S1 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001784727"
FT DOMAIN 61..309
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 341..579
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 661..908
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 957..1176
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 1290..1498
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
SQ SEQUENCE 1500 AA; 168955 MW; 27314F44C39F7DB3 CRC64;
MAAQYRAFLF VFLVVSNCQA DLDPVFTDID EELPANTMMP NERETIADCH LRFFWQGGRG
IVGPAFAKPA FLTEFAHIGA IGWTGANGQI EWACGGSLIW SNFILTVAHC AENDNNVQPD
VVRFGDLDLY SADDDTYAQQ LRIVKIIRHP LHRYSARYYD IALMQLERNV TIHETVAPAC
LWFDEEVRFT DLESAGWGRT GFGENKTPIL LKVNLKPLTN QKCTDHYNPT STRGLRNGLD
QHHICAGDAK MDTCPGDSGG PLQVRLQHNY RVTPFLVGLT SFGLPCGQSH PGVYTRVAHF
RSWIVETLQR NGASEVSDEL FEPQSCALRY VRLRQLAISR VVANESGVFE TFDISRQYIT
EEIMQKVVQL SWPGTTVPAR NNCMGAIIDH DTIVTLADCA TNAGIAPSEV VYLFRKDFDK
YEERRYDIKE IHVHPNHTDR SYYNNIALLK IDGSIDEMPA CIWNSQPLHD NQMELFANGR
ADLNKYQYAD KVILAKQYLD RLDQGLLDQH LCFGNDPFLV PQVCELAGGG SLQRSVSRLH
RFFKYVQGLS LFGRDCGFGE PAVAVRLQAH LGWLESVLLP NSPQLLQAKS TNDSLLYIDP
DLELFDRCDF PDASVGLCVP VERCRGVRNR LQRKEGLIFC SNGTIVCYCH MRYWKEGAQG
LVAPAYGQPA LLREFAHIAA IGWTREDGSV DWACGGSLIW ENYILTAAHC AANDDDVAPD
MARMGDLNIY SDEDDDYAQE RKIVKIIRHD QHRFSARYYD VALLKLDQQI TVHETVAPAC
LWLDDEVRFP KLFSAGWGRT GFGEGKTNVL LKVDLTPISN ENCSKFYKVG DRGLRAGLHA
HHLCAGDEKM DTCPGDSGGP LHVKLLHNAK MSPFLVGVTS FGKPCGQANP GVYARVSSFA
DWIIETLQKE GETEATPVKF QPWACALRYV HVREYEDDVV VSRSKNFETY NSDNAHLVRG
DSRHRVTLQW PNSLQPGRDN CSGTMFERDA VVTLADCATH MGATPSRVIL SNGKYLEVTE
TIVHPNYTQA GGRYYNNIAV LKLVGAASFI PACVWYNDTI PDPQFEVLGR DPSNVPMSPR
VTLRSNTDCQ LARQYREQLT RGLQNEHVCF QNKPFLVPQT CNQMFGAPIE REMWRFGRYF
NYVYGFNLFG RDCGFGEPAV AVRLNAHRPW LESVMLPNVQ RDTTNKDAVI FINPDLELSD
RCSYAGGVSG VCVEQERCPG IRQRMNSNLP VTLCSSGSIV CCPQADIKKP LTPLEKEFNE
CEERYRHLRK QRQQRWDGLQ PLSKRLSHVV ELAWENGPEL SFRCFGYLIS TKGIVASASC
LVAQEVIPAV ARLGGLYSHS RPDIAIIPVE SAVIHPEFNQ TTFMNNIALV KLTMAVQPTA
TKFPGCVWQN VTHTPVELEL HQTAQSVPIH PMYRRDCDAR YVRPFTDPRQ ICMVPGVTGP
GEHCYASGSP IVYKKYEEKN LFTEYLVNIY SHGQCNSTTL RIVQRVSMYI DWFKEELEET
//