GenomeNet

Database: UniProt
Entry: A0A084WAT8_ANOSI
LinkDB: A0A084WAT8_ANOSI
Original site: A0A084WAT8_ANOSI 
ID   A0A084WAT8_ANOSI        Unreviewed;      1500 AA.
AC   A0A084WAT8;
DT   29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT   29-OCT-2014, sequence version 1.
DT   27-MAR-2024, entry version 36.
DE   RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
GN   ORFNames=ZHAS_00015360 {ECO:0000313|EMBL:KFB47332.1};
OS   Anopheles sinensis (Mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=74873 {ECO:0000313|EMBL:KFB47332.1};
RN   [1] {ECO:0000313|EMBL:KFB47332.1, ECO:0000313|Proteomes:UP000030765}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=24438588; DOI=10.1186/1471-2164-15-42;
RA   Zhou D., Zhang D., Ding G., Shi L., Hou Q., Ye Y., Xu Y., Zhou H.,
RA   Xiong C., Li S., Yu J., Hong S., Yu X., Zou P., Chen C., Chang X., Wang W.,
RA   Lv Y., Sun Y., Ma L., Shen B., Zhu C.;
RT   "Genome sequence of Anopheles sinensis provides insight into genetics basis
RT   of mosquito competence for malaria parasites.";
RL   BMC Genomics 15:42-42(2014).
RN   [2] {ECO:0000313|EnsemblMetazoa:ASIC015360-PA}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (MAY-2020) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC       {ECO:0000256|ARBA:ARBA00024195}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ATLV01022266; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; KE525331; KFB47332.1; -; Genomic_DNA.
DR   STRING; 74873.A0A084WAT8; -.
DR   EnsemblMetazoa; ASIC015360-RA; ASIC015360-PA; ASIC015360.
DR   VEuPathDB; VectorBase:ASIC015360; -.
DR   VEuPathDB; VectorBase:ASIS009700; -.
DR   VEuPathDB; VectorBase:ASIS010287; -.
DR   OMA; LAECASH; -.
DR   Proteomes; UP000030765; Unassembled WGS sequence.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00190; Tryp_SPc; 2.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 5.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24260; -; 1.
DR   PANTHER; PTHR24260:SF131; CLIP DOMAIN-CONTAINING SERINE PROTEASE-RELATED; 1.
DR   Pfam; PF00089; Trypsin; 5.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 2.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 5.
DR   PROSITE; PS50240; TRYPSIN_DOM; 5.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 2.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|RuleBase:RU363034};
KW   Protease {ECO:0000256|RuleBase:RU363034};
KW   Reference proteome {ECO:0000313|Proteomes:UP000030765};
KW   Serine protease {ECO:0000256|RuleBase:RU363034};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..20
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           21..1500
FT                   /note="Peptidase S1 domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5001784727"
FT   DOMAIN          61..309
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   DOMAIN          341..579
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   DOMAIN          661..908
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   DOMAIN          957..1176
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   DOMAIN          1290..1498
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
SQ   SEQUENCE   1500 AA;  168955 MW;  27314F44C39F7DB3 CRC64;
     MAAQYRAFLF VFLVVSNCQA DLDPVFTDID EELPANTMMP NERETIADCH LRFFWQGGRG
     IVGPAFAKPA FLTEFAHIGA IGWTGANGQI EWACGGSLIW SNFILTVAHC AENDNNVQPD
     VVRFGDLDLY SADDDTYAQQ LRIVKIIRHP LHRYSARYYD IALMQLERNV TIHETVAPAC
     LWFDEEVRFT DLESAGWGRT GFGENKTPIL LKVNLKPLTN QKCTDHYNPT STRGLRNGLD
     QHHICAGDAK MDTCPGDSGG PLQVRLQHNY RVTPFLVGLT SFGLPCGQSH PGVYTRVAHF
     RSWIVETLQR NGASEVSDEL FEPQSCALRY VRLRQLAISR VVANESGVFE TFDISRQYIT
     EEIMQKVVQL SWPGTTVPAR NNCMGAIIDH DTIVTLADCA TNAGIAPSEV VYLFRKDFDK
     YEERRYDIKE IHVHPNHTDR SYYNNIALLK IDGSIDEMPA CIWNSQPLHD NQMELFANGR
     ADLNKYQYAD KVILAKQYLD RLDQGLLDQH LCFGNDPFLV PQVCELAGGG SLQRSVSRLH
     RFFKYVQGLS LFGRDCGFGE PAVAVRLQAH LGWLESVLLP NSPQLLQAKS TNDSLLYIDP
     DLELFDRCDF PDASVGLCVP VERCRGVRNR LQRKEGLIFC SNGTIVCYCH MRYWKEGAQG
     LVAPAYGQPA LLREFAHIAA IGWTREDGSV DWACGGSLIW ENYILTAAHC AANDDDVAPD
     MARMGDLNIY SDEDDDYAQE RKIVKIIRHD QHRFSARYYD VALLKLDQQI TVHETVAPAC
     LWLDDEVRFP KLFSAGWGRT GFGEGKTNVL LKVDLTPISN ENCSKFYKVG DRGLRAGLHA
     HHLCAGDEKM DTCPGDSGGP LHVKLLHNAK MSPFLVGVTS FGKPCGQANP GVYARVSSFA
     DWIIETLQKE GETEATPVKF QPWACALRYV HVREYEDDVV VSRSKNFETY NSDNAHLVRG
     DSRHRVTLQW PNSLQPGRDN CSGTMFERDA VVTLADCATH MGATPSRVIL SNGKYLEVTE
     TIVHPNYTQA GGRYYNNIAV LKLVGAASFI PACVWYNDTI PDPQFEVLGR DPSNVPMSPR
     VTLRSNTDCQ LARQYREQLT RGLQNEHVCF QNKPFLVPQT CNQMFGAPIE REMWRFGRYF
     NYVYGFNLFG RDCGFGEPAV AVRLNAHRPW LESVMLPNVQ RDTTNKDAVI FINPDLELSD
     RCSYAGGVSG VCVEQERCPG IRQRMNSNLP VTLCSSGSIV CCPQADIKKP LTPLEKEFNE
     CEERYRHLRK QRQQRWDGLQ PLSKRLSHVV ELAWENGPEL SFRCFGYLIS TKGIVASASC
     LVAQEVIPAV ARLGGLYSHS RPDIAIIPVE SAVIHPEFNQ TTFMNNIALV KLTMAVQPTA
     TKFPGCVWQN VTHTPVELEL HQTAQSVPIH PMYRRDCDAR YVRPFTDPRQ ICMVPGVTGP
     GEHCYASGSP IVYKKYEEKN LFTEYLVNIY SHGQCNSTTL RIVQRVSMYI DWFKEELEET
//
DBGET integrated database retrieval system