ID A0A084VUB4_ANOSI Unreviewed; 952 AA.
AC A0A084VUB4;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
GN ORFNames=ZHAS_00009158 {ECO:0000313|EMBL:KFB41558.1};
OS Anopheles sinensis (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=74873 {ECO:0000313|EMBL:KFB41558.1};
RN [1] {ECO:0000313|EMBL:KFB41558.1, ECO:0000313|Proteomes:UP000030765}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24438588; DOI=10.1186/1471-2164-15-42;
RA Zhou D., Zhang D., Ding G., Shi L., Hou Q., Ye Y., Xu Y., Zhou H.,
RA Xiong C., Li S., Yu J., Hong S., Yu X., Zou P., Chen C., Chang X., Wang W.,
RA Lv Y., Sun Y., Ma L., Shen B., Zhu C.;
RT "Genome sequence of Anopheles sinensis provides insight into genetics basis
RT of mosquito competence for malaria parasites.";
RL BMC Genomics 15:42-42(2014).
RN [2] {ECO:0000313|EnsemblMetazoa:ASIC009158-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC {ECO:0000256|ARBA:ARBA00024195}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ATLV01016713; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KE525103; KFB41558.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A084VUB4; -.
DR STRING; 74873.A0A084VUB4; -.
DR EnsemblMetazoa; ASIC009158-RA; ASIC009158-PA; ASIC009158.
DR VEuPathDB; VectorBase:ASIC009158; -.
DR VEuPathDB; VectorBase:ASIS002157; -.
DR VEuPathDB; VectorBase:ASIS012084; -.
DR VEuPathDB; VectorBase:ASIS021307; -.
DR OMA; GTCRIEH; -.
DR Proteomes; UP000030765; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 4.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24260; -; 1.
DR PANTHER; PTHR24260:SF136; PEPTIDASE S1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00089; Trypsin; 3.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00020; Tryp_SPc; 3.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 3.
DR PROSITE; PS50240; TRYPSIN_DOM; 3.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|RuleBase:RU363034};
KW Protease {ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000030765};
KW Serine protease {ECO:0000256|RuleBase:RU363034};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..31
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 32..952
FT /note="Peptidase S1 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001784098"
FT DOMAIN 75..320
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 385..597
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 721..951
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
SQ SEQUENCE 952 AA; 105940 MW; 0BF7036BDC7C9643 CRC64;
MAGIYKSLRK PESNRRLLLA ALFGFLHVAH CRIQPAERIF PTDINKAFMP TVRQTLNDCH
LRYHKYGDYY LVYPIYGVPV RQGEFAHMAA IGWPLANGSI SFDCGGSLIT PRHVLTAAHC
AVNDDGNPPT IVRLGVIDIT AGLYDPENQF AQEFGIESFK RHPEHEFRAE YHDIALVTMD
RAATLTSAVL PGCLWTGSRI PFRRMEAAGF GLTRFAGERT PILLKVELSP VSNAACNQFY
PAARRRRQGL IEQQMCASDP RMDTCYGDSG GPLQIKLMAN NRLVPFIVGV TSFGRFCGTG
TPAVYTRVSS YISWIQTETG NSYDSRACSA RHLNQREIET AMIANRIGDK LFVEPEKSYM
DLETVAKHRV YLGYSTASGR IQWNCGGVLI NENYVLTVAH CDRFVLNKTP DYVKVGDLDI
FQDHPQAQVI KIERFIKHPN YRNDAGVEND IALVKLQTNV KIQPNALPAC IMNTESVELP
FYEMAGLGPY NMNNFVMEEA SSSTNNTLVL TRMRADSSSC ELQSSNKVMC TRNNQSLVPG
TCRIEHGGPL EREIWHHDRY FSYVFGLTVA GDDCGFGSEA YYVKIASHID WIEGIVLGDR
NQRTQSRSRR QVWFPQSEED DDGVQQQQSC ALPDARRGVC TPYSSCSRQL MGPVVSICRH
GTEPIVCCPQ TPNVLTRPTR LSGSVAQTTR PIRQGGYTLN SCVSYWRQYR RVPTEEYEYI
PSEGRPAGAD EYPHIVAIGN GQQNLWPCTG VLVSDLYVIS AASCLLSIRG TRSVRLGQNS
ATVYGVSEIL NHPSYGGRSG DPYDLVLTRL DRRVQFSSTI IPACLWVKSE MVPLKLYALG
SSSTQQGQIF VYPRSAMYNA DCRSLPNVRG TIRDAENVCV ENYYATDTVC PDVAGNPVEG
AIEHNGTRIP LVVGLASYTV GCPVTSSSVT QTVTVLSRLA EHLTWIKEVV ER
//