ID A0A182QM54_9DIPT Unreviewed; 910 AA.
AC A0A182QM54;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 27.
DE RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
OS Anopheles farauti.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=69004 {ECO:0000313|EnsemblMetazoa:AFAF012961-PA, ECO:0000313|Proteomes:UP000075886};
RN [1] {ECO:0000313|Proteomes:UP000075886}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=FAR1 {ECO:0000313|Proteomes:UP000075886};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Besansky N., Howell P., Walton C., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles farauti FAR1 (V2).";
RL Submitted (JAN-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AFAF012961-PA}
RP IDENTIFICATION.
RC STRAIN=FAR1 {ECO:0000313|EnsemblMetazoa:AFAF012961-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC {ECO:0000256|ARBA:ARBA00024195}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AXCN02002049; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A182QM54; -.
DR STRING; 69004.A0A182QM54; -.
DR EnsemblMetazoa; AFAF012961-RA; AFAF012961-PA; AFAF012961.
DR VEuPathDB; VectorBase:AFAF012961; -.
DR OrthoDB; 3431105at2759; -.
DR Proteomes; UP000075886; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 4.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24260; -; 1.
DR PANTHER; PTHR24260:SF131; CLIP DOMAIN-CONTAINING SERINE PROTEASE-RELATED; 1.
DR Pfam; PF00089; Trypsin; 3.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 3.
DR PROSITE; PS50240; TRYPSIN_DOM; 3.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|RuleBase:RU363034};
KW Protease {ECO:0000256|RuleBase:RU363034};
KW Serine protease {ECO:0000256|RuleBase:RU363034};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..910
FT /note="Peptidase S1 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008133005"
FT DOMAIN 64..285
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 333..588
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 710..910
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
SQ SEQUENCE 910 AA; 102196 MW; 49C2B30CC9B7293C CRC64;
MRCRVMLLGA LAALMTALLP CGWAKLFPTN PKLPEDNEDM MMPFDKISLD DCHMRNWKDG
YLGLVAPAYG NPAYLREFAH IAAIGWSRPD GTVDWACGGS LIWENFILTA AHCAANDDDI
APDVARMGDL NIYSDEDDEF PQQLRIVKII RHKQHRFSAK YYDIRFPKLY AAGWGRTGFA
DDKTRILLKV DLTPMSNAEC GRFYTTAERG LRNGLHAHHL CAGDERMDTC PGDSGGPLHV
KLLHNAKMTP FLVGVTSFGK PCGQANPGVY ARVSSFVGWI IETLQQEGEL ATAEKFAPWS
CALRYVHVRE YEDDVVVSRA NNFETYNSDN AHLVTGDSIH RVALEWPDTL LPVRENCSGT
LIERDVVATL AECASHMGSN PVRVRFTNGK FVNVSETIVH PRYDPSVGRY YNNIAIMKLA
YRVLSVPACV WYKDTLPEPE FEVLGHGRAD LSPYNRDEVV TGLGMRPHEP ASPTNWCLTT
SVLADPRIIS ISPRATYNAS CQLSDQFRSR LGRGLQREHI CFQNKPFLVP ATCEQHFGGP
IEREMWRFTK YFNYVYGMNL FGRDCGFGEP AVAVSFNAHR AWLESVLLPE KTAMSARDPV
IFINPDLELN DRCSYGGGVD GVCVGHASCP NIKSRMANKQ PVTLCSKGSV VCCPRQDIKG
PSSAIEKELD ECEQRYRHLR QQRQARWDGF QPLNRRLSHV AEVGWEDGSQ ISFRCLGYLI
STRAVVAAAS CLLNSEYEPS IVRVGALWSN QAPTDIAFLT IGSLVFHPGF NDTTYDNNIG
LLMLTAPLQP MVTAFPGCLW QNTTHNPVET EVFSSGRFDP IHPVYQRECN ERFSNRFSSP
AITCMVPGVD GPDEFCYPQG APIVYRKHHE KNLFTEYLVN LYSHGRCNST NLRVVTRMAM
YIEWFKEVLK
//