GenomeNet

Database: UniProt
Entry: A0A182QM54_9DIPT
LinkDB: A0A182QM54_9DIPT
Original site: A0A182QM54_9DIPT 
ID   A0A182QM54_9DIPT        Unreviewed;       910 AA.
AC   A0A182QM54;
DT   07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT   07-SEP-2016, sequence version 1.
DT   27-MAR-2024, entry version 27.
DE   RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
OS   Anopheles farauti.
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=69004 {ECO:0000313|EnsemblMetazoa:AFAF012961-PA, ECO:0000313|Proteomes:UP000075886};
RN   [1] {ECO:0000313|Proteomes:UP000075886}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=FAR1 {ECO:0000313|Proteomes:UP000075886};
RG   The Broad Institute Genomics Platform;
RA   Neafsey D.E., Besansky N., Howell P., Walton C., Young S.K., Zeng Q.,
RA   Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA   Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA   Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA   Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA   Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA   Birren B.;
RT   "The Genome Sequence of Anopheles farauti FAR1 (V2).";
RL   Submitted (JAN-2014) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EnsemblMetazoa:AFAF012961-PA}
RP   IDENTIFICATION.
RC   STRAIN=FAR1 {ECO:0000313|EnsemblMetazoa:AFAF012961-PA};
RG   EnsemblMetazoa;
RL   Submitted (MAY-2020) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC       {ECO:0000256|ARBA:ARBA00024195}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AXCN02002049; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; A0A182QM54; -.
DR   STRING; 69004.A0A182QM54; -.
DR   EnsemblMetazoa; AFAF012961-RA; AFAF012961-PA; AFAF012961.
DR   VEuPathDB; VectorBase:AFAF012961; -.
DR   OrthoDB; 3431105at2759; -.
DR   Proteomes; UP000075886; Unassembled WGS sequence.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 4.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24260; -; 1.
DR   PANTHER; PTHR24260:SF131; CLIP DOMAIN-CONTAINING SERINE PROTEASE-RELATED; 1.
DR   Pfam; PF00089; Trypsin; 3.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 3.
DR   PROSITE; PS50240; TRYPSIN_DOM; 3.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|RuleBase:RU363034};
KW   Protease {ECO:0000256|RuleBase:RU363034};
KW   Serine protease {ECO:0000256|RuleBase:RU363034};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..24
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           25..910
FT                   /note="Peptidase S1 domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5008133005"
FT   DOMAIN          64..285
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   DOMAIN          333..588
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   DOMAIN          710..910
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
SQ   SEQUENCE   910 AA;  102196 MW;  49C2B30CC9B7293C CRC64;
     MRCRVMLLGA LAALMTALLP CGWAKLFPTN PKLPEDNEDM MMPFDKISLD DCHMRNWKDG
     YLGLVAPAYG NPAYLREFAH IAAIGWSRPD GTVDWACGGS LIWENFILTA AHCAANDDDI
     APDVARMGDL NIYSDEDDEF PQQLRIVKII RHKQHRFSAK YYDIRFPKLY AAGWGRTGFA
     DDKTRILLKV DLTPMSNAEC GRFYTTAERG LRNGLHAHHL CAGDERMDTC PGDSGGPLHV
     KLLHNAKMTP FLVGVTSFGK PCGQANPGVY ARVSSFVGWI IETLQQEGEL ATAEKFAPWS
     CALRYVHVRE YEDDVVVSRA NNFETYNSDN AHLVTGDSIH RVALEWPDTL LPVRENCSGT
     LIERDVVATL AECASHMGSN PVRVRFTNGK FVNVSETIVH PRYDPSVGRY YNNIAIMKLA
     YRVLSVPACV WYKDTLPEPE FEVLGHGRAD LSPYNRDEVV TGLGMRPHEP ASPTNWCLTT
     SVLADPRIIS ISPRATYNAS CQLSDQFRSR LGRGLQREHI CFQNKPFLVP ATCEQHFGGP
     IEREMWRFTK YFNYVYGMNL FGRDCGFGEP AVAVSFNAHR AWLESVLLPE KTAMSARDPV
     IFINPDLELN DRCSYGGGVD GVCVGHASCP NIKSRMANKQ PVTLCSKGSV VCCPRQDIKG
     PSSAIEKELD ECEQRYRHLR QQRQARWDGF QPLNRRLSHV AEVGWEDGSQ ISFRCLGYLI
     STRAVVAAAS CLLNSEYEPS IVRVGALWSN QAPTDIAFLT IGSLVFHPGF NDTTYDNNIG
     LLMLTAPLQP MVTAFPGCLW QNTTHNPVET EVFSSGRFDP IHPVYQRECN ERFSNRFSSP
     AITCMVPGVD GPDEFCYPQG APIVYRKHHE KNLFTEYLVN LYSHGRCNST NLRVVTRMAM
     YIEWFKEVLK
//
DBGET integrated database retrieval system