ID A0A182FLU3_ANOAL Unreviewed; 1094 AA.
AC A0A182FLU3;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
OS Anopheles albimanus (New world malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7167 {ECO:0000313|EnsemblMetazoa:AALB007502-PA, ECO:0000313|Proteomes:UP000069272};
RN [1] {ECO:0000313|EnsemblMetazoa:AALB007502-PA}
RP IDENTIFICATION.
RC STRAIN=STECLA/ALBI9_A {ECO:0000313|EnsemblMetazoa:AALB007502-PA};
RG EnsemblMetazoa;
RL Submitted (AUG-2022) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the RED family. {ECO:0000256|ARBA:ARBA00006660}.
CC -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC {ECO:0000256|ARBA:ARBA00024195}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182FLU3; -.
DR STRING; 7167.A0A182FLU3; -.
DR EnsemblMetazoa; AALB007502-RA; AALB007502-PA; AALB007502.
DR VEuPathDB; VectorBase:AALB007502; -.
DR VEuPathDB; VectorBase:AALB20_027820; -.
DR VEuPathDB; VectorBase:AALB20_031475; -.
DR VEuPathDB; VectorBase:AALB20_037778; -.
DR Proteomes; UP000069272; Chromosome 3R.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00190; Tryp_SPc; 2.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 2.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR039896; Red-like.
DR InterPro; IPR012492; RED_C.
DR InterPro; IPR012916; RED_N.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR12765:SF5; PROTEIN RED; 1.
DR PANTHER; PTHR12765; RED PROTEIN IK FACTOR CYTOKINE IK; 1.
DR Pfam; PF07807; RED_C; 1.
DR Pfam; PF07808; RED_N; 1.
DR Pfam; PF00089; Trypsin; 2.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00020; Tryp_SPc; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 2.
DR PROSITE; PS50240; TRYPSIN_DOM; 2.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 45..279
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 324..554
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
SQ SEQUENCE 1094 AA; 120413 MW; 7E7CFF995C6E3199 CRC64;
MAPSAAGFIS RCLRQTRDIF PLPRHRLTMA MKMGSSRKPS QLAKIVNGQT AQEGQFPWQV
SIRAALGRSV TVCGGSLIDA QWVLTAAHCA HDYNVFQIGL GSIHLNMARL TMLAVVKHVH
PEFDPSKLTN DVALIRLPSG VPYSLEIYPV QLPLSIAADD TFVGRRVIVS GFGRTSDAIQ
SISTELKYES LRIISNAQCA IVYGSSIIRN TTLCAVGWDR SNQNVCQGDS GGPMVIQQAN
GWVQIGIVSF VSSRGCSTGD PSGYIRTVNY LDWIARTTAL RSLASVLIQV DPAATMQLSA
GLVLLIAVVC ATASENGQNR NPRIVNGINA QPTPYNAYVL YLNSANAGFF GGGSLISDRH
VLTAAQNIQG FSRWEIGLGS TVFGQLIKQV STQAISHPSF NSPNRANDIG IVVLPSPVIF
SAAIAPIALP PLPTLNRPMP MENEEGMVVG FGFITANDQA PSSFLKAAYQ RVIGDNRCSG
TYQIQLPNHF CAEDTAQRGN VCNGDLGAGF TILDRRVETL AGIASLITAS CDSLTPTGYT
RVSVYRQWIR DTTGPKFNRR SRMHSDEPET PATQRLTNDD FRKLLMTPRV PAGAGHTVGS
IRDAMSSKSS ASAVPQSGSS VSASSSTIKT PSSERNEARR KKKNFYAQLK KQEDNKLAEL
AEKYRDRARE RRDGANPDYQ NLDSSSTTSA YRAVAPDAKS GLDAAERRRQ QIQESKFLGG
DMEHTHLVKG LDYALLQKVR SEIVAKEQEQ EEEMEKLVDI EATAALPSSG QRPATVSLLT
EKDLVTSEEI EFHTVLGSNI FRFLEKQRSR VIERNEMFAP GRMAYHIELE DENVDTDIPT
TVIRSKAEVP LDVCDTQTLT INDIVINKLA QILSYLRQGG RGKKNKRRDK DKPLFKIPGE
SGGKEVDESI YGDIGDYRPS RRYEDVATSS KLEGKHNYFG KGDSENTVDH EASISAIPPP
PKLSTQLITK LTAEPEGYAE CYPGLQEMND AIDDSDDEVD YTKMDLGNKK GPIGRWDFDT
QEEYSEYMSS KEALPKAAFQ YGVKMQDGRK TRKHKTEKNE KAELDREWQQ IQNIIQKRKA
KTGNDDAEIK KSKY
//