ID Q9I7V4_DROME Unreviewed; 364 AA.
AC Q9I7V4;
DT 01-MAR-2001, integrated into UniProtKB/TrEMBL.
DT 01-MAR-2001, sequence version 1.
DT 27-MAR-2024, entry version 194.
DE SubName: Full=MIP24941p {ECO:0000313|EMBL:ADM26844.1};
GN Name=SP22 {ECO:0000313|EMBL:AAG22193.1};
GN Synonyms=CG18735-RA {ECO:0000313|EMBL:ADM26844.1}, Dmel\CG18735
GN {ECO:0000313|EMBL:AAG22193.1};
GN ORFNames=CG18735 {ECO:0000313|EMBL:AAG22193.1,
GN ECO:0000313|FlyBase:FBgn0042098}, Dmel_CG18735
GN {ECO:0000313|EMBL:AAG22193.1};
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AAG22193.1, ECO:0000313|Proteomes:UP000000803};
RN [1] {ECO:0000313|EMBL:AAG22193.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A.,
RA An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H.,
RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E.,
RA Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C.,
RA Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A.,
RA Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C.,
RA Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G.,
RA Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.,
RA Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [2] {ECO:0000313|EMBL:AAG22193.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537568;
RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A.,
RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A.,
RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R.,
RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J.,
RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C.,
RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.;
RT "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster
RT euchromatic genome sequence.";
RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002).
RN [3] {ECO:0000313|EMBL:AAG22193.1, ECO:0000313|Proteomes:UP000000803}
RP GENOME REANNOTATION.
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4] {ECO:0000313|EMBL:AAG22193.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537573;
RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R.,
RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M.,
RA Celniker S.E.;
RT "The transposable elements of the Drosophila melanogaster euchromatin: a
RT genomics perspective.";
RL Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002).
RN [5] {ECO:0000313|EMBL:AAG22193.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537574;
RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A.,
RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G.,
RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M.,
RA Karpen G.H.;
RT "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.";
RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002).
RN [6] {ECO:0000313|EMBL:AAG22193.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022;
RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D.,
RA Ashburner M., Anxolabehere D.;
RT "Combined evidence annotation of transposable elements in genome
RT sequences.";
RL PLoS Comput. Biol. 1:166-175(2005).
RN [7] {ECO:0000313|EMBL:AAG22193.1}
RP NUCLEOTIDE SEQUENCE.
RA Celniker S., Carlson J., Wan K., Frise E., Hoskins R., Park S.,
RA Svirskas R., Rubin G.;
RL Submitted (AUG-2006) to the EMBL/GenBank/DDBJ databases.
RN [8] {ECO:0000313|EMBL:AAG22193.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569856; DOI=10.1126/science.1139815;
RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.;
RT "The Release 5.1 annotation of Drosophila melanogaster heterochromatin.";
RL Science 316:1586-1591(2007).
RN [9] {ECO:0000313|EMBL:AAG22193.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569867; DOI=10.1126/science.1139816;
RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M.,
RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A.,
RA Dimitri P., Karpen G.H., Celniker S.E.;
RT "Sequence finishing and mapping of Drosophila melanogaster
RT heterochromatin.";
RL Science 316:1625-1628(2007).
RN [10] {ECO:0000313|EMBL:ADM26844.1}
RP NUCLEOTIDE SEQUENCE.
RA Carlson J., Booth B., Frise E., Sandler J., Wan K., Yu C., Celniker S.;
RL Submitted (AUG-2010) to the EMBL/GenBank/DDBJ databases.
RN [11] {ECO:0000313|EMBL:AAG22193.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=26109357; DOI=.1534/g3.115.018929;
RG FlyBase Consortium;
RA Matthews B.B., Dos Santos G., Crosby M.A., Emmert D.B., St Pierre S.E.,
RA Gramates L.S., Zhou P., Schroeder A.J., Falls K., Strelets V., Russo S.M.,
RA Gelbart W.M., null;
RT "Gene Model Annotations for Drosophila melanogaster: Impact of High-
RT Throughput Data.";
RL G3 (Bethesda) 5:1721-1736(2015).
RN [12] {ECO:0000313|EMBL:AAG22193.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=26109356; DOI=.1534/g3.115.018937;
RG FlyBase Consortium;
RA Crosby M.A., Gramates L.S., Dos Santos G., Matthews B.B., St Pierre S.E.,
RA Zhou P., Schroeder A.J., Falls K., Emmert D.B., Russo S.M., Gelbart W.M.,
RA null;
RT "Gene Model Annotations for Drosophila melanogaster: The Rule-Benders.";
RL G3 (Bethesda) 5:1737-1749(2015).
RN [13] {ECO:0000313|EMBL:AAG22193.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=25589440;
RA Hoskins R.A., Carlson J.W., Wan K.H., Park S., Mendez I., Galle S.E.,
RA Booth B.W., Pfeiffer B.D., George R.A., Svirskas R., Krzywinski M.,
RA Schein J., Accardo M.C., Damia E., Messina G., Mendez-Lago M.,
RA de Pablos B., Demakova O.V., Andreyeva E.N., Boldyreva L.V., Marra M.,
RA Carvalho A.B., Dimitri P., Villasante A., Zhimulev I.F., Rubin G.M.,
RA Karpen G.H., Celniker S.E.;
RT "The Release 6 reference sequence of the Drosophila melanogaster genome.";
RL Genome Res. 25:445-458(2015).
RN [14] {ECO:0000313|EMBL:AAG22193.1}
RP NUCLEOTIDE SEQUENCE.
RG FlyBase;
RL Submitted (APR-2020) to the EMBL/GenBank/DDBJ databases.
RN [15] {ECO:0000313|EMBL:AAG22193.1}
RP NUCLEOTIDE SEQUENCE.
RG Berkeley Drosophila Genome Project;
RA Celniker S., Carlson J., Wan K., Pfeiffer B., Frise E., George R.,
RA Hoskins R., Stapleton M., Pacleb J., Park S., Svirskas R., Smith E., Yu C.,
RA Rubin G.;
RT "Drosophila melanogaster release 4 sequence.";
RL Submitted (MAY-2020) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE013599; AAG22193.1; -; Genomic_DNA.
DR EMBL; BT125606; ADM26844.1; -; mRNA.
DR RefSeq; NP_652645.1; NM_144388.2.
DR AlphaFoldDB; Q9I7V4; -.
DR SMR; Q9I7V4; -.
DR IntAct; Q9I7V4; 3.
DR STRING; 7227.FBpp0071658; -.
DR MEROPS; S01.B38; -.
DR PaxDb; 7227-FBpp0071658; -.
DR DNASU; 59137; -.
DR EnsemblMetazoa; FBtr0071744; FBpp0071658; FBgn0042098.
DR GeneID; 59137; -.
DR KEGG; dme:Dmel_CG18735; -.
DR UCSC; CG18735-RA; d. melanogaster.
DR AGR; FB:FBgn0042098; -.
DR FlyBase; FBgn0042098; CG18735.
DR VEuPathDB; VectorBase:FBgn0042098; -.
DR eggNOG; KOG3627; Eukaryota.
DR GeneTree; ENSGT00940000167642; -.
DR HOGENOM; CLU_006842_0_0_1; -.
DR InParanoid; Q9I7V4; -.
DR OMA; RDSCACS; -.
DR OrthoDB; 3059419at2759; -.
DR BioGRID-ORCS; 59137; 0 hits in 1 CRISPR screen.
DR GenomeRNAi; 59137; -.
DR Proteomes; UP000000803; Chromosome 2R.
DR Bgee; FBgn0042098; Expressed in pupa and 3 other cell types or tissues.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0017171; F:serine hydrolase activity; HDA:FlyBase.
DR GO; GO:0004252; F:serine-type endopeptidase activity; ISM:FlyBase.
DR GO; GO:0006508; P:proteolysis; ISM:FlyBase.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24252; ACROSIN-RELATED; 1.
DR PANTHER; PTHR24252:SF7; HYALIN; 1.
DR Pfam; PF00089; Trypsin; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 2: Evidence at transcript level;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|RuleBase:RU363034, ECO:0000313|EMBL:AAG22193.1};
KW Protease {ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000000803};
KW Serine protease {ECO:0000256|RuleBase:RU363034};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..364
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5015099776"
FT DOMAIN 83..316
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT REGION 322..364
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 335..349
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 364 AA; 39668 MW; 185769E510B07493 CRC64;
MCNFHLLLIL ATALGDLACA TPSLRSASDP EKILNNLAQL RQSSFLDWIQ SILGPEVPAE
WSSPAKRECA ECSCGNINTR HRIVGGQETE VHEYPWMIML MWFGNFYCGA SLVNDQYALT
AAHCVNGFYH RLITVRLLEH NRQDSHVKIV DRRVSRVLIH PKYSTRNFDS DIALIRFNEP
VRLGIDMHPV CMPTPSENYA GQTAVVTGWG ALSEGGPISD TLQEVEVPIL SQEECRNSNY
GESKITDNMI CAGYVEQGGK DSCQGDSGGP MHVLGSGDAY QLAGIVSWGE GCAKPNAPGV
YTRVGSFNDW IAENTRDACS CAQPEAAGEP ASPMETTEQG DQENTTANGA AEADPEVEEA
NKLI
//