ID B4L078_DROMO Unreviewed; 776 AA.
AC B4L078;
DT 23-SEP-2008, integrated into UniProtKB/TrEMBL.
DT 23-SEP-2008, sequence version 1.
DT 27-MAR-2024, entry version 79.
DE RecName: Full=trypsin {ECO:0000256|ARBA:ARBA00038868};
DE EC=3.4.21.4 {ECO:0000256|ARBA:ARBA00038868};
GN Name=Dmoj\GI12326 {ECO:0000313|EMBL:EDW18024.1};
GN ORFNames=Dmoj_GI12326 {ECO:0000313|EMBL:EDW18024.1};
OS Drosophila mojavensis (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila.
OX NCBI_TaxID=7230 {ECO:0000313|EMBL:EDW18024.1, ECO:0000313|Proteomes:UP000009192};
RN [1] {ECO:0000313|EMBL:EDW18024.1, ECO:0000313|Proteomes:UP000009192}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=TSC#15081-1352.22 {ECO:0000313|EMBL:EDW18024.1}, and Tucson
RC 15081-1352.22 {ECO:0000313|Proteomes:UP000009192};
RX PubMed=17994087; DOI=10.1038/nature06341;
RG Drosophila 12 genomes consortium;
RA Clark A.G., Eisen M.B., Smith D.R., Bergman C.M., Oliver B., Markow T.A.,
RA Kaufman T.C., Kellis M., Gelbart W., Iyer V.N., Pollard D.A., Sackton T.B.,
RA Larracuente A.M., Singh N.D., Abad J.P., Abt D.N., Adryan B., Aguade M.,
RA Akashi H., Anderson W.W., Aquadro C.F., Ardell D.H., Arguello R.,
RA Artieri C.G., Barbash D.A., Barker D., Barsanti P., Batterham P.,
RA Batzoglou S., Begun D., Bhutkar A., Blanco E., Bosak S.A., Bradley R.K.,
RA Brand A.D., Brent M.R., Brooks A.N., Brown R.H., Butlin R.K., Caggese C.,
RA Calvi B.R., Bernardo de Carvalho A., Caspi A., Castrezana S.,
RA Celniker S.E., Chang J.L., Chapple C., Chatterji S., Chinwalla A.,
RA Civetta A., Clifton S.W., Comeron J.M., Costello J.C., Coyne J.A., Daub J.,
RA David R.G., Delcher A.L., Delehaunty K., Do C.B., Ebling H., Edwards K.,
RA Eickbush T., Evans J.D., Filipski A., Findeiss S., Freyhult E., Fulton L.,
RA Fulton R., Garcia A.C., Gardiner A., Garfield D.A., Garvin B.E., Gibson G.,
RA Gilbert D., Gnerre S., Godfrey J., Good R., Gotea V., Gravely B.,
RA Greenberg A.J., Griffiths-Jones S., Gross S., Guigo R., Gustafson E.A.,
RA Haerty W., Hahn M.W., Halligan D.L., Halpern A.L., Halter G.M., Han M.V.,
RA Heger A., Hillier L., Hinrichs A.S., Holmes I., Hoskins R.A., Hubisz M.J.,
RA Hultmark D., Huntley M.A., Jaffe D.B., Jagadeeshan S., Jeck W.R.,
RA Johnson J., Jones C.D., Jordan W.C., Karpen G.H., Kataoka E.,
RA Keightley P.D., Kheradpour P., Kirkness E.F., Koerich L.B., Kristiansen K.,
RA Kudrna D., Kulathinal R.J., Kumar S., Kwok R., Lander E., Langley C.H.,
RA Lapoint R., Lazzaro B.P., Lee S.J., Levesque L., Li R., Lin C.F., Lin M.F.,
RA Lindblad-Toh K., Llopart A., Long M., Low L., Lozovsky E., Lu J., Luo M.,
RA Machado C.A., Makalowski W., Marzo M., Matsuda M., Matzkin L.,
RA McAllister B., McBride C.S., McKernan B., McKernan K., Mendez-Lago M.,
RA Minx P., Mollenhauer M.U., Montooth K., Mount S.M., Mu X., Myers E.,
RA Negre B., Newfeld S., Nielsen R., Noor M.A., O'Grady P., Pachter L.,
RA Papaceit M., Parisi M.J., Parisi M., Parts L., Pedersen J.S., Pesole G.,
RA Phillippy A.M., Ponting C.P., Pop M., Porcelli D., Powell J.R.,
RA Prohaska S., Pruitt K., Puig M., Quesneville H., Ram K.R., Rand D.,
RA Rasmussen M.D., Reed L.K., Reenan R., Reily A., Remington K.A.,
RA Rieger T.T., Ritchie M.G., Robin C., Rogers Y.H., Rohde C., Rozas J.,
RA Rubenfield M.J., Ruiz A., Russo S., Salzberg S.L., Sanchez-Gracia A.,
RA Saranga D.J., Sato H., Schaeffer S.W., Schatz M.C., Schlenke T.,
RA Schwartz R., Segarra C., Singh R.S., Sirot L., Sirota M., Sisneros N.B.,
RA Smith C.D., Smith T.F., Spieth J., Stage D.E., Stark A., Stephan W.,
RA Strausberg R.L., Strempel S., Sturgill D., Sutton G., Sutton G.G., Tao W.,
RA Teichmann S., Tobari Y.N., Tomimura Y., Tsolas J.M., Valente V.L.,
RA Venter E., Venter J.C., Vicario S., Vieira F.G., Vilella A.J.,
RA Villasante A., Walenz B., Wang J., Wasserman M., Watts T., Wilson D.,
RA Wilson R.K., Wing R.A., Wolfner M.F., Wong A., Wong G.K., Wu C.I., Wu G.,
RA Yamamoto D., Yang H.P., Yang S.P., Yorke J.A., Yoshida K., Zdobnov E.,
RA Zhang P., Zhang Y., Zimin A.V., Baldwin J., Abdouelleil A., Abdulkadir J.,
RA Abebe A., Abera B., Abreu J., Acer S.C., Aftuck L., Alexander A., An P.,
RA Anderson E., Anderson S., Arachi H., Azer M., Bachantsang P., Barry A.,
RA Bayul T., Berlin A., Bessette D., Bloom T., Blye J., Boguslavskiy L.,
RA Bonnet C., Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S.,
RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., Costello M.,
RA D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., Dhargay N., Dooley K.,
RA Dooley E., Doricent M., Dorje P., Dorjee K., Dupes A., Elong R., Falk J.,
RA Farina A., Faro S., Ferguson D., Fisher S., Foley C.D., Franke A.,
RA Friedrich D., Gadbois L., Gearin G., Gearin C.R., Giannoukos G., Goode T.,
RA Graham J., Grandbois E., Grewal S., Gyaltsen K., Hafez N., Hagos B.,
RA Hall J., Henson C., Hollinger A., Honan T., Huard M.D., Hughes L.,
RA Hurhula B., Husby M.E., Kamat A., Kanga B., Kashin S., Khazanovich D.,
RA Kisner P., Lance K., Lara M., Lee W., Lennon N., Letendre F., LeVine R.,
RA Lipovsky A., Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y.,
RA Lubonja R., Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C.,
RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O.,
RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., Mulrain L.,
RA Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., Nicol R., Norbu C.,
RA Norbu N., Novod N., O'Neill B., Osman S., Markiewicz E., Oyono O.L.,
RA Patti C., Phunkhang P., Pierre F., Priest M., Raghuraman S., Rege F.,
RA Reyes R., Rise C., Rogov P., Ross K., Ryan E., Settipalli S., Shea T.,
RA Sherpa N., Shi L., Shih D., Sparrow T., Spaulding J., Stalker J.,
RA Stange-Thomann N., Stavropoulos S., Stone C., Strader C., Tesfaye S.,
RA Thomson T., Thoulutsang Y., Thoulutsang D., Topham K., Topping I.,
RA Tsamla T., Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M.,
RA Wilkinson J., Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D.,
RA Zimmer A., Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J.,
RA Chin C., Gnerre S., Grabherr M., Kleber M., Mauceli E., MacCallum I.;
RT "Evolution of genes and genomes on the Drosophila phylogeny.";
RL Nature 450:203-218(2007).
RN [2] {ECO:0000313|EMBL:EDW18024.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=TSC#15081-1352.22 {ECO:0000313|EMBL:EDW18024.1};
RX PubMed=18057021; DOI=10.1093/bioinformatics/btm542;
RA Zimin A.V., Smith D.R., Sutton G., Yorke J.A.;
RT "Assembly reconciliation.";
RL Bioinformatics 24:42-45(2008).
RN [3] {ECO:0000313|EMBL:EDW18024.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=TSC#15081-1352.22 {ECO:0000313|EMBL:EDW18024.1};
RG FlyBase;
RL Submitted (NOV-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Preferential cleavage: Arg-|-Xaa, Lys-|-Xaa.; EC=3.4.21.4;
CC Evidence={ECO:0000256|ARBA:ARBA00036320};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH933809; EDW18024.1; -; Genomic_DNA.
DR EMBL; CH933809; KRG05906.1; -; Genomic_DNA.
DR EMBL; CH933809; KRG05907.1; -; Genomic_DNA.
DR RefSeq; XP_002007548.1; XM_002007512.2.
DR RefSeq; XP_015017558.1; XM_015162072.1.
DR RefSeq; XP_015017559.1; XM_015162073.1.
DR AlphaFoldDB; B4L078; -.
DR MEROPS; S01.B77; -.
DR EnsemblMetazoa; FBtr0163051; FBpp0161543; FBgn0135083.
DR EnsemblMetazoa; FBtr0423845; FBpp0381748; FBgn0135083.
DR EnsemblMetazoa; FBtr0432861; FBpp0390023; FBgn0135083.
DR GeneID; 6581826; -.
DR KEGG; dmo:Dmoj_GI12326; -.
DR eggNOG; KOG3627; Eukaryota.
DR HOGENOM; CLU_022212_0_0_1; -.
DR InParanoid; B4L078; -.
DR OMA; AMRRMHT; -.
DR OrthoDB; 5404167at2759; -.
DR Proteomes; UP000009192; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR InterPro; IPR000884; TSP1_rpt.
DR PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR Pfam; PF00089; Trypsin; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00020; Tryp_SPc; 1.
DR SMART; SM00209; TSP1; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU363034};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000009192};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW ECO:0000256|RuleBase:RU363034}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..776
FT /note="trypsin"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014298833"
FT DOMAIN 535..768
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT REGION 157..333
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 169..214
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 239..254
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 276..332
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 776 AA; 89932 MW; 2536D91A75BD6D71 CRC64;
MRLICSFIKL LALLQLLAPC QSGHVQEAKR HLRAELRDVL GHPSYQGSHW HSGHRVHRLR
MPHAHQRWTG RRRSINHHHP FSRWSQWTTC DEDCIRRRER FCKAKRKCGH TKHVEQHKCL
HCHPAVKWHS TSNVPQPNFP PHYPNPPPII INAAAAAAGG SSYQPHPTAT YAPPQPEPHP
QPQPPSYFEP HPQPQPHPHP HPPRHPPRQP LPDPYPTSDE ESVEVVHRKP PNWPSIESST
EEQEEEDEAQ PPFQDDEGDF FVLKRHRRRQ KPRHSSRHSS EQNSRQRNRQ KPRLDYIDHN
SRQRHRQRPR VDYTEDDSRQ RHRQKPRFDD IDDDFEDYLE GKSLNIGQRS LGSVENGGLE
GDIFQYEDFD FEPEAQYPDS EEFTDARNIS WLPPGKERNT PEGLMPRHRR VYSKWSRWTK
CSPKCTTRRY KKCRISEQCG REVLREIAYC YTENSFCQQW LQAQVQKMPA YESRPGPVTA
MRRMHTESGS GALSRNDVSV IMSGRGYRGP EYTPLKLQCG LVRTGHRNMY NLLKIIGGKA
ARKGEWPWQV AIFNRFKEAF CGGTLIAPLW VLTAAHCVRK VLYVRFGEHN LDYEDGTEVQ
LRVLKSYKHP SFDKRTVDSD VALLRLPRPV NATSWIGYSC LPRPYQPLPK NVDCTVIGWG
KRRNRDVVGT SVLHQADVPI IPMENCRSVY HDYTITKNMF CAGHRRGRID TCAGDSGGPL
LCRDSTKPNH PWTIFGITSF GDGCAKRNKF GIYAKVPNYV DWVWSVVNCN GNCKLH
//