ID B4HRM2_DROSE Unreviewed; 529 AA.
AC B4HRM2;
DT 23-SEP-2008, integrated into UniProtKB/TrEMBL.
DT 23-SEP-2008, sequence version 1.
DT 27-MAR-2024, entry version 75.
DE SubName: Full=GM21018 {ECO:0000313|EMBL:EDW46904.1};
GN Name=Dsec\GM21018 {ECO:0000313|EMBL:EDW46904.1};
GN ORFNames=Dsec_GM21018 {ECO:0000313|EMBL:EDW46904.1};
OS Drosophila sechellia (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7238 {ECO:0000313|Proteomes:UP000001292};
RN [1] {ECO:0000313|EMBL:EDW46904.1, ECO:0000313|Proteomes:UP000001292}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Rob3c / Tucson 14021-0248.25
RC {ECO:0000313|Proteomes:UP000001292};
RX PubMed=17994087; DOI=10.1038/nature06341;
RG Drosophila 12 genomes consortium;
RA Clark A.G., Eisen M.B., Smith D.R., Bergman C.M., Oliver B., Markow T.A.,
RA Kaufman T.C., Kellis M., Gelbart W., Iyer V.N., Pollard D.A., Sackton T.B.,
RA Larracuente A.M., Singh N.D., Abad J.P., Abt D.N., Adryan B., Aguade M.,
RA Akashi H., Anderson W.W., Aquadro C.F., Ardell D.H., Arguello R.,
RA Artieri C.G., Barbash D.A., Barker D., Barsanti P., Batterham P.,
RA Batzoglou S., Begun D., Bhutkar A., Blanco E., Bosak S.A., Bradley R.K.,
RA Brand A.D., Brent M.R., Brooks A.N., Brown R.H., Butlin R.K., Caggese C.,
RA Calvi B.R., Bernardo de Carvalho A., Caspi A., Castrezana S.,
RA Celniker S.E., Chang J.L., Chapple C., Chatterji S., Chinwalla A.,
RA Civetta A., Clifton S.W., Comeron J.M., Costello J.C., Coyne J.A., Daub J.,
RA David R.G., Delcher A.L., Delehaunty K., Do C.B., Ebling H., Edwards K.,
RA Eickbush T., Evans J.D., Filipski A., Findeiss S., Freyhult E., Fulton L.,
RA Fulton R., Garcia A.C., Gardiner A., Garfield D.A., Garvin B.E., Gibson G.,
RA Gilbert D., Gnerre S., Godfrey J., Good R., Gotea V., Gravely B.,
RA Greenberg A.J., Griffiths-Jones S., Gross S., Guigo R., Gustafson E.A.,
RA Haerty W., Hahn M.W., Halligan D.L., Halpern A.L., Halter G.M., Han M.V.,
RA Heger A., Hillier L., Hinrichs A.S., Holmes I., Hoskins R.A., Hubisz M.J.,
RA Hultmark D., Huntley M.A., Jaffe D.B., Jagadeeshan S., Jeck W.R.,
RA Johnson J., Jones C.D., Jordan W.C., Karpen G.H., Kataoka E.,
RA Keightley P.D., Kheradpour P., Kirkness E.F., Koerich L.B., Kristiansen K.,
RA Kudrna D., Kulathinal R.J., Kumar S., Kwok R., Lander E., Langley C.H.,
RA Lapoint R., Lazzaro B.P., Lee S.J., Levesque L., Li R., Lin C.F., Lin M.F.,
RA Lindblad-Toh K., Llopart A., Long M., Low L., Lozovsky E., Lu J., Luo M.,
RA Machado C.A., Makalowski W., Marzo M., Matsuda M., Matzkin L.,
RA McAllister B., McBride C.S., McKernan B., McKernan K., Mendez-Lago M.,
RA Minx P., Mollenhauer M.U., Montooth K., Mount S.M., Mu X., Myers E.,
RA Negre B., Newfeld S., Nielsen R., Noor M.A., O'Grady P., Pachter L.,
RA Papaceit M., Parisi M.J., Parisi M., Parts L., Pedersen J.S., Pesole G.,
RA Phillippy A.M., Ponting C.P., Pop M., Porcelli D., Powell J.R.,
RA Prohaska S., Pruitt K., Puig M., Quesneville H., Ram K.R., Rand D.,
RA Rasmussen M.D., Reed L.K., Reenan R., Reily A., Remington K.A.,
RA Rieger T.T., Ritchie M.G., Robin C., Rogers Y.H., Rohde C., Rozas J.,
RA Rubenfield M.J., Ruiz A., Russo S., Salzberg S.L., Sanchez-Gracia A.,
RA Saranga D.J., Sato H., Schaeffer S.W., Schatz M.C., Schlenke T.,
RA Schwartz R., Segarra C., Singh R.S., Sirot L., Sirota M., Sisneros N.B.,
RA Smith C.D., Smith T.F., Spieth J., Stage D.E., Stark A., Stephan W.,
RA Strausberg R.L., Strempel S., Sturgill D., Sutton G., Sutton G.G., Tao W.,
RA Teichmann S., Tobari Y.N., Tomimura Y., Tsolas J.M., Valente V.L.,
RA Venter E., Venter J.C., Vicario S., Vieira F.G., Vilella A.J.,
RA Villasante A., Walenz B., Wang J., Wasserman M., Watts T., Wilson D.,
RA Wilson R.K., Wing R.A., Wolfner M.F., Wong A., Wong G.K., Wu C.I., Wu G.,
RA Yamamoto D., Yang H.P., Yang S.P., Yorke J.A., Yoshida K., Zdobnov E.,
RA Zhang P., Zhang Y., Zimin A.V., Baldwin J., Abdouelleil A., Abdulkadir J.,
RA Abebe A., Abera B., Abreu J., Acer S.C., Aftuck L., Alexander A., An P.,
RA Anderson E., Anderson S., Arachi H., Azer M., Bachantsang P., Barry A.,
RA Bayul T., Berlin A., Bessette D., Bloom T., Blye J., Boguslavskiy L.,
RA Bonnet C., Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S.,
RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., Costello M.,
RA D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., Dhargay N., Dooley K.,
RA Dooley E., Doricent M., Dorje P., Dorjee K., Dupes A., Elong R., Falk J.,
RA Farina A., Faro S., Ferguson D., Fisher S., Foley C.D., Franke A.,
RA Friedrich D., Gadbois L., Gearin G., Gearin C.R., Giannoukos G., Goode T.,
RA Graham J., Grandbois E., Grewal S., Gyaltsen K., Hafez N., Hagos B.,
RA Hall J., Henson C., Hollinger A., Honan T., Huard M.D., Hughes L.,
RA Hurhula B., Husby M.E., Kamat A., Kanga B., Kashin S., Khazanovich D.,
RA Kisner P., Lance K., Lara M., Lee W., Lennon N., Letendre F., LeVine R.,
RA Lipovsky A., Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y.,
RA Lubonja R., Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C.,
RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O.,
RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., Mulrain L.,
RA Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., Nicol R., Norbu C.,
RA Norbu N., Novod N., O'Neill B., Osman S., Markiewicz E., Oyono O.L.,
RA Patti C., Phunkhang P., Pierre F., Priest M., Raghuraman S., Rege F.,
RA Reyes R., Rise C., Rogov P., Ross K., Ryan E., Settipalli S., Shea T.,
RA Sherpa N., Shi L., Shih D., Sparrow T., Spaulding J., Stalker J.,
RA Stange-Thomann N., Stavropoulos S., Stone C., Strader C., Tesfaye S.,
RA Thomson T., Thoulutsang Y., Thoulutsang D., Topham K., Topping I.,
RA Tsamla T., Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M.,
RA Wilkinson J., Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D.,
RA Zimmer A., Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J.,
RA Chin C., Gnerre S., Grabherr M., Kleber M., Mauceli E., MacCallum I.;
RT "Evolution of genes and genomes on the Drosophila phylogeny.";
RL Nature 450:203-218(2007).
CC -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC {ECO:0000256|ARBA:ARBA00024195}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00059}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH480816; EDW46904.1; -; Genomic_DNA.
DR RefSeq; XP_002032891.1; XM_002032855.1.
DR AlphaFoldDB; B4HRM2; -.
DR STRING; 7238.B4HRM2; -.
DR MEROPS; S01.B63; -.
DR EnsemblMetazoa; FBtr0204003; FBpp0202495; FBgn0175899.
DR GeneID; 6608145; -.
DR KEGG; dse:6608145; -.
DR HOGENOM; CLU_006842_2_0_1; -.
DR OMA; YEQCNHQ; -.
DR OrthoDB; 2877984at2759; -.
DR PhylomeDB; B4HRM2; -.
DR Proteomes; UP000001292; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR PANTHER; PTHR24258:SF134; AGAP011908-PA; 1.
DR PANTHER; PTHR24258; SERINE PROTEASE-RELATED; 1.
DR Pfam; PF00431; CUB; 1.
DR Pfam; PF00089; Trypsin; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00042; CUB; 1.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS01180; CUB; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000001292};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..529
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5002805941"
FT DOMAIN 23..142
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 275..520
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT REGION 142..179
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 162..179
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 529 AA; 58032 MW; D8E48BE7B9B7E19B CRC64;
MWTWLLFLPL LKASSTNAIF EQCNHQVKLQ SGQKLLINSP YYPALYPVGT SCRYAVEAPK
DHVLQFKCEL QLRTLSTDLK CRTEVFHFNS EGDEVLTSSE YFCGSGKFER KSFLNRAVIS
YISSGHSEPP SAVKLKDKLH AVPTTSTTTE APQAVSIDEQ DEEEKREAEE EPEEADEDLS
DEVLHVLQDV GVSDALAYVA SLLYDVEESR KSRTTIYLDK LPIRSSSKTS AIHARIVSSY
PPAAPSVGHG GGRFSCLVEA FQPTCSCGWS RIPRIASPTN EEAVLHEFPP MTGVLTKKHG
KVFCGAAIIH HHYLLSAAHC FQGPETNSAA KLRVVVGEHD LANSFETFAT RRYDLDALIL
HEDFSQASGQ PKNDIAMLKT RTAIVWSQHV GPACLPLQPA EDGRKLPLAG QQVVAAGWGT
TSYGGPQTHR LLKATLDVID GRRCRQALSS AGGVPPHTFC TYTPGRDTCQ YDSGGALYER
INGRLMAVGI VSFGQACAAQ QPSVNTRVAS FIKWIRTKSP EVTYCPGRT
//