ID W5MT02_LEPOC Unreviewed; 704 AA.
AC W5MT02;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 71.
DE SubName: Full=Mannan-binding lectin serine peptidase 1 {ECO:0000313|Ensembl:ENSLOCP00000011511.1};
GN Name=MASP1 {ECO:0000313|Ensembl:ENSLOCP00000011511.1};
OS Lepisosteus oculatus (Spotted gar).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Holostei; Semionotiformes; Lepisosteidae;
OC Lepisosteus.
OX NCBI_TaxID=7918 {ECO:0000313|Ensembl:ENSLOCP00000011511.1, ECO:0000313|Proteomes:UP000018468};
RN [1] {ECO:0000313|Proteomes:UP000018468}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lander E.S., Lindblad-Toh K.;
RT "The Draft Genome of Lepisosteus oculatus.";
RL Submitted (DEC-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLOCP00000011511.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- PTM: The iron and 2-oxoglutarate dependent 3-hydroxylation of aspartate
CC and asparagine is (R) stereospecific within EGF domains.
CC {ECO:0000256|PIRSR:PIRSR001155-3}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AHAT01012908; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; W5MT02; -.
DR STRING; 7918.ENSLOCP00000011511; -.
DR Ensembl; ENSLOCT00000011528.1; ENSLOCP00000011511.1; ENSLOCG00000009432.1.
DR eggNOG; KOG3627; Eukaryota.
DR GeneTree; ENSGT00950000183084; -.
DR HOGENOM; CLU_006842_14_1_1; -.
DR InParanoid; W5MT02; -.
DR OMA; QHWVAQG; -.
DR Proteomes; UP000018468; Linkage group LG14.
DR Bgee; ENSLOCG00000009432; Expressed in liver and 7 other cell types or tissues.
DR GO; GO:0005737; C:cytoplasm; IEA:Ensembl.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IBA:GO_Central.
DR GO; GO:0006956; P:complement activation; IEA:InterPro.
DR GO; GO:1904888; P:cranial skeletal system development; IEA:Ensembl.
DR GO; GO:0007507; P:heart development; IEA:Ensembl.
DR GO; GO:0045087; P:innate immune response; IEA:UniProtKB-KW.
DR GO; GO:0001755; P:neural crest cell migration; IEA:Ensembl.
DR GO; GO:0031638; P:zymogen activation; IBA:GO_Central.
DR CDD; cd00033; CCP; 2.
DR CDD; cd00041; CUB; 2.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 2.
DR Gene3D; 2.10.25.10; Laminin; 1.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 2.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024175; Pept_S1A_C1r/C1S/mannan-bd.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24255; COMPLEMENT COMPONENT 1, S SUBCOMPONENT-RELATED; 1.
DR PANTHER; PTHR24255:SF13; MANNAN-BINDING LECTIN SERINE PROTEASE 1; 1.
DR Pfam; PF00431; CUB; 2.
DR Pfam; PF00084; Sushi; 2.
DR Pfam; PF00089; Trypsin; 1.
DR PIRSF; PIRSF001155; C1r_C1s_MASP; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00032; CCP; 2.
DR SMART; SM00042; CUB; 2.
DR SMART; SM00181; EGF; 1.
DR SMART; SM00179; EGF_CA; 1.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF57535; Complement control module/SCR domain; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS01180; CUB; 2.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS50923; SUSHI; 2.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Calcium {ECO:0000256|ARBA:ARBA00022837, ECO:0000256|PIRSR:PIRSR001155-4};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157,
KW ECO:0000256|PIRSR:PIRSR001155-2};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU363034};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278,
KW ECO:0000256|PIRSR:PIRSR001155-3}; Immunity {ECO:0000256|ARBA:ARBA00022859};
KW Innate immunity {ECO:0000256|ARBA:ARBA00022588};
KW Metal-binding {ECO:0000256|PIRSR:PIRSR001155-4};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000018468};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW ECO:0000256|RuleBase:RU363034}; Signal {ECO:0000256|SAM:SignalP};
KW Sushi {ECO:0000256|ARBA:ARBA00022659, ECO:0000256|PROSITE-
KW ProRule:PRU00302}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..704
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004866514"
FT DOMAIN 17..136
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 183..295
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 297..362
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 363..432
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 447..704
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT ACT_SITE 487
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT ACT_SITE 549
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT ACT_SITE 647
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT BINDING 66
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 74
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 119
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 121
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 137
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 138
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 140
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 157
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 158
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 161
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 233
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 243
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 280
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 282
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT MOD_RES 157
FT /note="(3R)-3-hydroxyasparagine"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-3"
FT DISULFID 71..89
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 141..155
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 151..164
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 166..179
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 183..210
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2,
FT ECO:0000256|PROSITE-ProRule:PRU00059"
FT DISULFID 240..258
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 299..347
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 327..360
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 365..412
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 395..430
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 434..569
FT /note="Interchain (between heavy and light chains)"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 615..632
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 643..673
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
SQ SEQUENCE 704 AA; 79280 MW; CCE7F0D2805498B5 CRC64;
MRILIGFFIL CLLVHGCRGI ELTGMYGIIQ SPNFPESYPK DSEVAWNISV PDGFKIKLYF
MHFDLEPSYL CEYDYVKIEA DQGLQATFCG REDTDTEQTP REQIIMSHRN KLSVTFRSDF
SNEERFTGFE AHYTAVDVDE CKDRNDEELV CDHYCHNYIG GYYCSCRFGY ILHSDNRTCR
VECSDNVFNE RTGIVTSSDF PNPYPKSSDC LYRIELEEGF FINLEFDDNF DIEDHPEVTC
PYDYLKIKAG KREFGPLCGD RSPGRIETGS NSVQILFHSD NSGENIGWRL SYAAVGNSCP
NPQPPANGKI EPVLVQYSFK DQILVTCDTG YNVLKDDIEI EHYQMECLKD GSWSSSVPSC
KIVDCKAPVE IEYGYVIYDS SKNLTTYGST IQYSCKEPRY QMYPKINGTY TCGEKGAWIN
KEVDTKLPSC MPVCGTPQFS RTKLQRISNG NYATRGVSPW IAMLSRNGLP FCGGSLIGDR
WIVTAAHCLH EPLDSEDLVL RSSHIVGHTS FHIVLGKHST RRKDNTEQLF EAKSIILHPA
YQAATFEFDV ALLELNKPAA LNNYVIPVCL PDASEHSVLT EGDMVIVSGW GKQFLRRLPD
SLMEIEIPIV DHTICESGYG PLGRKVTDEM ICAGEKEGGK DACQGDSGGP MVTLNPHNHR
WFLVGVVSWG DGCGTNDRYV PCCSQTVCRE EIQTRGSSSS SWFR
//