ID G1N6P6_MELGA Unreviewed; 722 AA.
AC G1N6P6;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 29-SEP-2021, sequence version 3.
DT 27-MAR-2024, entry version 79.
DE SubName: Full=Mannan binding lectin serine peptidase 1 {ECO:0000313|Ensembl:ENSMGAP00000007781.3};
GN Name=MASP1 {ECO:0000313|Ensembl:ENSMGAP00000007781.3};
OS Meleagris gallopavo (Wild turkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; Phasianidae;
OC Meleagridinae; Meleagris.
OX NCBI_TaxID=9103 {ECO:0000313|Ensembl:ENSMGAP00000007781.3, ECO:0000313|Proteomes:UP000001645};
RN [1] {ECO:0000313|Ensembl:ENSMGAP00000007781.3, ECO:0000313|Proteomes:UP000001645}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=20838655; DOI=10.1371/journal.pbio.1000475;
RA Dalloul R.A., Long J.A., Zimin A.V., Aslam L., Beal K., Blomberg L.A.,
RA Bouffard P., Burt D.W., Crasta O., Crooijmans R.P., Cooper K.,
RA Coulombe R.A., De S., Delany M.E., Dodgson J.B., Dong J.J., Evans C.,
RA Frederickson K.M., Flicek P., Florea L., Folkerts O., Groenen M.A.,
RA Harkins T.T., Herrero J., Hoffmann S., Megens H.J., Jiang A., de Jong P.,
RA Kaiser P., Kim H., Kim K.W., Kim S., Langenberger D., Lee M.K., Lee T.,
RA Mane S., Marcais G., Marz M., McElroy A.P., Modise T., Nefedov M.,
RA Notredame C., Paton I.R., Payne W.S., Pertea G., Prickett D., Puiu D.,
RA Qioa D., Raineri E., Ruffier M., Salzberg S.L., Schatz M.C., Scheuring C.,
RA Schmidt C.J., Schroeder S., Searle S.M., Smith E.J., Smith J.,
RA Sonstegard T.S., Stadler P.F., Tafer H., Tu Z.J., Van Tassell C.P.,
RA Vilella A.J., Williams K.P., Yorke J.A., Zhang L., Zhang H.B., Zhang X.,
RA Zhang Y., Reed K.M.;
RT "Multi-platform next-generation sequencing of the domestic turkey
RT (Meleagris gallopavo): genome assembly and analysis.";
RL PLoS Biol. 8:E1000475-E1000475(2010).
RN [2] {ECO:0000313|Ensembl:ENSMGAP00000007781.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- PTM: The iron and 2-oxoglutarate dependent 3-hydroxylation of aspartate
CC and asparagine is (R) stereospecific within EGF domains.
CC {ECO:0000256|PIRSR:PIRSR001155-3}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_010715063.1; XM_010716761.2.
DR AlphaFoldDB; G1N6P6; -.
DR MEROPS; S01.132; -.
DR Ensembl; ENSMGAT00000008563.3; ENSMGAP00000007781.3; ENSMGAG00000007637.3.
DR GeneID; 100551064; -.
DR KEGG; mgp:100551064; -.
DR CTD; 5648; -.
DR GeneTree; ENSGT00950000183084; -.
DR HOGENOM; CLU_006842_14_1_1; -.
DR InParanoid; G1N6P6; -.
DR TreeFam; TF330373; -.
DR Proteomes; UP000001645; Chromosome 11.
DR Bgee; ENSMGAG00000007637; Expressed in liver and 9 other cell types or tissues.
DR GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR GO; GO:0005615; C:extracellular space; IEA:Ensembl.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0005509; F:calcium ion binding; IEA:Ensembl.
DR GO; GO:0048306; F:calcium-dependent protein binding; IEA:Ensembl.
DR GO; GO:0042803; F:protein homodimerization activity; IEA:Ensembl.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:Ensembl.
DR GO; GO:0001867; P:complement activation, lectin pathway; IEA:Ensembl.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00033; CCP; 2.
DR CDD; cd00041; CUB; 2.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 2.
DR Gene3D; 2.10.25.10; Laminin; 1.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 2.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 2.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024175; Pept_S1A_C1r/C1S/mannan-bd.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24255; COMPLEMENT COMPONENT 1, S SUBCOMPONENT-RELATED; 1.
DR PANTHER; PTHR24255:SF13; MANNAN-BINDING LECTIN SERINE PROTEASE 1; 1.
DR Pfam; PF00431; CUB; 2.
DR Pfam; PF00084; Sushi; 2.
DR Pfam; PF00089; Trypsin; 1.
DR PIRSF; PIRSF001155; C1r_C1s_MASP; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00032; CCP; 2.
DR SMART; SM00042; CUB; 2.
DR SMART; SM00181; EGF; 1.
DR SMART; SM00179; EGF_CA; 1.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF57535; Complement control module/SCR domain; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS01180; CUB; 2.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS50923; SUSHI; 2.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Calcium {ECO:0000256|PIRSR:PIRSR001155-4};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157,
KW ECO:0000256|PIRSR:PIRSR001155-2};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278,
KW ECO:0000256|PIRSR:PIRSR001155-3}; Immunity {ECO:0000256|ARBA:ARBA00022859};
KW Innate immunity {ECO:0000256|ARBA:ARBA00022588};
KW Metal-binding {ECO:0000256|PIRSR:PIRSR001155-4};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000001645};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825};
KW Sushi {ECO:0000256|ARBA:ARBA00022659, ECO:0000256|PROSITE-
KW ProRule:PRU00302}.
FT DOMAIN 2..128
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 175..287
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 289..354
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 355..424
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 440..708
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT ACT_SITE 487
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT ACT_SITE 543
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT ACT_SITE 656
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT BINDING 58
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 66
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 111
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 113
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 129
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 130
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 132
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 149
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 150
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 153
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 225
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 235
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 272
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 274
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT MOD_RES 149
FT /note="(3R)-3-hydroxyasparagine"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-3"
FT DISULFID 63..81
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 133..147
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 143..156
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 158..171
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 175..202
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2,
FT ECO:0000256|PROSITE-ProRule:PRU00059"
FT DISULFID 232..250
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 291..339
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 319..352
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 357..404
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 387..422
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 426..563
FT /note="Interchain (between heavy and light chains)"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 622..641
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 652..684
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
SQ SEQUENCE 722 AA; 81240 MW; ADD535F9CADFA76F CRC64;
MCAWLLGVVG AVELTEMFGE IRSPNFPDSY PSDSEVTWNI SVPEGFKIKL YFMHFDLESS
YLCEYDYVKI EAEDQELATF CGRETTDTEQ APGQQVILSP GPYMGLTFRS DFSNEERFTG
FDAHYTAVDV DECLEKSDEE LACDHYCHNY IGGYYCSCRF GYILHSDNRT CKVECSDNLY
TQRSGVVTSA DFPSPYPKSS DCLYRIELED GFFITLSFED SFDVEDHPEV TCPYDYIKIK
AGQREFGPFC GEKSPGRIET QTNSVQILFH SDNSGENRGW KLSYTAIGNP CPLVQPPING
KIEPSQAKYT FKDQVVISCN TGYKVLKDNL ESDSFQIECL KDGTWSNKIP ICKIADCQAP
PELEHGFVTF LSRNNLTTYR AAIQYHCQHP YYHMAPNSTA TYTCDASGVW RSEELGTALP
SCRPVCGRPA RALPGIIKRI IGGRNAEPGF FPWQALIVVE DMSRVPNDKW FGSGALLSDS
WVLTAAHVLR SQRRDKTVIP VSKEHVTVYL ALHDVRNKLE AVNRTVERII LHEDFDIQNY
NHDIALVKLK EKVTMGNYVM PICLPQFEHE LEGPHPNTLG LVAGWGISNP NITVDEIISS
GMRTLSDILQ YVKLPVVLHA ECKTSYESRS GNYSVTENMF CAGYYEGGKD TCLGDSGGAF
VIQDPGTRRW VAQGLVSWGG PEECGSKQVY GVYTKVSNYV DWVEKNTGSS ERWTFLDPDL
ER
//