ID K7FHI4_PELSI Unreviewed; 686 AA.
AC K7FHI4;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 77.
DE SubName: Full=Mannan binding lectin serine peptidase 2 {ECO:0000313|Ensembl:ENSPSIP00000007494.1};
GN Name=MASP2 {ECO:0000313|Ensembl:ENSPSIP00000007494.1};
OS Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC Trionychidae; Pelodiscus.
OX NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000007494.1, ECO:0000313|Proteomes:UP000007267};
RN [1] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG Soft-shell Turtle Genome Consortium;
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX PubMed=23624526; DOI=10.1038/ng.2615;
RA Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT into the development and evolution of the turtle-specific body plan.";
RL Nat. Genet. 45:701-706(2013).
RN [3] {ECO:0000313|Ensembl:ENSPSIP00000007494.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- PTM: The iron and 2-oxoglutarate dependent 3-hydroxylation of aspartate
CC and asparagine is (R) stereospecific within EGF domains.
CC {ECO:0000256|PIRSR:PIRSR001155-3}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCU01113095; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01113096; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_006127114.1; XM_006127052.2.
DR AlphaFoldDB; K7FHI4; -.
DR STRING; 13735.ENSPSIP00000007494; -.
DR MEROPS; S01.229; -.
DR Ensembl; ENSPSIT00000007535.1; ENSPSIP00000007494.1; ENSPSIG00000006750.1.
DR GeneID; 102453425; -.
DR KEGG; pss:102453425; -.
DR CTD; 10747; -.
DR eggNOG; KOG3627; Eukaryota.
DR GeneTree; ENSGT00950000183084; -.
DR HOGENOM; CLU_006842_14_1_1; -.
DR OMA; HPEAQCP; -.
DR OrthoDB; 5394076at2759; -.
DR TreeFam; TF330373; -.
DR Proteomes; UP000007267; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0048306; F:calcium-dependent protein binding; IEA:Ensembl.
DR GO; GO:0001855; F:complement component C4b binding; IEA:Ensembl.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:Ensembl.
DR GO; GO:0001867; P:complement activation, lectin pathway; IEA:Ensembl.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00033; CCP; 2.
DR CDD; cd00041; CUB; 2.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 2.
DR Gene3D; 2.10.25.10; Laminin; 1.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 2.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 2.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024175; Pept_S1A_C1r/C1S/mannan-bd.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24255; COMPLEMENT COMPONENT 1, S SUBCOMPONENT-RELATED; 1.
DR PANTHER; PTHR24255:SF10; MANNAN-BINDING LECTIN SERINE PROTEASE 2; 1.
DR Pfam; PF00431; CUB; 2.
DR Pfam; PF07645; EGF_CA; 1.
DR Pfam; PF00084; Sushi; 2.
DR Pfam; PF00089; Trypsin; 1.
DR PIRSF; PIRSF001155; C1r_C1s_MASP; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00032; CCP; 2.
DR SMART; SM00042; CUB; 2.
DR SMART; SM00181; EGF; 1.
DR SMART; SM00179; EGF_CA; 1.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF57535; Complement control module/SCR domain; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS01180; CUB; 2.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS50923; SUSHI; 2.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Calcium {ECO:0000256|PIRSR:PIRSR001155-4};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157,
KW ECO:0000256|PIRSR:PIRSR001155-2};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278,
KW ECO:0000256|PIRSR:PIRSR001155-3}; Immunity {ECO:0000256|ARBA:ARBA00022859};
KW Innate immunity {ECO:0000256|ARBA:ARBA00022588};
KW Metal-binding {ECO:0000256|PIRSR:PIRSR001155-4};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825};
KW Signal {ECO:0000256|SAM:SignalP};
KW Sushi {ECO:0000256|ARBA:ARBA00022659, ECO:0000256|PROSITE-
KW ProRule:PRU00302}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..686
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003901820"
FT DOMAIN 14..134
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 177..293
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 295..360
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 361..430
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 443..683
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT ACT_SITE 483
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT ACT_SITE 531
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT ACT_SITE 633
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT BINDING 64
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 72
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 117
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 119
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 135
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 136
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 138
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 155
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 156
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 159
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 231
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 241
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 278
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 280
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT MOD_RES 155
FT /note="(3R)-3-hydroxyasparagine"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-3"
FT DISULFID 69..87
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 139..153
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 149..162
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 164..177
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 181..208
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 238..256
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 297..345
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 325..358
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 363..410
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 393..428
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 432..551
FT /note="Interchain (between heavy and light chains)"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 597..618
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 629..659
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
SQ SEQUENCE 686 AA; 76720 MW; 6919E324C17F5E72 CRC64;
MRLYLLLVAL WHGMGGSIQL QKMYGRITSP DFPNMYPNNK ERTWNISVPT GYTIRIYFTH
FNMELSYQCD YDYVKISSGG KLVATLCGRE STDTEEAPGN TTYHSIDNTL TVTFRSDYSN
ENQFTGFEAF YAAEDVDECK QLIDGEPLCD HHCHNYLGGF YCSCSIGYVL HEDKRTCSAQ
CQNSAFTKRS GEITSPNYPK PYPKLSTCSY SIRVEDGFMI ILEFVETFNV ESHTEATCPY
DTLKIKTPKK EYGPFCGQNV PPKIETGSNA VDITFSTDIS GDHTGWKIKY TTTALQCPGP
KAPPHGHINP VQAKYILKDY YTLSCDTGYV VLENDVVLKS FKAECQKDGS WNKPMAKCII
VNCGPPENID NGMVTYITGP DVTTYKAEIQ YKCESTFYTM KASNNGRYQC GAEGFWQNSK
GEKALPVCEP VCGIQTRTSL ERIFGGKMAK PGQFPWQVLL IAEDAGAGGG ALLYDKWVLT
AAHVVARQRD PSTLIIKMGL LNKNSVHYHQ AWAEEIFVHE DYNDGISFNN DIALIKLKYK
VPINVNITPI CLPRKEARFH VKTDDMGTVA GWGRTDKRRL SPYLLYVELV VIDHKKCKEA
YAKMPHGKSQ LVTENMMCAG VEEGGKDSCQ GDSGGPLTFL DSQTKKWFVG GIVSWGLGCA
VAGQYGVYTQ VINYISWIEN TILKNS
//