GenomeNet

Database: UniProt
Entry: K7FHI4_PELSI
LinkDB: K7FHI4_PELSI
Original site: K7FHI4_PELSI 
ID   K7FHI4_PELSI            Unreviewed;       686 AA.
AC   K7FHI4;
DT   09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT   09-JAN-2013, sequence version 1.
DT   27-MAR-2024, entry version 77.
DE   SubName: Full=Mannan binding lectin serine peptidase 2 {ECO:0000313|Ensembl:ENSPSIP00000007494.1};
GN   Name=MASP2 {ECO:0000313|Ensembl:ENSPSIP00000007494.1};
OS   Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC   Trionychidae; Pelodiscus.
OX   NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000007494.1, ECO:0000313|Proteomes:UP000007267};
RN   [1] {ECO:0000313|Proteomes:UP000007267}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG   Soft-shell Turtle Genome Consortium;
RL   Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Proteomes:UP000007267}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX   PubMed=23624526; DOI=10.1038/ng.2615;
RA   Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA   White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA   Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA   Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA   Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT   "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT   into the development and evolution of the turtle-specific body plan.";
RL   Nat. Genet. 45:701-706(2013).
RN   [3] {ECO:0000313|Ensembl:ENSPSIP00000007494.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- PTM: The iron and 2-oxoglutarate dependent 3-hydroxylation of aspartate
CC       and asparagine is (R) stereospecific within EGF domains.
CC       {ECO:0000256|PIRSR:PIRSR001155-3}.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00302}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGCU01113095; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01113096; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   RefSeq; XP_006127114.1; XM_006127052.2.
DR   AlphaFoldDB; K7FHI4; -.
DR   STRING; 13735.ENSPSIP00000007494; -.
DR   MEROPS; S01.229; -.
DR   Ensembl; ENSPSIT00000007535.1; ENSPSIP00000007494.1; ENSPSIG00000006750.1.
DR   GeneID; 102453425; -.
DR   KEGG; pss:102453425; -.
DR   CTD; 10747; -.
DR   eggNOG; KOG3627; Eukaryota.
DR   GeneTree; ENSGT00950000183084; -.
DR   HOGENOM; CLU_006842_14_1_1; -.
DR   OMA; HPEAQCP; -.
DR   OrthoDB; 5394076at2759; -.
DR   TreeFam; TF330373; -.
DR   Proteomes; UP000007267; Unassembled WGS sequence.
DR   GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR   GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR   GO; GO:0048306; F:calcium-dependent protein binding; IEA:Ensembl.
DR   GO; GO:0001855; F:complement component C4b binding; IEA:Ensembl.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:Ensembl.
DR   GO; GO:0001867; P:complement activation, lectin pathway; IEA:Ensembl.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00033; CCP; 2.
DR   CDD; cd00041; CUB; 2.
DR   CDD; cd00054; EGF_CA; 1.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.10.70.10; Complement Module, domain 1; 2.
DR   Gene3D; 2.10.25.10; Laminin; 1.
DR   Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 2.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 2.
DR   InterPro; IPR000859; CUB_dom.
DR   InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR018097; EGF_Ca-bd_CS.
DR   InterPro; IPR024175; Pept_S1A_C1r/C1S/mannan-bd.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR035914; Sperma_CUB_dom_sf.
DR   InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR   InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24255; COMPLEMENT COMPONENT 1, S SUBCOMPONENT-RELATED; 1.
DR   PANTHER; PTHR24255:SF10; MANNAN-BINDING LECTIN SERINE PROTEASE 2; 1.
DR   Pfam; PF00431; CUB; 2.
DR   Pfam; PF07645; EGF_CA; 1.
DR   Pfam; PF00084; Sushi; 2.
DR   Pfam; PF00089; Trypsin; 1.
DR   PIRSF; PIRSF001155; C1r_C1s_MASP; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00032; CCP; 2.
DR   SMART; SM00042; CUB; 2.
DR   SMART; SM00181; EGF; 1.
DR   SMART; SM00179; EGF_CA; 1.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF57535; Complement control module/SCR domain; 2.
DR   SUPFAM; SSF57196; EGF/Laminin; 1.
DR   SUPFAM; SSF49854; Spermadhesin, CUB domain; 2.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS01180; CUB; 2.
DR   PROSITE; PS01186; EGF_2; 1.
DR   PROSITE; PS01187; EGF_CA; 1.
DR   PROSITE; PS50923; SUSHI; 2.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   4: Predicted;
KW   Calcium {ECO:0000256|PIRSR:PIRSR001155-4};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157,
KW   ECO:0000256|PIRSR:PIRSR001155-2};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278,
KW   ECO:0000256|PIRSR:PIRSR001155-3}; Immunity {ECO:0000256|ARBA:ARBA00022859};
KW   Innate immunity {ECO:0000256|ARBA:ARBA00022588};
KW   Metal-binding {ECO:0000256|PIRSR:PIRSR001155-4};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Serine protease {ECO:0000256|ARBA:ARBA00022825};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   Sushi {ECO:0000256|ARBA:ARBA00022659, ECO:0000256|PROSITE-
KW   ProRule:PRU00302}.
FT   SIGNAL          1..16
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           17..686
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003901820"
FT   DOMAIN          14..134
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          177..293
FT                   /note="CUB"
FT                   /evidence="ECO:0000259|PROSITE:PS01180"
FT   DOMAIN          295..360
FT                   /note="Sushi"
FT                   /evidence="ECO:0000259|PROSITE:PS50923"
FT   DOMAIN          361..430
FT                   /note="Sushi"
FT                   /evidence="ECO:0000259|PROSITE:PS50923"
FT   DOMAIN          443..683
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   ACT_SITE        483
FT                   /note="Charge relay system"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT   ACT_SITE        531
FT                   /note="Charge relay system"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT   ACT_SITE        633
FT                   /note="Charge relay system"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT   BINDING         64
FT                   /ligand="Ca(2+)"
FT                   /ligand_id="ChEBI:CHEBI:29108"
FT                   /ligand_label="1"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT   BINDING         72
FT                   /ligand="Ca(2+)"
FT                   /ligand_id="ChEBI:CHEBI:29108"
FT                   /ligand_label="1"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT   BINDING         117
FT                   /ligand="Ca(2+)"
FT                   /ligand_id="ChEBI:CHEBI:29108"
FT                   /ligand_label="1"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT   BINDING         119
FT                   /ligand="Ca(2+)"
FT                   /ligand_id="ChEBI:CHEBI:29108"
FT                   /ligand_label="1"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT   BINDING         135
FT                   /ligand="Ca(2+)"
FT                   /ligand_id="ChEBI:CHEBI:29108"
FT                   /ligand_label="2"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT   BINDING         136
FT                   /ligand="Ca(2+)"
FT                   /ligand_id="ChEBI:CHEBI:29108"
FT                   /ligand_label="2"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT   BINDING         138
FT                   /ligand="Ca(2+)"
FT                   /ligand_id="ChEBI:CHEBI:29108"
FT                   /ligand_label="2"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT   BINDING         155
FT                   /ligand="Ca(2+)"
FT                   /ligand_id="ChEBI:CHEBI:29108"
FT                   /ligand_label="2"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT   BINDING         156
FT                   /ligand="Ca(2+)"
FT                   /ligand_id="ChEBI:CHEBI:29108"
FT                   /ligand_label="2"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT   BINDING         159
FT                   /ligand="Ca(2+)"
FT                   /ligand_id="ChEBI:CHEBI:29108"
FT                   /ligand_label="2"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT   BINDING         231
FT                   /ligand="Ca(2+)"
FT                   /ligand_id="ChEBI:CHEBI:29108"
FT                   /ligand_label="3"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT   BINDING         241
FT                   /ligand="Ca(2+)"
FT                   /ligand_id="ChEBI:CHEBI:29108"
FT                   /ligand_label="3"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT   BINDING         278
FT                   /ligand="Ca(2+)"
FT                   /ligand_id="ChEBI:CHEBI:29108"
FT                   /ligand_label="3"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT   BINDING         280
FT                   /ligand="Ca(2+)"
FT                   /ligand_id="ChEBI:CHEBI:29108"
FT                   /ligand_label="3"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT   MOD_RES         155
FT                   /note="(3R)-3-hydroxyasparagine"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-3"
FT   DISULFID        69..87
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT   DISULFID        139..153
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT   DISULFID        149..162
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT   DISULFID        164..177
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT   DISULFID        181..208
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT   DISULFID        238..256
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT   DISULFID        297..345
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT   DISULFID        325..358
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT   DISULFID        363..410
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT   DISULFID        393..428
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT   DISULFID        432..551
FT                   /note="Interchain (between heavy and light chains)"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT   DISULFID        597..618
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT   DISULFID        629..659
FT                   /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
SQ   SEQUENCE   686 AA;  76720 MW;  6919E324C17F5E72 CRC64;
     MRLYLLLVAL WHGMGGSIQL QKMYGRITSP DFPNMYPNNK ERTWNISVPT GYTIRIYFTH
     FNMELSYQCD YDYVKISSGG KLVATLCGRE STDTEEAPGN TTYHSIDNTL TVTFRSDYSN
     ENQFTGFEAF YAAEDVDECK QLIDGEPLCD HHCHNYLGGF YCSCSIGYVL HEDKRTCSAQ
     CQNSAFTKRS GEITSPNYPK PYPKLSTCSY SIRVEDGFMI ILEFVETFNV ESHTEATCPY
     DTLKIKTPKK EYGPFCGQNV PPKIETGSNA VDITFSTDIS GDHTGWKIKY TTTALQCPGP
     KAPPHGHINP VQAKYILKDY YTLSCDTGYV VLENDVVLKS FKAECQKDGS WNKPMAKCII
     VNCGPPENID NGMVTYITGP DVTTYKAEIQ YKCESTFYTM KASNNGRYQC GAEGFWQNSK
     GEKALPVCEP VCGIQTRTSL ERIFGGKMAK PGQFPWQVLL IAEDAGAGGG ALLYDKWVLT
     AAHVVARQRD PSTLIIKMGL LNKNSVHYHQ AWAEEIFVHE DYNDGISFNN DIALIKLKYK
     VPINVNITPI CLPRKEARFH VKTDDMGTVA GWGRTDKRRL SPYLLYVELV VIDHKKCKEA
     YAKMPHGKSQ LVTENMMCAG VEEGGKDSCQ GDSGGPLTFL DSQTKKWFVG GIVSWGLGCA
     VAGQYGVYTQ VINYISWIEN TILKNS
//
DBGET integrated database retrieval system