ID G7NYN6_MACFA Unreviewed; 748 AA.
AC G7NYN6;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 24-JAN-2024, entry version 64.
DE RecName: Full=Mannan-binding lectin serine protease 1 {ECO:0008006|Google:ProtNLM};
GN ORFNames=EGM_11193 {ECO:0000313|EMBL:EHH51755.1};
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541;
RN [1] {ECO:0000313|EMBL:EHH51755.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CE-4 {ECO:0000313|EMBL:EHH51755.1};
RX PubMed=22002653; DOI=10.1038/nbt.1992;
RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., Li Q.,
RA Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., Huang Z.,
RA Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., Huang Y.,
RA Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., Li B., Liu X.,
RA Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., Li Y., Wang W.,
RA Katze M.G., Su B., Nielsen R., Yang H., Wang J., Wang X., Wang J.;
RT "Genome sequencing and comparison of two nonhuman primate animal models,
RT the cynomolgus and Chinese rhesus macaques.";
RL Nat. Biotechnol. 29:1019-1023(2011).
CC -!- PTM: The iron and 2-oxoglutarate dependent 3-hydroxylation of aspartate
CC and asparagine is (R) stereospecific within EGF domains.
CC {ECO:0000256|PIRSR:PIRSR001155-3}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001277; EHH51755.1; -; Genomic_DNA.
DR AlphaFoldDB; G7NYN6; -.
DR MEROPS; S01.132; -.
DR Proteomes; UP000009130; Chromosome 2.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006956; P:complement activation; IEA:InterPro.
DR GO; GO:0045087; P:innate immune response; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00033; CCP; 2.
DR CDD; cd00041; CUB; 2.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 2.
DR Gene3D; 2.10.25.10; Laminin; 1.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 2.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 3.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024175; Pept_S1A_C1r/C1S/mannan-bd.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24255; COMPLEMENT COMPONENT 1, S SUBCOMPONENT-RELATED; 1.
DR PANTHER; PTHR24255:SF13; MANNAN-BINDING LECTIN SERINE PROTEASE 1; 1.
DR Pfam; PF00431; CUB; 2.
DR Pfam; PF00084; Sushi; 1.
DR Pfam; PF00089; Trypsin; 1.
DR PIRSF; PIRSF001155; C1r_C1s_MASP; 2.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00032; CCP; 2.
DR SMART; SM00042; CUB; 2.
DR SMART; SM00179; EGF_CA; 1.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF57535; Complement control module/SCR domain; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 2.
DR PROSITE; PS01180; CUB; 2.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS50923; SUSHI; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Calcium {ECO:0000256|PIRSR:PIRSR001155-4};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157,
KW ECO:0000256|PIRSR:PIRSR001155-2};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278,
KW ECO:0000256|PIRSR:PIRSR001155-3}; Immunity {ECO:0000256|ARBA:ARBA00022859};
KW Innate immunity {ECO:0000256|ARBA:ARBA00022588};
KW Metal-binding {ECO:0000256|PIRSR:PIRSR001155-4};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825};
KW Sushi {ECO:0000256|ARBA:ARBA00022659, ECO:0000256|PROSITE-
KW ProRule:PRU00302}.
FT DOMAIN 4..127
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 174..286
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 288..353
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 414..680
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT ACT_SITE 461
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT ACT_SITE 517
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT ACT_SITE 628
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT BINDING 57
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 65
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 110
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 112
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 128
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 129
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 131
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 148
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 149
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 152
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 224
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 234
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 271
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 273
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT MOD_RES 148
FT /note="(3R)-3-hydroxyasparagine"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-3"
FT DISULFID 62..80
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 132..146
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 142..155
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 157..170
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 174..201
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2,
FT ECO:0000256|PROSITE-ProRule:PRU00059"
FT DISULFID 231..249
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 290..338
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 318..351
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 594..613
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 624..656
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
SQ SEQUENCE 748 AA; 83583 MW; 4E496631B5C62D8D CRC64;
MVMCSASAHT VELNDMFGQI QSPGYPDSYP SDSEVTWNIT VPDGFRIKLY FMHFNLESSY
LCEYDYVKVE TEDQVLATFC GRETTDTEQT PGQEVVLSPG SFMSITFRSD FSNEERFTGF
DAHYMAVDVD ECKEREDEEL SCDHYCHNYI GGYYCSCRFG YILHTDNRTC RVECSDNLFT
QRTGVITSPD FPNPYPKSSE CLYTIELEEG FMVNLQFEDI FDIEDHPEVP CPYDYIKIKV
GPKVLGPFCG EKAPEPINTQ SHSVLILFHS DNSGENRGWR LSYRAAGNEC PELQPPVHGK
IEPSQAKYSF KDQVLISCDT GYKVLKDNVE MDTFQIECLK DGTWSNKIPT CKIVDCRAPG
ELEHGLVTFS TRNNLTTYKS EIRYSCQEPY YKMLNNITEC GQPSRSLPSL VKRIIGGRNA
EPGLFPWQAL IVVEDTSRVP NDKWFGSGAL LSESWILTAA HVLRSQRRDT TVIPVSKEHV
TVYLGLHDVR DKSGAVNSSA ARVVLHPDFN IQNYNHDIAL VQLQEPVPLG PHVMPVCLPR
LEPEGPSPHM LGLVAGWGIS NPNVTVDEII SSGTRTLSDV LQYVKLPVVP HAECKTSYES
RSGNYSVTEN MFCAGYYEGG KDTCLGDSGG AFVILDDLSQ RWVVQGLVSW GGPEECGSKQ
VYGVYTKVSN YVDWVWEQMG SPQGRAPGGT GISSPNVTLD EISSSGMQTL SDVLQYVKLH
VVPHAECRTS YESCSGNYSV TENMFCAS
//