ID A0A151MCR6_ALLMI Unreviewed; 744 AA.
AC A0A151MCR6;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE RecName: Full=complement subcomponent C1r {ECO:0000256|ARBA:ARBA00011907};
DE EC=3.4.21.41 {ECO:0000256|ARBA:ARBA00011907};
GN Name=C1R {ECO:0000313|EMBL:KYO22326.1};
GN ORFNames=Y1Q_0002923 {ECO:0000313|EMBL:KYO22326.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO22326.1};
RN [1] {ECO:0000313|EMBL:KYO22326.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO22326.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- FUNCTION: C1r B chain is a serine protease that combines with C1q and
CC C1s to form C1, the first component of the classical pathway of the
CC complement system. {ECO:0000256|ARBA:ARBA00002304}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Selective cleavage of Lys(or Arg)-|-Ile bond in complement
CC subcomponent C1s to form the active form of C1s (EC 3.4.21.42).;
CC EC=3.4.21.41; Evidence={ECO:0000256|ARBA:ARBA00001057};
CC -!- SUBUNIT: C1 is a calcium-dependent trimolecular complex of C1q, C1r and
CC C1s in the molar ration of 1:2:2. C1r is a dimer of identical chains,
CC each of which is activated by cleavage into two chains, A and B,
CC connected by disulfide bonds. {ECO:0000256|ARBA:ARBA00025829}.
CC -!- PTM: The iron and 2-oxoglutarate dependent 3-hydroxylation of aspartate
CC and asparagine is (R) stereospecific within EGF domains.
CC {ECO:0000256|PIRSR:PIRSR001155-3}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00302}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO22326.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03006231; KYO22326.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A151MCR6; -.
DR STRING; 8496.A0A151MCR6; -.
DR eggNOG; KOG3627; Eukaryota.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:UniProtKB-EC.
DR GO; GO:0006958; P:complement activation, classical pathway; IEA:UniProtKB-KW.
DR GO; GO:0045087; P:innate immune response; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00033; CCP; 2.
DR CDD; cd00041; CUB; 2.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 2.
DR Gene3D; 2.10.25.10; Laminin; 1.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 2.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 3.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024175; Pept_S1A_C1r/C1S/mannan-bd.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24255:SF25; COMPLEMENT C1R SUBCOMPONENT; 1.
DR PANTHER; PTHR24255; COMPLEMENT COMPONENT 1, S SUBCOMPONENT-RELATED; 1.
DR Pfam; PF00431; CUB; 2.
DR Pfam; PF14670; FXa_inhibition; 1.
DR Pfam; PF00084; Sushi; 2.
DR Pfam; PF00089; Trypsin; 1.
DR PIRSF; PIRSF001155; C1r_C1s_MASP; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00032; CCP; 2.
DR SMART; SM00042; CUB; 2.
DR SMART; SM00181; EGF; 1.
DR SMART; SM00179; EGF_CA; 1.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF57535; Complement control module/SCR domain; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS01180; CUB; 2.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS50923; SUSHI; 2.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Calcium {ECO:0000256|PIRSR:PIRSR001155-4};
KW Complement pathway {ECO:0000256|ARBA:ARBA00022875};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157,
KW ECO:0000256|PIRSR:PIRSR001155-2};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278,
KW ECO:0000256|PIRSR:PIRSR001155-3}; Immunity {ECO:0000256|ARBA:ARBA00022859};
KW Innate immunity {ECO:0000256|ARBA:ARBA00022588};
KW Metal-binding {ECO:0000256|PIRSR:PIRSR001155-4};
KW Phosphoprotein {ECO:0000256|PIRSR:PIRSR001155-3};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825};
KW Sushi {ECO:0000256|ARBA:ARBA00022659, ECO:0000256|PROSITE-
KW ProRule:PRU00302}.
FT DOMAIN 38..167
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 218..330
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 332..398
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 399..469
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 484..740
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT ACT_SITE 522
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT ACT_SITE 588
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT ACT_SITE 692
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-1"
FT BINDING 92
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 100
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 145
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 147
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 168
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 171
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 192
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 193
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 196
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 278
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 315
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT BINDING 317
FT /ligand="Ca(2+)"
FT /ligand_id="ChEBI:CHEBI:29108"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-4"
FT MOD_RES 192
FT /note="(3R)-3-hydroxyasparagine"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-3"
FT MOD_RES 231
FT /note="Phosphoserine; by CK2"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-3"
FT DISULFID 97..115
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 172..190
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 186..199
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 201..214
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 218..245
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2,
FT ECO:0000256|PROSITE-ProRule:PRU00059"
FT DISULFID 275..293
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 334..383
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 363..396
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 401..449
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 431..467
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 471..608
FT /note="Interchain (between heavy and light chains)"
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 653..677
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
FT DISULFID 688..718
FT /evidence="ECO:0000256|PIRSR:PIRSR001155-2"
SQ SEQUENCE 744 AA; 83563 MW; 966EDB24D7ABCDEE CRC64;
MHSQGLARTF GGERGRDIHT SDKPRTMLFP FLLWESLLFG VISSSQTPKK LFGEITSPSY
PKPYPNNNIS SWDIVVPKGF VVKLTFWQFD LEPSESCFYD YVKITADKKD LGRYCGQPGS
AKGNHPGTRE FVSKGNQMRL KFQSDFSNEE NGTLIPYKGF LAYYQAVDLD ECTHGNAVEE
EEGPQCQHTC HNYIGGYFCS CHPGYQLQKD QHSCKVECSN ELFTEASGYL SSPEYPEPYP
AELQCKYSIR LDKGLSITLK FLEPFDIDEH QQVHCPYDQL KVQAGGRLLK EFCGKVSPGI
IDTNSNEVDI LFFTDESGFS RGWKIHYSSE RIRCPQPVPR DEFTVIKNPQ PVYQFRDYFI
VSCQKGYRLT ESGQKLASFT AVCQDDGTWH RPMPQCEIVN CGDPKLLNNG HFNYVSKPGN
NEYQSAITYH CKKPYYHLVT KGRSDTYTCS AQGSWENQDG HVDIPACLPV CGKPDNPVTS
IQRIVGGSRA NPGNFPWQAK TLIHGLGGGA LLGDRWILTA AHTIYPKNKE QGMEAIGRDG
EALLTGDEEV YLGHTNVTEI LKLGNRPVRK IFVHPGYKPG NSHNFDGDIA LLELKEPVKL
GRDVLPICLP DIKNNTFYTT GYMGYVSGFG VEKNILPDHL KYVALPVADQ TICQQWLQNN
NKKTMNKTMV FSENMFCAGS PSANKDTCQG DSGSVFTVLD PESERWIATG IVSWGIDCAK
GYGFYTKIIN YLDWIKKITG EKTL
//