ID W5PID9_SHEEP Unreviewed; 548 AA.
AC W5PID9;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 57.
DE RecName: Full=Complement component C9 {ECO:0000256|ARBA:ARBA00018261};
GN Name=C9 {ECO:0000313|Ensembl:ENSOARP00000010207.1};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000010207.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000010207.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000010207.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000010207.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000256|ARBA:ARBA00004651};
CC Multi-pass membrane protein {ECO:0000256|ARBA:ARBA00004651}. Membrane
CC {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004141}. Secreted
CC {ECO:0000256|ARBA:ARBA00004613}. Target cell membrane
CC {ECO:0000256|ARBA:ARBA00004276}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004276}.
CC -!- SIMILARITY: Belongs to the complement C6/C7/C8/C9 family.
CC {ECO:0000256|ARBA:ARBA00009214}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00124}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01035133; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01035134; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01035135; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_004017075.2; XM_004017026.2.
DR AlphaFoldDB; W5PID9; -.
DR SMR; W5PID9; -.
DR STRING; 9940.ENSOARP00000010207; -.
DR PaxDb; 9940-ENSOARP00000010207; -.
DR Ensembl; ENSOART00000010356.1; ENSOARP00000010207.1; ENSOARG00000009510.1.
DR GeneID; 101105026; -.
DR KEGG; oas:101105026; -.
DR CTD; 735; -.
DR eggNOG; ENOG502QWHM; Eukaryota.
DR HOGENOM; CLU_032453_2_0_1; -.
DR OMA; FSVRKCH; -.
DR OrthoDB; 4572257at2759; -.
DR Proteomes; UP000002356; Chromosome 16.
DR Bgee; ENSOARG00000009510; Expressed in caecum and 18 other cell types or tissues.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0005579; C:membrane attack complex; IEA:UniProtKB-KW.
DR GO; GO:0044218; C:other organism cell membrane; IEA:UniProtKB-KW.
DR GO; GO:0006957; P:complement activation, alternative pathway; IEA:UniProtKB-KW.
DR GO; GO:0006958; P:complement activation, classical pathway; IEA:UniProtKB-KW.
DR GO; GO:0031640; P:killing of cells of another organism; IEA:UniProtKB-KW.
DR CDD; cd00112; LDLa; 1.
DR Gene3D; 2.10.25.10; Laminin; 1.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 1.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 1.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR023415; LDLR_class-A_CS.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR001862; MAC_perforin.
DR InterPro; IPR020864; MACPF.
DR InterPro; IPR020863; MACPF_CS.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR PANTHER; PTHR45742; COMPLEMENT COMPONENT C6; 1.
DR PANTHER; PTHR45742:SF3; COMPLEMENT COMPONENT C9; 1.
DR Pfam; PF00057; Ldl_recept_a; 1.
DR Pfam; PF01823; MACPF; 1.
DR PRINTS; PR00764; COMPLEMENTC9.
DR SMART; SM00192; LDLa; 1.
DR SMART; SM00457; MACPF; 1.
DR SMART; SM00209; TSP1; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF57424; LDL receptor-like module; 1.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 1.
DR PROSITE; PS01209; LDLRA_1; 1.
DR PROSITE; PS50068; LDLRA_2; 1.
DR PROSITE; PS00279; MACPF_1; 1.
DR PROSITE; PS51412; MACPF_2; 1.
DR PROSITE; PS50092; TSP1; 1.
PE 3: Inferred from homology;
KW Complement alternate pathway {ECO:0000256|ARBA:ARBA00023162};
KW Complement pathway {ECO:0000256|ARBA:ARBA00022875};
KW Cytolysis {ECO:0000256|ARBA:ARBA00022852};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00124}; EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Immunity {ECO:0000256|ARBA:ARBA00022859};
KW Innate immunity {ECO:0000256|ARBA:ARBA00022588};
KW Membrane {ECO:0000256|ARBA:ARBA00023136};
KW Membrane attack complex {ECO:0000256|ARBA:ARBA00023058};
KW Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP};
KW Target cell membrane {ECO:0000256|ARBA:ARBA00022537};
KW Target membrane {ECO:0000256|ARBA:ARBA00023298};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692};
KW Transmembrane beta strand {ECO:0000256|ARBA:ARBA00022452}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..548
FT /note="Complement component C9"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004868971"
FT DOMAIN 136..514
FT /note="MACPF"
FT /evidence="ECO:0000259|PROSITE:PS51412"
FT DISULFID 105..123
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 117..132
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ SEQUENCE 548 AA; 61940 MW; 2183B5E1B23D83AA CRC64;
MSAGQRFAFA ICLLEISLLR AGPTPSYNPE ERHGTPLPID CRMSSWSEWS KCDPCLKQMF
RSRSIEIFGQ FNGRRCVDAV GDRRQCVPTE ACEDPEDDCG NNFQCGTGRC IKNQLLCNDD
NDCGDYSDED NCERDPRPPC RNRVVEESEL ARTAGYGINI LGMDPLSTPF DNQYYNGLCD
RVRDGNTLTY YRKPWNVASL SYDTKVDKNF RTENHEEEIQ VLRTIIEEKK LNVNADLTIK
YTPVEAIEKH KCTDLEHSDQ KNVSSPSKLA AEATFRFTYS KEDIYRLLSS YSAKKEKAFL
HVKGKVHLGR FVMRSRDVML QTTFLDSINA LPTAYEKGEY FAFLETYGTH YSSSGSLGGL
YELIYVLDKK SMEQKGIEVR DVQRCLGFDL DLSLKAGVEV TGNVDSKLCS KKGMGQMDAK
PEADLFDDVI TFIRGGTRKY ATELKEKLLR GAKTINVTDF VNWASSLNDA PVLISQKLAP
IYDLIPVKMK DAHLKKQNLE RAIEDYINEF SSRKCEPCQN GGTVVLLDGE CVCSCLKEFK
GVACEIKK
//