ID K7F7C7_PELSI Unreviewed; 597 AA.
AC K7F7C7;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 66.
DE SubName: Full=Complement C8 alpha chain {ECO:0000313|Ensembl:ENSPSIP00000003937.1};
GN Name=C8A {ECO:0000313|Ensembl:ENSPSIP00000003937.1};
OS Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC Trionychidae; Pelodiscus.
OX NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000003937.1, ECO:0000313|Proteomes:UP000007267};
RN [1] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG Soft-shell Turtle Genome Consortium;
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX PubMed=23624526; DOI=10.1038/ng.2615;
RA Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT into the development and evolution of the turtle-specific body plan.";
RL Nat. Genet. 45:701-706(2013).
RN [3] {ECO:0000313|Ensembl:ENSPSIP00000003937.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- SIMILARITY: Belongs to the complement C6/C7/C8/C9 family.
CC {ECO:0000256|ARBA:ARBA00009214}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00124}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCU01155445; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01155446; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_006133556.1; XM_006133494.2.
DR RefSeq; XP_014434269.1; XM_014578783.1.
DR AlphaFoldDB; K7F7C7; -.
DR STRING; 13735.ENSPSIP00000003937; -.
DR Ensembl; ENSPSIT00000003957.1; ENSPSIP00000003937.1; ENSPSIG00000003720.1.
DR GeneID; 102453101; -.
DR KEGG; pss:102453101; -.
DR CTD; 731; -.
DR eggNOG; ENOG502QT87; Eukaryota.
DR GeneTree; ENSGT00940000160126; -.
DR OMA; CQPGVTI; -.
DR OrthoDB; 4572257at2759; -.
DR TreeFam; TF330498; -.
DR Proteomes; UP000007267; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0005579; C:membrane attack complex; IEA:UniProtKB-KW.
DR GO; GO:0006957; P:complement activation, alternative pathway; IEA:UniProtKB-KW.
DR GO; GO:0006958; P:complement activation, classical pathway; IEA:UniProtKB-KW.
DR GO; GO:0031640; P:killing of cells of another organism; IEA:UniProtKB-KW.
DR CDD; cd00112; LDLa; 1.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 1.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 2.
DR InterPro; IPR048831; C8A_B_C6_EGF-like.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR023415; LDLR_class-A_CS.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR001862; MAC_perforin.
DR InterPro; IPR020864; MACPF.
DR InterPro; IPR020863; MACPF_CS.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR PANTHER; PTHR45742; COMPLEMENT COMPONENT C6; 1.
DR PANTHER; PTHR45742:SF1; COMPLEMENT COMPONENT C8 ALPHA CHAIN; 1.
DR Pfam; PF21195; C8A_B_C6_EGF-like; 1.
DR Pfam; PF00057; Ldl_recept_a; 1.
DR Pfam; PF01823; MACPF; 1.
DR PRINTS; PR00764; COMPLEMENTC9.
DR PRINTS; PR01705; TSP1REPEAT.
DR SMART; SM00192; LDLa; 1.
DR SMART; SM00457; MACPF; 1.
DR SMART; SM00209; TSP1; 2.
DR SUPFAM; SSF57424; LDL receptor-like module; 1.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 2.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS01209; LDLRA_1; 1.
DR PROSITE; PS50068; LDLRA_2; 1.
DR PROSITE; PS00279; MACPF_1; 1.
DR PROSITE; PS51412; MACPF_2; 1.
DR PROSITE; PS50092; TSP1; 2.
PE 3: Inferred from homology;
KW Complement alternate pathway {ECO:0000256|ARBA:ARBA00023162};
KW Complement pathway {ECO:0000256|ARBA:ARBA00022875};
KW Cytolysis {ECO:0000256|ARBA:ARBA00022852};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00124}; EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Immunity {ECO:0000256|ARBA:ARBA00022859};
KW Innate immunity {ECO:0000256|ARBA:ARBA00022588};
KW Membrane attack complex {ECO:0000256|ARBA:ARBA00023058};
KW Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..597
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003901364"
FT DOMAIN 148..511
FT /note="MACPF"
FT /evidence="ECO:0000259|PROSITE:PS51412"
FT DISULFID 109..121
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 128..143
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ SEQUENCE 597 AA; 67640 MW; E5798FA9DCFBB9F2 CRC64;
MWWSLNEVSP LILLIAYLIS CQDTVTAHSK VTGLPSQRRA SRDINTPAPV NCRLSPWSEW
TGCFPCQEKK YRYRTLLQPA KFKGSKCIGH LWDEAACQPT ETCTASQQCG NDFQCQETGR
CIKRHLLCNG EPDCRDGSDE DNCEDEDIES LCKNLQPIPG SEKAVQGYNV LTQEEKQFIY
DPKYYGGQCE YVYNGEWREL KYDAACEHLY YGDDEKYFRK PYNFHVYQFL AHADSGFSSE
FYEDVKDLID AIKRDNSRQY GFTFGIGFAD FPIHLQLGFS LSYGKGSLKN FTEYDEKNIG
FIRGITKVQT ARFKMRRENL VLDEDMLQSL MELPESYQYG MYAKFINDYG THFMTSGTMG
GIFEYILVVN KNEMSRKAIT SETVSSCFGL SAGLIFQKEG VDISAGLAYE KCKAEGFLKT
DEKSNSALVE DIIPRIQGGD VSSSGGLMNI WNANLYRRWG RSLKYNPAVI DFELQPIHEI
LRRSNLGAIE TKRQNLKRAL DEYLAEFSAC RCGPCQNNAE PMLLGNACVC QCTQGFEGPA
CEKTRRQGTK THGSWSCWSP WTPCQSGTKR RTRQCNNPAP QNGGTLCAGK CVQTETC
//