ID K7GDB0_PELSI Unreviewed; 1664 AA.
AC K7GDB0;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 58.
DE SubName: Full=Complement C3 {ECO:0000313|Ensembl:ENSPSIP00000018271.1};
OS Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC Trionychidae; Pelodiscus.
OX NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000018271.1, ECO:0000313|Proteomes:UP000007267};
RN [1] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG Soft-shell Turtle Genome Consortium;
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX PubMed=23624526; DOI=10.1038/ng.2615;
RA Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT into the development and evolution of the turtle-specific body plan.";
RL Nat. Genet. 45:701-706(2013).
RN [3] {ECO:0000313|Ensembl:ENSPSIP00000018271.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCU01098627; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01098628; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01098629; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01098630; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01098631; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01098632; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01098633; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01098634; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01098635; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 13735.ENSPSIP00000018271; -.
DR Ensembl; ENSPSIT00000018357.1; ENSPSIP00000018271.1; ENSPSIG00000015357.1.
DR eggNOG; KOG1366; Eukaryota.
DR GeneTree; ENSGT00940000154063; -.
DR HOGENOM; CLU_001634_4_0_1; -.
DR OMA; QATNTMQ; -.
DR TreeFam; TF313285; -.
DR Proteomes; UP000007267; Unassembled WGS sequence.
DR GO; GO:0005615; C:extracellular space; IEA:InterPro.
DR GO; GO:0004866; F:endopeptidase inhibitor activity; IEA:InterPro.
DR GO; GO:0050896; P:response to stimulus; IEA:UniProt.
DR CDD; cd00017; ANATO; 1.
DR CDD; cd02896; complement_C3_C4_C5; 1.
DR CDD; cd03583; NTR_complement_C3; 1.
DR Gene3D; 1.50.10.20; -; 1.
DR Gene3D; 2.20.130.20; -; 1.
DR Gene3D; 2.40.50.120; -; 1.
DR Gene3D; 2.60.120.1540; -; 1.
DR Gene3D; 2.60.40.1930; -; 3.
DR Gene3D; 2.60.40.1940; -; 1.
DR Gene3D; 6.20.50.160; -; 1.
DR Gene3D; 2.60.40.690; Alpha-macroglobulin, receptor-binding domain; 1.
DR Gene3D; 1.20.91.20; Anaphylotoxins (complement system); 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR InterPro; IPR009048; A-macroglobulin_rcpt-bd.
DR InterPro; IPR036595; A-macroglobulin_rcpt-bd_sf.
DR InterPro; IPR011625; A2M_N_BRD.
DR InterPro; IPR047565; Alpha-macroglob_thiol-ester_cl.
DR InterPro; IPR011626; Alpha-macroglobulin_TED.
DR InterPro; IPR000020; Anaphylatoxin/fibulin.
DR InterPro; IPR018081; Anaphylatoxin_comp_syst.
DR InterPro; IPR041425; C3/4/5_MG1.
DR InterPro; IPR049466; C3_CUB1.
DR InterPro; IPR048848; C3_CUB2.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR001599; Macroglobln_a2.
DR InterPro; IPR019742; MacrogloblnA2_CS.
DR InterPro; IPR002890; MG2.
DR InterPro; IPR041555; MG3.
DR InterPro; IPR040839; MG4.
DR InterPro; IPR001134; Netrin_domain.
DR InterPro; IPR018933; Netrin_module_non-TIMP.
DR InterPro; IPR035815; NTR_complement_C3.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR InterPro; IPR008993; TIMP-like_OB-fold.
DR PANTHER; PTHR11412:SF81; COMPLEMENT C3; 1.
DR PANTHER; PTHR11412; MACROGLOBULIN / COMPLEMENT; 1.
DR Pfam; PF00207; A2M; 1.
DR Pfam; PF07703; A2M_BRD; 1.
DR Pfam; PF07677; A2M_recep; 1.
DR Pfam; PF01821; ANATO; 1.
DR Pfam; PF21406; C3_CUB1; 1.
DR Pfam; PF21308; C3_CUB2; 1.
DR Pfam; PF17790; MG1; 1.
DR Pfam; PF01835; MG2; 1.
DR Pfam; PF17791; MG3; 1.
DR Pfam; PF17789; MG4; 1.
DR Pfam; PF01759; NTR; 1.
DR Pfam; PF07678; TED_complement; 1.
DR SMART; SM01360; A2M; 1.
DR SMART; SM01359; A2M_N_2; 1.
DR SMART; SM01361; A2M_recep; 1.
DR SMART; SM00104; ANATO; 1.
DR SMART; SM00643; C345C; 1.
DR SMART; SM01419; Thiol-ester_cl; 1.
DR SUPFAM; SSF49410; Alpha-macroglobulin receptor domain; 1.
DR SUPFAM; SSF47686; Anaphylotoxins (complement system); 1.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 1.
DR SUPFAM; SSF50242; TIMP-like; 1.
DR PROSITE; PS00477; ALPHA_2_MACROGLOBULIN; 1.
DR PROSITE; PS01177; ANAPHYLATOXIN_1; 1.
DR PROSITE; PS01178; ANAPHYLATOXIN_2; 1.
DR PROSITE; PS50189; NTR; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Thioester bond {ECO:0000256|ARBA:ARBA00022966}.
FT DOMAIN 688..723
FT /note="Anaphylatoxin-like"
FT /evidence="ECO:0000259|PROSITE:PS01178"
FT DOMAIN 1517..1662
FT /note="NTR"
FT /evidence="ECO:0000259|PROSITE:PS50189"
FT REGION 643..662
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1664 AA; 186655 MW; 58767626CBF082EE CRC64;
MVGLTTSRVL NTNSITHLSL FAVPSYSLIT PNVLRVESEE NVVVEAHGLD GPMAVTVTVQ
DFPLRKNILY QVPVELNPNT GMIGTAVIKV PTKDIKKDSK QNQYVVVQAK FPQKTLEKVV
LVHFHSGYIF IQTDKTIYTP GSTVLFRIFT MGHKLEPLSK TVIVEFETPE GIIVKQIPVS
APLKTGIFSL NHFLPEIISL GTWKIVARYE DSPQQSFSAQ FDVKEYVLPS FEVILEPSEK
FLYIDGEDDF RVSITARYLY GKKLHGTAFV LFGVKLDDEK RSIPQSLKRI PITDGEGEAT
LTREMLQARF ANLRELIGHS LYISVTVLTD TGSDMVEAER TGINIVESPY QILFTKTPKY
FKPGMPFELM VYVTNPDGSP APRVPVRTEQ FQSQGSTQGD GTAKLIINMP GDQPKVHITV
KTAHPNLPAN RQAEKTMVAE TYETQGNSQN FLHLAVTASE LKPGDNLPVN FHLKSNNAAV
LNGLKYFTYI ILNKGKIIRA GRQAKEAGQS LVTMYLPITP DLIPSFRIVG YYQVADSEIV
ADSVWLDIKD TCMGTLVVKG ASEEDRAIHA PGTQMKLKLE GDHKAYVGLV AVDKGVYVLN
KKHKITQSKI WDSVEKSDIG CTPGSGKDNK GVFTDAGLAL ETSGGVSTPQ RTDPACPQPA
TRRRRSVQLI EYKANKTAEY HDRKLKKCCE DGMYENPMGH SCEKRLGYIT DTEECKQAFL
ECCTYIKTVR DKMQRELHLQ LSRSDLDEDF LSDEDITSRS QFPESWLWQV EHLTERPNDL
GISSKTLQIF LKDSITTWEV LAVSLSETKG ICVADPYEIT VMKDFFIDLR LPYSVVRNEQ
VEIRAILYNY RAEQITVRVD LLYNPAFCSA ATSKTKYRQM LKIQGRSSLA VPFIIVPLLT
GSHEIEVKAA VWGSFVADGV KKKLKVVPEG MRMTKTIKSV VLDPSGKGNN GVQEEVVHAA
DIDDMVPNTE SETKVSIQGN PVAIIVENSI DGANLKHLIV TPSGCGEQNM IGMTPPVIAT
HYLDSTEQWE RVGAERRAEA IKLIMPGYTQ QMVYKKPDYS YAAFKDRQAS TWLTAYVAKV
FAMAKKLVPL ENQVICGAVK WLILEKQKPD GVFQEDAPVI HGEMVGGYKG AEPEASLTAF
VLIALEEAKE ICKDQVNSLE GSINKAADYL SQKYQSLSRP YTVALASYAL AMVGRLNTEK
ALMKVSTDLK DTNPKARKGG TKRMQLNETE QSSYALLALL KMKKYELTSP IVKWLREQNY
YGGGYGSTQA TIMVFQALAQ YQIDIPQLAE LNLDVSILLP RRASPIKYRI MNQNAMVSRS
AETKWNEDFT VKAEGKGQGT LTVMTIYNAQ LREDASLCKK FDLRVSVEEA RGAKKPEGAM
RSVNIRICVR FLGQVDATMS IIDVSMLTGF SPDVEDLKRL SEGVDKYMSK FDIDKAPSDR
GNLIIYLDKV SHREDECWQF KAHQFFEVGL IQPASVTVYD YYSIDDRCTR FYHPSKQSGL
LSKICHGDIC RCAEENCFMQ QKIEGPINLN KRIEEACEPG VDYVYKTRLV RTSESKDGYD
SYIMEILQII KAGTDENPLG KTRPFISHEK CRKSLSLEVN KDYLIWGLST DLWPRKSELT
YIIGKDTWIE KWPNEDECQE PDFQSLCQQF LEFSEAMTMF GCPT
//