ID K7F7N8_PELSI Unreviewed; 739 AA.
AC K7F7N8;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 69.
DE SubName: Full=Collagen type VIII alpha 1 chain {ECO:0000313|Ensembl:ENSPSIP00000004048.1};
GN Name=COL8A1 {ECO:0000313|Ensembl:ENSPSIP00000004048.1};
OS Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC Trionychidae; Pelodiscus.
OX NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000004048.1, ECO:0000313|Proteomes:UP000007267};
RN [1] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG Soft-shell Turtle Genome Consortium;
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX PubMed=23624526; DOI=10.1038/ng.2615;
RA Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT into the development and evolution of the turtle-specific body plan.";
RL Nat. Genet. 45:701-706(2013).
RN [3] {ECO:0000313|Ensembl:ENSPSIP00000004048.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCU01137060; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01137061; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01137062; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01137063; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01137064; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01137065; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_006130736.1; XM_006130674.2.
DR RefSeq; XP_006130737.1; XM_006130675.2.
DR RefSeq; XP_006130738.1; XM_006130676.2.
DR RefSeq; XP_014432795.1; XM_014577309.1.
DR AlphaFoldDB; K7F7N8; -.
DR STRING; 13735.ENSPSIP00000004048; -.
DR Ensembl; ENSPSIT00000004071.1; ENSPSIP00000004048.1; ENSPSIG00000003821.1.
DR GeneID; 102463098; -.
DR KEGG; pss:102463098; -.
DR CTD; 1295; -.
DR eggNOG; ENOG502QRFR; Eukaryota.
DR GeneTree; ENSGT00940000158272; -.
DR HOGENOM; CLU_001074_21_0_1; -.
DR OMA; PMGKEMP; -.
DR OrthoDB; 4272636at2759; -.
DR TreeFam; TF334029; -.
DR Proteomes; UP000007267; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0035987; P:endodermal cell differentiation; IEA:Ensembl.
DR Gene3D; 2.60.120.40; -; 1.
DR InterPro; IPR001073; C1q_dom.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR008983; Tumour_necrosis_fac-like_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF903; COLLAGEN ALPHA-1(VIII) CHAIN; 1.
DR Pfam; PF00386; C1q; 1.
DR Pfam; PF01391; Collagen; 1.
DR PRINTS; PR00007; COMPLEMNTC1Q.
DR SMART; SM00110; C1Q; 1.
DR SUPFAM; SSF49842; TNF-like; 1.
DR PROSITE; PS50871; C1Q; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..739
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003901383"
FT DOMAIN 606..739
FT /note="C1q"
FT /evidence="ECO:0000259|PROSITE:PS50871"
FT REGION 111..574
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 119..134
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 248..264
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 380..394
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 500..526
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 541..572
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 739 AA; 73389 MW; 94786A25877CD45D CRC64;
MAVLLAPVQL LTVAVTVYLE LVRAIQGGVY YGIKQLPPQI PQYQALGQQV PHLPLGKEGI
PMQHMGKELP HMQYGKEYPH LPQYVKEIPQ LPMLGKEMVP KKEKEIPLAS LRGEQGPPGE
PGPRGPPGPP GLPGHGVPGA KGKPGPQGYP GIGKPGMPGM PGKPGAMGLP GTRGEMGPKG
EIGSIGMPGP QGPPGPSGLP GIGKPGGQGL PGQPGIKGEP GMKGPPGLPG LQGLKGEKGI
GIPGLPGLKG PPGLPGPPGP VGLPGIGKPG LTGFPGPQGP IGKPGLPGER GPQGLLGAPG
IQGPPGLPGV GKPGQDGIPG QPGFPGGKGE QGLPGLPGPP GLPGIGKPGF PGLKGDRGMG
GFPGALGPKG EKGHMGPPGM GGPPGEPGQP GLPGVMGPPG AVGFPGPKGE GGPIGPQGPS
GPKGEPGLQG FPGKPGFPGE VGPPGLRGLP GPMGPKGEAG HKGLPGLPGV PGQLGPKGEP
GIPGDQGYQG PSGIPGIAGP SGPIGPPGLP GPKGEPGVPG PPGFPGLGKP GVSGLQGPPG
KPGALGPPGQ PGLQGPPGPP GPPGPPVIIS PTPPAIGQYL PEMGPGIDGV KTPPGYMGKK
GKTGGAVYEM PAFTAELTTP FPRVGVPVKF DKLLYNGRQN YNPQTGIFTC EIPGIYYFAY
HVHCKGANVW VALFKNNEPL MYTYDEYKKG FLDQASGSTV IQLMPGDRVY IQMPSEQAAG
LYAGQYVHSS FSGYLLYPM
//