GenomeNet

Database: UniProt
Entry: K7GJW5_PELSI
LinkDB: K7GJW5_PELSI
Original site: K7GJW5_PELSI 
ID   K7GJW5_PELSI            Unreviewed;      1289 AA.
AC   K7GJW5;
DT   09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT   09-JAN-2013, sequence version 1.
DT   28-JAN-2026, entry version 72.
DE   RecName: Full=Collagen alpha-1(XV) chain {ECO:0000256|ARBA:ARBA00074723};
OS   Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC   Trionychidae; Pelodiscus.
OX   NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000020576.1, ECO:0000313|Proteomes:UP000007267};
RN   [1] {ECO:0000313|Proteomes:UP000007267}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG   Soft-shell Turtle Genome Consortium;
RL   Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Proteomes:UP000007267}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX   PubMed=23624526; DOI=10.1038/ng.2615;
RA   Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA   White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA   Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA   Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA   Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT   "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT   into the development and evolution of the turtle-specific body plan.";
RL   Nat. Genet. 45:701-706(2013).
RN   [3] {ECO:0000313|Ensembl:ENSPSIP00000020576.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [4] {ECO:0000313|Ensembl:ENSPSIP00000020576.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- FUNCTION: Restin potently inhibits angiogenesis.
CC       {ECO:0000256|ARBA:ARBA00058706}.
CC   -!- FUNCTION: Structural protein that stabilizes microvessels and muscle
CC       cells, both in heart and in skeletal muscle.
CC       {ECO:0000256|ARBA:ARBA00058695}.
CC   -!- SUBUNIT: Interacts moderately with EFEMP2.
CC       {ECO:0000256|ARBA:ARBA00065596}.
CC   -!- SUBUNIT: Trimer; disulfide-linked. {ECO:0000256|ARBA:ARBA00061770}.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGCU01019129; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01019130; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01019131; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01019132; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01019133; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01019134; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01019135; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01019136; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01019137; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01019138; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   STRING; 13735.ENSPSIP00000020576; -.
DR   Ensembl; ENSPSIT00000020673.1; ENSPSIP00000020576.1; ENSPSIG00000018207.1.
DR   eggNOG; KOG3544; Eukaryota.
DR   eggNOG; KOG3546; Eukaryota.
DR   GeneTree; ENSGT00940000164061; -.
DR   HOGENOM; CLU_004003_1_0_1; -.
DR   OMA; RATEDQC; -.
DR   Proteomes; UP000007267; Unassembled WGS sequence.
DR   GO; GO:0005604; C:basement membrane; IEA:UniProtKB-ARBA.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.40.1620.70:FF:000002; Collagen alpha 1 (XV) chain; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 5.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00282; LamG; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..26
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           27..1289
FT                   /note="Collagen alpha-1(XV) chain"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003903167"
FT   DOMAIN          38..226
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   DOMAIN          87..225
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|SMART:SM00282"
FT   REGION          226..252
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          296..319
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          331..693
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          779..802
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          893..917
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          929..1030
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        242..252
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        341..357
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        419..430
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        475..484
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        508..517
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        562..571
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        663..672
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        930..947
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        964..977
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        978..990
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        997..1010
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1019..1028
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1289 AA;  132842 MW;  A49DB0831BEDB03F CRC64;
     MLSRHCWWPW NLLLLFWGSF LPVVLTAQVI EERRSKGHLD LTELIGVPLP PSVSFITGYG
     GFPAYSFGPN ANIGRLTKTL IPQTFYRDFA IVVTVKPNSD DGGVLFAITD AFQKTIYLGM
     RLSPVDDSTQ RIIIYYTEPR SSISREAASF KVPVMTNKWN RFTVTVQGND IVLFMDCEEY
     NRVQFRRSAK ALQFESSSGI FVGNAGATGL EKFTGSIQQL MIKPDPRATE DQCEDDDPYA
     SGDSSGAGSI QEQEGILETQ EVIASSPPLP AEITAEPVEA PPTVSAQLEE IDFSGHHISD
     ETSESPTIKK QGNDQHERDT TTIAQEILKQ ETGSGAIILQ GEREVQGQKG DRGEKGEQGP
     PGPPGPPGKS GLEDMQTGIQ GTLGPPGSPG KPGRDGEPGI PGKDGLPGER GLQGLPGLKG
     EDGLKGEKGE PGVGLPGLPG PPGLSAPFRG PSRLEPEGSG SGDLDRDNEI LRGLPGPPGP
     PGLPGAPGKP GSNAEPPGSP GKDGPSGEPG PPGPQGPPGL DGMVGPPGKK GEKGDQGLPG
     AAGPKGDAGD IGSPGPKGQA GDDGQPGKPG LQGPPGPPGP GYGFGFEDME GSGDISLLSE
     PRFPGSRMPN GPVGETGQRG PMGPKGEKGD TGPPGLAGLK GAQGADGKPG FPGVAGRPGD
     AGLKGEKGDM GPKGEPGQDG ASIVGPPGPP GPPGPIVAVS QLLLNDTDGV SNFTGIKGLV
     GPPGPDGRPG LPGFPGPKGP KGDIGLTGLQ GPKGQRGEKG EPGAIISADG SLRELVGRQG
     QKGDRGIIGP PGPMGPEGPS GSKGELGIPG RPGRPGLNGL KGVKGERGVT LYGLPGLPGP
     PGPPGPPGAI IYIKGTVFPI SPRPYCKMPV GTAHPGNQDA NIHGMKGELG AWGPLGPPGL
     KGEKGEQGSP GLPGPSLPSS YFSHFVNSMK GEKGDNGETG FKGEKGEPSG GFFMSGPPGP
     PGRPGLVGPK GDSIVGPRGP PGLPGLPGPP GYGIVGPPGP PGPPGPPGPP SIYGSAAFPG
     PPGPPGEPGL PGARNLVTTF RNIDSMLQKV HLVAEGTLIY LSESSEVFIR VRGGWRRLQL
     GELIPIPADS PPPPAISGYG FQPLPALRPV STIRKPTLHL VALNLPFSGA VRADYQCFQQ
     ARATGLTSTY RAFLSSHLQD LATVVRKTDR YNLPIVNLKG EVLFNNWESV FSGSGGQFNI
     QIPIYSFDGR NVMTDPSWPQ KVIWHGSTAN GIRLVNNYCE AWRTADMTVM GQASALTTGK
     LLDQKPYSCN NKFIVLCIEN SFVSDIQRK
//
DBGET integrated database retrieval system