GenomeNet

Database: UniProt
Entry: A0A452HIR9_9SAUR
LinkDB: A0A452HIR9_9SAUR
Original site: A0A452HIR9_9SAUR 
ID   A0A452HIR9_9SAUR        Unreviewed;      1384 AA.
AC   A0A452HIR9;
DT   08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT   08-MAY-2019, sequence version 1.
DT   28-JAN-2026, entry version 27.
DE   RecName: Full=Collagen type XV alpha 1 chain {ECO:0008006|Google:ProtNLM};
OS   Gopherus agassizii (Agassiz's desert tortoise).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Testudinata; Testudines; Cryptodira; Durocryptodira;
OC   Testudinoidea; Testudinidae; Gopherus.
OX   NCBI_TaxID=38772 {ECO:0000313|Ensembl:ENSGAGP00000014804.1, ECO:0000313|Proteomes:UP000291020};
RN   [1] {ECO:0000313|Proteomes:UP000291020}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=28562605;
RA   Tollis M., DeNardo D.F., Cornelius J.A., Dolby G.A., Edwards T.,
RA   Henen B.T., Karl A.E., Murphy R.W., Kusumi K.;
RT   "The Agassiz's desert tortoise genome provides a resource for the
RT   conservation of a threatened species.";
RL   PLoS ONE 12:e0177708-e0177708(2017).
RN   [2] {ECO:0000313|Ensembl:ENSGAGP00000014804.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSGAGP00000014804.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 38772.ENSGAGP00000014804; -.
DR   Ensembl; ENSGAGT00000016916.1; ENSGAGP00000014804.1; ENSGAGG00000011213.1.
DR   Proteomes; UP000291020; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1034; COLLAGEN ALPHA-1(XVIII) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 5.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00282; LamG; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000291020};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..1384
FT                   /note="Collagen type XV alpha 1 chain"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5019167700"
FT   DOMAIN          119..307
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   DOMAIN          168..306
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|SMART:SM00282"
FT   REGION          307..332
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          377..783
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          863..917
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          982..1003
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1020..1040
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1069..1121
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        429..443
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        502..513
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        522..533
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        561..570
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        572..582
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        657..669
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        752..761
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        889..910
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1020..1036
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1089..1102
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1112..1121
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1384 AA;  141781 MW;  067A476AFAF44E9D CRC64;
     MFSKLAWIIF VMQSGFIEGQ WWWQWLWGTT QTTTPPVATA VNTIALNIRT TVSARAPAGS
     ITSTASSGAT GVSYESTLPG GAAVTNFSRE KALSGLELKT GKKISQGKHK KTERGSKGHL
     DLTELIGVPL PLSVSFITGY GGFPAYSFGP DANIGRLTST LIPLTFYNDF AIIVTVKPNS
     DDGGVLFAIT DAFQKTIYLG MRLSPVDDGT QRIIMYYTEP GSYISREAAS FKVPVMTNKW
     NRFTVTVQGN DTVLFMDCEE YNRVQFQRSS QALQFGSNSG IFVGNAGATG LEKFTGSIQQ
     LMIKPDPRAT EDQCEDDDPY ASGDSSGTGS IQELEGLPET EEVIASSQPL PEETTAGPVG
     APPTISAQSE EMDFSGHHIL DETPEPPTIK EQGSTSAGND QHESDATTVA QEILKSETGS
     GAIVLQGVSR EKGQKGERGE KGEQGPQGPP GPPGKSKLEK MQTGIQGPPG PLGKPGRDGE
     PGIPGKDGLP GERGLQGLPG LKGEDGLKGE KGEPGVGLPG PQGLPGPPGP PAPFRGLSRL
     EPEGSGSGDL DRDREVLRGL PGPPGPPGLP GAPGKPDSNS GPPGSPGKDG KDGPSGEPGP
     PGPQGHPGLD GMVGPPGKKG EKGDQGLPGA VGPKGDAGDI GSPGPEGQAG ADGQPGKPGP
     QGPPGPPGPG YGFGFEDMEG SGSISLLSEP RIPGSRGPNG PAGETGQRGP MGPKGEKGDT
     GPPGIAGLKG EQGADGKPGF PGVAGRPGDA GPKGDKGDTG SKGEPGQDGA SIVGPPGPPG
     PPGPIIAVPQ LLLNDTDGIS NFTGVKGLLG PPGPDGRPGL PGFPGPRGPK GAIGLTGLQG
     PKGQRGEKGE PGFIISANGS LRELTGRQGQ KGERGAMGPP GPMGPVGPSG PKGELGIPGR
     PGRPGLNGLK GVKGERGVTL YGPPGLPGPP GPPGPPGAVI HIKGTVFPIS PRPHCKMPVG
     TAHPGNQEAN IYGMKGEPGS WGLHGPPGLK GEKGERGSPG LPGAPLPSAY FSHLVNSMKG
     EKGDNGETGF KGEKGEPSGG FFMSGPPGPP GLPGRPGLVG PKGDSIVGPR GPPGLPGLPG
     SPGYGIVGPP GPPGPPGPPG PPAIYGSAAA VPGPPGPPGE PGLPGTRNLV TTFRNIDGML
     RKVHLVAEGT LIYLSESSEV FIRVRGGWRR LQLGELIPIP ADSPPPPAIS GYGFQSLPAL
     RPVSAINHGK PTLHLVALNL PLSGAMRADY QCFQQARAAG LMSTYRAFLS SHLQDLSTVV
     RKSDRFNLPI VNLKGEILFN NWESVFTGSG GQFNIQIPIY SFDGRNIMTD PSWPHKIIWH
     GSTANGSRLV SNYCEAWRTA DMTVMGQASP LTTGKLLDQK PYSCSNKFIV LCIENSFVSD
     IRRK
//
DBGET integrated database retrieval system