GenomeNet

Database: UniProt
Entry: A0A452HIS3_9SAUR
LinkDB: A0A452HIS3_9SAUR
Original site: A0A452HIS3_9SAUR 
ID   A0A452HIS3_9SAUR        Unreviewed;      1370 AA.
AC   A0A452HIS3;
DT   08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT   08-MAY-2019, sequence version 1.
DT   28-JAN-2026, entry version 27.
DE   RecName: Full=Collagen type XV alpha 1 chain {ECO:0008006|Google:ProtNLM};
OS   Gopherus agassizii (Agassiz's desert tortoise).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Testudinata; Testudines; Cryptodira; Durocryptodira;
OC   Testudinoidea; Testudinidae; Gopherus.
OX   NCBI_TaxID=38772 {ECO:0000313|Ensembl:ENSGAGP00000014807.1, ECO:0000313|Proteomes:UP000291020};
RN   [1] {ECO:0000313|Proteomes:UP000291020}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=28562605;
RA   Tollis M., DeNardo D.F., Cornelius J.A., Dolby G.A., Edwards T.,
RA   Henen B.T., Karl A.E., Murphy R.W., Kusumi K.;
RT   "The Agassiz's desert tortoise genome provides a resource for the
RT   conservation of a threatened species.";
RL   PLoS ONE 12:e0177708-e0177708(2017).
RN   [2] {ECO:0000313|Ensembl:ENSGAGP00000014807.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSGAGP00000014807.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   Ensembl; ENSGAGT00000016919.1; ENSGAGP00000014807.1; ENSGAGG00000011213.1.
DR   Proteomes; UP000291020; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 5.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00282; LamG; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000291020};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..1370
FT                   /note="Collagen type XV alpha 1 chain"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5019562798"
FT   DOMAIN          119..307
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   DOMAIN          168..306
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|SMART:SM00282"
FT   REGION          307..332
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          377..783
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          863..916
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1005..1112
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        429..443
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        502..513
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        522..533
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        561..570
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        572..582
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        657..669
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        752..761
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        889..910
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1005..1022
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1075..1088
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1098..1107
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1370 AA;  140189 MW;  511E692AD0B916B3 CRC64;
     MFSKLAWIIF VMQSGFIEGQ WWWQWLWGTT QTTTPPVATA VNTIALNIRT TVSARAPAGS
     ITSTASSGAT GVSYESTLPG GAAVTNFSRE KALSGLELKT GKKISQGKHK KTERGSKGHL
     DLTELIGVPL PLSVSFITGY GGFPAYSFGP DANIGRLTST LIPLTFYNDF AIIVTVKPNS
     DDGGVLFAIT DAFQKTIYLG MRLSPVDDGT QRIIMYYTEP GSYISREAAS FKVPVMTNKW
     NRFTVTVQGN DTVLFMDCEE YNRVQFQRSS QALQFGSNSG IFVGNAGATG LEKFTGSIQQ
     LMIKPDPRAT EDQCEDDDPY ASGDSSGTGS IQELEGLPET EEVIASSQPL PEETTAGPVG
     APPTISAQSE EMDFSGHHIL DETPEPPTIK EQGSTSAGND QHESDATTVA QEILKSETGS
     GAIVLQGVSR EKGQKGERGE KGEQGPQGPP GPPGKSKLEK MQTGIQGPPG PLGKPGRDGE
     PGIPGKDGLP GERGLQGLPG LKGEDGLKGE KGEPGVGLPG PQGLPGPPGP PAPFRGLSRL
     EPEGSGSGDL DRDREVLRGL PGPPGPPGLP GAPGKPDSNS GPPGSPGKDG KDGPSGEPGP
     PGPQGHPGLD GMVGPPGKKG EKGDQGLPGA VGPKGDAGDI GSPGPEGQAG ADGQPGKPGP
     QGPPGPPGPG YGFGFEDMEG SGSISLLSEP RIPGSRGPNG PAGETGQRGP MGPKGEKGDT
     GPPGIAGLKG EQGADGKPGF PGVAGRPGDA GPKGDKGDTG SKGEPGQDGA SIVGPPGPPG
     PPGPIIAVPQ LLLNDTDGIS NFTGVKGLLG PPGPDGRPGL PGFPGPRGPK GAIGLTGLQG
     PKGQRGEKGE PGFIISANGS LRELTGRQGQ KGERGAMGPP GPMGPVGPSG PKGELGIPGR
     PGRPGLNGLK GVKGERGVTL YGPPGLPGPP GPPGPPGAVI HIKGVGTAHP GNQEANIYGM
     KGEPGSWGLH GPPGLKGEKG ERGSPGLPGA PLPSAYFSHL VNSMKGEKGD NGETGFKGEK
     GEPSGGFFMS GPPGPPGLPG RPGLVGPKGD SIVGPRGPPG LPGLPGSPGY GIVGPPGPPG
     PPGPPGPPAI YGSAAAVPGP PGPPGEPGLP GTRNLVTTFR NIDGMLRKVH LVAEGTLIYL
     SESSEVFIRV RGGWRRLQLG ELIPIPADSP PPPAISGYGF QSLPALRPVS AINHGKPTLH
     LVALNLPLSG AMRADYQCFQ QARAAGLMST YRAFLSSHLQ DLSTVVRKSD RFNLPIVNLK
     GEILFNNWES VFTGSGGQFN IQIPIYSFDG RNIMTDPSWP HKIIWHGSTA NGSRLVSNYC
     EAWRTADMTV MGQASPLTTG KLLDQKPYSC SNKFIVLCIE NSFVSDIRRK
//
DBGET integrated database retrieval system