GenomeNet

Database: UniProt
Entry: A0A452J0D2_9SAUR
LinkDB: A0A452J0D2_9SAUR
Original site: A0A452J0D2_9SAUR 
ID   A0A452J0D2_9SAUR        Unreviewed;      1658 AA.
AC   A0A452J0D2;
DT   08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT   08-MAY-2019, sequence version 1.
DT   28-JAN-2026, entry version 27.
DE   RecName: Full=Thrombospondin-like N-terminal domain-containing protein {ECO:0000259|SMART:SM00210};
OS   Gopherus agassizii (Agassiz's desert tortoise).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Testudinata; Testudines; Cryptodira; Durocryptodira;
OC   Testudinoidea; Testudinidae; Gopherus.
OX   NCBI_TaxID=38772 {ECO:0000313|Ensembl:ENSGAGP00000033816.1, ECO:0000313|Proteomes:UP000291020};
RN   [1] {ECO:0000313|Proteomes:UP000291020}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=28562605;
RA   Tollis M., DeNardo D.F., Cornelius J.A., Dolby G.A., Edwards T.,
RA   Henen B.T., Karl A.E., Murphy R.W., Kusumi K.;
RT   "The Agassiz's desert tortoise genome provides a resource for the
RT   conservation of a threatened species.";
RL   PLoS ONE 12:e0177708-e0177708(2017).
RN   [2] {ECO:0000313|Ensembl:ENSGAGP00000033816.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSGAGP00000033816.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   Ensembl; ENSGAGT00000038305.1; ENSGAGP00000033816.1; ENSGAGG00000024061.1.
DR   Proteomes; UP000291020; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR010363; DUF959_COL18_N.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06121; DUF959; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000291020};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          340..528
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          250..273
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          535..1082
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1108..1334
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1388..1470
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        616..634
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        649..658
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        711..721
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        726..742
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        795..807
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        832..841
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        876..885
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        886..899
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        901..919
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1151..1160
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1221..1239
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1292..1305
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1315..1327
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1432..1448
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1452..1463
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1658 AA;  170455 MW;  8064AA9062534C41 CRC64;
     MASEWVWLRN KWQNASPECE TRVQCCWLPG RSCGRVWAKD KTPGVLRSQF PVGVLPCRGR
     RAGNRRGWLV GSRAWQGLWL ATPPFSFVHA LRSCRGQTNV SAPGCCLLQD TFPGTVFFMA
     PAEQRLCLLL LLICCCSLAE TRFLDWLWGT QKNPTVTTLP PTESQTTELL TSPVATTTQA
     AVPATAQPGD QVWSTLSVTR QAPAVETAVP ALAKPAMPQG KEENIAGVGA EILDVAEGIR
     SLVQLWDERT TQRTGNTGDP TAVAPLTNPP TSLAAPNSAM GLADHGNMTA NSTGNGHSLL
     GTGQPEASLP TPAGPLPAWN RTQALLQKPA AALPEHFSTE VSLLQLIGDP PPEQITQVNG
     PDNTHAYVFG PDANTGQVAR YHLPSPFYRD FSLLFHVQPT TNKAGVLFAI TDASQTVIYV
     GVKLSAMQDR KQQIIFYYTE PGSESSYAAA TFSVPSLANE WTRFAISVED DEVILYLNCE
     EFHRVRFERS PDEMELEDGS GLFVAQAGGA DPDKYQGMIA DLKVTGDHQA ADLQCEEQEE
     DTDVASGDFG SGAEERPHLS GRKQGIPAVS RLPEPPPVTS PPTAGETVGK VSVGSQLPTD
     QIKVEGLQQV STEARVGTKG EKGAPGEKGE HGPKGDSGSG RVLATGSTKG DKGEKGDLGV
     KGSAGFGYPG SKGQKGEPGA QGSPGPAGLP GPPGSTLQRL DGSVVEQVAG PPGPTGPPGP
     PGKDGLPGKD GEPGDPGEDG KPGDVGPQGF PGTPGEPGLK GQKGEPGVGA RGPPGLPGPP
     GTPGLSSKQD KLGPRGPPGP PGPPGVPGLP GEPGRFGMNR TELPGPPGLP GLPGRDGVPG
     AQGPPGPPGK SGLAGQPGLQ GERGDPGDLG LPGAPGPKGG KGEMGLAGAP GETGLAGLPG
     PMGPRGPPGP PGPAGPPGPG YEAGFGDMEG SGMPVLSAVP GERGPKGPQG PPGLPGLKGD
     AGSPGLPGLP GQKGDHGAPG VDGRPGLEGF PGPQGPKGDK GSQGAKGERG QDGVGSPGPP
     GPPGQVIYAL GEEKALASLP GPEGKEGHAG FPGPMGPKGD LGDPGFPGTP GPKGEKGEPG
     VIIGPDGTGI SAGTKGDKGE QGLVGPMGPA GPHGRPGQKG EIGFPGRPGR PSMNGLKGEK
     GDPGGLGLQG PPGPPGPPGA PGNVASIYDN NAFSESGPPG PPGLPGYQGA PGQKGERGET
     GAPGPPGQFP YGLNEFGSTF RGERGDRGDP GLKGEKGEPG EGGLYGPSVS GPPGPQGYPG
     LPGPKGDSIR GLPGPPGPQG PPGIGYEGRQ GPPGPPGPPG PPSFPGPHRQ TVSIPGPPGP
     PGPPGPPGTS GSLTSYGLRI LPAYQSMLST AQEVPEGWLI FVQDREELYI RVRSGFRRVR
     LEEHTSISSP GLDNEVYDKP PSVHYSHGGT ASSGFLHPRR EHNPSSTAQP WRADDSIANH
     HRLPDHLPHG AQPQQEPLDS FSPNRRGESI PSAVHRHHDF QPALHLVALN APLSGSMRGI
     RGADFQCFQQ ARAVGLTGTF RAFLSSRLQD LYSIVRRADR SGMPIVNLRD EVLFNNWEAL
     FSGAEAQFRA GVRILSFDGR DVLRDSAWPQ KNVWHGSDAK GHRLTESYCE TWRTDDSVVT
     GQASSLASGK LLEQKVSSCR NAFIVLCIEN SFMTSSKK
//
DBGET integrated database retrieval system