GenomeNet

Database: UniProt
Entry: A0A7J6DEF4_9TELE
LinkDB: A0A7J6DEF4_9TELE
Original site: A0A7J6DEF4_9TELE 
ID   A0A7J6DEF4_9TELE        Unreviewed;      1265 AA.
AC   A0A7J6DEF4;
DT   07-APR-2021, integrated into UniProtKB/TrEMBL.
DT   07-APR-2021, sequence version 1.
DT   28-JAN-2026, entry version 17.
DE   RecName: Full=Thrombospondin-like N-terminal domain-containing protein {ECO:0000259|SMART:SM00210};
GN   ORFNames=G5714_001757 {ECO:0000313|EMBL:KAF4117204.1};
OS   Onychostoma macrolepis.
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC   Cyprinidae; Acrossocheilinae; Onychostoma.
OX   NCBI_TaxID=369639 {ECO:0000313|EMBL:KAF4117204.1, ECO:0000313|Proteomes:UP000579812};
RN   [1] {ECO:0000313|EMBL:KAF4117204.1, ECO:0000313|Proteomes:UP000579812}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=SWU-2019 {ECO:0000313|EMBL:KAF4117204.1};
RC   TISSUE=Muscle {ECO:0000313|EMBL:KAF4117204.1};
RA   Wang D.;
RT   "Chromosome-level genome assembly of a cyprinid fish Onychostoma macrolepis
RT   by integration of Nanopore Sequencing, Bionano and Hi-C technology.";
RL   Submitted (APR-2020) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAF4117204.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JAAMOB010000002; KAF4117204.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A7J6DEF4; -.
DR   Proteomes; UP000579812; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 5.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000579812};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          41..229
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          229..579
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          606..690
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          760..994
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1042..1064
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        308..322
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        345..359
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        384..399
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        428..439
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        469..484
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        507..520
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        558..573
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        642..652
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        654..670
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        677..686
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        803..813
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        825..838
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        846..855
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        951..961
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        965..975
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        976..985
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1043..1055
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1265 AA;  129944 MW;  BABC39080642ABE8 CRC64;
     MLKLLRKMKL WLLSWLIGAH LALSYRSSAL LVMEERGSKG HLVLTELVGV PLPPSVSFIT
     GYEGFPAYNF GPNANVGRLT QSFVPEPFFM DFSIIVTVKP SNSRGGVLFA ITDPSQRIVH
     LGLALTPVED KTQRIVLYYS EPGLPDTMEV ASFKVPDMTQ QWNRFTLTVE HEEVRLYMDC
     EEYHSAPLKR SQQPLSFKQG SGIFVANAGS TGLERFVGSI QQLVIKPDPR AAEEQCEEDD
     PSLQASGDRS GDGDYDDEEE HGRHEVIFGQ TNVREDKEKT HRPTYPVQAP PTVSPDMDEG
     EFSGHVTPID ERLLRGTYKT DETGESTGDG SGQGQKGERG EPGSAGPPGP PGPPGPSLPP
     THSGQPGQRG PQGPMGPPGR QGRPGKDGQS GRKGEEGKPG QRGPPGLPGL PGESGVKGEK
     GDSGVGLPGP PGPPGPAGPS KPIKVPYGFD ALGSGFEDVD IDTERLRGPP GPPGPPGKPG
     PSGPNGPLGA LLPGPPGTPG KDGRDGQPGL RGLPGPDGLT GQQGPKGAKG EQGTRGPSGP
     KGENGDSGLI GPMGPRGVPG PPGIPGPPGP PGPLSNNFMM DTLKDLESSG ESGLFLGAGI
     SKGYLGPPGL PGPPGSQGLP GADGAPGLSV KGEPGSPGQD GIPGLAGLPG ARGPKGDKGS
     PGEKGERGRD GLSITGPPGP PGPPGPTINL QDLLLNDTAA KLNLTKIRGP PGPMGPEGLP
     GRAGFPGPRG PKGEIGFPGI QGPPGLKGEK GEPGVSIAAD GSVIPGLRGP RGPKGMKGDI
     GLSGRPGIVG PIGPPGQKGE YGLPGRPGRA GIAGRKGDKG DSSGPPGPPG PPGPPGPPGR
     VIGLNGVNNN NSQQGDGRAS LGVKGDKGDV GNPGLPGTAV PLFPHGYVGA KGDNGYKGQK
     GEKGDPGLPG PPGMPGRTGL VGPKGDSIVG PPGDTGSPGA PGQPGYGIPG PQGPPGPPGP
     PGTLSAYGSA ASLSGPPGPP GPPGAPGHGN PVRTYKNSQT LIRETSQAAE GTLAFVIDKS
     ELYIRVRGGW KKVELGELIP VPQDSSSSAL TQGLSRPSDR SVPRVHSQEL KSFLPDYHAF
     LQNAHSMPAL HLVALNAPFS GDMHGIRGAD YQCYQQARAR GLTSTYRAFL SSHLQDLSSI
     VKKGDRFGLP VVNLKGDVLF SSWMAMFSGD GAVYDPLTPI YSFDGRNVMT EQAWPQKLVW
     HGSNTMGIRM TTSYCEAWRT GDMAVTGQAS LLQTGRLLGQ HTRSCSNHFI VLCIENSYIQ
     NPGRN
//
DBGET integrated database retrieval system