GenomeNet

Database: UniProt
Entry: A0AA88P2U2_9TELE
LinkDB: A0AA88P2U2_9TELE
Original site: A0AA88P2U2_9TELE 
ID   A0AA88P2U2_9TELE        Unreviewed;      1216 AA.
AC   A0AA88P2U2;
DT   27-MAR-2024, integrated into UniProtKB/TrEMBL.
DT   27-MAR-2024, sequence version 1.
DT   28-JAN-2026, entry version 9.
DE   RecName: Full=Thrombospondin-like N-terminal domain-containing protein {ECO:0000259|SMART:SM00210};
GN   ORFNames=Q8A67_024090 {ECO:0000313|EMBL:KAK2869698.1};
OS   Cirrhinus molitorella (mud carp).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC   Cyprinidae; Labeoninae; Labeonini; Cirrhinus.
OX   NCBI_TaxID=172907 {ECO:0000313|EMBL:KAK2869698.1, ECO:0000313|Proteomes:UP001187343};
RN   [1] {ECO:0000313|EMBL:KAK2869698.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=Prfri {ECO:0000313|EMBL:KAK2869698.1};
RC   TISSUE=Muscle {ECO:0000313|EMBL:KAK2869698.1};
RA   Liu H.;
RT   "Chromosome-level Genome Assembly of mud carp (Cirrhinus molitorella).";
RL   Submitted (AUG-2023) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAK2869698.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JAUYZG010000024; KAK2869698.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0AA88P2U2; -.
DR   Proteomes; UP001187343; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP001187343};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..27
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           28..1216
FT                   /note="Thrombospondin-like N-terminal domain-containing
FT                   protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5041646276"
FT   DOMAIN          40..228
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          224..645
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          686..945
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        258..271
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        294..305
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        322..331
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        343..355
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        356..366
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        419..428
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        460..472
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        502..517
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        583..598
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        634..645
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        748..769
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        785..796
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        829..843
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        911..921
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        930..939
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1216 AA;  124994 MW;  260EBB1BD321AAE9 CRC64;
     MGRASDSSPR IWCWTFIAVC ICSSGTALLD TDSERDSSGQ LDLTELIGVP LPPSVAFITG
     FEGFPAYSFG PDANVGRLTR SFIPDPFFRD FAIVVTAKPG TRRGGVLFAI TDALQKVVHL
     GVALAPVEDG SQRVILYYTE PGATRTQEAA SFKMGDLTGR WARFTLAVQG EEVKLYMDCE
     EYHRVAFSRS ADGLTFQPSS GIFIGNAGGT RLERFVGSIQ QLLLTPNPSA PNEQCEEDDP
     YASGYGSGDD IYDDTETRDE VKKVVEEREY TMPEDVESGP MKAPPTEPPS FSTETDDEDL
     EEGSGEDIVV ISGPEEPIRS EGASQDRNVM TMQKGEKGER GPQGPPGPAG PPAPRTPGES
     GARGPQGPSG PPGEPGKDGQ PGDSGKDGAP GEVGPPGFPG LPGNPGLKGD KGDPGVGVPG
     PPGPPGPPGR SGTYTYPDGI EGSGSSFVDL DSDTELIRGP PGPPGPPGPP GPSVGAHPGP
     IGPPGVPGKD GETGKPGLPG ENGRDGPAGK DGERGQKGEP GVPGIAGPKG DPGQPGLPGE
     TGAEGPRGQP GPPGPPGKGF SFDMMDLEGS GLDGGSGFRP VLPRGPPGLP GLPGPPGPQG
     KEGAVGPPGV SVKGEPGAKG DDGQPGSAGL PGRQGERGEK GDMGLKGERG LDGIGLPGPP
     GPPGPVINLQ ELMLNDTEGI FNLSGIFEPQ GPLGPRGPKG DTGAPGLQGP AGLKGQKGEP
     GIVSGVHAPQ GPKGLKGDSG VPGPPGQTGP IGPAGPKGEF GFPGRPGRPG MHGRKGEKGD
     SDGLPGPPGP PGPPGRPGIF GCPQGTVFPI PPRPGCKKPV NSDSTQEGAS SSSSASSSAS
     SSSDTLGTKV TAAKGDQGFR GEKGEAGMPG LPGRSGLAGP KGESIVGPPG HPGPRGPTGT
     PGFGRTGVAG PPGPPGPPGP PGQGSGVMIP GPPGPPGPPG RAAEASSAVR RYVSLQAMRQ
     SSVDEAGTLC FVIDTSKLYI KVPGGWREVQ LGGLVEMYSS PLLSQDDAEP LMLTPQFTHT
     ATIHSANTLR LVALNSPLTG NLGSIHSVNK LCRTQAQAMG IRDDYKAFLS HHLQDLIDIM
     QPMYRSNMPI VNLRGEVLFK NWDSIFSNHL LPVGVPLYSF DGRDVMSDPF WPQKAVWHGS
     SERGKRLSAL NCESWRAGDM AITGQASFLY SGLLNQQTRS CSNHFIVLCI ETSHQHTSVQ
     ELHKAQYRHR RWHHRY
//
DBGET integrated database retrieval system