ID A0A087XA17_POEFO Unreviewed; 1312 AA.
AC A0A087XA17;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 08-OCT-2025, sequence version 3.
DT 28-JAN-2026, entry version 58.
DE RecName: Full=Thrombospondin-like N-terminal domain-containing protein {ECO:0000259|SMART:SM00210};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000002620.2, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000002620.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (AUG-2025) to UniProtKB.
RN [3] {ECO:0000313|Ensembl:ENSPFOP00000002620.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2025) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01006278; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AYCK01006279; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 48698.ENSPFOP00000002620; -.
DR Ensembl; ENSPFOT00000002624.2; ENSPFOP00000002620.2; ENSPFOG00000002297.2.
DR eggNOG; KOG3544; Eukaryota.
DR eggNOG; KOG3546; Eukaryota.
DR GeneTree; ENSGT00940000164061; -.
DR OMA; CHCSSAY; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 1..187
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 470..677
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 691..739
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 814..877
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 946..1030
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 487..499
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 534..546
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 565..575
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 611..628
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 661..673
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 704..715
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 727..738
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 819..828
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 840..852
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 862..874
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 946..956
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 958..973
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 999..1011
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1020..1030
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1312 AA; 136510 MW; 4D89B59B5401C4D2 CRC64;
MDLTELIGVP LPPSVSFVTG FEGYPAYSFG PGANVGRLTK SFIPDPFHHD FAITVMAKPT
TRRGGVLFAI TDAFQKVVQL GVALSEVEDG AQNVILYYTD PETRGGTREA ASFKMGEVTG
RWARFTLTVQ GAEIRLYMDC EEYHRVAFTR SAQPLTFQTS SGIFVGNAGG TGLPRFVVSL
LASLFTTTTR DPRMKDHMCL SAVFSVTAAS ALLDVKLWSV IDVIKRKYFF YLPNHQSIYP
PVDRKKLSSV SSSEVFMSFP PIAPGAAPSQ SNFLLVGRCD LSLYSCHSLS SVIVSSNLEG
KNGSTASSDC SVFELAAQSE IISQRAITGL IQQHRLSAAV ASPWLWMNTQ EEIEISQAMP
CSIKNGCLSS GFIFTSTNLA ESLSSSPSAG LHPETAAELG SHGSSWSNMR VVPLCLGSFE
EFCDKITVFS FLELCFKKIL TVDTRKNVAV KYQIIFQVTV KKKEVSVLQP DPTYGGPVQA
PPTEPSLIDD EDGDDEESSG QELERQAAGG GGDPGPRGPQ GPKGPAGPTG LPGKDGEPGM
KGERGLPEGS GFEDFDSDTE VVRGPPGPPG PPGLPGLPGS SAGGVTPGLP GPPGPPGKDG
TNGEPGLAGV DGKDGDPGPA GEKGDKGEPG ASGPPGPKGD QGPAGFPGLP GSPGTEGQPG
PRGPPGPPGP PGPSGSRFAV ALELSQDLEG SGLLEDFGGS SGPQGPPGVP GPPGPKGERG
RDGVGVPGPP GLPGPPGPVI NLQDLLLNAT DGAFNFSGIF QTQGPKGPKG DVGLQGLQGP
PGIKGEKGEP GFLTGPDGSL MSDLAGALGT KGIKGDNGVP GVPGVSGPVG PPGPKGEIGF
PGRQGRPGLL GPKGEKGDPH GLPGPPGPPG PPGKPGVNIH QTVFPIPPRP HCKMPVGYSH
VNEDASEILL HAAWEQREGN CHQGAKGEKG ERGLPGLPAP QIIQNKTQQQ KPNNIKLTKG
DQGMKGEKGE KGEAGFPGQP GIPGRSGLVG PKGESVLGPP GPPGMPGPPG APGYGRPGAV
GPPGPPGPPG LPLRYGSAVA IAGPPGPPGP PGAPGTSILT SVKLKTFSTR ESMMQQTSRD
EEGTLAYVKA TGNLFLKVPQ GWKQIQLGSL IYLSSNIIPQ DEVHAALFIS ILTATWRIRA
ILNLVALNQP HSGNMMGLDM ADRMCYEQAK AMGLSPNYRA FISSHKQDLV HVVYPGFRDS
LPVTNLRGDV MFRNWQSIFI GNGGPVNPRI PIYSFDGRDV LADPFWPKKS IWHGSTSRGL
RVVDKHCETW QADDFSVMGQ SSSLTSGLLL GQQTRSCSTE FIVLCIETYK NS
//