ID A0A087YA40_POEFO Unreviewed; 1364 AA.
AC A0A087YA40;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE RecName: Full=Thrombospondin-like N-terminal domain-containing protein {ECO:0000259|SMART:SM00210};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000014893.1, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000014893.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01015902; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AYCK01015903; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 48698.ENSPFOP00000014893; -.
DR Ensembl; ENSPFOT00000014915.1; ENSPFOP00000014893.1; ENSPFOG00000014232.1.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000164160; -.
DR OMA; PGANSEC; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01391; Collagen; 9.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 1..141
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 260..352
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 387..622
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 643..827
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 849..1198
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1228..1356
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 260..282
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 391..411
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 432..453
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 480..497
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 853..867
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1232..1246
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1364 AA; 133774 MW; C2D8472B5C06A9B2 CRC64;
PRSFPDEYSF MTTFRMIKNT VSKVWNVWQI VDEDGHKQAG LRLNGDQQAL EYFLMGADGN
LQTVTFLGLS VLFNTKWHKV MIGVERDQVT LYVDCQPVDQ KPIKGKGPIN TEGDTLIGRL
DADPDASVVF ELQWMLIHCD PKRAQRESCN ELPAAEVQKQ GLRSHANCLS VCTVIPFQLQ
FRKPVRCTSC CISYRGFKTG CKVGGAPLGW PGEGLAMATG CGPLKNLDLI LNLHMDFSFQ
RFGKKPFQTF GKTLIVVSRG HQRSTNQPTF NRASQLSQPD KSLQGPPGPP GPEGPRGATG
ESGRDGFPGT PGIPGAPGSK GEKGEVGLPG QRGLPGLPGS PTPDHVRLKL GRSGTRVQNV
GVRVCGTLSV KGTAILILQG PIGPLGPRGP VGGLPGFPGP PGPPGPPGKP GTPGDEGNIG
PEKLQGPPGP RGGIGTPGPP GPPGPPGPVG PPGANSECPA ACPAGPQGLH GVPGMKKKTG
HKGLPGEPGK DGEPGEKGQT GETGPQGPPG RMGDDGQRGP IGAPGTPGEK GDRGAPGYNG
VPGPTGPPGA PGVEGVRGRQ GVPGLKGDNS EKGAVGPQGQ AGVKGEKGPV VAPQGDPGRD
GMDGLPGDNG TKGEPGAPGD VGLRGIVGIP ICNGNTAGVC SHGDKTKQGL PGKQGVAGPP
GEKGPMGPVG LPGKTGGPGK DGADGKPGEK GSTGETGKQG PVGVAGPQGI QGVKGEKGAT
GAKGFKGHTG HKGDQGPTGP VGPKGSQGDP GIAGDPGIKG DQGPPGVTGI KGERGEKGQR
GATGAEGRPG QPGHEGVTGP MGARGLEGDP GIPGPPGPRG LPGPKVSDEK LREMCSAVVE
ELLRRPAALS APGAPGPAGP PGPPGPAGAA GSAGVPGPMG PKGHPGFFGL PGAPGTKGET
GQKGDKGDKG EDGVGTKGSP GVPGVPGETG FMAENLQNGR PQPGAPGLPG VAKDGKNGER
GETGSPGVPG PAGPKGPSGP PGLCSPGEES SYGKRGEPGA PGDVGLRGIV GIPGLPGKQG
VAGPPGEKGE TGSEGPMGPV GLPGKTGGPG KDGADGKPGE KGSTGETGKQ GPVGVAGPQG
IQGVKGEKGA TGAKGFKGHT GHKGDQGPTG PVGPKGSQGD PGIAGDPGIK GDQGPPGVTG
IKGERGEKGQ RGATGAEGRP GQPGHEGVTG PMGARGLEGD PGIPGPPGPR GLPGPKVSDE
KLREMCSAVV EEQLEEFMKE LLRRPAALSA PGAPGPAGPP GPPGPAGAAG SAGVPGPMGP
KGHPGFFGLP GAPGTKGETG QKGDKGDKGE DGVGTKGSPG VPGVPGAPGL PGVAKDGKNG
ERGETGSPGV PGPAGPKGPS GPPGLCDPST CLSRLPPLYM VSGK
//