ID A0A3Q1HYI5_ANATE Unreviewed; 1362 AA.
AC A0A3Q1HYI5;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 28-JAN-2026, entry version 32.
DE RecName: Full=Thrombospondin-like N-terminal domain-containing protein {ECO:0000259|SMART:SM00210};
OS Anabas testudineus (Climbing perch) (Anthias testudineus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Anabantaria; Anabantiformes; Anabantoidei; Anabantidae; Anabas.
OX NCBI_TaxID=64144 {ECO:0000313|Ensembl:ENSATEP00000014157.1, ECO:0000313|Proteomes:UP000265040};
RN [1] {ECO:0000313|Ensembl:ENSATEP00000014157.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Wellcome Sanger Institute Data Sharing;
RL Submitted (APR-2021) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSATEP00000014157.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (AUG-2025) to UniProtKB.
RN [3] {ECO:0000313|Ensembl:ENSATEP00000014157.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2025) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC {ECO:0000256|ARBA:ARBA00061275}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_026232922.1; XM_026377137.1.
DR Ensembl; ENSATET00000014386.3; ENSATEP00000014157.1; ENSATEG00000009884.3.
DR GeneID; 113173652; -.
DR CTD; 564123; -.
DR GeneTree; ENSGT00940000165423; -.
DR OMA; NQWKGER; -.
DR Proteomes; UP000265040; Chromosome 21.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR CDD; cd00247; Endostatin-like; 1.
DR FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 3: Inferred from homology;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW Reference proteome {ECO:0000313|Proteomes:UP000265040};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..1362
FT /note="Thrombospondin-like N-terminal domain-containing
FT protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018602647"
FT DOMAIN 30..219
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 221..904
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 918..1027
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1117..1185
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 241..257
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 282..296
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 312..327
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 332..341
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 347..358
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 373..385
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 406..418
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 437..449
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 451..460
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 465..475
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 501..515
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 525..538
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 580..596
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 598..613
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 631..645
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 688..705
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 708..717
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 725..735
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 736..745
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 801..813
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 841..851
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 853..868
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 918..930
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 941..959
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 965..974
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 983..997
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1009..1021
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1121..1137
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1140..1152
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1165..1177
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1362 AA; 139751 MW; D981485B382FC3E2 CRC64;
MRCSPWLKRL VCCVLVLVGP ATSQWPEDSG VSLLQLIGDP PPSEITQIDG PDNSPGYVFG
PDANAGQLAH AHLPSPFYRN FALIFNLKPT TDRGGVIFSV TDITQQIIYV GVKLSAVQGG
YQHVILYYTE PDSQTSYEAA RFRAPSMRDT WTRFAIAVRD DKVMFYLNCD IDPQVKRFER
SPDEMELEAG AGVFVGQAGG ADPEKFLGVI GELRVVGDPR AAERHCEEEE DDSDMASGDG
SGHEEVRPTK PAVERHRWTT APASFRPIQV PPHTRREEVS VSRETASDSQ QVSVSVETGA
GVPGPSGPAG PKGEKGDRGD RGEKGDRGPI GPKGESSASS GSQGGIRGEK GEPGEKGAKG
SAGFGYQGKK GEPGPPGPPG PPGPSGPATE LAVGSDGSVV SRVPGPRGPP GPPGPQGPPG
AEGEPGDPGE DGKTGPEGPP GFPGTPGDPG PRGEKGDRGE GQPGPRGPPG PPGPPGQGFR
STFVDMEGSG FLDMESVRGL PGPPGPPGPP GPPGPSTTGT ALSSGAFGPP GKEGAPGKPG
LPGLPGTDGS PGTPGPKGEK GDSGELGLPG AVGEKGARGE PGLPGTAGEP GLAGLPGPIG
PVGPPGPPGP PGPSYRVGFD DMEGSGGGFI SGRPGVRGPE GRQGPPGVPG LPGKPGLPGT
PGQKGSEGPA GSDGQPGLDG FPGAQGPRGD RGDKGERGEP GRDGTGRPGP PGPPGPPGQI
IYKTSGDSDG VSGSAGPQGG PGLPGKAGFP GPMGPKGDRG DSGLPGYGIK GEKGEPGLVL
GPDGNPLYLE RLTGLKGERG PPGPVGPTGP YGPPGLKGEI GMPGRPGRPG VNGYKGEKGE
PGGGAGYGLP GVPGSPGPPG PPGPPGPAIP LDRFSRYDET SRNYPAIKGE KGERGLPGIP
GTSSNFDIYT LKNELKGARG EPGVKGEKGE SGGGYYDPRF GVQGQPGPPG KQGLPGPKGD
SIMGPPGPQG PQGPPGIGYD GRPGPPGPPG PPGPPSQPGA YRPNYPVSVP GPPGPPGPPG
SPGLSSGVTV LRSYDTMIAT ARRQAEGSLI YIIDKADLYL RVRDGVRQVM LGDYNPFFRD
LENEVAEAQP PPVILYPESH DQSHNNGAGH YSQGVSVIQP IEPPPQPHVD PVYPPQYDPR
FPDPRHTGHT DGRLPTQQTE SRHPVTPPKR PSPPVPQPGG HVETSHSGLH LVALNAPQTG
NMRGIRGADF LCFQQARAVG LRGTFRAFLS SKLQDLYTIV RRSDRDSVPI VNLKDQVLFS
SWESLFTDEN KMRENVPVYS FDGRDILRDS AWPEKMVWHG SSNKGHRQTD HYCETWRAGD
RAVTGLASSL QSGHLLQQTS SSCSGSYIVL CIENAITSHF KK
//