GenomeNet

Database: UniProt
Entry: A0A3Q1HYI5_ANATE
LinkDB: A0A3Q1HYI5_ANATE
Original site: A0A3Q1HYI5_ANATE 
ID   A0A3Q1HYI5_ANATE        Unreviewed;      1362 AA.
AC   A0A3Q1HYI5;
DT   10-APR-2019, integrated into UniProtKB/TrEMBL.
DT   10-APR-2019, sequence version 1.
DT   28-JAN-2026, entry version 32.
DE   RecName: Full=Thrombospondin-like N-terminal domain-containing protein {ECO:0000259|SMART:SM00210};
OS   Anabas testudineus (Climbing perch) (Anthias testudineus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Anabantaria; Anabantiformes; Anabantoidei; Anabantidae; Anabas.
OX   NCBI_TaxID=64144 {ECO:0000313|Ensembl:ENSATEP00000014157.1, ECO:0000313|Proteomes:UP000265040};
RN   [1] {ECO:0000313|Ensembl:ENSATEP00000014157.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG   Wellcome Sanger Institute Data Sharing;
RL   Submitted (APR-2021) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSATEP00000014157.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSATEP00000014157.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_026232922.1; XM_026377137.1.
DR   Ensembl; ENSATET00000014386.3; ENSATEP00000014157.1; ENSATEG00000009884.3.
DR   GeneID; 113173652; -.
DR   CTD; 564123; -.
DR   GeneTree; ENSGT00940000165423; -.
DR   OMA; NQWKGER; -.
DR   Proteomes; UP000265040; Chromosome 21.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000265040};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..23
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           24..1362
FT                   /note="Thrombospondin-like N-terminal domain-containing
FT                   protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018602647"
FT   DOMAIN          30..219
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          221..904
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          918..1027
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1117..1185
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        241..257
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        282..296
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        312..327
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        332..341
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        347..358
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        373..385
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        406..418
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        437..449
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        451..460
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        465..475
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        501..515
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        525..538
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        580..596
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        598..613
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        631..645
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        688..705
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        708..717
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        725..735
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        736..745
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        801..813
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        841..851
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        853..868
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        918..930
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        941..959
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        965..974
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        983..997
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1009..1021
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1121..1137
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1140..1152
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1165..1177
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1362 AA;  139751 MW;  D981485B382FC3E2 CRC64;
     MRCSPWLKRL VCCVLVLVGP ATSQWPEDSG VSLLQLIGDP PPSEITQIDG PDNSPGYVFG
     PDANAGQLAH AHLPSPFYRN FALIFNLKPT TDRGGVIFSV TDITQQIIYV GVKLSAVQGG
     YQHVILYYTE PDSQTSYEAA RFRAPSMRDT WTRFAIAVRD DKVMFYLNCD IDPQVKRFER
     SPDEMELEAG AGVFVGQAGG ADPEKFLGVI GELRVVGDPR AAERHCEEEE DDSDMASGDG
     SGHEEVRPTK PAVERHRWTT APASFRPIQV PPHTRREEVS VSRETASDSQ QVSVSVETGA
     GVPGPSGPAG PKGEKGDRGD RGEKGDRGPI GPKGESSASS GSQGGIRGEK GEPGEKGAKG
     SAGFGYQGKK GEPGPPGPPG PPGPSGPATE LAVGSDGSVV SRVPGPRGPP GPPGPQGPPG
     AEGEPGDPGE DGKTGPEGPP GFPGTPGDPG PRGEKGDRGE GQPGPRGPPG PPGPPGQGFR
     STFVDMEGSG FLDMESVRGL PGPPGPPGPP GPPGPSTTGT ALSSGAFGPP GKEGAPGKPG
     LPGLPGTDGS PGTPGPKGEK GDSGELGLPG AVGEKGARGE PGLPGTAGEP GLAGLPGPIG
     PVGPPGPPGP PGPSYRVGFD DMEGSGGGFI SGRPGVRGPE GRQGPPGVPG LPGKPGLPGT
     PGQKGSEGPA GSDGQPGLDG FPGAQGPRGD RGDKGERGEP GRDGTGRPGP PGPPGPPGQI
     IYKTSGDSDG VSGSAGPQGG PGLPGKAGFP GPMGPKGDRG DSGLPGYGIK GEKGEPGLVL
     GPDGNPLYLE RLTGLKGERG PPGPVGPTGP YGPPGLKGEI GMPGRPGRPG VNGYKGEKGE
     PGGGAGYGLP GVPGSPGPPG PPGPPGPAIP LDRFSRYDET SRNYPAIKGE KGERGLPGIP
     GTSSNFDIYT LKNELKGARG EPGVKGEKGE SGGGYYDPRF GVQGQPGPPG KQGLPGPKGD
     SIMGPPGPQG PQGPPGIGYD GRPGPPGPPG PPGPPSQPGA YRPNYPVSVP GPPGPPGPPG
     SPGLSSGVTV LRSYDTMIAT ARRQAEGSLI YIIDKADLYL RVRDGVRQVM LGDYNPFFRD
     LENEVAEAQP PPVILYPESH DQSHNNGAGH YSQGVSVIQP IEPPPQPHVD PVYPPQYDPR
     FPDPRHTGHT DGRLPTQQTE SRHPVTPPKR PSPPVPQPGG HVETSHSGLH LVALNAPQTG
     NMRGIRGADF LCFQQARAVG LRGTFRAFLS SKLQDLYTIV RRSDRDSVPI VNLKDQVLFS
     SWESLFTDEN KMRENVPVYS FDGRDILRDS AWPEKMVWHG SSNKGHRQTD HYCETWRAGD
     RAVTGLASSL QSGHLLQQTS SSCSGSYIVL CIENAITSHF KK
//
DBGET integrated database retrieval system