GenomeNet

Database: UniProt
Entry: A0A8X7X0Z5_POLSE
LinkDB: A0A8X7X0Z5_POLSE
Original site: A0A8X7X0Z5_POLSE 
ID   A0A8X7X0Z5_POLSE        Unreviewed;      1396 AA.
AC   A0A8X7X0Z5;
DT   14-DEC-2022, integrated into UniProtKB/TrEMBL.
DT   14-DEC-2022, sequence version 1.
DT   08-OCT-2025, entry version 12.
DE   SubName: Full=COIA1 protein {ECO:0000313|EMBL:KAG2459231.1};
DE   Flags: Fragment;
GN   Name=Col18a1 {ECO:0000313|EMBL:KAG2459231.1};
GN   ORFNames=GTO96_0019283 {ECO:0000313|EMBL:KAG2459231.1};
OS   Polypterus senegalus (Senegal bichir).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Polypteriformes; Polypteridae; Polypterus.
OX   NCBI_TaxID=55291 {ECO:0000313|EMBL:KAG2459231.1, ECO:0000313|Proteomes:UP000886611};
RN   [1] {ECO:0000313|EMBL:KAG2459231.1, ECO:0000313|Proteomes:UP000886611}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Bchr_013 {ECO:0000313|EMBL:KAG2459231.1};
RX   PubMed=33545088;
RA   Bi X., Wang K., Yang L., Pan H., Jiang H., Wei Q., Fang M., Yu H., Zhu C.,
RA   Cai Y., He Y., Gan X., Zeng H., Yu D., Zhu Y., Jiang H., Qiu Q., Yang H.,
RA   Zhang Y.E., Wang W., Zhu M., He S., Zhang G.;
RT   "Tracing the genetic footprints of vertebrate landing in non-teleost ray-
RT   finned fishes.";
RL   Cell 0:0-0(2021).
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAG2459231.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JAATIS010005477; KAG2459231.1; -; Genomic_DNA.
DR   Proteomes; UP000886611; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000886611};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..26
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           27..1396
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5036496811"
FT   DOMAIN          33..221
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          219..246
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          273..1066
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1141..1212
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        228..241
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        275..314
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        326..338
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        387..399
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        418..430
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        465..474
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        479..492
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        517..531
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        615..628
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        630..639
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        712..727
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        733..742
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        846..855
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        877..887
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        894..905
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        942..963
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1019..1034
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1047..1061
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1141..1154
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:KAG2459231.1"
FT   NON_TER         1396
FT                   /evidence="ECO:0000313|EMBL:KAG2459231.1"
SQ   SEQUENCE   1396 AA;  142965 MW;  2CB2000CDFBD5FC5 CRC64;
     MGKRCLARLQ PLLCFVLLWS IPPVASQKGD EGDVSLLQII GDPPPEPILK VEGPDNIAGY
     MFTPEVNTGQ LARAHFPAIF YRDFSLLFHI KPTSVKPGVI FAVTDFQQKL MYVGVKLSAA
     KGDIRNVLFY YTEPDAESSY QAASFKISSP IDEWVRFAIK VKDDEVTFFL NCQQAEVTRI
     ERSPDEMELE GGAGVFVAQA GGADPDKFEG AIAELRISGD PDLADRQCDE DEGGDDSDDA
     SGDYGSGIEE KIKVQTESTI TVKPLVLPPV TKPDISVKKE KDKEKEKHKP PVAIETEKKH
     VFEKVESLPP RVKEGIPAQI GLAGPKGEKG DRGEKGSRGD QGPTGPKGDN GAISTGTQGG
     KGEPGEKGMK GSAGFGYPGP KGDTGAPGPP GPPGPPGPAP EVVKRGDGTV VQQVSGPQGP
     PGKPGQPGAP GPAGADGEPG DPGEDGKPGT VGPPGFPGLP GDTGPKGEKG EKGEGQPGPR
     GPPGPPGLPG PPRSDKQTFF DMEGSAFPDL DSIRGPPGLP GLPGPPGPPG PLSTTGTLGP
     VGPPGPPGSN GKDGAPGPRG PAGLPGPPGK DGSAGQPGAK GDKGESGEYG LPGPSGEKGS
     KGDTGTPGNP GEIGLAGLPG PMGPVGLPGL PGPPGPPGPG FAFGIDDTEG SAFGVSGPDG
     KPGLPGVPGM PGLKGDRGIP GLPGLKGESG IPGAAGKDGQ QGLDGFPGPE GLKGDKGDPG
     ERGEPGRDGA GLPGPPGPPG PPGRTIYEAS ENHGFNGVPG PEGKQGVNGQ AGFPGPMGPK
     GDRGDRGPPG YGIKGEKGEP GSLIGPDGNI MNGYLSRAAK GEKGDRGPPG PFGPVGATGH
     KGEIGLPGRP GRPGLNGFKG EKGEPADPSG RYAVVGPPGP PGPPGPPGIA GYDETGGAGR
     VLTGGKGERG FPGYPGEKGE PGEPGQPGLP GQPGSLSNFD VHSLKRELKG ERGDVGLKGE
     KGEPGGGGGY YDSRFSGIQG PPGKPGPQGL PGPKGDSIRG PPGPPGHQGP PGVGYDGRPG
     PPGQPGLPGP PGPSASPGAS RHSISIPGPP GPQGPPGPPG YPSGVNIVRN LQTMMSTARN
     VQEGTLIMIL EKGDLYIRVR DGYRQIMLGE YTPFFGPGGG IFENEVAAVQ PPPVILYPHS
     HPSQSAAETI SQGGQDIKQD LPPHSPVRQP DYGHNVYPSV PDPPRRNPLL PQVSYPDSPQ
     HRPHLSESSS SAIHLHGTGP ALHLIALNSP QTGDMRGIRG ADFQCFQQAR AVGLTGTFRA
     FLSSKLQDLY SIVRRADRNT LPIVNLKDEV LFDSWESLYN GLDGQMKPNV HIYSFDGRDV
     LKDNTWTQKM VWHGSNNKGQ RMTDHYCETW RTGVHELTGF ASSLEAGRLL QQRPVSCSSS
     LIVLCIENSF MTQAKK
//
DBGET integrated database retrieval system