GenomeNet

Database: UniProt
Entry: A0A8X7WYR2_POLSE
LinkDB: A0A8X7WYR2_POLSE
Original site: A0A8X7WYR2_POLSE 
ID   A0A8X7WYR2_POLSE        Unreviewed;      1287 AA.
AC   A0A8X7WYR2;
DT   14-DEC-2022, integrated into UniProtKB/TrEMBL.
DT   14-DEC-2022, sequence version 1.
DT   28-JAN-2026, entry version 13.
DE   SubName: Full=COFA1 protein {ECO:0000313|EMBL:KAG2458303.1};
DE   Flags: Fragment;
GN   Name=Col15a1 {ECO:0000313|EMBL:KAG2458303.1};
GN   ORFNames=GTO96_0018239 {ECO:0000313|EMBL:KAG2458303.1};
OS   Polypterus senegalus (Senegal bichir).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Polypteriformes; Polypteridae; Polypterus.
OX   NCBI_TaxID=55291 {ECO:0000313|EMBL:KAG2458303.1, ECO:0000313|Proteomes:UP000886611};
RN   [1] {ECO:0000313|EMBL:KAG2458303.1, ECO:0000313|Proteomes:UP000886611}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Bchr_013 {ECO:0000313|EMBL:KAG2458303.1};
RX   PubMed=33545088;
RA   Bi X., Wang K., Yang L., Pan H., Jiang H., Wei Q., Fang M., Yu H., Zhu C.,
RA   Cai Y., He Y., Gan X., Zeng H., Yu D., Zhu Y., Jiang H., Qiu Q., Yang H.,
RA   Zhang Y.E., Wang W., Zhu M., He S., Zhang G.;
RT   "Tracing the genetic footprints of vertebrate landing in non-teleost ray-
RT   finned fishes.";
RL   Cell 0:0-0(2021).
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAG2458303.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JAATIS010007298; KAG2458303.1; -; Genomic_DNA.
DR   Proteomes; UP000886611; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 5.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000886611};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..24
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           25..1287
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5036474764"
FT   DOMAIN          36..224
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          224..251
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          313..766
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          821..855
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          869..1027
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        229..251
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        328..346
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        361..370
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        372..382
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        383..395
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        483..499
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        519..531
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        575..590
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        591..603
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        685..697
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        705..714
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        838..850
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        920..940
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        967..998
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1008..1021
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:KAG2458303.1"
FT   NON_TER         1287
FT                   /evidence="ECO:0000313|EMBL:KAG2458303.1"
SQ   SEQUENCE   1287 AA;  132618 MW;  5EC87445E36BC767 CRC64;
     MALGDCRGPL ALLLILWSCH LSEAAQVLEE RGSRGHLDLT ELIGVPLPPS VSFITGYEGF
     PAYSFGQDAN IGRLTKTFIP QPFYRDFAVI VTIKPTTNRG GVLFAITDAF QKIIYLGIEL
     TPVEDDTQRI IMYYTESSTS KSQQVASFKV QEMTNKWNRF TLAVQDNKIT LFMDCEEYHR
     ILFERTSQQL QFESSSGIFV GNAGGTGRPK FIGSIQQLLI KPDPRAAEEQ CEDDDPYASG
     DESGDSPQEE INVDEAKKNV EEVEYFPSPS ELEHSVPVPA PPTEVSIIKE TDEFSGGLST
     PTEQLLISSA IPEHTDEVGL ESSVSQKEVP RERGHGVDAS HGEKTESSSF RFGPKGEKGD
     PGPPGPPGPS GQPGQSNGVT PLAGPPGPAG PPGQPGTPGR DGQPGLSGKD GLPGEKGPQG
     FPGLPGESGI KGEKGDPGVG LPGPPGPPGA PGRAFSYRLM NENDAEGSGL RDIDSDAEIM
     RGPPGPPGPP GKPGPPGPSN PGGSLNDAVT GPPGPPGIPG KDGAPGKPGL PGFHGPPGRN
     GSIGPKGEHG DKGDIGPQGL PGEKGDPGMP GEAGPPGLSG TQGQTGLPGP QGLPGPPGPP
     GPPGRGYNFG LEDLEGSTAF GGLSEPGPRG PPGIAGLPGP VGSKGEQGDI GPPGATGQKG
     EPGLTGVDGL PGIAGQPGPV GQKGEMGDKG QRGERGLDGV SITGPPGPPG PPGPVISLQD
     FMLNDTDGIF NFTGIQGPPG PPGPRGPKGD HGFPGVQGPP GVKGEKGEPG VVIAADGSMM
     TGVIGPKGMK GDRGMSGPHG PMGPSGAKGE FGFPGRPGRP GMTGKKGEKG EVLNIQGPPG
     PPGPPGPPGP CLSIKGTVFP VPPRLHCKTP VNGNEETDRT LSFSVIKGQK GSLGTNGRKG
     EKGANGTPGK PGPADSLLPD DFKGDKGDAG YKGDKGEKGE AGLPGPPGLP GRSGLVGPKG
     EAVIGPRGPP GFPGLPGPPG FGRPGPPGPP GPPGPPGPSS MFGSSIPLPG PPGPPGQPGP
     PGIGTSNAVI TYRNLEGLLR DSYRTAEGTL IYITDKSELY IRVRQGWRKV QLGDLIPITG
     DSPPLPGSPV LEMQSFLPGH SLLSHTVHTH PGLHLVALNA PLSGDMMGIR GADNQCFQQA
     RAMGLVATYR AFLSSQLQDL ATIVKKADRF NTPIVNLKGE ILFNNWMEIF ATNGQFNPQI
     PIYSFDGRNV LTDPSWPQKM IWHGSSPGGI RLMSSYCEAW RAGDMAVTGQ ASFLQRGRLL
     EQHTRSCSNN FIVLCIENSY LQDHRRK
//
DBGET integrated database retrieval system