GenomeNet

Database: UniProt
Entry: A0A493TZH9_ANAPP
LinkDB: A0A493TZH9_ANAPP
Original site: A0A493TZH9_ANAPP 
ID   A0A493TZH9_ANAPP        Unreviewed;      1163 AA.
AC   A0A493TZH9;
DT   05-JUN-2019, integrated into UniProtKB/TrEMBL.
DT   05-JUN-2019, sequence version 1.
DT   28-JAN-2026, entry version 26.
DE   SubName: Full=Collagen type XV alpha 1 chain {ECO:0000313|Ensembl:ENSAPLP00000031282.1};
GN   Name=COL15A1 {ECO:0000313|Ensembl:ENSAPLP00000031282.1};
OS   Anas platyrhynchos platyrhynchos (Northern mallard).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Galloanserae; Anseriformes; Anatidae;
OC   Anatinae; Anas.
OX   NCBI_TaxID=8840 {ECO:0000313|Ensembl:ENSAPLP00000031282.1, ECO:0000313|Proteomes:UP000016666};
RN   [1] {ECO:0000313|Ensembl:ENSAPLP00000031282.1, ECO:0000313|Proteomes:UP000016666}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Hou Z.-C., Zhou Z.-K., Zhu F., Hou S.-S.;
RT   "A new Pekin duck reference genome.";
RL   Submitted (OCT-2017) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSAPLP00000031282.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSAPLP00000031282.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A493TZH9; -.
DR   STRING; 8840.ENSAPLP00000031282; -.
DR   Ensembl; ENSAPLT00000046444.1; ENSAPLP00000031282.1; ENSAPLG00000014374.2.
DR   GeneTree; ENSGT00940000158302; -.
DR   OMA; RATEDQC; -.
DR   Proteomes; UP000016666; Chromosome 2.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005783; C:endoplasmic reticulum; IEA:Ensembl.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000016666};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..26
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           27..1163
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5019785847"
FT   DOMAIN          38..226
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          223..251
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          337..566
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          583..631
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          646..713
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          755..906
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        370..379
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        394..403
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        445..460
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        553..562
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        589..605
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        607..622
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        667..688
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        704..713
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        771..780
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        781..790
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        868..881
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        891..903
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1163 AA;  121273 MW;  55D432BC8EEF61BA CRC64;
     MLSWHAWWTW DLLLLLLGLS IHSGSAAEII EERGSKGSLD LTELIGVPLP PSVYFVTGYG
     GFPAYSFGID ANIGRLTSAI IPLSFYRDFA IVVTVKPNSD RGGVLFAITD AFQKTIYLGM
     RISPVDDSTQ RIIMYYTEPG SRISQEAASF KVPVMTNKWN RFTVTVQGNY IALFMDCEEY
     QRVPFQRSAE ALVFDSDSGI FVGNAGATGL DKFTGSIQHL TIRSDPRATE DHCEDDDPYA
     SGDMSGNGSI QEHENIPETQ EALSPSHLPI RLEDTLAEPV EAPPTILSFL EESNFSGHHR
     TEETSETAKI REQDSAVMET GQGNSQSTTV TQKILGEEDA SGFSVLPTES GEKKGQNGED
     GPTGPPGRPG IEDVQRKDPG SPGPSAKHGP PGLPGLDGVV GPPGQKGEKG DRGLPGSIGP
     KGSTGDTGLI GPKGEAGPHG SPGKQGPPGP PGPPGPPGPP GSTGLSYSSG FEDMEGSGII
     GLLSESRITG SRVPKGERGN IGPPGSKGLP GTKGKPGFPG IDGHPGNMGP KGEKGDPGPQ
     GERGQDGNSI VGPPGPPGPP GPIIAIPELL LNDTDGIFNF SSVEELVGPP GPDGRPGLPG
     FPGPQGPKGD IGLPGPKGPK GQQGEKGEPG AIITADGSLT ELLGRKGQKG EAGVVGPVGP
     RGPIGPAGPK GELGFPGRPG RPGLNGLRGV KGERGEALNG MPGLPGPPGP PGRPGRIVYI
     KGTVFPVSRR PHCKMTVSAT YPGNEEALNV QGAKGNRDCW GLQSTGGSPD LKGEKGDRGD
     PGPPGPPLPP SYFSHFINSI KGEKGDNGNT GVKGEKGEPN RGFYLTGPPG PPGRPGLIGP
     KGDSVVGPRG PPGLPGLPGL PGYGKTGPPG PPGPPGPPGP PAIYGSASAM PGPPGPPGEP
     GPPVSRNLIT TFQNIAGMLE KVHFVAEGTL IYLSETSEVF IRVRNGWRKL QLGELIPIPD
     DSLPPPAISS HGFQSLPAPS PISNMNNGKP VLHLVALNLP FSGDMRADFQ CFQQAQLAGL
     TATYRAFLSS HLQDLATVVR KTDRYNLPIV NLKGEMLFHN WESIFKGNGG EFNINVPIYS
     FDGRNVMTDP SWPQKVMWHG STANGIRMVS NYCEAWHTSD LAAMGQASPL KKGKLLDQKA
     YSCSNQFIVL CIENSFVSDP QGK
//
DBGET integrated database retrieval system