ID A0A4W5N809_9TELE Unreviewed; 652 AA.
AC A0A4W5N809;
DT 18-SEP-2019, integrated into UniProtKB/TrEMBL.
DT 18-SEP-2019, sequence version 1.
DT 28-JAN-2026, entry version 27.
DE RecName: Full=Collagen type XVIII alpha 1 chain a {ECO:0008006|Google:ProtNLM};
OS Hucho hucho (huchen).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Protacanthopterygii; Salmoniformes;
OC Salmonidae; Salmoninae; Hucho.
OX NCBI_TaxID=62062 {ECO:0000313|Ensembl:ENSHHUP00000045675.1, ECO:0000313|Proteomes:UP000314982};
RN [1] {ECO:0000313|Proteomes:UP000314982}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Macqueen D.J., Gundappa M.K.;
RT "Genome assembly of Danube salmon.";
RL Submitted (JUN-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSHHUP00000045675.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (AUG-2025) to UniProtKB.
RN [3] {ECO:0000313|Ensembl:ENSHHUP00000045675.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2025) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A4W5N809; -.
DR STRING; 62062.ENSHHUP00000045675; -.
DR Ensembl; ENSHHUT00000047365.1; ENSHHUP00000045675.1; ENSHHUG00000027869.1.
DR GeneTree; ENSGT00940000165423; -.
DR Proteomes; UP000314982; Unassembled WGS sequence.
DR CDD; cd00247; Endostatin-like; 1.
DR FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR050938; Collagen_Structural_Proteins.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR37456:SF6; COLLAGEN ALPHA-1(XXIII) CHAIN-LIKE ISOFORM X2; 1.
DR PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000314982}.
FT DOMAIN 236..283
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 478..647
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 1..45
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 127..235
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 284..476
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 20..29
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 30..43
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 127..140
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 218..227
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 329..346
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 347..363
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 371..392
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 394..418
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 446..459
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 652 AA; 71093 MW; D9319E0D75FA5F2B CRC64;
MPGRPGRPGL NGVKGERGDS AGGSGGYGGP PGPPGPPGPP GPAVPLDRFN RYDDVSRLYP
GKSTHSHWPV SHHDVKRCHV FSYMLAFFLH PSDSKGEKGD RGVPGIPGVP GLTSNFDIYT
FKKEMKGERG VSGMKGEKGE PAGGYHYSGQ GGQPGPPGLP GPQGESIIGP SGPQGPPGNP
GRGYEGRQGN PGPPGPPGPS GSSSSPGAYR PTQTISIPGP PGPPGPPGTD GHSSGVMVLR
SYDTMTATAR RQAEGTLVYL VDQTDFYIRV RDGFRKIQLG PYIALPPDQG NELAAVDPPP
VVYYQPDQPS NTATEQPPRQ LDPHQPQPEG HHPVYPDPRN PTHPDPRYPA QPDPRYPAQP
DPRYPAHPDP RYPAQPDPRY PAQPDPRYPA QPDPRYHSHP DPHYPSHSDP RYPSHTDPRY
PSYTDRQHNP DQVQPVQPQP APVPQNPVYS DTRYPVTPQR RPRPPETPSH QHTSGPSIHL
VALNAPQEGN MRGIRGADFL CFNQARAIGL KGTFRAFLSS KLQDLYSIVR KSDRDRMPIV
NLKDEVLFDS WEAIFSDSEG KVKDNVPIYS FDGKDIFTDD TWPDKMIWHG STSRGHGQVD
NYCETWRIGE QALTGMASSL QGGQLLQQRT SSCHSSYAVL CIENSYIGQF KR
//