GenomeNet

Database: UniProt
Entry: A0AAZ3RV54_ONCTS
LinkDB: A0AAZ3RV54_ONCTS
Original site: A0AAZ3RV54_ONCTS 
ID   A0AAZ3RV54_ONCTS        Unreviewed;      1343 AA.
AC   A0AAZ3RV54;
DT   05-FEB-2025, integrated into UniProtKB/TrEMBL.
DT   05-FEB-2025, sequence version 1.
DT   28-JAN-2026, entry version 5.
DE   RecName: Full=Collagen alpha-1(XVIII) chain-like {ECO:0008006|Google:ProtNLM};
GN   Name=LOC112260532 {ECO:0000313|Ensembl:ENSOTSP00005144998.1};
OS   Oncorhynchus tshawytscha (Chinook salmon) (Salmo tshawytscha).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Protacanthopterygii; Salmoniformes;
OC   Salmonidae; Salmoninae; Oncorhynchus.
OX   NCBI_TaxID=74940 {ECO:0000313|Ensembl:ENSOTSP00005144998.1, ECO:0000313|Proteomes:UP000694402};
RN   [1] {ECO:0000313|Proteomes:UP000694402}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=29621340;
RA   Christensen K.A., Leong J.S., Sakhrani D., Biagi C.A., Minkley D.R.,
RA   Withler R.E., Rondeau E.B., Koop B.F., Devlin R.H.;
RT   "Chinook salmon (Oncorhynchus tshawytscha) genome and transcriptome.";
RL   PLoS ONE 13:e0195461-e0195461(2018).
RN   [2] {ECO:0000313|Ensembl:ENSOTSP00005144998.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSOTSP00005144998.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   Ensembl; ENSOTST00005127031.1; ENSOTSP00005144998.1; ENSOTSG00005042394.2.
DR   GeneTree; ENSGT00940000164061; -.
DR   Proteomes; UP000694402; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1019; COLLAGEN ALPHA-5(IV) CHAIN ISOFORM X1; 1.
DR   Pfam; PF01391; Collagen; 6.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00282; LamG; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000694402};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..18
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           19..1343
FT                   /note="Collagen alpha-1(XVIII) chain-like"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5044199017"
FT   DOMAIN          110..300
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   DOMAIN          159..299
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|SMART:SM00282"
FT   REGION          301..362
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          387..765
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          778..825
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          867..1069
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1123..1146
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        322..348
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        389..400
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        417..430
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        539..553
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        589..601
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        643..652
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        679..688
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        728..743
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        749..760
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        902..914
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        915..927
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        963..978
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1027..1041
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1049..1058
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1343 AA;  138817 MW;  4A686DD5910C2190 CRC64;
     MALPRLYSLL LLMAPVHGQW WSAFLGKAQE MTTQPLTTVP TTTQVLWTTE GTEVGQVGTQ
     TEVFTTAPVQ STPLQSIQEP AGDGTTEIKP KAKKISLKMW KSRERGSTGH LDLTELIGVP
     LPPSVSFITG YEGFPAYGFG PDANIGRLTK TFMPDPFFRD FAIIVTIRPS SKQGGVLFAI
     TDAQQKVVQL GLALTPVEDE TQRIQLYYTE GGEESSHSQQ VASFKLPDMR NKWTRFTLSV
     QDQEVRLYMD CDDFQAETFH RSSRQLSFEP SSGIFVGNAG GTGLEKFVGS IQQLVIKPDP
     RAAEEQCEED DPYASGDGSG DDTLHDRETD DKLMKNMERK KETARPEDML SVPVRAPPTE
     SPELELDEYT VHLTPTNQAH QEMLLEGSHQ TEEPGERSGD GRPLSYGQKG EQGEPGPMGP
     AGPRGPPGPS TPSEDRSGHG QPGPRGPQGP SGAPGVPGKD GQPGSKGEDG DPGQRGPHGF
     PGLAGEVGVK GDKGDTGVGL PGPPGPPGPL KSHSVPYGED ALGSGFGDLD DTEFIRGSPG
     PPGPPGQPGP PGPTRFFEAS EGLFPGQPGS PGPPGRDGLV GKPGPPGPTG LDGDAGLTGP
     TGLKGEQGLA GPNGPMGVSG DPGLTGATGP RGPEGKTGDP GPRGLPGPPG PPGGRFFVED
     VEGSGKNDMV LGTELKGPQG PPGLPGPAGP KGEDGKDGAS GLSVKGEPGA SGPEGLQGLA
     GLPGARGLKG DKGDPGPKGE CGPDGHNVPG PPGPPGPPGP IINLQDSLLN NTESMFNITE
     IRGPPGPMGP EGEPGRAGFP GPRGPKGDSG LPGFQGPPGM KGAKGEAGVT IAADGTALTS
     VRGPRGPKGI KGERGFPGAY GVMGPIGPTG QKGEYGFPGR PGRPGMAGKK GDQGDAIGQP
     GLPGPPGPPG PPGPVTGLNG VNGSKGSSQG GRRRNGGAKG EKGEVGLPGM PGEPDDILPE
     GFVGEKGDMG YEGMKGDQGE PGLPGPPGLP GRSGLVGPKG ESVIGTVGHP GAPGEPGVLG
     IGRPGSRGPP GPAGPPGPPP VYGSAVSIPG PPGPPGPPGI TGYENPVSTY RNTNSLMRES
     HRAAEGSMAY VSDKGELYVR TRDGWRKVQL GELILVPAES PSSAVSQALS RPGDRTRPHR
     PHSQELVGTS YVPNYNVLPH TVHSVPALHL VALNTPFSGD MRGIRGADFQ CYQQARAMGL
     TATYRAFLSS HLQDLATIVK KGDRYNMPVI NLKGEVIYSS WMNIFSGNGG VFDPSIPIYS
     FEGRNVMTDP TWPQKLVWHG SSTVGIRMTT NYCEAWRAGD MAVTGQASLL QTGRLLGQHT
     RSCSNHFIVL CIENSYIDHR RSN
//
DBGET integrated database retrieval system