ID A0A674P5B4_TAKRU Unreviewed; 1232 AA.
AC A0A674P5B4;
DT 17-JUN-2020, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 1.
DT 28-JAN-2026, entry version 23.
DE SubName: Full=Collagen alpha-1(XVIII) chain-like {ECO:0000313|Ensembl:ENSTRUP00000081040.1};
GN Name=LOC101066353 {ECO:0000313|Ensembl:ENSTRUP00000081040.1};
OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Takifugu.
OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000081040.1, ECO:0000313|Proteomes:UP000005226};
RN [1] {ECO:0000313|Ensembl:ENSTRUP00000081040.1, ECO:0000313|Proteomes:UP000005226}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21551351;
RA Kai W., Kikuchi K., Tohari S., Chew A.K., Tay A., Fujiwara A., Hosoya S.,
RA Suetake H., Naruse K., Brenner S., Suzuki Y., Venkatesh B.;
RT "Integration of the genetic map and genome assembly of fugu facilitates
RT insights into distinct features of genome evolution in teleosts and
RT mammals.";
RL Genome Biol. Evol. 3:424-442(2011).
RN [2] {ECO:0000313|Ensembl:ENSTRUP00000081040.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (AUG-2025) to UniProtKB.
RN [3] {ECO:0000313|Ensembl:ENSTRUP00000081040.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2025) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_029699052.1; XM_029843192.1.
DR AlphaFoldDB; A0A674P5B4; -.
DR Ensembl; ENSTRUT00000075124.1; ENSTRUP00000081040.1; ENSTRUG00000028988.1.
DR GeneID; 101066353; -.
DR GeneTree; ENSGT00940000164061; -.
DR Proteomes; UP000005226; Chromosome 10.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000005226};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..1232
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5025534146"
FT DOMAIN 45..234
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 233..260
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 282..666
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 766..796
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 839..982
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 312..335
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 357..369
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 394..410
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 424..433
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 463..473
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 487..496
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 503..520
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 526..545
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 550..562
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 786..795
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 939..951
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 963..975
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1232 AA; 126836 MW; 3A06F9629D70D93F CRC64;
MGRSRQRRNA PNLLLLIGTI CSSPTITCTS EFDPRQTKEN GPLGALDLTE LIGVPLPPSV
SFVTGFEGYP AYSFGPDANV GRLAKSFIPD PFYYDFAVTV TAKPTTHRGG VLFAITDAYQ
KIVHLGVALS EVEDGSQRVV LYYTDQATRG RTQEAASFKM GDLTGRWARF TLTVQGAEVR
LYMDCEEYHR VAFNRSPRPL TFEAGSGIFV GNAGGTGLTK FVGSIQQLVL KSDPTAPDDQ
CEEDDPYASG YGSGDSTYED MEGVGEVKKF VEKIVYTVPL PELDPTYSAP VQAPPTELSP
GYLETDDEDL QETSGQEMDS TTVTVPETSY QTDASASEED TLSPGQKGEP GEPGPRGPAG
RGPEEGPGGP EVHPGQPGAP GKDGQPGTKG EDGLPGPTGL PGLPGLMGEP GPKGEKGDLG
FGLPGPPGPP GLPGQPSKSS LSEGSGFDDF DTDDEIIRGA PGPRGPPGPP GPPGESSEVI
LPGAPGKDGE DGEKGEPGLP GVDGKDGEPG PTGDKGDKGE PGLTGQPGPK GDQGPPGIPG
LQGPDGQPGP RGPPGPPGPP VPSGRAFVMD MEDLEGSGMN EFGVSRGPQG PPGLPGLKGP
KGNDGVSGKP GLKGEPGVAG SPGLPGQEGL KGKEGPMGAK GEPGPKGEAG RDGLSVHGPP
GPPGPPGAII NLQELLLNDT AGALNFSGVF EAQGPKGDMG IRGVQGPPGL KGEKGEPGHL
LSEDGTVITG PAGPRGHKGD AGLPGTPGIQ GPVGPAGPKG ELGFPGRSGR LGLMGPKGEK
GDSGGPPGPP GPPGRPGMFN CPKGNCCRMN PKYTCLMHVQ AGKVKVHCDK NELNPNGTIA
YGTCQSGPKG EKGDHGPPGT PAERLSQGVD PRSGWGSRGE QGFKGQKGDK GEAGLPGRAG
TPGRPGLVGP KGESVLGPQG PPGMPGSPGS PGFGKTGPVG PPGPPGPPGP PSRYGSALTM
AGPPGPPGPT GPPGPSGNGA AFRTFSSRET MMQQSSRDPE GTLSYVTGTG SLYLKVSQGW
KEVQLGDLIH VSSNIIPQDE PKVAYQIRGD TMERIPSASQ RLNLVALNQP HSGDMMGLDV
ADRLCYEQAK AMGLPPNYRA FVSSHRQDLV HVVYPGFRET FPITNLRGDV MFRNWRSIFN
GNGGPISPRV PIYSFDGRDV LADPFWPQKS IWHGSTSTGT RVMDKHCETW RTDHVSVVGQ
SSSLAGGLLL GQQMRSCSNQ YIVLCIETHK NL
//