ID A0A455BP63_PHYMC Unreviewed; 1140 AA.
AC A0A455BP63;
DT 05-JUN-2019, integrated into UniProtKB/TrEMBL.
DT 05-JUN-2019, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Collagen alpha-1(XIX) chain {ECO:0000313|RefSeq:XP_028350527.1};
GN Name=COL19A1 {ECO:0000313|RefSeq:XP_028350527.1};
OS Physeter macrocephalus (Sperm whale) (Physeter catodon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Odontoceti;
OC Physeteridae; Physeter.
OX NCBI_TaxID=9755 {ECO:0000313|Proteomes:UP000248484, ECO:0000313|RefSeq:XP_028350527.1};
RN [1] {ECO:0000313|RefSeq:XP_028350527.1}
RP IDENTIFICATION.
RC TISSUE=Muscle {ECO:0000313|RefSeq:XP_028350527.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_028350527.1; XM_028494726.1.
DR AlphaFoldDB; A0A455BP63; -.
DR STRING; 9755.ENSPCTP00005026512; -.
DR Ensembl; ENSPCTT00005029189; ENSPCTP00005026512; ENSPCTG00005018611.
DR KEGG; pcad:112067401; -.
DR InParanoid; A0A455BP63; -.
DR OrthoDB; 3809795at2759; -.
DR Proteomes; UP000248484; Chromosome 10.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:Ensembl.
DR GO; GO:0007519; P:skeletal muscle tissue development; IEA:Ensembl.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR37456:SF5; -; 1.
DR PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR Pfam; PF01391; Collagen; 11.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|RefSeq:XP_028350527.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000248484};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..1140
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5019817568"
FT DOMAIN 50..234
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 295..681
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 704..1009
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1048..1140
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 415..430
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 479..495
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 837..852
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1102..1118
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1140 AA; 114467 MW; 9678B9B2609CF8E2 CRC64;
MRPAGSWKLW FWAAMLLLPA SASVVSRDKP EKQCPILRKE GHQFTPDYRD KVEVSGFDLG
ERFSLRRVFC EGDKTCIKLG SSLLIRDTIK VFPKGLPEEF AIAAVFRVRR STKKERWFLW
QVLNQQNMPQ VSIVVDGGKK MVEFMFRAVE GDVLNYIFKN RELRPLFDRQ WHKLGFSIQS
RAISVYMDCN LVASRHIDGK GTVDFHGRTV IATRASDGKP VDVEIHQLQI YCNSNFIAQE
TCCEISDPKC PEQHGFGSTA ASLVPAHASR MSAFLPAKQE LTDQCQCIPN KGEAGLLGVP
GSPGQKGDKG EQGENGLHGA PGLPGQKGEQ GFEGSKGEIG GKGAQGEKGD PGLAGVHGQD
GLKGDVGPHG PPGPKGEKGD LGPPGPPALT GSLGIQGPQG PPGKEGQRGR RGKPGPPGKP
GPPGPPGPPG IQGMQQTLDG YYHKGYLGEH GAGGPKGEKG EIGPPGFPGS IGPKGEKGES
GEPFTKGEKG DRGEPGIKGS QGIKGEPGDP SPPGVIGSSG LKGQQGPAGP MGPRGPPGDT
GLPGEHGIPG KQGIKGEKGD PGGIIGPPGL PGPKGEAGPP GKSLPGEPGL DGNPGAPGPR
GPKGERGLPG IHGSPGDIGP PGIGIPGRTG SQGPAGEPGI QGPRGLPGLP GAPGTPGNDG
APGRDGKPGL PGPPGDPIAL PLLGDTGALF KNFCGNCQAS VPGLKSSKGE EAGTGEPGQF
DSVAQKGDVG PRGPPGVPGR EGPKGSKGER GYPGIPGEKG NEGLQGVPGL PGAPGPTGPP
GLLGRTGHPG PSGTKGDKGS EGPPGRPGPP GPPGVPFNEG NGMSSLYKLQ GGVNAPSYPG
PPGPPGPKGD PGPVGEPGAM GLPGLEGFPG IKGDRGPAGP PGVAGISGKP GAPGLRGIPG
EPGERGPVGD IGFPGPEGPS GKPGTNGKDG LPGVQGIMGK PGERGPKGER GDQGIPGDRG
PQGERGKSGL PGIKGAIGPM GPPGNKGSPG SPGHQGLPGH PGLPGSPADV VSFEEIKKYI
NQEVLRIFEE RMAVFLSQLK LPAAMLAAQA HGRPGPPGKD GLPGPPGDPG PQGYRGQKGE
RGEPGMGLPG SPGLPGTSAP GLPGSPGAPG PQGPPGPSGR CNPGDCLYPV SHVRQQTGGK
//