ID G3SPL5_LOXAF Unreviewed; 1139 AA.
AC G3SPL5;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 65.
DE SubName: Full=Collagen type XIX alpha 1 chain {ECO:0000313|Ensembl:ENSLAFP00000001664.3};
GN Name=COL19A1 {ECO:0000313|Ensembl:ENSLAFP00000001664.3};
OS Loxodonta africana (African elephant).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta.
OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000001664.3, ECO:0000313|Proteomes:UP000007646};
RN [1] {ECO:0000313|Ensembl:ENSLAFP00000001664.3, ECO:0000313|Proteomes:UP000007646}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000001664.3,
RC ECO:0000313|Proteomes:UP000007646};
RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Loxodonta africana (African elephant).";
RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLAFP00000001664.3}
RP IDENTIFICATION.
RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000001664.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3SPL5; -.
DR STRING; 9785.ENSLAFP00000001664; -.
DR Ensembl; ENSLAFT00000001990.3; ENSLAFP00000001664.3; ENSLAFG00000001988.3.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000158276; -.
DR HOGENOM; CLU_282267_0_0_1; -.
DR InParanoid; G3SPL5; -.
DR OMA; FMFQATE; -.
DR TreeFam; TF351778; -.
DR Proteomes; UP000007646; Unassembled WGS sequence.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:Ensembl.
DR GO; GO:0007519; P:skeletal muscle tissue development; IEA:Ensembl.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR37456:SF3; COLLAGEN ALPHA-1(XXV) CHAIN; 1.
DR PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR Pfam; PF01391; Collagen; 11.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007646};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..1139
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003454024"
FT DOMAIN 50..234
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 288..678
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 705..1006
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1047..1139
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 334..348
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 415..430
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 479..495
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 836..857
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1101..1117
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1139 AA; 114246 MW; 86C5C8157866C46C CRC64;
MRHTSSWKLW LWVAVLLLPA SASMTVRDKP EKLCPILRTN GYQFIQGSGD KLEVSGFDLG
ESFSLRRAFC EGDKTCFKLG SALLIRDTIK IFPKGLPEEY SVAAMFRVRR TTKKERWFLW
QVLNQQNMPQ VSIVIDGGKK VVEFMFRAAE GDVLNYIFKN RELRPLFDRQ WHKLGIGIQS
RVISLYMDCN LIERRQTDEK DTVDFHGRTV IAARASDSKP VDIELHQLKI YCNSNFVAQE
ACCDVSDTKC PEQDGIGSTA SLSVTAHASK MSAYLPAKQE LKDQCQCIPN KGEAGLPGAP
GSPGEKGDKG EPGENGLHGA PGLPGQKGEQ GFEGGKGEIG EKGDPGEKGD PGLTGINGQD
GLKGDLGPHG PSGPKGEKGD MGPPGPPALT SSPGIQGPQG PPGKEGQRGR RGKPGPPGKP
GPPGPPGPPG IQGVQQTFGG YYNKGNLGEH GAGGPKGEKG KNGQPGFPGS VGSKEQKGEP
GEPFTKGEKG DRGEPGTKGS QGIKGEPGDP GPPGLIGSPG LKGQQGPTGS MGPRGPPGDV
GLPGEHGIPG KQGTKGEKGD PGGIIGPPGL PGPKGEAGPP GKSLPGEPGS DGNPGAPGPR
GPKGERGLPG VHGSPGDTGP PGVGIPGRTG SQGPAGEPGI QGPRGLPGLP GTPGSDVRTG
APGKDGKPGL PGPPGDPIAL PLLGDIGALL KNFCGNCQSG IPGLKSNKED GGAGEPGKYD
FTAQKGDTGP RGPPGIPGRE GPKGNKGERG YPGIPGDKGD EGLQGIPGIP GAPGPTGPPG
LLGRTGHPGP VGAKGDKGSE GPPGKPGPPG PPGVPFNEGN GMSSLYKIQG GVNVPSYPGP
PGPPGPKGDP GPVGEPGPMG LPGLEGFPGA KGDRGPAGPP GIAGISGKPG APGPPGIPGE
PGERGPVGDT GFPGPEGPSG KPGINGKDGL PGAQGIMGKP GDRGPKGERG DQGIPGDRGP
QGEQGKPGLP GMKGAIGPVG PPGNKGSTGS PGHQGPPGSP GIPGTSVDAV SLEEIKKYIN
QEVLRIFEER MAVFLSQLKL PAAMLAAQGH GRPGPPGKDG LPGPPGDPGP QGYRGQKGER
GEPGIGLPGS PGLPGTSAPG VPGSPGAPGP QGPPGPSGRC NPEDCLYPVS HARQRTGGK
//