ID A0A0D9R527_CHLSB Unreviewed; 1749 AA.
AC A0A0D9R527;
DT 27-MAY-2015, integrated into UniProtKB/TrEMBL.
DT 27-MAY-2015, sequence version 1.
DT 27-MAR-2024, entry version 45.
DE SubName: Full=Collagen type V alpha 3 chain {ECO:0000313|Ensembl:ENSCSAP00000003716.1};
GN Name=COL5A3 {ECO:0000313|Ensembl:ENSCSAP00000003716.1};
OS Chlorocebus sabaeus (Green monkey) (Cercopithecus sabaeus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Chlorocebus.
OX NCBI_TaxID=60711 {ECO:0000313|Ensembl:ENSCSAP00000003716.1, ECO:0000313|Proteomes:UP000029965};
RN [1] {ECO:0000313|Ensembl:ENSCSAP00000003716.1, ECO:0000313|Proteomes:UP000029965}
RP NUCLEOTIDE SEQUENCE.
RA Warren W., Wilson R.K.;
RL Submitted (MAR-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCSAP00000003716.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AQIB01153061; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01153062; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01153063; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01153064; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01153065; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01153066; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01153067; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 60711.ENSCSAP00000003716; -.
DR Ensembl; ENSCSAT00000005496.1; ENSCSAP00000003716.1; ENSCSAG00000007588.1.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000162394; -.
DR OMA; NHGSQGI; -.
DR Proteomes; UP000029965; Chromosome 6.
DR Bgee; ENSCSAG00000007588; Expressed in fibroblast and 5 other cell types or tissues.
DR GO; GO:0005588; C:collagen type V trimer; IEA:Ensembl.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0008201; F:heparin binding; IEA:Ensembl.
DR GO; GO:0007160; P:cell-matrix adhesion; IEA:Ensembl.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF42; COLLAGEN ALPHA-1(XI) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 4.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000029965};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..1749
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5002344911"
FT DOMAIN 1518..1748
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 230..304
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 320..366
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 388..440
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 477..720
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 789..1497
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 247..267
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 273..293
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 337..351
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 399..426
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 667..681
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 789..828
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1056..1070
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1083..1097
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1303..1317
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1321..1335
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1435..1452
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1749 AA; 173132 MW; 0679E92A220B295F CRC64;
MGNRRGLGQP RAGLCLLLAA LQLLPGTQAD PVDVLKALGV RGGQAGVPEG PGFCPQRTPE
GDRAFRVGQA STLGIPTREL FPDGHFPENF SLLITLRGQP ANQSVLLSIY DERGARQLGL
ALGPALGLLG DPFRPLPQQV NLTDGRWHRV AVSIDGETVT LVADCKAQPP VLGHGPRFIS
IAGLTMLGTQ DLGEKTFEGD IQELLISPDP QAAFQACEWY LPDCDNLVPA ATGAPQGEPE
TPRPRPKGKG KGRKKGRGRK GKGRKKKNKE ILTSSPPPDS PENQTSTDIP KTETPAPNLL
PTPTPLVVLS TVTAGPNATI LEGSLDPDSG TELGTPETKA TREDEEGGDS TMGPDFRAAE
YPSRTQFQIF PGAGEKGAKG EPAVIEKGQQ FEGPPGAPGP RGVVGPSGPP GPPGFPGDPG
PQGPAGLPGI PGIDGIRGPP GTVIMMPFQF AGGSFKGPPV SFQQAQAQAV LQQTQLSMKG
PPGPVGLTGR PGPVGLPGYP GLKGEAGAEG PQGPRGLQGP HGPPGRVGKM GRPGADGARG
LPGDTGPKGD RGFDGLPGLP GEKGQRGDFG HVGQPGPPGE DGERGSEGPP GPTGQAGEPG
PRGLIGPRGS PGPTGRPGVT GIDGAPGAKG NVGPPGEPGP PGQQGNHGSQ GLPGPQGLIG
TPGEKGPPGN PGIPGLPGAD GPPGHPGHEG PTGEKGAQGP PGSAGPAGYP GPRGVKVGDT
GGLRGYATQL HRKLLNVWRG NQLVSRDRVS PCWPGWSRTP DLRKGFSVGQ SGLELLTSVP
PSQFTHCSLE LPGSISPPTS PSQVAGTTGC PGRSGTPGLK QSSSGKAGQP GLEGERGPPG
FRGERGQPGA TGQPGPKGDV GQDGAPGIPG EKGLPGLQGP PGFPGPKGPP GHQGKDGRPG
HPGQRGELGF QGQTGPPGPA GVLGPQGKTG EVGPLGERGP PGPPGPPGEQ GLPGLEGREG
AKGELGPPGP LGKEGPAGLR GFPGPKGGPG DPGPTGLKGD KGPPGPVGAN GSPGERGPVG
PAGGIGLPGQ SGSQGPVGPA GEKGSPGERG PPGPTGKDGI PGPLGPLGPP GAAGPSGEEG
DKGHVGAPGH KGSKGDKGDV GPPGQPGIRG PAGHPGPPGA DGAQGRRGPP GLFGQKGDDG
VRGFVGVIGP PGLQGLPGPP GEKGEVGDVG SMGPHGAPGP RGPQGPSGSE GTPGLPGGVG
QPGAVGEKGE PGEAGDPGPP GAPGIPGPKG DIGEKGDSGP SGAAGPPGKK GPPGEDGAKG
NVGPTGLPGD LGPPGDPGVS GIDGSPGEKG DPGDVGGPGP PGASGEPGAP GPPGKRGPSG
PMGREGREGE KGAKGEPGPD GPPGRTGPMG ARGPPGRVGP EGLRGIPGPV GEPGLLGAPG
QMGPPGPLGP SGLPGLKGDA GPKGEKGHIG LIGLIGPPGE AGEKGDQGLP GVQGPPGPKG
DPGPPGPIGS LGHPGPPGVA GPLGQKGSKG SPGSMGPRGD TGPAGPPGPP GPPAELHGLR
RRRRFVPVPL PVVEGGLEEV LASLTSLSLE LEQLQRPPGT AERPGLVCHE LHRNHPHLPD
GEYWIDPNQG CARDSFRVFC NFTAGGETCL YPDKKFEIVK LASWSKEKPG GWYSTFRRGK
KFSYVDADGS PVNVVQLNFL KLLSATARQS LTYSCQNAAA WLDEATGDHS RSVRFLGTNG
EELSFNQTTA AIVSVPQDGC RLRKGQTKTL FEFSSSRAGF LPLWDVAATD FGQTNQKFGF
ELGPVCFSS
//