ID H3BGZ3_LATCH Unreviewed; 1742 AA.
AC H3BGZ3;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 70.
DE SubName: Full=Collagen type V alpha 1 chain {ECO:0000313|Ensembl:ENSLACP00000021164.1};
GN Name=COL5A1 {ECO:0000313|Ensembl:ENSLACP00000021164.1};
OS Latimeria chalumnae (Coelacanth).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Coelacanthiformes; Coelacanthidae; Latimeria.
OX NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000021164.1, ECO:0000313|Proteomes:UP000008672};
RN [1] {ECO:0000313|Proteomes:UP000008672}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Wild caught {ECO:0000313|Proteomes:UP000008672};
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lander E., Lindblad-Toh K.;
RT "The draft genome of Latimeria chalumnae.";
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLACP00000021164.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AFYH01008080; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01008081; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01008082; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01008083; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01008084; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01008085; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01008086; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01008087; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01008088; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01008089; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 7897.ENSLACP00000021164; -.
DR Ensembl; ENSLACT00000021304.1; ENSLACP00000021164.1; ENSLACG00000018593.1.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000159211; -.
DR HOGENOM; CLU_001074_2_1_1; -.
DR InParanoid; H3BGZ3; -.
DR OMA; VERPFET; -.
DR TreeFam; TF323987; -.
DR Proteomes; UP000008672; Unassembled WGS sequence.
DR Bgee; ENSLACG00000018593; Expressed in pelvic fin and 5 other cell types or tissues.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF387; COLLAGEN ALPHA-1(V) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 6.
DR Pfam; PF02210; Laminin_G_2; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00282; LamG; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000008672};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 1513..1741
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 333..448
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 461..1503
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 379..394
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 726..747
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 858..876
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1005..1019
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1059..1073
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1152..1170
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1224..1260
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1284..1298
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1357..1371
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1428..1443
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1462..1478
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1742 AA; 175068 MW; 2881449F293409F0 CRC64;
DSPFPEDFSI LTTLRPKKGS QSFLLSIYNE QGIQQVGMEI GRSPVFLYED HTGRPAPEDY
PLFRNINLSD GKWHRVAISV YKRNVTLILD CNRKSSMVLK RSKHPIIDTN GIIVFGTRIL
DEDVFEGDIQ QLLIVPDYRA AYDYCEHYSP DCNLPVPDAP QSQDPNQDEY YDPYYYEYPY
YEDMEVTEYS TVGPSQGAEV VRSITEVQEE LTAAPTKEAV ITAVVENKED DKVVDYTTDL
GEDYNYGDLD YSVYNEGDDF YKEYEEKNEE IDVPPTTTVI ISHNKMTHPG ASRGKGKESL
SVKLEEKNVQ NLLMPIDDYD IYDDEFYNYD LNDTDIGPGS PVERPFETID GLKGEKGQKG
EPAVIEPGML IEGPPGPEGS AGLPGPPGPP GPPGSFGDTG ERGPPGRAGL PGADGLPGPP
GTVLMLPFRF SGGDASQKGP MVSAQEAQAQ AILQQARLAM RGPTGPMGLT GRPGPLGPPG
VPGFKGESGD LGPQGPRGPQ GPPGPSGKSG RRGRAGSDGA RGMPGQTGAK GDRGFDGLAG
LPGEKGHRGD AGPSGPPGPS GDDGERGDDG EIGPRGLPGE PGPRGLLGPK GPPGSPGVTG
VAGMDGHPGP KGNLGPQGEP GPPGQQGNPG AQGAPGPQGA IGPPGAKGPL GRPGLPGMPG
SDGPPGHPGK EGPPGEKGGQ GPHGPQGPIG YPGPRGVKGA DGIRGLQGPK GEKGEDGFPG
FKGDIGLKGD RGEHGPAGPR GEDGPEGPKG RSGPNGDPGP LGPSGEKGKL GVPGLPGYPG
RQGPKGSIGF PGFPGANGEK GGRGTPGKPG PRGQRGPTGP RGERGPRGPT GKPGPKGTAG
SDGPPGGPGE RGLPGPQGPT GFPGPKGPPG PPGKDGLPGH PGQRGETGFQ GKTGPPGPPG
VVGPQGPTGE TGPMGERGHP GPPGPPGEQG LPGTAGKEGA KGDPGPAGPA GKDGPPGLRG
FPGERGLPGP LGAAGLKGNE GPPGPPGPAG SPGERGAAGP AGPIGLPGRP GPQGPPGPSG
EKGAPGEKGP QGPAGRDGIQ GPVGLPGPAG PAGPPGEDGD KGEIGEPGQK GSKGDKGEQG
PPGPSGPQGP IGQPGPAGAD GEPGPRGQQG LFGQKGDEGP RGFPGPPGPV GLQGLPGPAG
EKGETGDVGQ MGPPGPPGPR GPPGPSGADG PQGPPGGIGN PGSVGEKGEP GETGEPGPPG
EIGPVGPKGE RGEKGEAGPP GAAGPPGRKG PPGDDGPKGS PGPVGFPGDP GPPGEPGPAG
QDGPPGDKGE DGEAGQTGSP GPTGEPGPSG PPGKRGPPGA RGPEGRQGEK GAKGESGLEG
PPGKTGPVGP QGAPGKTGPE GLRGIPGPVG EQGLPGSPGP DGPPGPMGPP GLPGLKGDSG
PKGEKGHPGL IGLIGPPGEQ GEKGDRGLPG PQGTQGLKGE QGITGPSGPL GPPGPPGLPG
PPGPKGSKGS SGSSGPKGET GVPGPPGLPG PPGDVIQPLP YQSPKRTRRN IDASQLLDEG
NPDNYMDYAD GMEEIFGSLN SLKLEIEQMK HPMGTQNNPA RTCKDLQLCH PDFPDGEYWI
DPNQGCSRDS FKVFCNFTAG GETCIYPDKK SEGARLTSWQ KENPGSWFSE FKRGKLLSYV
DSESNPIGVV QMTFLRLLSA SAHQNITYNC YHSIAWHDAT TDTYDKAIRF LGSNDEEMSY
DNNPYIMAVS DGCALKKGYE KTILQINTPK VEQAPIIDIM FNDFGDATQK FGFEVGPVCF
IG
//