GenomeNet

Database: UniProt
Entry: H3BGZ3_LATCH
LinkDB: H3BGZ3_LATCH
Original site: H3BGZ3_LATCH 
ID   H3BGZ3_LATCH            Unreviewed;      1742 AA.
AC   H3BGZ3;
DT   18-APR-2012, integrated into UniProtKB/TrEMBL.
DT   18-APR-2012, sequence version 1.
DT   27-MAR-2024, entry version 70.
DE   SubName: Full=Collagen type V alpha 1 chain {ECO:0000313|Ensembl:ENSLACP00000021164.1};
GN   Name=COL5A1 {ECO:0000313|Ensembl:ENSLACP00000021164.1};
OS   Latimeria chalumnae (Coelacanth).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Coelacanthiformes; Coelacanthidae; Latimeria.
OX   NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000021164.1, ECO:0000313|Proteomes:UP000008672};
RN   [1] {ECO:0000313|Proteomes:UP000008672}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Wild caught {ECO:0000313|Proteomes:UP000008672};
RA   Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA   MacCallum I., Young S., Walker B.J., Lander E., Lindblad-Toh K.;
RT   "The draft genome of Latimeria chalumnae.";
RL   Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSLACP00000021164.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AFYH01008080; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01008081; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01008082; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01008083; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01008084; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01008085; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01008086; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01008087; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01008088; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01008089; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   STRING; 7897.ENSLACP00000021164; -.
DR   Ensembl; ENSLACT00000021304.1; ENSLACP00000021164.1; ENSLACG00000018593.1.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000159211; -.
DR   HOGENOM; CLU_001074_2_1_1; -.
DR   InParanoid; H3BGZ3; -.
DR   OMA; VERPFET; -.
DR   TreeFam; TF323987; -.
DR   Proteomes; UP000008672; Unassembled WGS sequence.
DR   Bgee; ENSLACG00000018593; Expressed in pelvic fin and 5 other cell types or tissues.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR048287; TSPN-like_N.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF387; COLLAGEN ALPHA-1(V) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 6.
DR   Pfam; PF02210; Laminin_G_2; 1.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00282; LamG; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008672};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          1513..1741
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          333..448
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          461..1503
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        379..394
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        726..747
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        858..876
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1005..1019
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1059..1073
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1152..1170
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1224..1260
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1284..1298
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1357..1371
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1428..1443
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1462..1478
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1742 AA;  175068 MW;  2881449F293409F0 CRC64;
     DSPFPEDFSI LTTLRPKKGS QSFLLSIYNE QGIQQVGMEI GRSPVFLYED HTGRPAPEDY
     PLFRNINLSD GKWHRVAISV YKRNVTLILD CNRKSSMVLK RSKHPIIDTN GIIVFGTRIL
     DEDVFEGDIQ QLLIVPDYRA AYDYCEHYSP DCNLPVPDAP QSQDPNQDEY YDPYYYEYPY
     YEDMEVTEYS TVGPSQGAEV VRSITEVQEE LTAAPTKEAV ITAVVENKED DKVVDYTTDL
     GEDYNYGDLD YSVYNEGDDF YKEYEEKNEE IDVPPTTTVI ISHNKMTHPG ASRGKGKESL
     SVKLEEKNVQ NLLMPIDDYD IYDDEFYNYD LNDTDIGPGS PVERPFETID GLKGEKGQKG
     EPAVIEPGML IEGPPGPEGS AGLPGPPGPP GPPGSFGDTG ERGPPGRAGL PGADGLPGPP
     GTVLMLPFRF SGGDASQKGP MVSAQEAQAQ AILQQARLAM RGPTGPMGLT GRPGPLGPPG
     VPGFKGESGD LGPQGPRGPQ GPPGPSGKSG RRGRAGSDGA RGMPGQTGAK GDRGFDGLAG
     LPGEKGHRGD AGPSGPPGPS GDDGERGDDG EIGPRGLPGE PGPRGLLGPK GPPGSPGVTG
     VAGMDGHPGP KGNLGPQGEP GPPGQQGNPG AQGAPGPQGA IGPPGAKGPL GRPGLPGMPG
     SDGPPGHPGK EGPPGEKGGQ GPHGPQGPIG YPGPRGVKGA DGIRGLQGPK GEKGEDGFPG
     FKGDIGLKGD RGEHGPAGPR GEDGPEGPKG RSGPNGDPGP LGPSGEKGKL GVPGLPGYPG
     RQGPKGSIGF PGFPGANGEK GGRGTPGKPG PRGQRGPTGP RGERGPRGPT GKPGPKGTAG
     SDGPPGGPGE RGLPGPQGPT GFPGPKGPPG PPGKDGLPGH PGQRGETGFQ GKTGPPGPPG
     VVGPQGPTGE TGPMGERGHP GPPGPPGEQG LPGTAGKEGA KGDPGPAGPA GKDGPPGLRG
     FPGERGLPGP LGAAGLKGNE GPPGPPGPAG SPGERGAAGP AGPIGLPGRP GPQGPPGPSG
     EKGAPGEKGP QGPAGRDGIQ GPVGLPGPAG PAGPPGEDGD KGEIGEPGQK GSKGDKGEQG
     PPGPSGPQGP IGQPGPAGAD GEPGPRGQQG LFGQKGDEGP RGFPGPPGPV GLQGLPGPAG
     EKGETGDVGQ MGPPGPPGPR GPPGPSGADG PQGPPGGIGN PGSVGEKGEP GETGEPGPPG
     EIGPVGPKGE RGEKGEAGPP GAAGPPGRKG PPGDDGPKGS PGPVGFPGDP GPPGEPGPAG
     QDGPPGDKGE DGEAGQTGSP GPTGEPGPSG PPGKRGPPGA RGPEGRQGEK GAKGESGLEG
     PPGKTGPVGP QGAPGKTGPE GLRGIPGPVG EQGLPGSPGP DGPPGPMGPP GLPGLKGDSG
     PKGEKGHPGL IGLIGPPGEQ GEKGDRGLPG PQGTQGLKGE QGITGPSGPL GPPGPPGLPG
     PPGPKGSKGS SGSSGPKGET GVPGPPGLPG PPGDVIQPLP YQSPKRTRRN IDASQLLDEG
     NPDNYMDYAD GMEEIFGSLN SLKLEIEQMK HPMGTQNNPA RTCKDLQLCH PDFPDGEYWI
     DPNQGCSRDS FKVFCNFTAG GETCIYPDKK SEGARLTSWQ KENPGSWFSE FKRGKLLSYV
     DSESNPIGVV QMTFLRLLSA SAHQNITYNC YHSIAWHDAT TDTYDKAIRF LGSNDEEMSY
     DNNPYIMAVS DGCALKKGYE KTILQINTPK VEQAPIIDIM FNDFGDATQK FGFEVGPVCF
     IG
//
DBGET integrated database retrieval system