GenomeNet

Database: UniProt
Entry: A0A3Q2W6R7_HAPBU
LinkDB: A0A3Q2W6R7_HAPBU
Original site: A0A3Q2W6R7_HAPBU 
ID   A0A3Q2W6R7_HAPBU        Unreviewed;       839 AA.
AC   A0A3Q2W6R7;
DT   10-APR-2019, integrated into UniProtKB/TrEMBL.
DT   10-APR-2019, sequence version 1.
DT   27-MAR-2024, entry version 18.
DE   SubName: Full=Collagen type VI alpha 1 chain {ECO:0000313|Ensembl:ENSHBUP00000020385.1};
GN   Name=COL6A1 {ECO:0000313|Ensembl:ENSHBUP00000020385.1};
OS   Haplochromis burtoni (Burton's mouthbrooder) (Chromis burtoni).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC   Pseudocrenilabrinae; Haplochromini; Haplochromis.
OX   NCBI_TaxID=8153 {ECO:0000313|Ensembl:ENSHBUP00000020385.1, ECO:0000313|Proteomes:UP000264840};
RN   [1] {ECO:0000313|Ensembl:ENSHBUP00000020385.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A3Q2W6R7; -.
DR   Ensembl; ENSHBUT00000029993.1; ENSHBUP00000020385.1; ENSHBUG00000022833.1.
DR   GeneTree; ENSGT00940000162889; -.
DR   Proteomes; UP000264840; Unplaced.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 3.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF84; COLLAGEN ALPHA-1(XXI) CHAIN-LIKE ISOFORM X1; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF00092; VWA; 3.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00327; VWA; 3.
DR   SUPFAM; SSF53300; vWA-like; 3.
DR   PROSITE; PS50234; VWFA; 3.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000264840};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..839
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018581661"
FT   DOMAIN          30..141
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          439..622
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          646..832
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          174..418
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        205..227
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        292..315
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   839 AA;  89869 MW;  4EB20D09329BF550 CRC64;
     TAARSHALLV LLLCSGWRHQ CDRILTWNSG ALHYSDEVKL VKELSDLNTE RNALKTAIDN
     IAYIGKGTYT DCAIKRGLAE LLIGGSHYHE NKYLVVVTDG HPITGYKEPC GGVQEAANEA
     KLHGVKVFAV AISPDQEVQR IHTHTDHLLN FAQMKPCIAQ IHFEGVKVWR GEIGRPGMPG
     EKGDVGDVGR IGDPGPVGYS GMKGDRGHKG YKGDKGQRGK DGIDGRKGEP GFPGLAGCKG
     SPGQDGSQGE HGPKGDPGSY GIKGEKGDPG RDGEPGRPGN YGPLGPKGDP GPRGPDGDKG
     ERGEDVSDKG LPGEKGEQGS RGNRGPRGEP VSADKRFNPS PSRAFPSSGP RGPRGIKGAP
     GDRGPMGERG ADGAPGNGTE GCHGFQGYPG PRGDPGEPGG RGTPGPKGDD GEPGDPGPDV
     TFMKLFMCVS PECKCAPVDL AFIVDSSESI GSTNFALAKD FIITVIDRLI KDQQVKFAVD
     ESTVSVVQYS GSRAQEAVRL SSSLTEFKQA VRDMKWLAEA TYTGEALDFA LSNTISVMRK
     ENKVVLVLTD GRSDIDRDKT PLNILCGKGL QVGGLGVKDY SGREPNQEQL DDVVCKSDPK
     PGFSFVLDNF AELLDDNFLQ NLTDRICKEK KCPDYRCPIE FPQSTDILVM MDSSASVGQK
     NFEISKTFAQ HLADRFLNAN RSLGAQIRVG VGQYSRNARL DAPLNNNLTV LSEEIKAATF
     QNDGTSVTQA LEFAIRTLAS RGDGSAGSKK LVLFSDGRSQ GVTQPVLEKR VREVADAGIE
     LYVISAGTQV SEANLRTLVS RGRLADITYA QRHLFRLPDY PSLLRGVFYQ TVSRRVSMP
//
DBGET integrated database retrieval system