ID A0A3Q2W6R7_HAPBU Unreviewed; 839 AA.
AC A0A3Q2W6R7;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE SubName: Full=Collagen type VI alpha 1 chain {ECO:0000313|Ensembl:ENSHBUP00000020385.1};
GN Name=COL6A1 {ECO:0000313|Ensembl:ENSHBUP00000020385.1};
OS Haplochromis burtoni (Burton's mouthbrooder) (Chromis burtoni).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Haplochromini; Haplochromis.
OX NCBI_TaxID=8153 {ECO:0000313|Ensembl:ENSHBUP00000020385.1, ECO:0000313|Proteomes:UP000264840};
RN [1] {ECO:0000313|Ensembl:ENSHBUP00000020385.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3Q2W6R7; -.
DR Ensembl; ENSHBUT00000029993.1; ENSHBUP00000020385.1; ENSHBUG00000022833.1.
DR GeneTree; ENSGT00940000162889; -.
DR Proteomes; UP000264840; Unplaced.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 3.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF84; COLLAGEN ALPHA-1(XXI) CHAIN-LIKE ISOFORM X1; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF00092; VWA; 3.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00327; VWA; 3.
DR SUPFAM; SSF53300; vWA-like; 3.
DR PROSITE; PS50234; VWFA; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000264840};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..839
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018581661"
FT DOMAIN 30..141
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 439..622
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 646..832
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 174..418
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 205..227
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 292..315
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 839 AA; 89869 MW; 4EB20D09329BF550 CRC64;
TAARSHALLV LLLCSGWRHQ CDRILTWNSG ALHYSDEVKL VKELSDLNTE RNALKTAIDN
IAYIGKGTYT DCAIKRGLAE LLIGGSHYHE NKYLVVVTDG HPITGYKEPC GGVQEAANEA
KLHGVKVFAV AISPDQEVQR IHTHTDHLLN FAQMKPCIAQ IHFEGVKVWR GEIGRPGMPG
EKGDVGDVGR IGDPGPVGYS GMKGDRGHKG YKGDKGQRGK DGIDGRKGEP GFPGLAGCKG
SPGQDGSQGE HGPKGDPGSY GIKGEKGDPG RDGEPGRPGN YGPLGPKGDP GPRGPDGDKG
ERGEDVSDKG LPGEKGEQGS RGNRGPRGEP VSADKRFNPS PSRAFPSSGP RGPRGIKGAP
GDRGPMGERG ADGAPGNGTE GCHGFQGYPG PRGDPGEPGG RGTPGPKGDD GEPGDPGPDV
TFMKLFMCVS PECKCAPVDL AFIVDSSESI GSTNFALAKD FIITVIDRLI KDQQVKFAVD
ESTVSVVQYS GSRAQEAVRL SSSLTEFKQA VRDMKWLAEA TYTGEALDFA LSNTISVMRK
ENKVVLVLTD GRSDIDRDKT PLNILCGKGL QVGGLGVKDY SGREPNQEQL DDVVCKSDPK
PGFSFVLDNF AELLDDNFLQ NLTDRICKEK KCPDYRCPIE FPQSTDILVM MDSSASVGQK
NFEISKTFAQ HLADRFLNAN RSLGAQIRVG VGQYSRNARL DAPLNNNLTV LSEEIKAATF
QNDGTSVTQA LEFAIRTLAS RGDGSAGSKK LVLFSDGRSQ GVTQPVLEKR VREVADAGIE
LYVISAGTQV SEANLRTLVS RGRLADITYA QRHLFRLPDY PSLLRGVFYQ TVSRRVSMP
//