ID A0A452F637_CAPHI Unreviewed; 2253 AA.
AC A0A452F637;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE SubName: Full=Collagen type VI alpha 6 chain {ECO:0000313|Ensembl:ENSCHIP00000019730.1};
GN Name=COL6A6 {ECO:0000313|Ensembl:ENSCHIP00000019730.1};
OS Capra hircus (Goat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Capra.
OX NCBI_TaxID=9925 {ECO:0000313|Ensembl:ENSCHIP00000019730.1, ECO:0000313|Proteomes:UP000291000};
RN [1] {ECO:0000313|Ensembl:ENSCHIP00000019730.1, ECO:0000313|Proteomes:UP000291000}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Bickhart D.M., Koren S., Rosen B., Hastie A., Liachko I., Sullivan S.T.,
RA Burton J., Sayre B.L., Huson H.J., Lee J., Lam E., Kelley C.M.,
RA Hutchison J.L., Zhou Y., Sun J., Crisa A., Schwartz J.C., Hammond J.A.,
RA Schroeder S.G., Liu G.E., Dunham M., Shendure J., Sonstegard T.S.,
RA Phillippy A.M., Van Tassell C.P., Smith T.P.;
RT "Polished mammalian reference genomes with single-molecule sequencing and
RT chromosome conformation capture applied to the Capra hircus genome.";
RL Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCHIP00000019730.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- FUNCTION: Collagen VI acts as a cell-binding protein.
CC {ECO:0000256|ARBA:ARBA00043858}.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- SIMILARITY: Belongs to the type VI collagen family.
CC {ECO:0000256|ARBA:ARBA00044000}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LWLT01000001; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR Ensembl; ENSCHIT00000027547.1; ENSCHIP00000019730.1; ENSCHIG00000018630.1.
DR GeneTree; ENSGT00940000155619; -.
DR OMA; NFIRNTS; -.
DR Proteomes; UP000291000; Chromosome 1.
DR Bgee; ENSCHIG00000018630; Expressed in longissimus thoracis muscle and 10 other cell types or tissues.
DR CDD; cd01472; vWA_collagen; 4.
DR CDD; cd01450; vWFA_subfamily_ECM; 3.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 8.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR22588:SF5; COLLAGEN ALPHA-6(VI) CHAIN; 1.
DR PANTHER; PTHR22588; UNCHARACTERIZED; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF00092; VWA; 8.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00327; VWA; 9.
DR SUPFAM; SSF53300; vWA-like; 9.
DR PROSITE; PS50234; VWFA; 8.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000291000};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..2253
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5019301248"
FT DOMAIN 28..200
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 227..405
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 434..604
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 620..789
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 807..980
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 998..1169
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1739..1920
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1947..2148
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 1396..1714
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2253 AA; 245177 MW; 081829B87EEDE70C CRC64;
MKMLLILFLI IICSHVSVNQ HSGPEYADVV FLVDSSDHLG IKSFPFVKAF INKMISSLPI
EADKYHVGLA QYSDGLHREF QLSTFKSRGP MLNHLKKNFG FLGGSLRIGK ALQEVHRAYF
SSGRDRKQFP PILVVLASAE SEDAVEEAAA ALRKDGVRIV SVGMQGASEK TLKAMATGQF
HYSLRTVRDL STFSQNMTQI LKAAAQYKDG AVNDVLVEVC QGPSVADVVF LLDMSTNSSW
EDFDYLKEFL EESISALDIK EHCMRVGLVA YSNETKVVST LSGGVNKSEV LQNIQSLAPL
AGKAYTGAAL KKIRKEVFSA QHGSRKNQGV PQIAVLVTHS PSQDNVTKAA VNLRRQGVTV
FTIGVEGANN TQLEKIASHP AEQYVSQLKS FSDLAAHNQT FLKKLRNQIT HTVSVISERT
ETLKAGCVDT EEADIYLLID GSGSTQATDF QEMKTFLSEV AGMFNIAPQK VRVGAVQFAD
RWDLEFEISK YTNKHDVGKA IENIRQMGGN RNTGAALNFT LGLLQKAKQQ RGGRVPSHLV
VLTSGASRDS VLGPANRLRE ELVHVYAIGV REANQTQLRE IAGEEKRVYY VHDFDALKDI
RNQVVQEICA EEACKEMKAD IMFLVDSSGS IGLENFIKMK TFMKNLVSKS QIRADRVQIG
VVQFSDVNKE EFQLNRYTSQ GEISDAIDRM AHIGETTLTG SALTFVSQYF SPAKGARPNV
RKFLILITDG EAQDIVKDPA VALREEGIII YSVGVFGSNV TQLEEISGRP EMVFYVENFD
ILQHIEDDLV FGICSPREEC KRIEVLDVVF VIDSSGSIDH DEYNIMKDFM IDLVKKADVG
KNHVRFGALK YADDPEVLFY LDNLDTKWEV ISVLQNDQPM GGNTYTAEAL GFSDHMFTEA
RGSRLHKGVP QVLIVITDGE SHDADKLNAT AKALRDKGIL VLAVGIAGAN PVELLAMAGS
SDKYFFVETF GGLKGIFSDV SASVCNTSKV DCEIEKVDLV FLMDGSNSIH PDDFRKMKEF
LASVIQDFDI SNNRVRIGAA QFSHTYRPEF PLGMFISKKE ISFQIENIKQ IFGYTHIGAA
LRQVGHYFRP DMGSRIHTGT PQVLLVLTDG QSQDEVAQAA EELRHKGVDI YSVGIGDVDD
QQLVQITGTA DKKLTVHNFD ELKKVKKRIV RNICTSGGDS NCFVDVVVGF DISTQQNGQA
LLEGQSWMET YLQDILRVIS SLNGVSCEVG TEAQVSVAFQ VTNAAEKYSP KFEIYSENIL
NSLKDLTVKG PSLLNTNHLS SLWDAFQNKS AARGKVALLF SDGLDDDIEK LEQKSDELRK
EGLNALITIA LDGPTSSGDL ADLLYIEFGR GFEYRTQLTL GMRDLGSQLS KHLVNVAERT
CCCLFCKCIG GDGTRGDPGP AGKTGLPGFK GSEGYLGEEG TAGERGAGGP VGEQGTKGCY
GVKGPKGTRG LNGQEGEVGE SGIDGLNGEQ GDSGLPGAKG EKGNEGAQGI PGERGISGDH
GAKGLRGDPG VPGFDNSIEG PKGLKGEPGR QGRRGWPGPP GTPGSRRKTA AHGQRGHTGP
QGKPGIPGPD GLAGSLGLKG PQGPRGEAGM KGEKGSLGSK GPQGLPGPAG EAGSQGRLGS
QGNKGEPGDL GIKGAVGLRG PRGLLGDDGN PGYGSVGSKG AKGQEGFPGE IGPKGEVGDP
GGPGQTGPKG ARGKTVSAGL PGEPGSPGEL GPPGRKTCEL IQYVRDHSRA PQCPVYPTEL
VFALDQSRSV TEPEFVRMKA MLSSLLSGLR VREDHCPAGA RVAVLAYDSH ARLLIRFSDT
YRKDRLLREI EALPYERSTA SRDIGKAMRF VSRHVFKRML PGSHARRIST FFSGGPSVDP
QTITTAGLEF SALDIIPVVI AFNQVPAVRR SFAIDDTGRF QVIIIPSGAD SAPALEELQR
CTFCYDVCKP DASCDQARPP PVQSYVDAAF LLDSSRHVGS AEFEDIRRFL GALLDHFEVT
PEPETSVTGD RVALLSLSPP HFLPNTQRSP VRSEFNLTTY RSKRLMKRHV DESVQQLNGD
AFIGHALQWA LDNVFSRTPN LRRNKVIFVI SAGETSHLDR ETLKKESLRA KCQGYTLFVF
SLGPSWNDQE LEDLASYPLD HHLVQLGRIH KPDHRYGVKF VKAFISSVRR AINKYPPINI
KAKCNRLSSM EPQQPPLQFV RSFVPGPHRA TLKEDALQKA KFFQYKNYFS RAARGGRDGA
VQNFTRNIFR AFGNGKRVMR AAPKHDKGSA QGV
//