GenomeNet

Database: UniProt
Entry: A0A452F637_CAPHI
LinkDB: A0A452F637_CAPHI
Original site: A0A452F637_CAPHI 
ID   A0A452F637_CAPHI        Unreviewed;      2253 AA.
AC   A0A452F637;
DT   08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT   08-MAY-2019, sequence version 1.
DT   27-MAR-2024, entry version 18.
DE   SubName: Full=Collagen type VI alpha 6 chain {ECO:0000313|Ensembl:ENSCHIP00000019730.1};
GN   Name=COL6A6 {ECO:0000313|Ensembl:ENSCHIP00000019730.1};
OS   Capra hircus (Goat).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC   Caprinae; Capra.
OX   NCBI_TaxID=9925 {ECO:0000313|Ensembl:ENSCHIP00000019730.1, ECO:0000313|Proteomes:UP000291000};
RN   [1] {ECO:0000313|Ensembl:ENSCHIP00000019730.1, ECO:0000313|Proteomes:UP000291000}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Bickhart D.M., Koren S., Rosen B., Hastie A., Liachko I., Sullivan S.T.,
RA   Burton J., Sayre B.L., Huson H.J., Lee J., Lam E., Kelley C.M.,
RA   Hutchison J.L., Zhou Y., Sun J., Crisa A., Schwartz J.C., Hammond J.A.,
RA   Schroeder S.G., Liu G.E., Dunham M., Shendure J., Sonstegard T.S.,
RA   Phillippy A.M., Van Tassell C.P., Smith T.P.;
RT   "Polished mammalian reference genomes with single-molecule sequencing and
RT   chromosome conformation capture applied to the Capra hircus genome.";
RL   Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSCHIP00000019730.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   -!- FUNCTION: Collagen VI acts as a cell-binding protein.
CC       {ECO:0000256|ARBA:ARBA00043858}.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the type VI collagen family.
CC       {ECO:0000256|ARBA:ARBA00044000}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LWLT01000001; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   Ensembl; ENSCHIT00000027547.1; ENSCHIP00000019730.1; ENSCHIG00000018630.1.
DR   GeneTree; ENSGT00940000155619; -.
DR   OMA; NFIRNTS; -.
DR   Proteomes; UP000291000; Chromosome 1.
DR   Bgee; ENSCHIG00000018630; Expressed in longissimus thoracis muscle and 10 other cell types or tissues.
DR   CDD; cd01472; vWA_collagen; 4.
DR   CDD; cd01450; vWFA_subfamily_ECM; 3.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 8.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR22588:SF5; COLLAGEN ALPHA-6(VI) CHAIN; 1.
DR   PANTHER; PTHR22588; UNCHARACTERIZED; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF00092; VWA; 8.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00327; VWA; 9.
DR   SUPFAM; SSF53300; vWA-like; 9.
DR   PROSITE; PS50234; VWFA; 8.
PE   3: Inferred from homology;
KW   Reference proteome {ECO:0000313|Proteomes:UP000291000};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..18
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           19..2253
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5019301248"
FT   DOMAIN          28..200
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          227..405
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          434..604
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          620..789
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          807..980
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          998..1169
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1739..1920
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1947..2148
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          1396..1714
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2253 AA;  245177 MW;  081829B87EEDE70C CRC64;
     MKMLLILFLI IICSHVSVNQ HSGPEYADVV FLVDSSDHLG IKSFPFVKAF INKMISSLPI
     EADKYHVGLA QYSDGLHREF QLSTFKSRGP MLNHLKKNFG FLGGSLRIGK ALQEVHRAYF
     SSGRDRKQFP PILVVLASAE SEDAVEEAAA ALRKDGVRIV SVGMQGASEK TLKAMATGQF
     HYSLRTVRDL STFSQNMTQI LKAAAQYKDG AVNDVLVEVC QGPSVADVVF LLDMSTNSSW
     EDFDYLKEFL EESISALDIK EHCMRVGLVA YSNETKVVST LSGGVNKSEV LQNIQSLAPL
     AGKAYTGAAL KKIRKEVFSA QHGSRKNQGV PQIAVLVTHS PSQDNVTKAA VNLRRQGVTV
     FTIGVEGANN TQLEKIASHP AEQYVSQLKS FSDLAAHNQT FLKKLRNQIT HTVSVISERT
     ETLKAGCVDT EEADIYLLID GSGSTQATDF QEMKTFLSEV AGMFNIAPQK VRVGAVQFAD
     RWDLEFEISK YTNKHDVGKA IENIRQMGGN RNTGAALNFT LGLLQKAKQQ RGGRVPSHLV
     VLTSGASRDS VLGPANRLRE ELVHVYAIGV REANQTQLRE IAGEEKRVYY VHDFDALKDI
     RNQVVQEICA EEACKEMKAD IMFLVDSSGS IGLENFIKMK TFMKNLVSKS QIRADRVQIG
     VVQFSDVNKE EFQLNRYTSQ GEISDAIDRM AHIGETTLTG SALTFVSQYF SPAKGARPNV
     RKFLILITDG EAQDIVKDPA VALREEGIII YSVGVFGSNV TQLEEISGRP EMVFYVENFD
     ILQHIEDDLV FGICSPREEC KRIEVLDVVF VIDSSGSIDH DEYNIMKDFM IDLVKKADVG
     KNHVRFGALK YADDPEVLFY LDNLDTKWEV ISVLQNDQPM GGNTYTAEAL GFSDHMFTEA
     RGSRLHKGVP QVLIVITDGE SHDADKLNAT AKALRDKGIL VLAVGIAGAN PVELLAMAGS
     SDKYFFVETF GGLKGIFSDV SASVCNTSKV DCEIEKVDLV FLMDGSNSIH PDDFRKMKEF
     LASVIQDFDI SNNRVRIGAA QFSHTYRPEF PLGMFISKKE ISFQIENIKQ IFGYTHIGAA
     LRQVGHYFRP DMGSRIHTGT PQVLLVLTDG QSQDEVAQAA EELRHKGVDI YSVGIGDVDD
     QQLVQITGTA DKKLTVHNFD ELKKVKKRIV RNICTSGGDS NCFVDVVVGF DISTQQNGQA
     LLEGQSWMET YLQDILRVIS SLNGVSCEVG TEAQVSVAFQ VTNAAEKYSP KFEIYSENIL
     NSLKDLTVKG PSLLNTNHLS SLWDAFQNKS AARGKVALLF SDGLDDDIEK LEQKSDELRK
     EGLNALITIA LDGPTSSGDL ADLLYIEFGR GFEYRTQLTL GMRDLGSQLS KHLVNVAERT
     CCCLFCKCIG GDGTRGDPGP AGKTGLPGFK GSEGYLGEEG TAGERGAGGP VGEQGTKGCY
     GVKGPKGTRG LNGQEGEVGE SGIDGLNGEQ GDSGLPGAKG EKGNEGAQGI PGERGISGDH
     GAKGLRGDPG VPGFDNSIEG PKGLKGEPGR QGRRGWPGPP GTPGSRRKTA AHGQRGHTGP
     QGKPGIPGPD GLAGSLGLKG PQGPRGEAGM KGEKGSLGSK GPQGLPGPAG EAGSQGRLGS
     QGNKGEPGDL GIKGAVGLRG PRGLLGDDGN PGYGSVGSKG AKGQEGFPGE IGPKGEVGDP
     GGPGQTGPKG ARGKTVSAGL PGEPGSPGEL GPPGRKTCEL IQYVRDHSRA PQCPVYPTEL
     VFALDQSRSV TEPEFVRMKA MLSSLLSGLR VREDHCPAGA RVAVLAYDSH ARLLIRFSDT
     YRKDRLLREI EALPYERSTA SRDIGKAMRF VSRHVFKRML PGSHARRIST FFSGGPSVDP
     QTITTAGLEF SALDIIPVVI AFNQVPAVRR SFAIDDTGRF QVIIIPSGAD SAPALEELQR
     CTFCYDVCKP DASCDQARPP PVQSYVDAAF LLDSSRHVGS AEFEDIRRFL GALLDHFEVT
     PEPETSVTGD RVALLSLSPP HFLPNTQRSP VRSEFNLTTY RSKRLMKRHV DESVQQLNGD
     AFIGHALQWA LDNVFSRTPN LRRNKVIFVI SAGETSHLDR ETLKKESLRA KCQGYTLFVF
     SLGPSWNDQE LEDLASYPLD HHLVQLGRIH KPDHRYGVKF VKAFISSVRR AINKYPPINI
     KAKCNRLSSM EPQQPPLQFV RSFVPGPHRA TLKEDALQKA KFFQYKNYFS RAARGGRDGA
     VQNFTRNIFR AFGNGKRVMR AAPKHDKGSA QGV
//
DBGET integrated database retrieval system