ID G1QAL4_MYOLU Unreviewed; 2631 AA.
AC G1QAL4;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 27-MAR-2024, entry version 59.
DE SubName: Full=Collagen type VI alpha 5 chain {ECO:0000313|Ensembl:ENSMLUP00000020747.1};
GN Name=COL6A5 {ECO:0000313|Ensembl:ENSMLUP00000020747.1};
OS Myotis lucifugus (Little brown bat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; Vespertilionidae;
OC Myotis.
OX NCBI_TaxID=59463 {ECO:0000313|Ensembl:ENSMLUP00000020747.1, ECO:0000313|Proteomes:UP000001074};
RN [1] {ECO:0000313|Ensembl:ENSMLUP00000020747.1, ECO:0000313|Proteomes:UP000001074}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSMLUP00000020747.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Collagen VI acts as a cell-binding protein.
CC {ECO:0000256|ARBA:ARBA00043858}.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- SIMILARITY: Belongs to the type VI collagen family.
CC {ECO:0000256|ARBA:ARBA00044000}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAPE02006055; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 59463.ENSMLUP00000020747; -.
DR Ensembl; ENSMLUT00000022516.1; ENSMLUP00000020747.1; ENSMLUG00000010691.2.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000162990; -.
DR HOGENOM; CLU_000182_1_0_1; -.
DR InParanoid; G1QAL4; -.
DR OMA; PCWKEKC; -.
DR TreeFam; TF318242; -.
DR Proteomes; UP000001074; Unassembled WGS sequence.
DR CDD; cd01472; vWA_collagen; 2.
DR CDD; cd01450; vWFA_subfamily_ECM; 3.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 9.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR22588; UNCHARACTERIZED; 1.
DR PANTHER; PTHR22588:SF8; VWFA DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00092; VWA; 9.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00327; VWA; 9.
DR SUPFAM; SSF53300; vWA-like; 10.
DR PROSITE; PS50234; VWFA; 9.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000001074};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..2631
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003419001"
FT DOMAIN 30..206
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 261..442
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 467..637
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 653..826
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 839..1016
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1030..1203
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1785..1930
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1991..2182
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 2313..2507
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 1429..1755
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2276..2304
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2607..2631
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1441..1458
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1490..1510
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1540..1576
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2276..2299
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2631 AA; 289421 MW; 5D4457092E262FEE CRC64;
MKNVLIILVL TLWTETLADQ SPGPGPQYAD VVFLVDSSDH LGIKSFPLVK TFINKVITSL
PIEASKYRVA LAQYSNKLHS EFQLNTFKSR NPMLNHLKKN FVFMGGALRI GNALREAHRT
YFSGPANGRD KKLFPPVLVV LASAESEDDV EEASKALRED GVRIISVGLQ KASEENLKAM
ATAQFHVNLR TARDLSTFSK NMTQIIKEAT QYRERAVDPD SKGKEIRSSS VLFPSQSWHS
VRHNDMVVSF PVTCQKDSLA DLMFLVDESV GTRQNLRNLQ NFLNNITMSL DVKDNCMRLG
LMSYSNQAEA ISLLKSSTNQ SVFQEQIQKL SLRTGKSNAG AAIEKLRREG FLESSGSRRA
QGVPQIAVLV THRASDDEVR EAAMKLRQED VTVFAMGIEG ANSTQLEEIA SYPPRQTSSL
LKSYADLETY STNFLKKVQN EIWSQVSTRS EQVDLDKTGC ADTKEADFYF LIDGSGSIGV
EDFKQIKKFM LGVIDMFSIS PDRVRVGAVQ YASTQRVEFD IHTYTNEVAL RQAVSNIQQL
YGGTATGAAL DFMLPIIKEG RKHRSSEVPC HLIVLTDGAS GDDVLKPANR VRAEQVTIHA
VGIGNANKIQ LLQIAEKEER VYFGQNFDSL KSIKDEVVRS ICMEKGCEDM KADIMFLVDS
SGSIGAENFE TMKTFMRNVS ANIQIGPDKT QIGVVQFSGH NKEEFQLNKY FTQKEISDAI
DRMSLIGQNT LMGSALTFVD EYFTLPKGAR LGVKKFLILI TDGEAQDDVT KPAKALRDKG
VIIICVGVYG AKRTQLEEIS GDGSLVFHVE KFDHLKAIES KLLSQVCARY DCKSIRRLDV
VFVLDHSGSI FPHQQESMIN LTVHLVNKAD VGPDRVQFGA VKYSDQPEVL FYLNTYSNRS
GVVENLRKRR SIGGNTYTAK ALDHTNILFT EEHGSRIKQN VKQMLVIITD GESHDRNMLN
DTASKLRDKG IIIYAVGVDK ANQDELEIMA GNKNNTIHVQ DFDKLKDITL PIQESMCTNA
QEPCNTREAD VIFLCDGSNR VSDSEFVTLT TFLSDLIDNF DIQSQAVKIG MAQFGSRYQE
MIELGNSLTK PQWKTQIQNI TKSSGSPHIV SALKKVRFMF DPYVGGRRNA GVPQTLVVIT
SGDPQDNVAD AVKVLKDLGI CILVLGIGHV HKAQFLPITG NSEKIITFQD FNKLKNVEVK
KRIVREICQS CGKTNCVVDI VVGFDMSNHL PGQRLFHGHP RLESYLPSIL GDITSIKGAS
CGAGAETHVN VAFKVNNDQA FPAKFQIYQE AIFENLLQVT VNGPTHLNAQ FLQKLWDTFE
NKNASRGQVL LIFSDGLGSE SITMLENQSD SLREAGLDAL LVVSLNPVAH DEFSSFEFGK
GFDYRTHLTI GNRDLGKMLS QYLGNIVERT CCCAFCKCPG NPGPHGTRGL QGLKGSLGLK
GSRGHRGEDG DHGMRGDTGP PGDKGIAGCP GQWGQKGVRG LPGSKGELGE DAIDGLDGEE
GSRGFPGKKG ERGDPGSQGS PGARGPPGEY GERGFPGDPG NPGQNSNIKG QKGSEGQQGR
QGRTGQKGTQ GSPNSEGDRG REGHRGPQGV PGEPGDPGLP GALGAEGLKG PQQKGSSGIL
GSKGEKGSQG HKGPQGSPGP VGAEGSVGRP GPSGKKGESG IPGGPGLAGQ PGQQGKQGDY
GIPGYGNIGR KGIQGPRGFP GDMGQKGEIG NPGIPGEPGP KGFRGRRLTV GLKGVKGSQG
THGPPGRRGP KGTAGMPVYS QCDLIRFVRE RSPCWREKCP VYPTELVFAL DQSRGITEQR
FNEMRDVIIS VVNDLNIREN NCPMGARVVV VSYDSGTSYL IRWSDYHTKK QLLQLLSQIK
YQVPTEAQDI GNAMRFVARN VFKRTYAGAN LRRVAVFFSN GEASSKSSII TATMELSALD
ISPAVFAFNE RGFLDEAFGF DNTGTFQVIP APSNGDHEPL ERLQRCTLCY DKCFPNACTK
EVILPENSHM DVAFLLDNSQ NIASGEFKVV KDLVSSMLDN FDIASDPFIS GSGDRIALLS
YSPWDSSRRK KGAVKTEFAF TTYNSQVLMK NHIQDSLQQL NGEATIGHAL LWTMENLFPG
APNLRKHKVI FVVSAGENHE RKELLKKMAL RAQCQGYAIV VISLGSPHDD NMEELASYPL
DHHLIQLGRI HKPDLDYIVK FLKPFVYSVR RGFNQYPPPG LENACRLINS QAEDNQDSGL
LFTDVPHESP SEENSFINQE LNAGGDTSFV LDDHLVHILN QMFMPPKLMT KYEDQDSEEI
ASLTSGHENH GRKEEPSFNG KPGDASLEEY YMDVAFLIDA SHRVENDEFE EVKAFITSVL
DYFHIAPDPL TSPLGDRVAV LSYSPPGYMP NTEECPVYLE FDLVTYSSIY QMKRHLQDSL
QQLNGNVFIG HALQWTIDNV FEGTPYPRKN KVIFVISAGE TNPLDKEVLR NVSLRAKCQG
YTIFVFSFGP IYNDKELEEL ASHPLDHHLV QLGRTHKPDL HYIIKFFKPF VYSIRRAINS
YPPADLSPKC VNITSPNPEN SGIENILHFI PEVYEIKAEN SEPEGEFGSQ EHHVFVSGSS
DENGSGITTD LIQKLYSLFS AGELMMKDKE EAHSEEITAP ANDKQQDKKG N
//