KEGG   Mus musculus (mouse): 12824
Entry
12824             CDS       T01002                                 

Gene name
Col2a1, Co, Col2, Col2a, Col2a-1, Del, Del1, Dmm, L, Lpk, M100413, Rgsc4, Rgsc413, Rgsc8, Rgsc856
Definition
(RefSeq) collagen, type II, alpha 1
  KO
K19719  collagen type II alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04974  Protein digestion and absorption
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12824 (Col2a1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12824 (Col2a1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12824 (Col2a1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    12824 (Col2a1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12824 (Col2a1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12824 (Col2a1)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12824 (Col2a1)
SSDB
Motif
Pfam: Collagen COLFI VWC Amnionless
Other DBs
NCBI-GeneID: 12824
NCBI-ProteinID: NP_112440
MGI: 88452
Ensembl: ENSMUSG00000022483
Vega: OTTMUSG00000021708
UniProt: P28481
Position
15 F1; 15 53.97 cM
AA seq 1487 aa
MIRLGAPQSLVLLTLLIAAVLRCQGQDAQEAGSCLQNGQRYKDKDVWKPSSCRICVCDTG
NVLCDDIICEDPDCLNPEIPFGECCPICPADLATASGKLGPKGQKGEPGDIRDIIGPRGP
PGPQGPAGEQGPRGDRGDKGEKGAPGPRGRDGEPGTPGNPGPAGPPGPPGPPGLSAGNFA
AQMAGGYDEKAGGAQMGVMQGPMGPMGPRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPMG
PRGPPGPAGKPGDDGEAGKPGKSGERGLPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKG
EAGAPGVKGESGSPGENGSPGPMGPRGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVG
PAGGPGFPGAPGAKGEAGPTGARGPEGAQGSRGEPGNPGSPGPAGASGNPGTDGIPGAKG
SAGAPGIAGAPGFPGPRGPPGPQGATGPLGPKGQAGEPGIAGFKGDQGPKGETGPAGPQG
APGPAGEEGKRGARGEPGGAGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLTG
PKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPG
VMGFPGPKGANGEPGKAGEKGLAGAPGLRGLPGKDGETGAAGPPGPSGPAGERGEQGAPG
PSGFQGLPGPPGPPGEGGKQGDQGIPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRG
LPGTPGTDGPKGAAGPDGPPGAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPG
KDGGRGLTGPIGPPGPAGANGEKGEVGPPGPSGSTGARGAPGERGETGPPGPAGFAGPPG
ADGQPGAKGDQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPG
AAGRVGPPGANGNPGPAGPPGPAGKDGPKGVRGDSGPPGRAGDPGLQGPAGAPGEKGEPG
DDGPSGLDGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPG
PVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALGAPGAPGPPGSPGPAG
PTGKQGDRGEAGAQGPMGPSGPAGARGIAGPQGPRGDKGESGEQGERGLKGHRGFTGLQG
LPGPPGPSGDQGASGPAGPSGPRGPPGPVGPSGKDGSNGIPGPIGPPGPRGRSGETGPVG
PPGSPGPPGPPGPPGPGIDMSAFAGLGQREKGPDPMQYMRADEADSTLRQHDVEVDATLK
SLNNQIESIRSPDGSRKNPARTCQDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETG
ETCVYPNPATVPRKNWWSSKSKEKKHIWFGETMNGGFHFSYGDGNLAPNTANVQMTFLRL
LSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEMRAEGNSRFTYTALKDGCTK
HTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGAEQEFGVDIGPVCFL
NT seq 4464 nt   +upstreamnt  +downstreamnt
atgatccgcctcggggctccccagtcgctggtgctgctgacgctgctcatcgccgcggtc
ctacggtgtcagggccaggatgcccaggaggctggcagctgtctgcagaatgggcagagg
tataaagataaggatgtatggaagccctcatcttgccgcatctgtgtgtgtgacactggg
aatgtcctctgcgatgacattatctgtgaagacccagactgcctcaaccccgagatcccc
ttcggagagtgctgtcccatctgcccagctgacctcgccactgccagtggaaaattaggg
ccaaaggggcagaaaggagaacctggagatatcagagatatcataggacccagaggacct
cctggccctcagggacctgcaggtgaacaaggacccagaggtgatcgtggtgacaaggga
gaaaagggtgcgcctggaccccgtggcagagatggagaacctggtacccctggaaatcct
ggccccgctggccctccaggtccccctggtccccctggccttagtgcaggaaacttcgcg
gctcagatggctggagggtatgacgagaaggctggtggtgcccagatgggagtcatgcaa
gggcccatgggccccatgggaccccgtggacccccaggccctgccggtgcccccggccct
caaggatttcaaggcaatcctggtgaacctggcgagcctggtgtctctggtcccatgggt
ccccgaggtcctcctggccctgctggaaaacctggtgacgacggtgaagctgggaagccc
ggaaagtctggggaaagaggcctccctggccctcagggtgctcgtggattcccaggaacc
ccgggtctccccggtgtcaagggtcacagaggttacccaggcctcgacggtgctaagggg
gaagctggtgctccgggtgtgaagggtgagagtggttcccctggtgagaacggatccccg
ggcccaatgggtccccgtggcctgcctggtgagagaggacggactggccctgctggtgct
gctggtgctcggggtaacgatggccagccaggccccgctggacctccgggtcctgtgggt
cccgcaggtggtcctggcttccctggtgctcctggtgccaagggcgaagctggtcccact
ggtgctcgcggtcctgaaggtgctcaaggttctcgtggcgagcctggcaatcctgggtcc
cctgggcctgcaggtgcttctggtaacccagggactgatggtattcctggagccaaagga
tccgctggtgctcctggaattgctggtgcccctggcttccctgggccccgtggccctccc
ggtcctcaaggtgcaactggtccccttggccccaaaggtcaggcgggtgaacctggcatt
gctggctttaaaggtgatcaaggccccaagggagagactggacctgctgggccccaagga
gcccctggccccgctggtgaagaaggcaaacgaggtgctcgaggagagccgggtggtgct
ggaccaatcggaccccctggagagagaggtgctcctggcaaccgtggattcccaggtcaa
gatggtctggcaggtcccaagggtgcccctggagagcgagggcccagtggcttgactggt
cccaagggagccaacggtgacccgggtcgtcctggagaacctggtcttcctggagccagg
ggtcttaccggtcgccctggtgacgctggtcctcaaggcaaagttggtccttctggagcc
cctggtgaagacggtcgccctggacctcctggtcctcagggagctcgtgggcagcctggc
gtcatgggtttccctggccccaaaggtgccaacggcgagcctggcaaagctggtgagaag
ggtctggctggcgctcctggtctgagaggtcttcctggaaaagacggtgagacgggagcc
gcaggaccccccggccccagtggacctgctggtgaacgaggcgagcagggcgctcctgga
ccatcagggttccagggacttcctggccctcccggtcccccaggtgaaggtggaaagcaa
ggtgaccagggtattcctggtgaagctggagctcctggccttgtgggtcctcggggcgag
cgaggtttcccaggtgaacgtggctctcctggtgctcagggccttcagggtccccgaggc
ctccctggcactcctggtactgatggtcccaaaggtgcagctggcccagatggcccccct
ggggctcaggggcctccaggtctacagggaatgcctggtgagagaggagccgctggcatt
gctgggcccaagggagacagaggcgatgttggcgagaaaggcccagagggagctcctggg
aaggatggcggccgaggtctgactgggcccatcggacccccaggcccagcaggggccaac
ggcgagaagggagaagtcggacctcctggcccgtcaggaagtaccggagctcgaggtgcc
ccgggtgaacgcggagagaccgggccacctggacctgctggattcgctggccctcctggt
gctgatggccagcctggtgccaagggtgatcaaggagaagccggacagaaaggagatgct
ggtgcccccggcccacaaggcccctcgggagcccctgggccacagggtcctactggagtg
actggtcctaagggagcccgaggtgcccaaggtcccccgggagccaccggattccctgga
gctgctggccgagttggacccccaggtgctaatggcaatcctggacccgccggtccccct
ggtcctgctggaaaagatggtcccaaaggtgttcgaggagacagtggcccccctggcaga
gctggtgaccccggtcttcaaggtcctgcaggagctcctggcgagaaaggagaacctgga
gatgatggtccctctggtcttgatggtcctccaggtccccaggggctggctggtcaaagg
ggcattgttggtctgcctggtcagcgtggtgagagaggattccccggccttcccggccca
tcgggtgagcccggcaagcagggtgcacctggcgcgtctggagacagaggtcctcctggt
cctgtggggcctcctggcctgacagggcctgcaggtgaacctggacgagagggcagccct
ggtgctgatggaccccctggaagagatggtgcagctggagtcaagggagatcgtggtgag
actggagcactgggtgcccctggagctcctgggcccccaggctctcctggtcctgctggc
ccaactggcaaacaaggagacagaggagaggctggtgcacaaggtcctatgggtccctca
ggacctgctggagcccgtgggattgcaggccctcaaggcccccgaggtgacaaaggagaa
tctggagagcagggcgagaggggactgaagggacaccgaggtttcactggactgcagggt
ctgcctggccctccgggtccttctggagatcagggtgcttctggccctgctggtccttct
ggccctagaggtccacctggccctgttggtccctctggcaaagatggctctaatggaatc
cctggccccatcgggcctccaggtccccgtggacgctcaggagaaacaggccctgttggt
ccccctggaagtcccggtcctcctggccctccaggtcctcctggtcctggcatcgacatg
tcagcctttgctggcttagggcagagagagaagggccccgaccccatgcagtacatgcgg
gccgacgaggcagacagtaccttgagacagcacgacgtggaggtggacgctacactcaag
tcactgaacaaccagattgagagcatccgcagccccgacggctcccgcaagaaccctgct
cgcacttgccaagacctgaaactctgccaccccgagtggaagagcggagactactggatt
gatcccaaccagggctgcaccttggacgccatgaaagttttctgcaacatggagaccggc
gagacttgcgtctaccccaacccagcgactgtccctcggaaaaactggtggagcagcaag
agcaaggaaaagaaacacatctggtttggagagaccatgaacggtggcttccacttcagc
tatggcgatggcaacctggctcccaacaccgctaacgtccagatgactttcctccgtcta
ctgtccactgagggctcccagaacatcacctaccactgtaagaacagcatcgcctacctg
gacgaagcggctggcaacctcaagaaggccttgctcatccagggctccaatgatgtagag
atgagggccgagggcaacagcaggttcacatacactgccctgaaggatggctgcacgaaa
cacactggtaagtggggcaagaccgtcatcgagtaccgatcacagaagacctcccgcctt
cccattattgacatcgcacccatggacattggaggggctgaacaggaatttggtgtggac
atagggcctgtctgcttcttgtaa

KEGG   Mus musculus (mouse): 12826
Entry
12826             CDS       T01002                                 

Gene name
Col4a1, Br, Bru, Col4, Col4a-1, Del(8)44, R, Raw, S, Svc
Definition
(RefSeq) collagen, type IV, alpha 1
  KO
K06237  collagen type IV alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04926  Relaxin signaling pathway
mmu04933  AGE-RAGE signaling pathway in diabetic complications
mmu04974  Protein digestion and absorption
mmu05146  Amoebiasis
mmu05165  Human papillomavirus infection
mmu05200  Pathways in cancer
mmu05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12826 (Col4a1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12826 (Col4a1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12826 (Col4a1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    12826 (Col4a1)
  09154 Digestive system
   04974 Protein digestion and absorption
    12826 (Col4a1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    12826 (Col4a1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    12826 (Col4a1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12826 (Col4a1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    12826 (Col4a1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    12826 (Col4a1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12826 (Col4a1)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12826 (Col4a1)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   12826 (Col4a1)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12826 (Col4a1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 12826
NCBI-ProteinID: NP_034061
MGI: 88454
Ensembl: ENSMUSG00000031502
Vega: OTTMUSG00000021316
UniProt: P02463
Position
8 A1.1; 8 5.53 cM
AA seq 1669 aa
MGPRLSVWLLLLFAALLLHEERSRAAAKGDCGGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPHGPPGQKGDAGEPGLPGTKGTRGPPGAAGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFSGNPGPPGLPGMKGDPGEILGHVPGTLLKGERGFPGI
PGMPGSPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGSSFQGPKGDKGEQGVSGPP
GVPGQAQVKEKGDFAPTGEKGQKGEPGFPGVPGYGEKGEPGKQGPRGKPGKDGEKGERGS
PGIPGDSGYPGLPGRQGPQGEKGEAGLPGPPGTVIGTMPLGEKGDRGYPGAPGLRGEPGP
KGFPGTPGQPGPPGFPTPGQAGAPGFPGERGEKGDQGFPGVSLPGPSGRDGAPGPPGPPG
PPGQPGHTNGIVECQPGPPGDQGPPGTPGQPGLTGEVGQKGQKGESCLACDTEGLRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGLEGLPGPQGSPGLIGQPGAKGEPGEIFFDMRLK
GDKGDPGFPGQPGMPGRAGTPGRDGHPGLPGPKGSPGSIGLKGERGPPGGVGFPGSRGDI
GPPGPPGVGPIGPVGEKGQAGFPGGPGSPGLPGPKGEAGKVVPLPGPPGAAGLPGSPGFP
GPQGDRGFPGTPGRPGIPGEKGAVGQPGIGFPGLPGPKGVDGLPGEIGRPGSPGRPGFNG
LPGNPGPQGQKGEPGIGLPGLKGQPGLPGIPGTPGEKGSIGGPGVPGEQGLTGPPGLQGI
RGDPGPPGVQGPAGPPGVPGIGPPGAMGPPGGQGPPGSSGPPGIKGEKGFPGFPGLDMPG
PKGDKGSQGLPGLTGQSGLPGLPGQQGTPGVPGFPGSKGEMGVMGTPGQPGSPGPAGTPG
LPGEKGDHGLPGSSGPRGDPGFKGDKGDVGLPGMPGSMEHVDMGSMKGQKGDQGEKGQIG
PTGDKGSRGDPGTPGVPGKDGQAGHPGQPGPKGDPGLSGTPGSPGLPGPKGSVGGMGLPG
SPGEKGVPGIPGSQGVPGSPGEKGAKGEKGQSGLPGIGIPGRPGDKGDQGLAGFPGSPGE
KGEKGSAGTPGMPGSPGPRGSPGNIGHPGSPGLPGEKGDKGLPGLDGVPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEQGVPGRGFPGFPGSKGDKGSKGEVGFPGLAGSP
GIPGVKGEQGFMGPPGPQGQPGLPGTPGHPVEGPKGDRGPQGQPGLPGHPGPMGPPGFPG
INGPKGDKGNQGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGLPGVPGFQGQKG
LPGLQGVKGDQGDQGVPGPKGLQGPPGPPGPYDVIKGEPGLPGPEGPPGLKGLQGPPGPK
GQQGVTGSVGLPGPPGVPGFDGAPGQKGETGPFGPPGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTTDDPLCPPGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPISGDNIRPFISRCAVCEAPAMVMAV
HSQTIQIPQCPNGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcagcgtctggcttctgctgctcttcgccgcccttctgctccacgag
gagcgcagccgagcagctgcgaagggcgattgtggtggctctggctgtggaaaatgtgac
tgtcatggcgtgaagggacaaaagggagaaagaggcttgccagggttgcaaggtgtcatt
ggatttcccgggatgcaaggacctgaggggccacatggaccaccaggacaaaagggtgat
gctggagaacctggacttcctggcacaaaagggacgaggggaccccctggagcagctggc
taccctgggaatcccggacttcctggtattcctggccaagatggccctccaggtccccca
ggtatcccaggatgcaacggtacaaagggagagagagggccgctcggtcctcctggcttg
cctggattcagcggcaatcctggaccaccagggttaccaggaatgaagggagatcctggt
gaaattctcggccacgttcccggaacactgctgaaaggggagagaggatttcctggtatc
cccgggatgccgggctcaccagggttgccaggactccagggacccgtgggtcctccggga
tttactggaccaccgggtcctccaggccctcctggacctcctggagaaaagggccagatg
ggctccagcttccaaggaccaaaaggtgacaagggagagcaaggggtcagcggtcccccg
ggagttcctggacaggcacaagttaaggaaaaaggagactttgccccaacaggagaaaag
ggtcagaaaggtgaacctggatttccgggagtaccaggatatggagagaaaggtgaacct
ggcaagcaagggcctcggggaaaacctgggaaagacggtgaaaaaggagaaagggggagc
ccgggcattcctggcgattccgggtacccaggtctcccaggccggcagggcccgcaggga
gagaagggtgaagctggacttccaggcccccctggaactgtgataggcaccatgcctttg
ggagagaaaggagatcgaggctacccaggagcaccagggttgagaggagagcccggccct
aaaggtttccctggaacaccaggccaaccaggccctccaggtttccctactccaggccag
gctggtgctccaggcttcccaggtgaaaggggagaaaaaggtgaccagggatttccaggc
gtgtcattgccaggaccaagtggaagagatggagccccggggcctcccggccctcccggc
ccccctgggcagccaggccacacaaatggcattgtggagtgtcaacctggaccacccggg
gaccagggccctcctgggactccaggacagccaggtttgacaggtgaagttggacagaaa
ggtcagaaaggagagagctgccttgcctgtgacacagaaggacttcgtggccctccaggg
ccacaggggcctccgggagagattggtttccctggacaacctggggctaaaggtgaccga
ggcctgcccggcagagatggtcttgaaggattgccgggtccacaaggttcaccagggctt
ataggccagcctggagctaagggagagcctggagagatattttttgacatgcgactcaaa
ggtgacaaaggagacccaggttttccgggacagcctggcatgccaggaagagcaggaact
cccggaagagatggccacccaggactccctggacccaaaggctctccgggttcaatagga
ttgaaaggagagcgaggtcctcccggaggagttggcttccctggtagtcgtggagacatc
ggccctcctggacccccaggagttggccccattggtcctgttggtgaaaaaggacaagca
ggctttcctggaggccctgggtccccaggtcttccaggcccaaagggtgaggcaggaaag
gttgtcccactccctggccccccaggagctgcaggacttccaggatcccctggcttccca
gggccacaaggtgaccgaggcttcccaggaaccccaggacgtccaggcatcccgggagag
aaaggtgctgtgggccagccaggaattggatttcctgggcttcctggccccaaaggtgtt
gatggcttgcctggagagataggacggcctgggagtccaggtcgccctggatttaacggc
ttacctggcaacccaggacctcaaggtcaaaagggagaacctggcattgggcttccagga
ctcaaagggcaaccaggccttccaggcattcccggtacacctggagagaagggtagcatc
gggggacccggcgttccaggagaacaggggttgacaggccccccaggactccaggggatc
agaggtgacccagggcctcctggagttcaaggcccagcaggtccaccaggggtcccagga
atagggccacccggagctatgggccctcccggagggcaaggaccaccagggtcatcaggt
ccacctggaattaaaggagagaaagggttccctggattccccggactggatatgcctggc
cccaaaggcgataaaggctctcaaggacttcctggcctcacaggacagtcaggcctccct
ggccttcctggacagcaggggacacctggagttccagggttcccaggttctaaaggtgaa
atgggtgtcatgggaaccccgggacaaccaggctcgccaggaccagcaggcaccccaggg
ttacctggagaaaaaggggaccatggccttccgggctcctcaggacccaggggcgaccct
ggcttcaaaggtgataaaggtgacgttgggcttcctggcatgccaggatccatggagcat
gtggacatggggagcatgaagggacagaaaggagaccagggagagaaaggacaaatcgga
cccactggtgataaaggttcccgaggagaccctggaacaccaggagtacctgggaaggat
gggcaggcagggcatcccggacagccagggcctaaaggtgacccaggccttagtgggaca
ccaggatcccctggactccctggacccaaaggatcagttggaggaatgggcttgccaggt
tcgcctggagaaaaaggtgtgcctggcatccctggctcacagggtgtccctggctcacct
ggagagaagggagccaaaggagagaaagggcagtcaggtctacctggcattgggattccg
ggacggcctggtgacaagggagaccagggccttgctggcttcccaggcagccccggtgag
aagggagagaaaggcagtgccggaaccccagggatgccagggtccccaggcccgagaggt
tctccggggaacatcggccatccaggaagcccaggcctgcctggagagaaaggggataaa
ggcctcccaggactggatggcgttcctggtgtcaaaggagaagcaggtctccctgggact
cctggccccacaggcccagctggccagaagggagagccgggcagcgatggaatcccgggg
tcggcaggagagaagggtgaacaaggtgttccaggaagaggcttccccggcttccctggc
tccaaaggagacaaaggctccaagggtgaagtgggtttccctggcctagctggaagtcct
ggaattcctggagtcaaaggcgagcaagggttcatgggtcctcctggccctcaaggacaa
cctggcttacctggcactcctggtcaccctgtggaggggcccaaaggagaccgaggacct
cagggtcaacctggcctgccagggcatccgggacctatggggccgccagggttcccagga
atcaatgggccaaaaggtgacaagggaaatcaaggttggccaggagctccgggggttcca
ggccctaagggagacccaggattccaaggcatgccgggcattggcggctctccagggatc
acaggttcaaagggagatatgggactgcccggagttccaggatttcaaggtcagaaaggg
cttcctggtctgcagggagtgaaaggagaccagggagaccaaggtgtacccggccctaaa
ggtctccaaggtccccctgggcctccaggtccctacgatgtcatcaaaggagaaccaggg
ctccctggtcctgagggtccccctggtcttaaaggactccagggaccaccaggtccaaaa
ggacagcaaggtgtgacaggctcagtgggcttgcctggacctccaggtgtccctgggttc
gatggtgctcctggccagaaaggagagacaggaccctttggaccacctggtccaagaggg
tttcctggcccaccaggccccgatgggctgccaggatccatgggtcccccaggtacccca
tctgtggaccatggcttccttgtgaccaggcatagtcagacaacagatgacccactgtgt
cccccagggaccaaaattctttaccatggatactctctgctctatgtccaaggcaacgag
cgtgcccacgggcaggacttgggtacggctggcagctgcctgcgtaagttcagcaccatg
ccctttctcttctgcaacatcaacaacgtctgcaacttcgcctccaggaacgactactct
tactggctgtccacgccagagcccatgcccatgtccatggcacccatctctggggacaac
atccggcccttcattagcaggtgtgcggtttgtgaagcaccggccatggtgatggcggta
cacagtcagaccattcagattccgcagtgccctaacggttggtcctcactgtggatcggc
tattccttcgtgatgcacaccagcgctggtgctgaaggttccggccaagccctcgcatcc
cccgggtcctgtctggaagagtttagaagcgccccattcatcgagtgccacggcagagga
acgtgcaattactacgcaaatgcttacagcttttggctcgccaccatagagagaagcgag
atgttcaagaagcccacgccatccaccttgaaggcaggggagctgcgaacacacgtcagc
cgctgccaagtgtgcatgagaagaacataa

KEGG   Mus musculus (mouse): 12827
Entry
12827             CDS       T01002                                 

Gene name
Col4a2, Col4, Col4a-2
Definition
(RefSeq) collagen, type IV, alpha 2
  KO
K06237  collagen type IV alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04926  Relaxin signaling pathway
mmu04933  AGE-RAGE signaling pathway in diabetic complications
mmu04974  Protein digestion and absorption
mmu05146  Amoebiasis
mmu05165  Human papillomavirus infection
mmu05200  Pathways in cancer
mmu05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12827 (Col4a2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12827 (Col4a2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12827 (Col4a2)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    12827 (Col4a2)
  09154 Digestive system
   04974 Protein digestion and absorption
    12827 (Col4a2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    12827 (Col4a2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    12827 (Col4a2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12827 (Col4a2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    12827 (Col4a2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    12827 (Col4a2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12827 (Col4a2)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12827 (Col4a2)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   12827 (Col4a2)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12827 (Col4a2)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 12827
NCBI-ProteinID: NP_034062
MGI: 88455
Ensembl: ENSMUSG00000031503
Vega: OTTMUSG00000021317
UniProt: P08122 B2RQQ8
Position
8 A1.1; 8 5.62 cM
AA seq 1707 aa
MDRVRFKASGPPLRGWLLLATVTVGLLAQSVLGGVKKLDVPCGGRDCSGGCQCYPEKGAR
GQPGAVGPQGYNGPPGLQGFPGLQGRKGDKGERGVPGPTGPKGDVGARGVSGFPGADGIP
GHPGQGGPRGRPGYDGCNGTRGDAGPQGPSGSGGFPGLPGPQGPKGQKGEPYALSKEDRD
KYRGEPGEPGLVGYQGPPGRPGPIGQMGPMGAPGRPGPPGPPGPKGQPGNRGLGFYGQKG
EKGDIGQPGPNGIPSDITLVGPTTSTIHPDLYKGEKGDEGEQGIPGVISKGEEGIMGFPG
IRGFPGLDGEKGVVGQKGSRGLDGFQGPSGPRGPKGERGEQGPPGPSVYSPHPSLAKGAR
GDPGFQGAHGEPGSRGEPGEPGTAGPPGPSVGDEDSMRGLPGEMGPKGFSGEPGSPARYL
GPPGADGRPGPQGVPGPAGPPGPDGFLFGLKGSEGRVGYPGPSGFPGTRGQKGWKGEAGD
CQCGQVIGGLPGLPGPKGFPGVNGELGKKGDQGDPGLHGIPGFPGFKGAPGVAGAPGPKG
IKGDSRTITTKGERGQPGIPGVHGMKGDDGVPGRDGLDGFPGLPGPPGDGIKGPPGDAGL
PGVPGTKGFPGDIGPPGQGLPGPKGERGFPGDAGLPGPPGFPGPPGPPGTPGQRDCDTGV
KRPIGGGQQVVVQPGCIEGPTGSPGQPGPPGPTGAKGVRGMPGFPGASGEQGLKGFPGDP
GREGFPGPPGFMGPRGSKGTTGLPGPDGPPGPIGLPGPAGPPGDRGIPGEVLGAQPGTRG
DAGLPGQPGLKGLPGETGAPGFRGSQGMPGMPGLKGQPGFPGPSGQPGQSGPPGQHGFPG
TPGREGPLGQPGSPGLGGLPGDRGEPGDPGVPGPVGMKGLSGDRGDAGMSGERGHPGSPG
FKGMAGMPGIPGQKGDRGSPGMDGFQGMLGLKGRQGFPGTKGEAGFFGVPGLKGLPGEPG
VKGNRGDRGPPGPPPLILPGMKDIKGEKGDEGPMGLKGYLGLKGIQGMPGVPGVSGFPGL
PGRPGFIKGVKGDIGVPGTPGLPGFPGVSGPPGITGFPGFTGSRGEKGTPGVAGVFGETG
PTGDFGDIGDTVDLPGSPGLKGERGITGIPGLKGFFGEKGAAGDIGFPGITGMAGAQGSP
GLKGQTGFPGLTGLQGPQGEPGRIGIPGDKGDFGWPGVPGLPGFPGIRGISGLHGLPGTK
GFPGSPGVDAHGDPGFPGPTGDRGDRGEANTLPGPVGVPGQKGERGTPGERGPAGSPGLQ
GFPGISPPSNISGSPGDVGAPGIFGLQGYQGPPGPPGPNALPGIKGDEGSSGAAGFPGQK
GWVGDPGPQGQPGVLGLPGEKGPKGEQGFMGNTGPSGAVGDRGPKGPKGDQGFPGAPGSM
GSPGIPGIPQKIAVQPGTLGPQGRRGLPGALGEIGPQGPPGDPGFRGAPGKAGPQGRGGV
SAVPGFRGDQGPMGHQGPVGQEGEPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQEPMCP
VGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASRNDKSY
WLSTTAPLPMMPVAEEEIKPYISRCSVCEAPAVAIAVHSQDTSIPHCPAGWRSLWIGYSF
LMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYFANKYSFWLTTIPEQNFQ
STPSADTLKAGLIRTHISRCQVCMKNL
NT seq 5124 nt   +upstreamnt  +downstreamnt
atggaccgagtgcggttcaaagcgtctggccctcccctgcgagggtggctgctgctagca
actgtgacagtgggactcctggctcagagcgtcttggggggtgtgaagaagttggatgtg
ccctgcggagggagagactgcagtgggggttgccagtgctacccggagaaaggagcaagg
ggtcagccaggggcagtgggcccccaggggtacaatggccccccagggttgcaaggattc
ccaggactacagggccgcaaaggagacaagggcgaacggggagttcctggaccaactgga
ccaaaaggagatgtgggagcgagaggcgtctctggattccccggtgcagatggaattcct
gggcatccagggcaaggcggaccccgaggaaggccaggctacgatggttgcaacggaacc
aggggagacgcgggtccacaggggccctctggatctgggggcttccctggcctccctggg
ccccaaggacccaaggggcagaaaggcgagccttacgcactctctaaagaagaccgtgac
aaataccggggtgaacctggagagcctggcctggttggctaccagggacctcctggccgc
ccagggcccataggacagatgggtcccatgggagcacctggaagaccgggaccaccggga
ccccctggacccaagggacaaccaggcaaccgagggctcggtttttacggacagaagggt
gaaaagggtgacataggacagccaggacccaacgggatcccatctgacatcacacttgtt
gggcccacaacatcaacgatccacccggatctgtacaagggtgaaaaaggagacgaagga
gagcaagggatacccggcgtaatctcaaaaggcgaggaaggcatcatggggttcccagga
atacggggttttcctggccttgatggagaaaaaggagtcgtaggacagaaaggaagcaga
ggcctggatggtttccaaggccccagtggacctcgaggacccaagggagaacggggggaa
caaggacccccaggaccctctgtctactcgccccatccatccctggcaaaaggtgcccga
ggtgacccaggattccaaggtgctcatggggagccaggaagccgaggcgaaccgggagag
cctggaaccgcgggtcctccaggcccatccgtcggagatgaagattcaatgagaggcctt
ccaggggagatgggacccaaaggcttctcaggagaaccagggtcccctgcacgctatctt
ggcccaccaggagcagatggaagaccaggtccccaaggagtccctggacctgcaggccca
cctggaccagatggcttcctgtttggcctgaaaggatcagaagggagagtgggctacccg
ggaccttcaggtttcccaggaacacgagggcagaaaggatggaaaggtgaagctggagac
tgccaatgtggccaggtcatcgggggtcttccgggactgccaggacccaaaggttttccc
ggtgtcaatggggagctcggaaagaaaggcgaccaaggagacccaggcctgcacggtatc
cctgggttcccaggattcaagggagccccgggcgttgctggagctcctggacccaaagga
ataaagggagactctagaacaatcaccaccaaaggggagcgaggacagccaggtatccct
ggtgtacacggaatgaaaggtgatgacggcgtcccaggacgggatgggctggatggattc
cctggcctgcctggccccccgggcgatggcatcaaaggccctccaggggatgcaggtctc
cctggtgtacccggaaccaagggctttccaggagacatcggtcctccaggacagggctta
ccaggtccgaaaggcgagcgtggtttccctggagacgctgggttgcctggacctccgggc
ttccctggtcctccaggtcccccaggaaccccaggacagagagattgtgacacgggtgtg
aaaagacctatcggaggcggccagcaagtggtcgttcagccaggttgcattgaaggcccc
acaggatcacctggccagccaggacccccaggccccacaggtgccaaaggcgtcagagga
atgccaggattcccaggagcttctggagaacaagggctcaagggctttccaggagaccca
ggtcgagaaggcttcccagggcccccagggttcatgggaccccgaggatccaaaggtacc
acaggcctccctggcccagatggacccccaggccccattggacttccaggaccagctgga
ccccctggggacagagggattccaggagaagtgctaggggcccagcctgggacacgagga
gatgctggacttccaggacagcctgggctgaaagggctcccaggagaaacaggagcaccg
ggattcagaggaagccaagggatgcccgggatgccagggcttaaaggtcagccaggcttc
ccaggaccttcgggacagccaggacagtctgggcccccaggacaacatggattcccagga
actcctggccgagaggggcctttggggcagccaggctcccctggccttggaggtttgcct
ggagacaggggcgagcctggtgaccctggtgtccctggccctgtgggcatgaaaggtctg
tctggggacagaggtgacgccggtatgtcaggcgagcgtggtcaccctggaagccctgga
tttaaaggaatggcagggatgcctggtatccctgggcagaaaggtgatcgagggtcgcct
ggaatggacggtttccagggcatgttgggacttaagggaagacagggcttcccagggacc
aaaggagaagctggattttttggagttcctggtttgaagggcctgcctggcgagccaggg
gttaaaggaaaccgaggagaccgggggcccccaggaccacctcctctcatcctgccgggg
atgaaggacatcaagggagaaaagggagatgaaggaccaatgggcctgaaagggtatctg
ggcttaaaaggcatccaaggaatgcccggagtccccggagtgtctggattccctgggcta
cctggaaggcctggcttcatcaaaggagtcaagggagacatcggagtccctggcacacca
ggcttgccgggattccctggggtgtctggccctcctggaattaccgggtttccaggattc
acaggcagccggggcgagaagggtactccaggagtagcaggagtttttggcgagaccggc
cctactggggactttggtgacattggggacactgtggacttaccagggagcccaggcctg
aagggggaacggggcatcacgggaattccaggtctcaagggattctttggagagaaagga
gctgcgggtgacattggatttcctgggataacgggcatggctggagctcagggctctcct
ggacttaaaggacagacaggctttccagggctcacagggctgcaagggccacagggagag
cctggacggatcggaatacccggtgacaagggtgatttcggctggccaggcgtaccaggt
ctaccaggttttcctgggatccggggcatcagcggattgcacggcctgccaggcaccaaa
ggcttccctggatcaccaggtgttgatgctcatggagacccagggttcccaggccccact
ggagacaggggtgaccggggagaggccaacacacttccaggccccgtgggagttccaggg
cagaaaggggagcggggaaccccaggggaacgtggcccagctggaagcccaggacttcag
ggtttccctggtatctctccaccatccaacatctctgggtcacctggtgatgtaggcgca
ccaggaatatttggcttgcaaggctaccaaggccctccaggaccacctgggccaaacgca
cttcctggaatcaaaggagatgaggggagctctggagctgcaggattccctggacagaaa
ggatgggttggggacccagggccccagggccagcctggtgtactcggtcttccgggggag
aaagggcccaagggtgaacaaggattcatgggcaacaccgggccctctggggccgtgggt
gacagaggccccaaaggacccaaaggcgaccaaggattcccaggtgctcctggctctatg
gggtccccaggaattcctggcatcccccagaagattgctgtccagcctggaacgctgggt
ccccagggcaggagaggccttcctggggccctgggagagatagggccgcagggccctcct
ggagatccaggcttccgtggggccccaggcaaggctgggcctcagggcagaggtggcgtg
tctgctgttccagggttccggggagaccaagggcccatgggacaccagggtccagttggc
caggaaggggagccaggccgtccagggagcccaggcctgccagggatgccaggccgcagt
gtgagcatcggctacctcctggtgaagcacagccaaacggaccaggaacccatgtgccct
gtgggcatgaacaagctctggagtgggtacagcctgctatattttgagggccaggagaaa
gcgcacaaccaggacctaggactggcaggctcctgcctggcacgcttcagcaccatgcct
ttcctgtactgcaatccgggtgacgtctgctactatgccagccgcaacgacaagtcctac
tggctctccaccacggcccctctgcccatgatgcccgtggctgaggaggaaatcaagccc
tacatcagccgctgctctgtgtgcgaggctccggccgtggccattgccgtgcacagccag
gatacctctataccccactgcccggctgggtggcggagtttgtggatcggatattcattc
ctcatgcacactgcagccggggatgaaggcggtggccagtcactggtgtcgccgggcagc
tgtctagaggacttccgtgcaacgccatttatcgagtgtaacgggggccgtggtacctgc
cactacttcgctaacaagtacagcttctggctgaccacgatcccagagcagaacttccag
agcacaccatccgctgacacgctcaaggctggcctcatccgcacgcacatcagccgctgc
caagtgtgcatgaagaatctgtga

KEGG   Mus musculus (mouse): 12828
Entry
12828             CDS       T01002                                 

Gene name
Col4a3, [a]3(IV), alph, alpha3(IV), tumst
Definition
(RefSeq) collagen, type IV, alpha 3
  KO
K06237  collagen type IV alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04926  Relaxin signaling pathway
mmu04933  AGE-RAGE signaling pathway in diabetic complications
mmu04974  Protein digestion and absorption
mmu05146  Amoebiasis
mmu05165  Human papillomavirus infection
mmu05200  Pathways in cancer
mmu05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12828 (Col4a3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12828 (Col4a3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12828 (Col4a3)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    12828 (Col4a3)
  09154 Digestive system
   04974 Protein digestion and absorption
    12828 (Col4a3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    12828 (Col4a3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    12828 (Col4a3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12828 (Col4a3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    12828 (Col4a3)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    12828 (Col4a3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12828 (Col4a3)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12828 (Col4a3)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   12828 (Col4a3)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12828 (Col4a3)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 12828
NCBI-ProteinID: NP_031760
MGI: 104688
Ensembl: ENSMUSG00000079465
Vega: OTTMUSG00000021696
UniProt: Q9QZS0
Position
1; 1 C5
AA seq 1669 aa
MHSKTAPRFLVFLLLTLLLLLAASPVASKGCVCKGKGQCLCAGTKGEKGEKGVPGSPGFP
GQKGFPGPEGLPGPQGPKGSPGLPGLTGPKGIRGITGLPGFAGPPGLPGLPGHPGPRGLA
GLPGCNGSKGEQGFPGFPGTPGYAGLPGPDGLKGQKGEPAQGEDRGFNGKGDPGPPGVPG
FQGFPGLPGFPGPAGPPGPPGFFGLPGAMGPRGPKGHMGDSVIGQKGERGMKGLTGPPGP
PGTVIFTLTQPYNKSDFKGEKGDEGERGEPGPPGPSGPPGDSYGSEKGAPGEPGPRGKPG
KDGAPGFPGTEGAKGNRGFPGLRGEAGIKGRKGDIGPPGFPGPTEYYDAYLEKGERGMPG
LPGPKGARGPQGPSGPPGVPGSPGLSRPGLRGPIGWPGLKGSKGERGPPGKDTVGPPGPL
GCPGSPGPPGPPGPPGCPGDIVFKCSPGEHGMPGDTGPPGVPGLDGPKGEPGSPCTECHC
FPGPPGVPGFPGLDGIKGIPGGRGVPGLKGNPGSPGSAGLPGFAGFPGDQGHPGLKGDKG
DTPLPWGQVGNPGDPGLRGLPGRKGFDGTPGGPGAKGPPGPQGEPALSGRKGDQGPPGPP
GFPGPPGPAGPAGPPGYGPQGEPGPKGAQGVPGVLGPPGEAGLKGEPSTSTPDLGPPGPP
GPPGQAGPRGLPGLPGPVGKCDPGLPGPDGEPGIPEAGCPGPPGPKGNQGFPGTKGSPGC
PGEMGKPGRPGEPGIPGAKGEPSVGRPGKPGKPGFPGERGNAGENGDIGLPGLPGLPGTP
GRGGLDGPPGDPGQPGSPGAKGSPGRCIPGPRGTQGLPGLNGLKGQPGRRGDTGPKGDPG
IPGMDRSGVPGDPGPPGTPGCPGEMGPPGQKGYPGAPGFPGPPGEKGEVGMMGYPGTTGP
PGLPGKPGSQGQRGSLGIPGMKGEKGRPGAKGERGEKGKPGPSQTTLLKGDKGEPGLKGF
VGNPGEKGNRGNPGLPGPKGLEGLPGLPGPPGPRGDTGSRGNPGRPGPHGMPGSMGIMGV
PGPKGRKGTSGLPGLAGRPGLTGIHGPQGDKGEPGYSEGARPGPPGPKGDPGLPGDKGKK
GERGVPGPPGQSGPAGPDGAPGSPGSPGHPGKPGPAGDLGLKGQKGFPGPPGSTGPPGPP
GLPGLPGPMGMRGDQGRDGIPGPPGEKGETGLLGAYPGPKGSPGVPGAKGDRGVPGLSGL
PGRKGVMGDVGPQGPPGTAGLPGPPGLPGAIIPGPKGDRGLPGLRGNPGEPGPPGPPGPI
GKGIKGDKGFMGPPGPKGLPGTVGDMGPPGFPGAPGTPGLPGVRGDPGFPGFPGIKGEKG
NPGFLGPIGHPGPVGPKGPPGPRGKPGTLKVISLPGSPGPPGVPGQPGMKGDPGPLGLPG
IPGPCGPRGKPGKDGKPGTPGPAGTKGNKGLKGQQGPPGLDGLPGLKGNPGDRGTPATGT
RMRGFIFTRHSQTTAIPSCPEGTQPLYSGFSLLFVQGNKRAHGQDLGTLGSCLQRFTTMP
FLFCNINNVCNFASRNDYSYWLSTPALMPMDMAPISGRALEPYISRCTVCEGPAMAIAVH
SQTTAIPPCPQDWVSLWKGFSFIMFTSAGSEGAGQALASPGSCLEEFRASPFIECHGRGT
CNYYSNSYSFWLASLNPERMFRKPIPSTVKAGDLEKIISRCQVCMKKRH
NT seq 5010 nt   +upstreamnt  +downstreamnt
atgcactccaagactgctccaaggttcctggtgttcctgctgcttaccctgctgctactc
ctggctgcctctcctgtggctagcaagggctgtgtctgcaaaggcaaaggacaatgcctc
tgcgcggggaccaaaggggagaagggggagaaaggggttcctggttcccctggatttcct
ggccagaaaggatttccaggtcctgaaggcttgcctggaccacagggacccaagggctcc
ccaggacttccgggactcactggccccaaaggcatcaggggaataactggattaccaggg
tttgcaggtcctcctggacttccaggcctcccaggccaccctgggcctcgtgggctggct
ggtttaccaggatgcaacggatctaagggtgaacaaggattcccgggctttcccggcaca
ccaggctatgcagggctcccaggtcctgatggcttgaaaggacaaaagggtgagcctgct
caaggagaagacaggggattcaatggaaaaggtgaccctgggcctccaggggttccaggc
ttccagggttttccgggactcccaggttttccagggcctgccggtccaccaggacctccg
ggattttttggtttaccaggagcaatgggacccagaggacccaaaggccacatgggcgat
agcgtgataggacaaaaaggagaaaggggtatgaaaggattaacaggacctcctgggcca
ccaggaacagtgatttttacactcacccagccatacaacaaatcggacttcaagggagag
aaaggagatgagggagagaggggtgaacccggacctcctggaccctcggggccacctgga
gactcctatggatcagaaaagggtgcgcctggagagcctggtcctcggggcaaacctgga
aaagatggtgcccctggcttccctggcactgagggagccaaaggcaacaggggctttcct
gggttgaggggtgaagctggcattaagggaaggaaaggagacattggccccccaggattt
cctggtccaacagaatattatgacgcatacctggaaaagggagagagaggaatgccaggc
cttcctgggcccaaaggagcccgtggcccacagggtccaagtggtccccctggagtccct
ggaagtcctggactgtcaaggcctggcctcagaggacccattggatggccaggcttgaaa
gggagtaaaggagagagaggaccccctggaaaagacactgtgggccctcctggacccctg
ggatgtcctggctcaccaggtccaccaggccctccaggacctccaggatgtccaggtgac
attgtttttaaatgttcccctggtgagcatggaatgccaggtgacacaggacctccagga
gttccaggacttgatggacccaaaggagaaccaggcagtccatgcaccgagtgtcactgc
ttcccagggccccctggagtcccaggatttccaggattagatggtatcaaaggaatccca
ggaggacgaggtgtgcctggtctgaagggaaacccagggtccccaggaagtgcaggtctc
ccaggatttgcaggattcccaggtgaccaagggcatccaggacttaaaggggacaaaggg
gatacacctctaccctgggggcaagtgggtaatccaggtgatcctggactcagaggcctg
cccgggagaaagggcttcgatggaactcctggaggtccaggagcgaaaggaccaccagga
cctcagggcgaaccggccctgagtggaaggaaaggggaccaaggacctccaggacctcct
ggattccctggacccccaggacctgcaggaccagctggaccaccaggctatggacctcaa
ggagagccaggtccaaagggagcccaaggagtccccggcgtcctgggaccacctggagaa
gccggtcttaaaggagaacccagtacatcaactccagatctaggtcccccaggccctcca
gggccccctggtcaggctggccccagaggtctacctggtttgcctggaccggtgggaaaa
tgtgatcctggtcttcctggacctgatggtgagccaggaattccagaagctggatgccca
gggcctccaggacctaagggaaatcaaggtttcccagggacaaaaggatcacctggttgt
cctggggaaatgggaaagcctggccgtcctggagaaccaggcatcccaggagccaaggga
gaaccatccgtaggcaggcctgggaaaccagggaaaccaggctttccaggagaaagaggc
aatgctggggaaaatggagatattggactccctgggcttccaggcctccctggaactcca
ggaagaggcgggcttgacgggcctccaggagacccaggacagcctggatcacctggagca
aaaggatccccaggaaggtgcataccaggacccaggggcacccaaggacttcccggcttg
aatggattgaaagggcaaccaggtcgaagaggtgataccgggccaaagggagaccctggc
atcccaggcatggacaggtctggggtccccggagatcctggaccaccaggaacaccaggt
tgtcctggtgagatgggaccacctggacaaaaaggatatccgggagctccaggattccca
ggaccaccaggcgagaaaggagaggttggaatgatgggctatcctggaaccactggccct
ccggggcttcctgggaaaccaggctcacaggggcagcgaggtagcctcggaatcccagga
atgaaaggggaaaaaggacgcccaggagccaaaggcgaacgaggagagaaaggaaaacca
gggccttctcaaacaacactcttaaagggagacaaaggagagcctggacttaaaggattt
gttgggaatccaggtgagaaaggaaacagaggcaacccagggttaccaggtccgaaaggc
ctcgagggattgcctgggctaccaggtcctccaggccctagaggagatacgggaagcaga
ggaaatcctggaagaccaggaccacacggcatgccaggaagcatggggatcatgggggtg
ccaggccctaaagggagaaaaggaacttcagggcttccaggtctagctggaagaccaggc
ctgacagggatccatggtccccaaggagataagggggagccgggttattcagaaggtgca
agaccaggaccaccaggaccaaagggcgatccaggattgccaggtgacaaaggaaagaaa
ggagaaagaggcgtacctgggccacctggacaatcggggcctgccggacctgatggagcc
cctgggagtcctgggagtcctggtcacccaggaaaaccaggtcctgctggtgatttgggt
ctgaaaggacagaaaggcttcccaggccctccaggaagcactggccctccaggccctcca
ggactcccaggacttcctgggccaatgggtatgagaggtgaccaaggacgtgatggaatt
cctggtcctccaggagaaaaaggagaaacaggcttgctgggggcctacccaggcccaaaa
gggagccctggtgtaccaggtgccaaaggagacaggggagttccaggcttgtctggcctt
cccggcaggaagggggttatgggggatgttggaccccaaggacccccaggcactgctgga
ctcccagggccaccaggtctacctggggcaattatccctggccccaaaggagacagaggt
cttcctggcttaagaggaaatccaggtgagccaggtccccctggacctccaggacctatc
ggaaaaggcataaaaggtgacaaaggatttatgggcccacctggccccaaaggcctgcct
ggaactgtaggggacatgggcccaccaggtttcccgggagcaccaggtacccccggtctt
cccggtgtcagaggtgatccaggattccctggatttccaggcataaaaggagaaaagggt
aaccctggatttcttgggccaattggacatccaggaccagttgggcccaaaggaccacca
ggtccacgtggaaaacctggcactcttaaggtcatctctcttccgggaagcccagggcca
cctggtgtacctggacagccaggaatgaaaggagaccctggacccctgggactgccagga
atcccaggaccctgtgggccaagaggtaaaccaggcaaggatgggaaaccaggaactcca
ggaccagctggtacaaagggcaacaagggcttgaaaggacagcaaggcccacctggtctg
gatggattgccaggcttaaagggaaatcctggtgacaggggaacgccggccactggtaca
agaatgcgaggcttcatcttcacccgacacagtcaaaccacggccattccttcatgccct
gaaggaacacagccactctatagtgggttttctcttctttttgtacaaggaaacaaacgt
gcacatggacaagacctaggtactctgggcagctgcctgcagcgattcaccacaatgccg
ttcttattctgtaacatcaataatgtatgtaactttgcatcacgaaatgattattcatac
tggctgtcaacaccagctctgatgccaatggacatggctccaattagtggcagagctctc
gaaccctatattagcagatgcaccgtctgtgaaggtccagcaatggccatagctgttcac
agtcaaactactgctatccctccgtgtccccaggactgggtttctctctggaaaggtttt
tctttcattatgttcacaagtgcaggctctgagggtgctggacaagcacttgcctcgcct
ggctcctgcctggaagaattccgagccagtccatttatagaatgccatggacgagggaca
tgtaactactactcaaactcctacagtttctggctggcttcgctgaacccagaaagaatg
ttcagaaaacctattccatcaactgtgaaagctggagacttagagaaaatcataagccgc
tgtcaggtgtgcatgaagaaaagacattga

KEGG   Mus musculus (mouse): 12829
Entry
12829             CDS       T01002                                 

Gene name
Col4a4, E130010M05Rik, [a]4(, [a]4(IV)
Definition
(RefSeq) collagen, type IV, alpha 4
  KO
K06237  collagen type IV alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04926  Relaxin signaling pathway
mmu04933  AGE-RAGE signaling pathway in diabetic complications
mmu04974  Protein digestion and absorption
mmu05146  Amoebiasis
mmu05165  Human papillomavirus infection
mmu05200  Pathways in cancer
mmu05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12829 (Col4a4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12829 (Col4a4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12829 (Col4a4)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    12829 (Col4a4)
  09154 Digestive system
   04974 Protein digestion and absorption
    12829 (Col4a4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    12829 (Col4a4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    12829 (Col4a4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12829 (Col4a4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    12829 (Col4a4)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    12829 (Col4a4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12829 (Col4a4)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12829 (Col4a4)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   12829 (Col4a4)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12829 (Col4a4)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 12829
NCBI-ProteinID: NP_031761
MGI: 104687
Ensembl: ENSMUSG00000067158
Vega: OTTMUSG00000021695
UniProt: Q9QZR9
Position
1; 1 C5
AA seq 1682 aa
MRCFFRWTKSFVTAPWSLIFILFTIQYEYGSGKKYGGPCGGRNCSVCQCFPEKGSRGHPG
PLGPQGPIGPLGPLGPIGIPGEKGERGDSGSPGPPGEKGDKGPTGVPGFPGVDGVPGHPG
PPGPRGKPGVDGYNGSRGDPGYPGERGAPGPGGPPGQPGENGEKGRSVYITGGVKGIQGD
RGDPGPPGLPGSRGAQGSPGPMGHAGAPGLAGPIGHPGSPGLKGNPATGLKGQRGEPGEV
GQRGPPGPTLLVQPPDLSIYKGEKGVKGMPGMIGPPGPPGRKGAPGVGIKGEKGIPGFPG
PRGEPGSHGPPGFPGFKGIQGAAGEPGLFGFLGPKGDLGDRGYPGPPGILLTPAPPLKGV
PGDPGPPGYYGEIGDVGLPGPPGPPGRPGETCPGMMGPPGPPGVPGPPGFPGEAGVPGRL
DCAPGKPGKPGLPGLPGAPGPEGPPGSDVIYCRPGCPGPMGEKGKVGPPGRRGAKGAKGN
KGLCTCPPGPMGPPGPPGPPGRQGSKGDLGLPGWHGEKGDPGQPGAEGPPGPPGRPGAMG
PPGHKGEKGDMVISRVKGQKGERGLDGPPGFPGPHGQDGGDGRPGERGDPGPRGDHKDAA
PGERGLPGLPGPPGRTGPEGPPGLGFPGPPGQRGLPGEPGRPGTRGFDGTKGQKGDSILC
NVSYPGKPGLPGLDGPPGLKGFPGPPGAPGMRCPDGQKGQRGKPGMSGIPGPPGFRGDMG
DPGIKGEKGTSPIGPPGPPGSPGKDGQKGIPGDPAFGDPGPPGERGLPGAPGMKGQKGHP
GCPGAGGPPGIPGSPGLKGPKGREGSRGFPGIPGSPGHSCERGAPGIPGQPGLPGTPGDP
GAPGWKGQPGDMGPSGPAGMKGLPGLPGLPGADGLRGPPGIPGPNGEDGLPGLPGLKGLP
GLPGFPGFPGERGKPGPDGEPGRKGEVGEKGWPGLKGDLGERGAKGDRGLPGDAGEAVTS
RKGEPGDAGPPGDGGFSGERGDKGSSGMRGGRGDPGRDGLPGLHRGQPGIDGPPGPPGPP
GPPGSPGLRGVIGFPGFPGDQGDPGSPGPPGFPGDDGARGPKGYKGDPASQCGPPGPKGE
PGSPGYQGRTGVPGEKGFPGDEGPRGPPGRPGQPGSFGPPGCPGDPGMPGLKGHPGEVGD
PGPRGDAGDFGRPGPAGVKGPLGSPGLNGLHGLKGEKGTKGASGLLEMGPPGPMGMPGQK
GEKGDPGSPGISPPGLPGEKGFPGPPGRPGPPGPAGAPGRAAKGDIPDPGPPGDRGPPGP
DGPRGVPGPPGSPGNVDLLKGDPGDCGLPGPPGSRGPPGPPGCQGPPGCDGKDGQKGPMG
LPGLPGPPGLPGAPGEKGLPGPPGRKGPVGPPGCRGEPGPPADVDSCPRIPGLPGVPGPR
GPEGAMGEPGRRGLPGPGCKGEPGPDGRRGQDGIPGSPGPPGRKGDTGEAGCPGAPGPPG
PTGDPGPKGFGPGSLSGFLLVLHSQTDQEPACPVGMPRLWTGYSLLYMEGQEKAHNQDLG
LAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLSSAAPLPMMPLSEEEIRSYISRCAV
CEAPAQAVAVHSQDQSIPPCPRTWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDFRA
APFVECQGRQGTCHFFANEYSFWLTTVNPDLQFASGPSPDTLKEVQAQRRKISRCQVCMK
HS
NT seq 5049 nt   +upstreamnt  +downstreamnt
atgaggtgctttttcagatggaccaagtcctttgtcacagccccctggtcactaatattt
atcctctttactatacaatatgaatatgggagtggcaagaagtatggcggcccctgcgga
ggaagaaactgctctgtctgccagtgttttcctgagaagggatctcggggtcacccagga
ccactgggaccacagggaccaatagggcccttgggacccctgggacctattgggattcca
ggagaaaaaggagagagaggagacagtggctcacctggaccaccaggagagaaaggagac
aagggtccaactggtgtccctggatttccaggtgtggatggtgtacctggtcacccaggg
cctcctggacccagaggcaaacctggtgtggacggctacaatggctcacgtggtgatccg
ggctatccaggagaaagaggagctccaggcccaggcggcccccctggtcagcctggggaa
aatggagaaaaaggaagatctgtgtacattactggtggcgttaaaggtattcagggagac
cgtggggacccaggaccacccggcttaccaggatctaggggtgcacaagggtcaccaggg
cccatggggcatgcaggcgcaccagggctagcaggacctataggtcatcctggaagccct
gggttgaagggaaatcctgccactggactgaagggacaaaggggagagccgggtgaagtt
ggccagcgtggtcctcctgggcccacgctgttggtacagccgcctgatttgagtatctat
aaaggagaaaagggtgtaaaaggaatgcctggaatgattggtcccccaggacctccaggt
cgcaagggagcgcctggtgtcgggatcaaaggagagaaaggtatccctgggttcccagga
ccccggggtgagcctggttcgcatgggcctccaggttttccaggattcaaggggatacaa
ggagcagctggagagcctgggctgttcggatttcttggtccaaagggggatctgggagat
cgcgggtacccgggaccaccaggcattttgttgactccagctccaccactcaaaggtgtt
ccaggggacccaggaccccctggctactatggggagattggagatgttggattaccaggt
cctccagggcccccaggcagaccaggggaaacttgtccaggcatgatgggacctcctgga
ccaccaggggttcctggtcctccaggatttccaggagaagctggggttcctggaagactc
gactgtgccccaggaaaacctgggaagccagggctacccggcctgcctggcgcaccagga
ccagaggggccccctggatcagatgtgatatactgtcgccctgggtgccctgggccaatg
ggtgaaaagggaaaagtgggtcctccaggaagaagaggagccaaaggagcaaaaggaaac
aagggactctgtacctgtccacctggtcccatgggaccccctggtcccccaggacctcct
ggaagacaaggaagtaaaggagacttagggctccctggttggcatggagaaaagggcgat
ccaggtcagcctggtgctgaaggaccgccagggccaccaggaagacctggtgccatgggg
ccccctggtcacaaaggggaaaagggtgacatggtcatatcaagagtgaaagggcaaaaa
ggagaaagaggactggatgggccaccaggatttccggggccacatggacaagatggtggg
gatggacgtcctggagaaagaggggatcctgggccacgaggggaccataaggatgcagcc
ccgggtgagagagggcttcccggactgccaggccctccaggaagaacaggacctgagggg
cctccaggcctgggatttccaggcccaccaggacagcgaggcctacccggagagcccggg
cgtccgggcaccagaggctttgatggcacgaaggggcagaaaggtgactccattctgtgt
aatgtaagctacccgggaaaaccaggtctgccaggtcttgatggacctccaggtctgaag
gggtttccaggccccccaggggctcctggtatgagatgtccagatgggcagaaaggccag
cggggcaaaccgggaatgtctgggattccaggtccacctggttttcgtggtgacatggga
gatccaggcatcaaaggtgaaaaggggacctcccctattggacccccaggcccacctggg
tctcctggaaaggatggccagaaaggaatcccaggagaccctgcatttggtgacccagga
ccccctggagagaggggtcttccaggagcaccaggcatgaaaggacagaaaggtcaccca
gggtgcccaggggctggaggccctcctggcatccctggatctccaggtctcaaaggtccc
aaaggtagagagggaagtcgtggctttccagggatcccaggatcccctggccattcctgt
gaaagaggtgctcccgggatcccaggacaaccgggacttcctggaactccaggtgatcca
ggggccccaggttggaaaggccagccaggagacatgggtccctctggaccagctgggatg
aagggcctccctggtctcccaggcctaccaggagctgatggtctccgagggcctcctggg
attccaggccccaatggggaagatggtctacctggtcttccaggcctgaagggactccca
gggctgcctggcttccctggttttccaggagagagaggaaagccagggccagatggagaa
cctggaagaaagggagaggttggagagaaaggctggcctggcttgaagggagacctggga
gagcgaggcgccaaaggagacagaggacttcctggtgatgcaggagaagcagtcacttcc
agaaagggtgaacctggggatgctgggcctccaggagatggtgggttctcaggagagaga
ggtgataaaggaagctcagggatgcgagggggaagaggggatcccggaagagatggacta
ccgggactccacagagggcagcctgggatagatgggcctcctgggcctcctgggccccct
ggccctccaggatcacctggactgagaggagtcattggtttcccaggatttccaggtgac
cagggtgatccaggttctccaggaccccctggattcccaggagatgatggagcaagagga
cctaaaggatacaaaggtgaccctgccagtcagtgtggtccacctggtccaaagggtgag
ccaggtagtcctggataccaaggacgtactggagtccccggagagaaaggctttcctgga
gatgaaggaccccgaggaccaccaggcagacctggacagcctggatcttttggaccacca
gggtgtccaggtgacccagggatgcccggactcaagggacatccaggagaagtgggagac
cctgggccaagaggtgatgcgggagatttcgggaggccaggtcctgctggagtgaaaggg
cctctcgggtctcctggtctgaatggcttacatggtctgaagggtgagaaaggaacaaaa
ggcgcctcaggtttgctcgaaatgggtccacctgggccaatggggatgcctgggcaaaaa
ggagagaaaggagaccctgggagtccagggatttctcctccaggtcttcctggagaaaag
ggcttcccaggacccccagggagaccaggaccacctggtcccgcaggtgccccaggaaga
gctgctaaaggtgacattcctgatccaggtccacctggagaccggggacctcctggcccc
gatggcccaagaggagtacctgggcctccgggttcccctgggaatgttgaccttctgaaa
ggggacccaggtgactgtggtttgccaggaccacctggctcccgaggcccaccaggccct
ccagggtgtcagggtcccccaggatgtgatggcaaagatggccagaaaggaccaatggga
cttccagggttgccagggccacctggtcttcctggggcacctggcgagaagggattacct
ggccctccaggcagaaaagggccagtaggtcctccaggctgcagaggtgaacctgggcct
cctgcagacgtggattcctgtcctcgaatcccaggacttcctggagtgccaggtcccaga
ggaccagaaggagctatgggggaacctggacggagaggcctgcctggaccagggtgcaaa
ggagagcctggaccggatggcagacggggccaggatggcatccctggatctcccgggcct
cctggacgcaaaggtgacactggagaggcagggtgtcctggagcaccaggccctcctggt
ccaactggtgatcctgggcccaaggggtttggccctggatcgctcagtggcttcctcctg
gttctccacagtcagactgaccaggaaccggcctgccccgtgggcatgcctcggctttgg
acagggtacagcttgttgtacatggaaggacaggagaaggcgcacaatcaagatcttggt
ttggcaggctcttgtcttcctgtgtttagcactctgccctttgcctactgcaacatccac
caagtgtgccactacgcccagagaaatgacaggtcctactggctgtccagtgcggctcct
cttcctatgatgccactctcagaggaagagatccgctcttacatcagccgctgtgcggtg
tgtgaggccccagcacaggccgtggcggtccacagtcaggaccaatccatcccaccgtgt
ccacggacatggaggagtctctggattgggtactcatttctgatgcacactggagctggg
gaccaaggaggtgggcaggcactgatgtcacctggcagctgcctagaggacttcagagct
gcaccgttcgttgaatgccagggccgacagggaacctgtcacttttttgcaaacgagtac
agcttctggctgaccaccgtgaacccagacctgcagtttgcctctggcccgtcaccagac
accttgaaagaagttcaggcccagcgccggaaaatcagcaggtgccaggtttgcatgaag
cacagctaa

KEGG   Mus musculus (mouse): 12830
Entry
12830             CDS       T01002                                 

Gene name
Col4a5
Definition
(RefSeq) collagen, type IV, alpha 5
  KO
K06237  collagen type IV alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04926  Relaxin signaling pathway
mmu04933  AGE-RAGE signaling pathway in diabetic complications
mmu04974  Protein digestion and absorption
mmu05146  Amoebiasis
mmu05165  Human papillomavirus infection
mmu05200  Pathways in cancer
mmu05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12830 (Col4a5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12830 (Col4a5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12830 (Col4a5)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    12830 (Col4a5)
  09154 Digestive system
   04974 Protein digestion and absorption
    12830 (Col4a5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    12830 (Col4a5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    12830 (Col4a5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12830 (Col4a5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    12830 (Col4a5)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    12830 (Col4a5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12830 (Col4a5)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12830 (Col4a5)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   12830 (Col4a5)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12830 (Col4a5)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 12830
NCBI-ProteinID: NP_001156627
MGI: 88456
Ensembl: ENSMUSG00000031274
Vega: OTTMUSG00000018954
UniProt: Q63ZW6
Position
X F2; X 62.16 cM
AA seq 1691 aa
MQVRGVCLAAGWFLLALCLWGQPAEAAACHGCSSGSKCDCSGIKGEKGERGFPGLEGHPG
LPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQG
IPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMSPLPGPKGNQGYPGPPG
IQGPPGPKGISGPSGPPGPPGLMGPPGPPGPPGPKGNMGLNFQGPKGEKGEQGLQGPPGP
PGQISEQKRPIDIEFQKGDQGVPGDRGPPGPPGIPGPPGPPGGVKGEKGEPGEAGKRGKP
GKDGETGQPGIPGLPGDPGYPGEPGRDGEKGQKGDIGPPGPPGLVIPAPGTGVTMGEKGN
IGLPGLPGEKGERGFPGIQGPPGFPGPPGTAVVGPPGPPGYPGERGQKGDEGPPGICIPG
SPGLDGQPGAPGLPGPPGPPGPQLPSRDEICKAGPPGPPGPPGDKGLQGERGVKGDKGDT
CFNCIGTGISGPPGQPGLPGLPGPPGSLGIPGEKGDKGQAGITGPKGLPGIPGPPGAPGF
PGSKGDPGDVLTLPGMKGDKGELGFPGAPGLPGLPGSPGKDGLPGLPGPKGEPGGITFKG
ERGPPGSPGLPGLPGNMGPTGPVGFGPPGPIGEKGIQGVAGNPGQPGLPGPKGDPGQTIT
QPGKPGLPGSPGRDGEVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGPPGPPGPKGFPGI
PGPPGAPGAPGRMGPDGPPGPPGFPGPKGEPGFGLPGPPGPPGFPGFKGTPGPKGDRGFP
GPSGPPGRTGLDGLPGPKGDVGPNGQPGPVGPPGLPGIGLQGPPGPPGTPGPIGQPGLHG
IPGEKGDPGPPGFDVPGLPGERGSPGLPGAPGLIGPPGSPGVPGKAGVPGFPGVKGEMGM
MGPPGPPGPLGIPGRSGAPGLKGDDGMQGQPGLPGPAGEKGGKGEPGLPGPPGPMHPDFL
GSKGEKGEPGLPGIPGLSGPKGYQGLPGDPGQPGLSGQPGSPGLPGPKGNPGLPGPPGLT
GPPGPKGNIGDMGFPGPQGVDGPPGPPGFPGQPGSPGLPGQKGSKGEPGVSGIGLPGLPG
PKGEPGLPGYPGNPGIKGSVGETGLPGLPGNPGAKGQPGLPGFPGTPGLPGPKGMNGPPG
NPGFPGEPGPVGAGGRPGPPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGIPGPPGLPGL
SGQKGDGGSPGLPGNPGLPGSKGEPGFQGFPGMPGPPGPPGSPGQALEGPKGNPGPQGPP
GRPGPPGFQGPPGTEGPRGPPGNGGIKGERGNPGPPGQPGLPGLKGDQGPPGLPGNPGRP
GLNGMKGDPGLPGVPGFPGMKGPIGVPGSTGPDGEPGLTGPPGPPGLPGPSGQSIVIKGD
PGPPGIPGQPGLKGLPGLPGPQGLPGPIGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGA
RGLDGPPGPDGLQGPPGPPGTTSVAHGFLITRHSQTTEAPQCPRGTVHIYEGFSLLYVQG
NKRAHGQDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPMNMEPLKG
QSIQPFISRCAVCEAPAVVIAVHSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQAL
ASPGSCLEEFRSAPFIECHGRGTCNYYANSYSFWLATVDMSDMFNKPQSETLKAGDLRTR
ISRCQVCMKRT
NT seq 5076 nt   +upstreamnt  +downstreamnt
atgcaagtgcgtggagtgtgcctggctgccggctggttcttactggccctgtgtctgtgg
gggcagcccgcagaggctgcggcctgccatgggtgttcttctggatcaaaatgtgactgc
agtggtataaaaggagaaaagggagaacgggggtttccaggtttagaaggtcatccaggt
ttaccaggatttccaggtccagaaggccctccaggacctcggggacaaaagggtgatgat
ggaattccagggccaccaggaccaaaaggaatcagaggtcctcctggacttcctggattt
ccagggacaccaggtcttcctgggatgccaggccatgatggggctccaggacctcaaggt
atccctggatgcaatggaaccaagggagaacgtggatttccaggcagtccgggctttcct
ggtttacaaggtcctccaggacctcctgggatcccaggtatgaagggggaaccaggcagt
ataatcatgtcaccactgccaggaccaaagggtaatcaaggatacccaggtcctcctgga
atacaaggcccacctggtcccaagggtatatcagggccaagtggtcctccaggaccacca
ggtttaatgggacctcctggcccaccaggcccaccgggaccaaaggggaatatgggctta
aatttccagggacccaaaggtgaaaaaggcgaacaaggtcttcagggcccacctggtcca
cctgggcagatcagcgaacagaaaagaccaattgacatagaatttcagaaaggagatcag
ggagttcctggtgatagagggcctcctggacctccaggaatacctggtcctcccggtcct
cctggtggtgtgaaaggtgaaaagggtgagccaggagaggcaggcaaaagaggtaaacca
ggcaaagatggagagactggccaaccaggaatcccaggtttgcctggtgatcctggttac
cctggagaaccaggaagagatggtgaaaagggtcaaaagggtgatattggcccacctgga
cctcccggacttgtaattcctgcacctggaactggtgtgactatgggagaaaaaggaaat
atcgggttacctggtttgcctggagaaaagggagagcgaggatttcctggaatacaaggt
ccacctggctttcctggacctccaggaacagcagttgtgggtccccctggtcctcctgga
tatcctggtgaaaggggccagaaaggtgatgaaggtccccctggaatttgtattcctgga
tctcctggacttgatggacagcctggggctcctggccttccaggacctcctggcccccct
ggcccccagttgccatccagagatgaaatctgtaaagcaggccctcctgggcctccagga
cctccaggtgataaaggactccaaggagagcgaggagtaaaaggtgacaaaggtgatact
tgcttcaactgtattggaactggcatttcagggcctccaggccaacctggtttaccaggt
ctcccaggtcctccaggatctcttggaatccctggagagaagggggacaaaggacaagct
gggataactggtccgaaaggattgccaggcatacctggacctccaggtgctccaggcttt
ccagggtctaagggtgaccctggggatgtcctcactcttccaggaatgaaaggtgacaaa
ggagagttgggtttccctggagctccagggcttcctggtctacctggtagtcctggaaag
gatggtttgccaggactccctggccccaaaggagagcctggtggaattacttttaagggt
gaaagaggtcctcctgggagcccaggtttaccaggtctcccaggaaatatgggacctaca
ggcccagttggttttgggcctccaggccccataggcgaaaaaggcatacaaggtgtggca
ggaaatccaggccagccgggattaccaggtcctaaaggagatccaggccagaccataacc
cagccaggaaagcctggtttgccaggtagtccaggcagagatggtgaagtgggtcttcca
ggtgatcctggacttccaggccaaccaggcctgccaggaatacctggtagcaaaggagaa
ccaggtatccctgggattgggcctcctggaccacctggtcctaaaggttttccaggaatt
ccgggacctccaggagctcctggggcccctggaagaatgggtccagatgggcctcctggg
ccaccaggcttcccaggaccaaagggagagccaggatttgggttacctgggccacctggg
ccaccaggattcccaggtttcaaaggaacacctggtccaaaaggtgatcgtggtttccca
ggaccttcaggtcctccaggacgcactggcttggatgggctacctggaccaaaaggtgat
gttggaccaaatggacaacctggcccagtagggcctccagggctgccaggaattggtctt
cagggaccaccaggaccaccagggactccagggccaataggccaacctggcttacatggg
attccaggagagaagggggacccaggacctcctgggtttgatgttccaggactccctgga
gagagaggtagtccagggctccctggagcccctggtcttataggacctccaggatcacct
ggtgtccctggaaaagcaggtgtccctggatttccaggtgtcaaaggcgaaatgggtatg
atgggacctccaggcccacctggacctttgggaattcctggcaggagtggtgctcctggt
cttaaaggtgatgatggcatgcaaggtcagccaggtctccctggccctgcaggagaaaag
ggcggtaagggagagcctggccttccaggtcctcctggaccaatgcatccagatttcctg
ggatcgaaaggagaaaagggtgagcccggcttaccaggtattcctggactttcaggacca
aaaggctaccagggattacctggagacccagggcaacctggactgagtggacaaccagga
tcaccaggactcccaggtcccaagggtaatcctggtctccctggtccaccaggacttaca
ggacctcctggaccaaaaggaaacattggtgacatgggttttcctggccctcaaggtgta
gatgggccccctggacctcctggatttcctggacaacctggctctcctgggttacctggc
cagaaaggaagtaaaggagagcctggggtttcaggcattggacttccaggtcttcctggc
ccaaagggtgaacctggtctgcctggatatccaggaaacccaggtatcaaagggtcggtg
ggagagactggtcttcctggattaccaggaaaccctggagcaaaaggtcaaccgggcctg
cctggattcccaggaacaccaggacttcctggaccaaaaggtatgaatggtcctcctggt
aaccctggctttcccggagaacctggtcctgtaggtgccggagggcgtcctggaccacca
ggacctcctggtgaaaaaggcaagccaggtcaagatggcattcctgggccagcaggacaa
aagggagaaccaggtcaaccaggctttggaatcccaggacctcctggactcccaggactt
tctggacaaaagggtgatggaggatcacctggacttccaggaaatcctggccttccaggt
tcaaagggtgagccaggctttcagggattccctggtatgccaggtcccccaggtcctcct
ggttctcctggtcaagctttggaaggtcccaaaggaaaccctgggcctcaaggccctcct
gggagaccaggtcctccaggttttcaaggtccaccaggtacagaaggtcctcgtggaccc
cctggaaatggaggtatcaaaggagagaggggaaatccaggcccacctgggcagccaggc
ttacctggtttgaaaggagatcaaggaccaccaggactcccgggtaaccctggccggcca
ggtcttaatggaatgaaaggagatcctggtctccctggtgttccaggattcccaggaatg
aaaggacccattggagtaccaggctccactggccctgatggggaaccaggtcttactggg
cccccaggtcctcctgggttgcctggtccttcaggacagagtattgtaatcaaaggagat
cctggacctccagggattcctggacaacctggattaaaaggtctaccaggacttccaggc
cctcaaggcctaccaggtccaattggccctccaggagatccaggacgcaatggactccca
ggctttgatggtgcaggagggcgcaaaggagacccagggctgccaggacagccaggtgcc
cgtggattggatggtcccccaggaccagatggattgcaaggacctccaggaccccctgga
accacctctgttgcccatggattcctcatcacacgtcacagtcaaacaacagaagcacca
caatgcccacggggaacagttcatatctatgaaggcttttctctcctgtatgtacaagga
aataaaagagcccatggtcaagacttgggtacggctggcagctgccttcgtcgcttcagt
accatgcctttcatgttctgcaacattaacaacgtttgtaactttgcttcaagaaatgac
tattcttactggctttccaccccagagcccatgccaatgaacatggaacccctgaaggga
cagagcatccagccattcattagtcgatgtgcagtgtgtgaggcgccagctgtggtgatc
gctgttcacagccagactattcagattccacactgtcctcaggggtgggactctctgtgg
attggctattccttcatgatgcatacaagtgcaggagcagaaggctctggccaagcccta
gcctctcctggctcctgtttggaagaatttcgttccgctcccttcatcgaatgtcatggg
cgagggacatgcaattactatgccaattcgtacagcttttggctggccactgttgatatg
tcagacatgttcaacaaacctcagtcagaaaccctgaaagcaggagacttgaggacgcgt
attagccgatgtcaagtatgcatgaagaggacataa

KEGG   Mus musculus (mouse): 12833
Entry
12833             CDS       T01002                                 

Gene name
Col6a1, AI747156, Col6, Col6a-1
Definition
(RefSeq) collagen, type VI, alpha 1
  KO
K06238  collagen type VI alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04974  Protein digestion and absorption
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12833 (Col6a1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12833 (Col6a1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12833 (Col6a1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    12833 (Col6a1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12833 (Col6a1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12833 (Col6a1)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12833 (Col6a1)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   12833 (Col6a1)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12833 (Col6a1)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   12833 (Col6a1)
SSDB
Motif
Pfam: VWA Collagen VWA_2 VWA_3 YcaO Toprim_3 ETF
Other DBs
NCBI-GeneID: 12833
NCBI-ProteinID: NP_034063
MGI: 88459
Ensembl: ENSMUSG00000001119
Vega: OTTMUSG00000020489
UniProt: Q04857 B0LAD9
Position
10 C1; 10 39.71 cM
AA seq 1025 aa
MRLAHALLPLLLQACWVATQDIQGSKAIAFQDCPVDLFFVLDTSESVALRLKPYGALVDK
VKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVDAVK
YFGKGTYTDCAIKKGLEELLIGGSHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKH
LGIKVFSVAITPDHLEPRLSIIATDHTYRRNFTAADWGHSRDAEEVISQTIDTIVDMIKN
NVEQVCCSFECQAARGPPGPRGDPGYEGERGKPGLPGEKGEAGDPGRPGDLGPVGYQGMK
GEKGSRGEKGSRGPKGYKGEKGKRGIDGVDGMKGETGYPGLPGCKGSPGFDGIQGPPGPK
GDAGAFGMKGEKGEAGADGEAGRPGNSGSPGDEGDPGEPGPPGEKGEAGDEGNAGPDGAP
GERGGPGERGPRGTPGVRGPRGDPGEAGPQGDQGREGPVGIPGDSGEAGPIGPKGYRGDE
GPPGPEGLRGAPGPVGPPGDPGLMGERGEDGPPGNGTEGFPGFPGYPGNRGPPGLNGTKG
YPGLKGDEGEVGDPGEDNNDISPRGVKGAKGYRGPEGPQGPPGHVGPPGPDECEILDIIM
KMCSCCECTCGPIDILFVLDSSESIGLQNFEIAKDFIIKVIDRLSKDELVKFEPGQSHAG
VVQYSHNQMQEHVDMRSPNVRNAQDFKEAVKKLQWMAGGTFTGEALQYTRDRLLPPTQNN
RIALVITDGRSDTQRDTTPLSVLCGADIQVVSVGIKDVFGFVAGSDQLNVISCQGLSQGR
PGISLVKENYAELLDDGFLKNITAQICIDKKCPDYTCPITFSSPADITILLDSSASVGSH
NFETTKVFAKRLAERFLSAGRADPSQDVRVAVVQYSGQGQQQPGRAALQFLQNYTVLASS
VDSMDFINDATDVNDALSYVTRFYREASSGATKKRVLLFSDGNSQGATAEAIEKAVQEAQ
RAGIEIFVVVVGPQVNEPHIRVLVTGKTAEYDVAFGERHLFRVPNYQALLRGVLYQTVSR
KVALG
NT seq 3078 nt   +upstreamnt  +downstreamnt
atgaggctggcccacgctctgctgcccctgctgctacaagcctgctgggtggccacacag
gacatccagggctccaaagcgattgccttccaagactgccctgtggatctattcttcgtg
ctcgacacctcggagagtgtggccttgaggctgaaaccttatggggccttggtggacaag
gtgaagtccttcactaagcgcttcattgacaacctgagagacaggtactaccggtgtgac
cgcaacctggtttggaatgcgggtgcgctgcactacagtgacgaggtggagatcatccga
gggctcacgcgcatgcccagtggccgcgatgagctcaaggccagcgtggatgcggtcaag
tacttcgggaaaggcacctacaccgactgcgccattaagaaggggctggaggagctgctc
atagggggctcccacctgaaggagaacaagtacttgatcgtggtgaccgacgggcatcct
ctagagggctacaaggaaccatgcgggggtctggaagatgcagtaaatgaggccaaacac
ctgggcatcaaggtcttttctgtggccatcacacctgaccacctggagccacgtctaagt
atcattgccacagaccacacataccggcgcaatttcacggcagctgactgggggcatagc
cgcgatgcagaagaggtcatcagccagaccattgacaccattgtggacatgattaaaaat
aacgtggaacaagtgtgttgttcttttgagtgccaggctgccagaggacctccagggccc
cgaggcgaccctgggtatgagggggagcgaggaaagccaggtcttccgggagagaaggga
gaagctggagaccctggacgacctggggatcttggaccagtcgggtaccagggtatgaag
ggagaaaaggggagccgtggagagaagggttccagaggaccgaaaggttacaagggcgag
aaaggcaagcgcggaatcgacggggtcgacggcatgaagggagagacggggtacccagga
ctaccgggctgcaagggctccccaggatttgatggcattcaaggacccccgggtcccaag
ggtgatgctggtgcctttgggatgaagggagaaaagggtgaagctggagcagacggtgag
gctgggagaccagggaactcagggtcacctggagatgagggtgatcctggagagcctggt
ccccccggagaaaaaggagaggccggtgatgaaggaaatgctggcccagacggtgcccct
ggagagaggggtggccctggtgaaagaggacctcgggggacccctggtgtgagaggacca
aggggagacccgggtgaagctggaccacagggtgaccaaggaagagaggggcccgtcggc
atccctggagactcgggtgaggctggccccattggacctaaaggataccgaggtgatgag
ggtcctccaggtcctgagggcctcagaggagccccaggacctgttggtcctcctggagac
cccggactgatgggtgagagaggtgaggatggaccaccaggaaacggcacggaaggtttc
cccggcttccctgggtatccaggcaacagaggccctcctgggctaaatggcacaaaaggc
taccctggcctcaagggggatgagggtgaagtgggagacccaggagaggataacaacgac
atttcaccccgtggggtcaaaggggcaaagggataccgaggcccagaaggaccccaggga
cctccaggacatgtgggaccacctgggccagatgagtgtgagatcctggatatcatcatg
aaaatgtgctcctgctgtgagtgcacatgtggacccattgacatcctcttcgtgctggac
agctcggagagcattggcctacagaactttgagattgccaaggacttcatcatcaaggtc
attgaccggttgagcaaggatgagctggtcaaatttgagccagggcagtctcacgcgggc
gtggtacagtacagccacaaccagatgcaagagcacgtggacatgcggagccccaacgtc
cgcaacgcccaggacttcaaagaagctgtcaagaagctacaatggatggctggtggcaca
ttcaccggagaagcgctgcagtacacccgggaccggctactcccacccacacagaacaac
cgaattgccctggtcattacggatggacgttctgacactcaacgggacacgacacctctc
agtgtgctctgtggtgcagacattcaggtagtttctgtgggaatcaaggatgtgtttggc
tttgtggcgggctccgaccagctcaatgtcatttcctgccaaggcttatcgcaaggtcgg
ccaggtatctccctggtgaaggagaactatgcagagcttctcgatgacggctttctgaag
aacataacagcccagatctgtatagataagaagtgtccggattatacctgtccaatcaca
ttctcctccccggctgacatcaccatcctgctagacagctcagccagtgtcggcagccac
aacttcgaaaccaccaaggtcttcgccaagcgcctagctgagcgattcctgtcagcaggc
agggcggatccttcccaggatgtgcgggtggccgtggtacagtatagtggccaggggcag
caacagccaggtcgggcggctcttcagttcttacagaattacacagtgctggccagctct
gtggacagcatggatttcatcaacgacgccacagacgtcaacgatgctctgagctacgtg
actcgtttctaccgggaagcctcgtcaggtgccaccaagaagagagtgctgttgttttca
gacggcaactctcagggggccacagcagaggccattgagaaggctgtgcaggaggcccag
cgtgcaggcattgagatctttgtggtggtggtgggaccccaggtgaacgagccccacatc
cgtgtgcttgtcactggcaagactgcagagtacgacgtggcctttggcgagcgccaccta
ttccgtgtaccaaactaccaggccctgctacgtggcgtactctaccagacagtctccagg
aaggtggcactgggctag

KEGG   Mus musculus (mouse): 12834
Entry
12834             CDS       T01002                                 

Gene name
Col6a2, Col6, Col6a-2
Definition
(RefSeq) collagen, type VI, alpha 2
  KO
K06238  collagen type VI alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04974  Protein digestion and absorption
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12834 (Col6a2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12834 (Col6a2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12834 (Col6a2)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    12834 (Col6a2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12834 (Col6a2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12834 (Col6a2)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12834 (Col6a2)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   12834 (Col6a2)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12834 (Col6a2)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   12834 (Col6a2)
SSDB
Motif
Pfam: VWA Collagen VWA_2 VWA_3 Ins134_P3_kin_N
Other DBs
NCBI-GeneID: 12834
NCBI-ProteinID: NP_666119
MGI: 88460
Ensembl: ENSMUSG00000020241
Vega: OTTMUSG00000020488
UniProt: Q02788
Position
10 C1; 10 39.32 cM
AA seq 1034 aa
MTTIKMLQGPLSVLLIGGLLGVLHAQQQEAISPQEQEAVSPDISTTERNNNCPEKADCPV
NVYFVLDTSESVAMQSPTDSLLYHMQQFVPQFISQLQNEFYLDQVALSWRYGGLHFSDQV
EVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQIRQHVGKGVVNFAVVITDGHV
TGSPCGGIKMQAERAREEGIRLFAVAPNRNLNEQGLRDIANSPHELYRNNYATMRPDSTE
IDQDTINRIIKVMKHEAYGECYKVSCLEIPGPHGPKGYRGQKGAKGNMGEPGEPGQKGRQ
GDPGIEGPIGFPGPKGVPGFKGEKGEFGSDGRKGAPGLAGKNGTDGQKGKLGRIGPPGCK
GDPGSRGPDGYPGEAGSPGERGDQGAKGDSGRPGRRGPPGDPGDKGSKGYQGNNGAPGSP
GVKGGKGGPGPRGPKGEPGRRGDPGTKGGPGSDGPKGEKGDPGPEGPRGLAGEVGSKGAK
GDRGLPGPRGPQGALGEPGKQGSRGDPGDAGPRGDSGQPGPKGDPGRPGFSYPGPRGTPG
EKGEPGPPGPEGGRGDFGLKGTPGRKGDKGEPADPGPPGEPGPRGPRGIPGPEGEPGPPG
DPGLTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRL
GAIAKDPKSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPS
ALKFAYNQLIKESRRQKTRVFAVVITDGRHDPRDDDLNLRALCDRDVTVTAIGIGDMFHE
THESENLYSIACDKPQQVRNMTLFSDLVAEKFIDDMEDVLCPDPQIVCPELPCQTELYVA
QCTQRPVDIVFLLDGSERLGEQNFHKVRRFVEDVSRRLTLARRDDDPLNARMALLQYGSQ
NQQQVAFPLTYNVTTIHEALERATYLNSFSHVGTGIVHAINNVVRGARGGARRHAELSFV
FLTDGVTGNDSLEESVHSMRKQNVVPTVVAVGGDVDMDVLTKISLGDRAAIFREKDFDSL
AQPSFFDRFIRWIC
NT seq 3105 nt   +upstreamnt  +downstreamnt
atgactactatcaagatgctccagggtcctctttctgtgctcctgattgggggactcttg
ggggtcctccatgcccagcagcaggaagccatctcaccccaggagcaggaagctgtctca
ccagacatctccaccactgaaaggaacaacaattgtccagagaaggccgactgcccagtc
aacgtgtatttcgtgttggacacctcagagagcgtggccatgcagtccccgacagacagc
ctgctctatcatatgcagcagttcgtaccgcagtttatcagccagctgcagaacgagttc
tacctggaccaggtggccctgagctggcgctacggtggtctacacttctcggaccaagtg
gaggtgttcagcccaccgggcagtgaccgggcctccttcactaagagcctacaaggcatc
cgctccttccgcaggggcaccttcactgactgtgcattggctaacatgacgcagcagatc
cggcagcacgtaggcaagggggtggtcaacttcgccgtggtcatcactgacggccacgtc
acgggcagtccgtgtgggggcatcaagatgcaggctgagcgtgcccgtgaagagggcatc
cggctcttcgctgtggcccctaacaggaacctaaacgaacaaggcctgagggacatcgct
aactctccacatgagctctaccgtaacaactacgccaccatgcgacccgactctaccgag
attgaccaggacaccatcaaccgcatcatcaaggtcatgaaacatgaagcctatggagag
tgctacaaggtgagctgcctggagattcctggaccccacggacccaagggttaccgagga
cagaagggtgccaagggcaacatgggtgaaccaggagagcctggacagaaaggacgacag
ggagaccccggcatcgaaggccccattggattcccgggaccgaagggtgtgcctggcttc
aagggagagaagggtgaatttggatcggatggtcggaagggagcgcctggcctagctggc
aagaatggaacagatggacagaagggcaaactgggccgcattgggcctcctggttgcaag
ggagaccccggaagtcggggccccgatggataccctggagaagctggaagcccaggcgag
cgaggagaccagggtgccaagggggactctggccgcccaggacgcaggggaccaccagga
gatcctggagacaaaggaagcaagggatatcaaggcaacaacggagcccctggaagcccg
ggagtgaaaggaggcaagggagggcctggcccccgtggaccaaaaggagagcctggacgc
agaggagaccccgggaccaagggcggccccggcagcgatggtccaaagggagagaaggga
gaccctggtcctgaggggcctcgaggcctggctggagaagttggcagtaaaggagccaag
ggagacagaggtttgcctggacccagaggcccccagggggctcttggagagccaggaaag
cagggatctcgaggagaccctggtgacgccggacctcgaggggattcaggacagccgggc
cccaagggcgatcctggaaggcctggattcagctacccgggacctcgagggacacccggt
gaaaaaggcgagcccggtccaccaggccctgagggaggccgaggagactttggtctgaaa
ggaacacccggacggaagggagataaaggggagccagctgatcctggtccccctggtgaa
cctggccctcgggggccaagaggaatcccaggacctgagggagaacccggccctccagga
gaccctggtctcacggaatgtgatgtcatgacctatgtgagggagacctgtggatgctgc
gactgtgagaagcgctgtggtgccctggatgtggtcttcgtcatcgacagttctgagagt
attggctacaccaacttcaccttggagaagaactttgtcatcaatgtggtcaacaggcta
ggtgccattgccaaggaccccaagtcagaaacaggcacacgtgtgggtgtggtgcagtac
agccacgagggcacctttgaggccatccggctggacgacgagcgagtcaactccctgtct
agtttcaaggaggctgtcaaaaaccttgaatggatcgccggtggcacttggacgccctct
gccctcaagtttgcctataatcagctcatcaaagaaagccggcgccagaagacccgggtg
ttcgcagtggtcatcacggatgggcgccatgacccccgagatgatgacctcaatcttcgg
gcactgtgtgaccgagatgtcactgtgacagccattggcatcggtgacatgttccacgag
actcatgagagtgagaacctctactccattgcctgtgacaagccacagcaagtgcgcaac
atgacgctgttctctgacctggtggccgagaagttcatcgatgacatggaagacgtcctt
tgtccagacccccagatcgtgtgtccagaacttccctgccaaacagagctctatgtggcc
cagtgcacacaacggcccgtggacattgtcttcctgctggatggctcggagcggctgggc
gagcagaacttccacaaggtgcggcgcttcgtggaggacgtgtcccggcgcctgactctg
gcccggagggatgatgacccactcaacgcccgcatggctctgttgcaatatggcagccag
aatcagcaacaggtggccttcccactgacctacaacgtgaccaccatccacgaggccctg
gagagggccacctacctcaattccttttctcacgtgggcacgggcatcgtacacgccatc
aacaacgtggtgcggggggcacggggtggggcgcggcgccacgcagagctctccttcgtc
ttcctcacggacggtgtcaccggcaatgacagcctggaggagtcagtgcactctatgcgt
aagcagaacgtggtgcccactgtggtcgctgtgggcggcgacgtggacatggatgtgctt
actaagatcagcctgggtgacagggcggccatcttccgggagaaagactttgacagtctg
gcccagcccagcttctttgacaggttcatccgctggatctgttag

KEGG   Mus musculus (mouse): 12835
Entry
12835             CDS       T01002                                 

Gene name
Col6a3, AI507288, Col6, Col6a-3
Definition
(RefSeq) collagen, type VI, alpha 3
  KO
K06238  collagen type VI alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04974  Protein digestion and absorption
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12835 (Col6a3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12835 (Col6a3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12835 (Col6a3)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    12835 (Col6a3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12835 (Col6a3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12835 (Col6a3)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12835 (Col6a3)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   12835 (Col6a3)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12835 (Col6a3)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   12835 (Col6a3)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 Kunitz_BPTI fn3
Other DBs
NCBI-GeneID: 12835
NCBI-ProteinID: NP_001229937
MGI: 88461
Ensembl: ENSMUSG00000048126
Vega: OTTMUSG00000022359
UniProt: Q9Z0I9
Structure
PDB: 
4IHK 4IGI

Position
1 D; 1 45.53 cM
AA seq 3284 aa
MRKHRHLPLVAVFSLLLSGIATTHAQQHGDVKNGAAADIVFLVDSSWSAGKDRFLLVQEF
LSDVVESLAVGDNDFHFALVRLNGNPHTEFLLNTYHSKQEVLSHIVNMSYIGGSNQTGKG
LEYVIHSHLTEASGSRAADGVPQVIIVLTDGQSEDGFALPSAELKSADVNVFAVGVEGAD
ERALGEVASEPLSMHVFNLENVTSLHGLVGNLVSCIHSSVNPERAGDKESLKDITAQDSA
DIIFLIDGSQNTGNANFDVIRDFLVNVLERLSVGNQQVQVGVVQYSEEPITMFSLNSYPS
KAAVLDAVKGLSLVGGESANIGQALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDE
IRDSVVALKQASVFSFGLGAQAASRAELQHIATDDSLVFTVPEFRSFGDLQEQILPYLVG
VAQRHIVLQPPAIVTQVMEVNKRDIVFLVDGSSSLGPSNFNAIRDFVTRVIQRLEIGQDL
VQVSVAQYADTVKPEFYLNSYTNKRDAITAVRKMRALNGSALYTGSSLDFVRNNLFTSSA
GHRAAEGVPKLLVLITGGKSLDEVSQPAQELKRGSIMALAVGSKAADEDELKEIAFDSSL
VFIPAEFRPAPLQNMLPSLMAPLRTLTGTTEVHVNKRDIIFLLDGSDNVGKNNFPYVRDF
VTNLVNSLDVGSDNIRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGS
ALSYIHANHFTEAGGSRTREHVPQLLLLLMAGPSEDAYLQAANALVRSGVLTFCVGTNRA
DKAELEHIAFNPSLVYLMDDFRSLPSLPQQLIQPLTTYVSGGVQEVPLSQPESKRDILFL
FDGSVNVLGQFPAVRDFLYRIIEELDVKPDGTRVAIAQFSDDVRLESRFSEHQTKAEILN
LVKKMKLKTGKALNLGYALDYALRNIFVRSAGSRIEDNVQQFLVLLVAGRSSDAVAGPAS
SLKQRGVVPFIFQAKNANPSELEQIVLSPAFILAAESLPKIGDLQSQIVSLLKAEQGSGP
VSGEKDVVFLIDGSEGVRSGFPLLKDFVQRVVESLDVGPDRVRVALVQYSDRTRPEFYLN
SHMDQQGVISAIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIAEGVPQLLIVLTAEP
SGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQVISE
RVIQLNREELSSLKPILTPSTGAGVGSKKDVVFLIDGSRNAGPEFQYIRTLIERIVEYLD
IGFDTTRVAVIQFSEDSKMEFPLNAHFSKDEVQNAVRRLRPKGGSQVYIGNALEYVLKNI
FQRPLGSRIEEGVPQFLVLISSGKSDDEVDDSAVELKQFGVAPLTIARHTDQEELVKISL
SPEYVYSVSTFRELPRLEQKLLTPITTLTSQQIHQILASTRYPPSVVESDAADIVFLIDS
SDAVKPDGIAHIRDFVSRIVRRLNIGPSKVRIGVVQFSNDVFPEFYLKTHKSQSSVLEAI
RRLRFKGGSPLNTGRALEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVI
SSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPP
GVDLPSPSRPEKKKADIVFLLDGSINFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQY
NSDPTDEFFLRDFSTKRQIIDAINKVVYKGGRHANTRVGIEHLLRNHFVPEAGSRLDERV
PQIAFVITGGKSVEDAQDVSLALTQKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQ
ELSELSETVLETLHDAMHETLCPGVTDVSKACNLEVILGFDGSRDQNVFVSQKGLESKVD
IILNRISQIQRISCSGNQLPTVRVSVMANTPSGPVEAFDFAEYQPELFEKFRNMRSQRPY
VLTADTLKLYQNKFRQSSPDTVKVVIHFTDGADGDMADLYRASEELRQAGAQALILVGLE
RVANLERLMHLEFGRGFMYDRPLRLNLLDLDYELAEQLDNIAEKACCGVPCKCSGERGDR
GPIGSIGPKGISGEDGYRGYPGDEGGPGERGPPGVNGTQGFQGCPGQRGVKGSRGFPGEK
GELGEIGLDGLDGEEGDKGLPGSSGEKGSPGRRGDKGPKGDKGERGDVGIRGDPGDSGRD
SQQRGPKGETGDIGPMGLPGRDGIPGSPGDPGKDGGSGRRGPAGAKGNRGGPGQPGFEGE
QGTRGSQGPPGPIGPPGLIGEQGIPGPRGGGGTAGAPGERGRTGPLGRKGEPGEPGPKGS
IGNRGPRGETGDDGRDGVGSEGRRGKKGERGFPGYPGPKGTPGEPGADGPPGPKGIRGRR
GNSGPPGATGQKGDPGYPGPSGHKGNRGDSVDQCALIQSIRDKCPCCYGPLECPVFPTEL
AFALDTSEGVTQDTFSRMREVLLGIVGDLTIAESNCPRGARVAVVTYNNEVTTEIRFADS
KKKSALLDSIQNLQVALTSKQQSLETAMSFVARNTFKRVRSGFLMRKVAVFFSNKPTRAS
PQLREAVLKLSDAGITPLFLTSQEDRQLINALQINNTAVGHALVLPARRDLTDFLKNVLT
CHVCLDICNIDPSCGFGSWRPSFRDRRAAGSDVDIDLAFILDSSEATTLFQFNEMKKYIG
YVIRQLDLSPDPKASQHFARVAVVQQSTYESVDNASVPPVKVEFSLTDYGAKEKLLDFLS
RRMTQLQGTMGLGNAIEYTIENIFESAPNPRDLKIMVLMLTGDMQRQQLEEAQRAILQAK
CKGYFFVVLGIGRKVNIKEVYSFASEPNDVFFKFVDKSTELNEEPLMRFGRLLPSFVSSE
NAFYLPPDLRKQCDWFQGDQPVKNGVKFGHKQINPPHTANSSLTSKVVTTMKPVTTTKPT
AIVNLPPAKPAPARPAPAQPVLAKPDPAKPAQARPAPAKPASAKLVPPQPVHVQPAPAQT
ASVRPAPAKPAPPQPAAAKPVPAKPAVPAQPAPPQPAAAKPVPAKPAVPAQPAPPQPAAA
KPVPAKPAVPAQPAAAQPMPAQPVLTKSAAVKPASANKPVAAKPVATNTATATARPALAA
KPAAAKPAATRPLAAAIRPVATKPEAPRQQAKPAATKPATTKPLARVSREVQVSEVTENS
ARLHWERPEPSSSFFYDLTVTSAHDQSLVLRQNLTVTDRVIGGLLAGQLYHVVVVSYLQS
QVRAIYQGSFNTKKTQPPPLQAAHRASSSTINLMVNTEPLFLTKTDICKLSRDAGTCVDF
KLLWHYDLESKSCKRFWYGGCGGNENRFHSQEECEKMCSPELTV
NT seq 9855 nt   +upstreamnt  +downstreamnt
atgaggaaacaccggcatctgcccctagtggctgtctttagccttctcctctcaggcatt
gccacgactcatgcccaacagcatggagatgtcaaaaacggtgcggcggctgacatagtg
tttctagtggattcctcttggagtgctgggaaggaccgtttcctgcttgttcaagagttt
ctgtctgatgttgtagagtctttagctgtgggagacaatgattttcactttgctctggtc
cggctcaacggaaacccacataccgagttcctgttaaatacataccactctaaacaagaa
gtcctttctcatattgtgaacatgtcttacattgggggaagcaaccaaactggaaaagga
ttagaatacgtaattcacagccacctcactgaggcttccggaagccgggctgctgacgga
gtccctcaggttatcatagtattaactgatggacaatcggaggatggcttcgctctgccc
tcagcggaacttaagtctgcggacgttaatgtgtttgcagtcggagttgagggtgcagat
gaaagagccctaggagaagtagcaagcgaaccactcagtatgcatgttttcaacctagag
aacgttacctcactgcatggcctagtagggaacttagtgtcctgtatacactcctctgtg
aatccagaaagggctggggacaaggaaagccttaaagacatcacagcacaagactcagct
gacattattttccttattgatggatcacagaacaccggaaatgccaatttcgatgtcatt
cgagacttccttgtaaatgtccttgagagactctcagttggaaaccagcaggtccaagtg
ggggtggtacagtatagcgaggagcccataaccatgttctccttgaatagctacccctcc
aaggctgcagttctggatgccgtgaaagggctcagcttggttgggggggagtcggccaat
ataggccaggctcttgactttgtagtggagaatcacttcactcgggcaggtggcagccgc
gtggaggaaggggtcccccaggtgctggtcctcataagtgctgggccttctagtgatgag
attcgagattcggtggtagctctgaagcaggctagtgtgttctcgtttggcctcggagcc
caggcagcctccagggcagagcttcagcacatagctaccgatgacagcttggtgtttact
gtcccggaattccgtagctttggggatctccaggagcaaatactgccgtaccttgttggt
gtggcccaaaggcacattgtcttgcaaccaccagctattgtcacacaagtcatggaagtc
aacaagagggacatagtcttcctggtggatggttcctccagcctggggccatccaacttc
aatgccatccgagacttcgttaccagagtcatccagaggttggaaattggacaggacctg
gttcaggtatcagtggctcagtatgcagacactgtcaagcctgagttctatttaaattcc
tataccaacaaaagggatgccataactgctgtacggaagatgagggccctgaatggctca
gccctgtacacgggatcttctctggactttgttcgaaacaacctcttcactagctcagcc
ggccatagggccgctgagggggttcctaagctcctggtgctgatcacaggcggtaagtcc
ctcgatgaagtcagccagcctgcccaggagctgaagagaggcagcatcatggccttggcc
gttggcagcaaggccgccgatgaggatgagctgaaagagatcgctttcgactcctccctg
gtcttcattcccgctgagttccgccctgctcccttgcaaaacatgctgcccagcctgatg
gcacctctcaggactctcactggaaccacggaagttcacgtaaacaaacgggatatcatc
ttccttttggatggctccgacaacgttgggaaaaacaatttcccttatgtgcgtgacttt
gtaaccaacctagttaacagcctcgatgttggaagtgacaatatccgtgttggcttggtg
cagtttagtgacactccggtcacggagttctctctagacacttaccagaccaagtccgag
ctgctcgctcacctgaggcgcttgcagctcaaggggggctccggcttgaacgcaggctct
gccttgagctatatccatgccaatcacttcacggaggcaggaggcagcaggactcgtgaa
cacgtgcctcagctgctgctcctgctcatggccgggccctctgaggatgcctatttgcaa
gctgccaatgccctggtgcgctcaggcgtgctgaccttctgtgtggggaccaatcgggcg
gataaggctgagttagagcatattgcttttaatcccagcctggtatacctcatggacgat
ttccgctccctgccatctttgcctcaacagctgattcagcccctaaccacatatgttagt
ggaggtgtacaggaagttccactatcccagccagagagcaagcgggacattcttttcctc
tttgacggctcagtcaatgtcttgggtcagttccctgctgtcagagactttctctacagg
atcattgaggaacttgatgtgaagccggatgggacccgggtagcaatagctcagttcagt
gatgacgtcaggctggagtcccgcttcagtgagcaccagacgaaagctgaaatccttaat
ctcgtgaagaagatgaagctaaaaacaggcaaagccctcaacctgggctacgccctggac
tacgcgcttaggaacatctttgtgaggtcggctggcagccgcatcgaggataacgtgcag
cagttcctggtgctgctggttgctggaaggtcgtcagatgctgtggccgggcctgcaagc
tccctgaagcagagaggagtggtacctttcatctttcaagccaagaacgccaaccccagt
gagctggagcagattgtgctgtcgcctgcattcatcctggctgcagagtcactccctaag
atcggagacctccaatcacagattgtgagccttctgaaagcagaacagggttcaggccct
gtatcaggtgaaaaggatgtggtgtttctcattgatggctctgagggtgtccgcagtggc
ttccccctgctaaaggactttgtgcagagggttgtggagagcttggatgtgggtcccgac
cgtgtgcgtgtggcactggtgcagtacagtgaccggaccaggccggagttctacctgaat
tcccacatggaccagcagggtgtcatcagcgccatccgcaggctgacactgctgggcggc
ccaacccccaacacaggggcggcactggagttcgtgttaaggaacattctgaccagttct
actgggagcaggatagcagaaggtgttcctcagctcctgattgtcctcacagcagagcca
tcaggggatgacgtgcgaggcccttcagtagtcctgaagcagggcggggctgtgcccatt
ggcattggcattgggaatgctgacatctctgaaatgcagaccatctccttcatccctgac
tttgctgtggccatccccaccttccgggagcttgggacgatacagcaggtcatctctgag
agggtgatccagcttaatcgtgaggagctgagctcgttgaaacccattttgacgccctca
acaggtgcaggtgttggcagcaagaaagatgtggtctttctcatcgatggatcccgaaat
gctggccctgagttccagtacatccgtaccctgattgagaggatagtcgagtacctggac
ataggctttgacaccacccgggtagccgtcatccagttcagtgaggactccaagatggag
tttcctctgaatgcccacttcagcaaggatgaagtacagaacgctgtgcgcaggctgcgg
cccaagggtgggagtcaggtctacatcggaaatgccctggagtacgtgctgaagaacatc
ttccagaggccactggggagccggatagaagagggcgttccgcagttcctagtcctcatt
tcgtctggaaagtctgatgatgaggtggacgactcagccgtggaactcaagcagttcggt
gtggcccctctcaccatagccagacacacagaccaagaggagctggtgaagatctcccta
agtcctgagtacgtgtactccgtgagcaccttcagggagctgcccagactggagcagaag
ctgctgacgcccatcaccactctaacctcacagcagatccatcagatcctggccagtacc
cgatatcccccttcagtggttgaaagcgatgcggctgacatcgtctttctgattgacagc
tctgatgccgtcaagcccgatggcatcgctcatatccgagactttgtcagcaggatcgtt
cgcagactcaacattggtcccagtaaagtgaggattggggtcgtgcagtttagcaatgat
gtcttccctgagttctacctgaagacccacaagtcccagagcagtgtgcttgaagccata
cggcgtctgaggttcaaaggagggtctcccctaaacactggcagagccctggagtttgtg
gcaagaaacctcttcgtgaagtctgctgggagccggatagaagatggggtccctcaacac
ctggtcttattcctgggtgggaagtctcaggacgatgtggccaggcatgcccaagtcata
agctcatcggggattgtgagcttaggaataggagatcggaacattgataggacggacttg
caaaccatcaccaatgaccccaggctggtcttcacagttcgggaattcagagagctgccc
aacatagaagagagggtcatgctttcctttggaccctctggggctactcctcagcctcct
ggggtagatttaccttctccttcccggccagagaagaagaaggccgacattgtgttcctg
ctggatggttccatcaattttaggagggacagtttccaggaagtgctccgttttgcatct
gaaatcgtggacacggtctatgaagacggagattccatcagagtgggactggtccagtac
aactcagaccccacggatgaattcttcctgagggacttttctaccaaaaggcagattatc
gatgccatcaacaaagtcgtctataaaggagggaggcatgccaacaccagggtgggcatc
gagcacctgctgcggaatcactttgtgccggaggctggtagccgcctggatgagcgggtt
cctcagatcgccttcgtgatcaccggagggaagtcagtggaggatgctcaggatgtgagc
ctggctctcacccagaaaggtgtcaaggtgtttgccgttggcgtgaggaacatcgactct
gaggaggtgggcaagatagcctccaacagtgccacagccttccgggtgggcagcgtgcag
gaactgtccgaactgagtgagacagtgctcgagactctgcatgatgcaatgcatgagacc
ctctgccctggtgtgaccgatgtctccaaagcctgcaatctggaagtcattctggggttt
gatggatctcgagatcagaatgtgtttgtgagtcagaagggccttgagtccaaggtggac
atcatcttgaacagaatcagccagatacaaaggatcagctgcagtggcaaccagctgccc
actgtgagggtatccgtgatggcgaacacgccctctgggccagtggaggcctttgacttt
gccgagtatcagccggagctgtttgagaagtttcgcaacatgcgcagtcagcgcccatac
gtcctcacagctgacacgctgaagttgtaccagaacaagttcaggcagtcgtcgcctgat
actgtgaaggtagtcattcacttcacggatggagcagatggagacatggctgacttatac
agagcatctgaggaactccggcaagcaggtgcccaagccctgatcctggtgggtcttgaa
cgtgtggctaacctggagcggctgatgcatctggagttcgggcggggcttcatgtatgac
aggcccctgaggctcaacctgcttgacttagactatgaactggcagagcagcttgacaat
attgctgagaaagcttgctgtggggttccatgcaagtgctctggagagaggggagacaga
gggcccattggcagcattgggccaaagggtatctccggggaagatggctaccgaggctac
ccgggagacgaaggcggacctggtgagcgaggtccacctggtgtgaatggcacacaaggt
ttccagggctgtccaggccagaggggagtgaagggttctcgaggattcccaggagagaag
ggtgaattgggagaaatcggcttggatggtctggatggtgaagagggagacaaggggctg
cctggttcttctggagaaaaggggagtcctggaagaaggggtgacaaaggacccaaagga
gacaaaggcgagagaggagatgtcgggattcgaggtgacccgggtgactcaggacgggac
agccagcagagaggacccaaaggagagacgggagacatcggccccatgggtctcccaggc
agagatggcatccctggcagcccaggagaccctgggaaggacggtggctctggccgaagg
ggccctgcaggagctaagggcaacagaggtggtcctggccagccaggcttcgaaggcgag
cagggtaccagaggctcacagggcccacctggtcccattggtcccccaggcctgattggg
gaacaaggaattcccggacctcgaggaggcggaggtacggctggggctcctggagaacgt
ggcagaactggtcctctggggaggaagggtgagccaggagagccagggccgaagggaagc
atcgggaaccgtggaccccgaggagagacgggagatgacgggagagatggggttggcagt
gaaggacgcagaggcaaaaaaggagaaagaggcttccctgggtatccgggaccaaagggc
acccctggtgagccaggggcagatggaccaccaggacccaaaggcatcagaggtcgaagg
ggaaattcaggacctccaggggcaactggacaaaagggagaccctggttacccaggacca
tctggtcacaagggcaaccgaggcgactccgttgatcaatgcgccctcatccagagcatc
agagacaaatgcccttgctgctacgggcccctggagtgccctgtattcccaacagaactc
gcctttgccctggatacctctgagggggtgactcaggacaccttcagccggatgcgagag
gtgcttctgggcattgtgggcgatctgaccattgcagagagcaactgtccgcggggtgcc
cgcgtggctgtggtcacctataacaatgaggtgactacagagatccggtttgctgactcc
aagaagaagtctgccctcttagacagtattcagaacctccaagtggccttgacctctaag
cagcagagcctggagaccgcaatgtcattcgtggccaggaatacattcaaacgtgtgagg
agtggcttcctgatgaggaaagtggctgttttcttcagcaacaagcccaccagggcatcc
ccccagctccgagaggccgtactcaagctttccgatgcggggatcacacccttgttcctt
acaagccaggaggaccgacagctcatcaacgctttgcagatcaataacacagcagtggga
catgccctggttctgcctgcaaggagggatctcacagacttcctgaagaatgtccttacc
tgccatgtttgcttggacatttgcaatatcgacccatcctgtggcttcggcagctggagg
ccttccttcagggaccggagggcagcaggcagtgatgtggacatagacctggctttcatc
ttggacagctcagaggctaccacactgttccagttcaatgagatgaagaaatacataggc
tatgtgatcagacagctggacctgagcccagacccgaaggcttcccaacactttgccagg
gtggctgttgtgcagcagtcaacctacgagtctgtggacaatgccagtgtgccgcctgtg
aaggtggaattctcactgacagactatggtgccaaggaaaagctgttggatttccttagc
cggaggatgacccaacttcagggaaccatgggcctgggcaatgccattgaatacactata
gagaacatctttgaaagtgctccaaacccacgggacctcaaaattatggtgctgatgttg
actggtgacatgcagaggcagcagctggaggaggcccagagagccatcctccaggccaag
tgcaagggttacttctttgtggttctgggcattggcaggaaggtgaatatcaaggaggtc
tacagctttgccagtgagcccaatgatgtcttcttcaaatttgtggacaaatcaactgag
ctcaacgaggagcctctgatgcgctttgggaggctgctcccatcctttgtcagcagtgaa
aatgcattttacctgcctccagatctcaggaaacagtgtgattggttccaaggggaccaa
ccagtgaagaatggcgtgaagtttggtcacaaacagataaaccctccacatactgctaat
tcaagtcttacatccaaagttgtgaccacaatgaagccagtgacaacgaccaaaccaaca
gcaattgtaaacctgccacctgctaagccagccccagcgaggccagcccctgctcagcca
gttcttgctaagccagaccccgcaaaaccagcccaagcaaggccagcccctgctaagcca
gcttctgctaagttggtgcctcctcagccagtccatgttcagccagcccctgcccagaca
gcttctgtaagaccagcccccgctaagccagcccctcctcagccagctgctgctaagcca
gtccctgctaagccagcagtccctgctcagccagcccctcctcagccagctgctgctaag
ccagtccctgctaagccagcagtccctgctcagccagcccctcctcagccagctgctgct
aagccagtccctgctaagccagcagtccctgctcagccagccgctgctcaaccaatgcct
gctcagccagtccttactaaatcagccgctgtgaaaccagcctctgccaacaagcctgtg
gcagccaaacctgtggccacgaacacagccacagccacagccagacctgcattagcagcg
aagcctgctgcagcgaagcctgcagcaacgagaccccttgctgctgccataaggccagtt
gctaccaagccagaggcccccaggcaacaggccaaaccagctgccaccaagccagccact
actaagcccttggcgagagtgtcccgagaagtccaggtctctgaagtcacggagaacagt
gccagactccactgggagcggcccgagccttccagctcctttttttatgacctcactgta
acctcggcccacgaccagtccttggttctgagacaaaacctcactgtcactgaccgtgtc
atcggaggcctactggctggacagctgtaccatgtcgtagtggtctcctacctgcagtct
caggtcagggccatctaccaaggaagtttcaatacgaagaaaacccagcctccacctctt
caggcagcacaccgagcatccagttcaaccatcaacctcatggtgaacacagaaccattg
tttctcactaaaacagatatatgtaagctgtccagagatgctgggacttgtgtggacttc
aagttactatggcactatgacctagagagcaaaagttgcaagagattctggtatggaggt
tgtggaggcaacgagaacagattccactcccaggaagaatgtgaaaagatgtgtagtcct
gagttaacagtttga

KEGG   Mus musculus (mouse): 12839
Entry
12839             CDS       T01002                                 

Gene name
Col9a1, Col9, Col9a-1
Definition
(RefSeq) collagen, type IX, alpha 1
  KO
K08131  collagen type IX alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04974  Protein digestion and absorption
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12839 (Col9a1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12839 (Col9a1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12839 (Col9a1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    12839 (Col9a1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12839 (Col9a1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:mmu00535]
    12839 (Col9a1)
Proteoglycans [BR:mmu00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   12839 (Col9a1)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 12839
NCBI-ProteinID: NP_031766
MGI: 88465
Ensembl: ENSMUSG00000026147
Vega: OTTMUSG00000022796
UniProt: Q05722 G3X995 Q8BSQ4 Q9CT83 Q8CEP0
Position
1 A5; 1 9.95 cM
AA seq 921 aa
MKNFWKISVFFCVCSCLGPWVSATLKRRARFPANSISNGGSELCPKIRIGQDDLPGFDLI
SQFQIEKAASRRTIQRVVGSTALQVAYKLGSNVDFRIPTRHLYPSGLPEEYSFLTTFRMT
GSTLEKHWNIWQIQDSAGREQVGVKINGQTKSVAFSYKGLDGSLQTAAFLNLPSLFDSRW
HKLMIGVERTSATLFIDCIRIESLPIKPRGQIDADGFAVLGKLVDNPQVSVPFELQWMLI
HCDPLRPRRETCHELPIRITTSQTTDERGPPGEQGPPGPPGPPGVPGIDGIDGDRGPKGP
PGPPGPPGDPGKPGAPGKPGTPGADGLTGPDGSPGSVGPRGQKGEPGVPGSRGFPGRGIP
GPPGPPGTTGLPGELGRVGPIGDPGKRGPPGPPGPPGPSGTIGFHDGDPLCPNSCPPGRS
GYPGLPGMRGHKGAKGEIGEPGRQGHKGEEGDQGELGEVGDQGPPGPQGLRGITGIVGDK
GEKGARGFDGEPGPQGIPGAAGDQGQRGPPGETGPEGDRGIQGSRGIPGSPGPKGDTGLP
GVDGRDGIPGMPGTKGEAGKPGPPGDVGLQGLPGVPGIPGAKGVAGEKGNTGAPGKPGQL
GSSGKPGQQGPPGEVGPRGPRGLPGSRGPVGPEGSPGIPGKLGSVGSPGLPGLPGPPGLP
GMKGDRGVFGEPGPKGEQGASGEEGEAGARGDLGDMGQPGPKGSVGNPGEPGLRGPEGIR
GLPGVEGPRGPPGPRGMQGDQGATGLPGIQGPPGRAPTDQHIKQVCMRVVQEHFAEMAAS
LKRPDTGASGLPGRPGPPGPPGPPGENGFPGQMGIRGLPGIKGPPGALGLRGPKGDLGEK
GERGPPGRGPKGLPGAIGLPGDPGPASYGKNGRDGEQGPPGVAGIPGVPGPPGPPGPPGF
CEPASCTLQSGQRAFSKGPDK
NT seq 2766 nt   +upstreamnt  +downstreamnt
atgaagaacttctggaaaatctcagttttcttctgtgtgtgcagttgtctgggaccctgg
gtatccgcaactcttaagcgtcgtgcaagatttcctgccaattccatttctaatggtgga
agtgaactgtgtcccaagatcaggattggccaagatgacttaccagggtttgacctgatt
tctcagttccaaattgagaaggctgcatctcgaaggactatccagagggtggtgggatcc
acagccttacaagtggcttacaagctgggaagtaatgtagacttcaggattccaacaagg
catttgtatcccagtggactgcctgaagaatattcctttttaactactttccggatgact
ggaagcacacttgaaaagcactggaacatttggcagatccaggattccgcagggagagag
caagttggcgtgaagataaatggccaaacaaagtctgtcgcattttcatacaagggactg
gatgggagtctccaaacggctgccttcttgaatttgccgtccttgtttgactcccggtgg
cataagctcatgattggcgtggaaagaacaagcgccactctttttattgactgcatcagg
atcgaatctttacctataaagccaagaggccagattgatgcggatggctttgcagtgctg
ggaaaacttgtggacaatcctcaggtttctgttccttttgaactccagtggatgctgatt
cattgtgaccccctgagacccaggagagaaacctgtcatgagctgccaatcagaatcaca
accagccagaccactgatgagagaggtcctccgggtgagcaggggcctccagggcctcct
gggcctcctggagttccgggcatagatggcattgatggtgaccgaggtccaaagggtccc
ccaggacctccgggtcctcctggagacccaggcaagccaggagcaccaggcaagccaggc
acaccaggagctgatggattaacaggacctgatggatcccctggctctgttggaccaagg
ggacaaaaaggagaacctggtgtacctgggtctcgtggatttccaggccgtggtattcca
ggaccccctggtcctcctgggaccacaggacttcctggagaacttggccgagtaggccct
attggagaccctgggaaaagaggaccacctggccctcctggacctccaggacccagtgga
acaattggatttcatgatggagacccattgtgccccaattcctgcccaccaggtcgctct
ggatatccaggcctaccaggcatgaggggccataaaggggcgaaaggagaaatcggcgag
ccaggaagacaaggacacaagggtgaagagggtgaccagggggaactgggagaagttggc
gatcaaggacctccaggacctcagggtctgagaggcatcactggcatagttggagacaaa
ggagaaaagggtgctcggggatttgatggagagcctggacctcagggcattccaggtgca
gctggtgatcaaggacagcgaggccctccaggagaaacaggtcctgagggagacagaggc
attcaaggttcccgaggaattcctggatccccagggccaaaaggagacacgggcttgcca
ggtgtagatggccgagatggaataccaggaatgcccgggacaaagggtgaagcagggaag
cctggaccccccggggacgtgggattgcagggcttaccaggtgtccctggaatccctggt
gcaaaaggtgttgccggtgaaaagggtaacacgggtgctccaggaaagcccggtcagttg
ggaagttcagggaaaccaggccaacaagggcccccaggagaggttggacctcggggaccc
aggggccttccaggcagtagaggcccggtaggaccagaagggtctccaggcataccaggg
aaactgggatctgttggcagccctggccttcctggcttgcctgggccccctggacttcct
ggaatgaaaggagacaggggtgtatttggtgaaccgggtcccaagggtgaacagggtgcc
tctggtgaagaaggtgaagcaggagcaaggggtgaccttggagatatgggacaacctggc
ccaaagggatctgtgggtaaccccggggagccgggtctgagggggcctgaaggaatcaga
gggcttcctggagtggaaggaccaagaggaccacctggaccccggggcatgcagggagat
cagggtgccactgggctgcctggtatccagggccctccgggcagagcgccgaccgaccag
cacatcaagcaggtttgcatgagagtcgtgcaagagcattttgcggaaatggcagccagc
ctcaagagaccagacacaggagcctctggtcttcctgggaggcctggcccccctgggcct
ccgggcccccctggagagaatggtttccctggtcagatgggaatccgtggtctcccaggc
attaagggtccccctggtgctcttggcttaagaggacctaaaggagacttgggagaaaaa
ggagaacgtggtcctccaggaagaggtcctaagggtttgcctggagcgataggtctccca
ggtgacccaggccctgccagctatgggaaaaatggccgtgacggagagcaaggtcccccg
ggagtggcaggaattcctggtgtgcctggacccccaggtcctccaggccctcctgggttc
tgtgagccagcctcttgcaccctgcagtctggtcaaagagcatttagcaaagggccggac
aagtga

KEGG   Mus musculus (mouse): 12840
Entry
12840             CDS       T01002                                 

Gene name
Col9a2, AI427499, Col9, Col9a-2
Definition
(RefSeq) collagen, type IX, alpha 2
  KO
K08131  collagen type IX alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04974  Protein digestion and absorption
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12840 (Col9a2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12840 (Col9a2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12840 (Col9a2)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    12840 (Col9a2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12840 (Col9a2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:mmu00535]
    12840 (Col9a2)
Proteoglycans [BR:mmu00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   12840 (Col9a2)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 12840
NCBI-ProteinID: NP_031767
MGI: 88466
Ensembl: ENSMUSG00000028626
Vega: OTTMUSG00000008902
UniProt: I7HJR1
Position
4 D2.2; 4 56.78 cM
AA seq 688 aa
MTAVPAPRSLFVLLQVLWLALAQIRGPPGEPGPPGPPGPPGVPGSDGIDGDKGPPGKVGP
PGSKGEPGKPGPDGPDGKPGIDGLMGAKGEPGPVGTPGVKGQPGLPGPPGLPGPGFAGPP
GPPGPVGLPGEIGTPGPKGDPGPEGPSGPPGPPGKPGRPGTIQGLEGSADFLCPTNCPAG
VKGPQGLQGVKGHPGKRGILGDPGRQGKPGPKGDVGASGEQGIPGPPGPQGIRGYPGMAG
PKGEMGPRGYKGMVGSIGAAGPPGEEGPRGPPGRAGEKGDVGSQGARGPQGITGPKGTTG
PPGIDGKDGTPGIPGMKGSAGQVGRPGSPGHQGLAGVPGQPGTKGGPGDKGEPGQQGLPG
VSGPPGKEGEPGPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGPPGIPGPQG
LPGIKGDKGSPGKTGPRGGVGDPGVAGLPGEKGEKGQSGEPGLKGQQGVRGETGYPGPSG
DAGAPGVQGYPGLPGPRGLVGDRGVPGQPGRQGVVGRAASDQHIVDVVLKMIQEQLAEVA
VSAKREALGAAGMVGLPGPPGPPGYPGKQGPNGHPGPRGIPGIVGAVGQIGNTGPKGKRG
EKGDRGEMGRGHPGMPGPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGRPGRPGPVGLP
GFCEPAACLGASAYTSARLTEPGSIKGP
NT seq 2067 nt   +upstreamnt  +downstreamnt
atgaccgccgtgcccgcgccccgcagcctgttcgttctcctccaggtgctgtggctcgcc
ctggctcagatcagaggtccaccaggggagccgggtcctccagggccaccagggccacca
ggagtgcctggttcagatggcatcgacggtgacaaggggcctccaggtaaagttggtcct
ccgggatccaaaggagagcctggcaaacctggcccagatggtccagatgggaaacctggc
attgatggtttaatgggagccaagggagagcctggtcccgtggggacccctggagtcaag
ggccagcctgggctcccagggccccctggcctgccgggccctggttttgctggacctccc
ggaccccctggacctgttggcctccctggtgagattggaaccccaggccccaagggggat
ccaggaccagagggaccatcagggcccccagggccccctgggaaaccaggccgaccagga
accatccagggcttggaagggagtgcggatttcttgtgtccaaccaactgtccagctggt
gtgaagggtcctcagggtttgcaaggagtgaagggtcatccaggcaaacgggggattctg
ggtgatcctggtcgccaggggaagccaggtcccaagggagatgtgggtgcctctggagag
caaggcatccctggaccaccgggcccccaaggcatcaggggctacccgggcatggcagga
cccaagggagagatgggtcctcgtggctataaaggcatggtgggctccatcggggctgct
gggccaccgggtgaagaaggcccaagagggccaccaggccgagccggtgagaagggggat
gtggggagccaaggtgccagaggaccccaggggataacaggcccaaagggaacaaccggt
ccaccaggcattgatggcaaggatgggaccccaggcattcctggcatgaagggcagtgca
ggacaagtgggacggccaggaagcccaggccaccagggcttagcgggtgtgccgggtcag
cctggaacaaaaggaggtcctggagacaagggtgaaccaggccagcagggcttgccggga
gtctctggtccccctgggaaagaaggggagccagggcctcgaggagaaattggtccacag
ggcatcatgggacagaagggtgaccagggcgagagggggccagtgggacagccaggccct
caaggacgacagggccccaaaggagagcagggccctccaggaattccaggaccccaaggc
ttgccaggcatcaaaggagataagggttccccagggaagaccgggccccgaggcggagtg
ggtgacccgggggtggccggcctcccgggagagaaaggagagaagggccaatcaggcgag
ccagggcttaagggacagcaaggagtccgtggagagaccggctaccccggccccagcgga
gatgccggtgccccaggagtgcagggctaccccgggcttcccgggccccgaggactggtg
ggagatcgaggcgtgccaggacaacccgggagacagggtgtggtgggccgagccgccagt
gaccagcacatcgtggatgtggtgctgaagatgattcaagagcaacttgctgaggtagct
gtgagtgccaagcgagaagccctgggtgcagcagggatggtgggccttccaggacctcct
gggccccctggatatccaggcaaacaggggcccaatgggcatcccggcccccgaggcatt
cctggcatcgtgggagcagtgggtcagattggcaacactggacccaagggaaagcgtgga
gagaagggggatcgaggagaaatgggtcgtggccatcccgggatgcctgggcccccaggg
atcccaggtcttcctggccggcctggccaggcaatcaatggcaaggatggggaccgagga
tccccaggggctccaggagaggctgggagacctggccggccaggcccagtggggctacca
ggtttctgtgagcctgccgcatgcctgggggcttcagcctatacctctgctcgcctcaca
gagcctgggtccatcaaggggccatga

KEGG   Mus musculus (mouse): 12841
Entry
12841             CDS       T01002                                 

Gene name
Col9a3, AV006866
Definition
(RefSeq) collagen, type IX, alpha 3
  KO
K08131  collagen type IX alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04974  Protein digestion and absorption
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12841 (Col9a3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12841 (Col9a3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12841 (Col9a3)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    12841 (Col9a3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12841 (Col9a3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:mmu00535]
    12841 (Col9a3)
Proteoglycans [BR:mmu00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   12841 (Col9a3)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 12841
NCBI-ProteinID: NP_001365706
MGI: 894686
Ensembl: ENSMUSG00000027570
Vega: OTTMUSG00000016344
Position
2 H4; 2 103.14 cM
AA seq 719 aa
MTGAPALALLLLGQLLTATSAQKVGPRGPPGPQGPPGKPGKDGIDGEAGPPGLPGLPGPK
GTSGKPGKPGEAGLPGLPGVDGLTGRDGPAGPKGAPGERGSLGPPGPPGLGGKGLPGPPG
EAGVSGLPGGIGLRGPPGPSGLPGLPGLPGPPGPPGNPGVLPEGATDLQCPAICPPGPPG
PPGMPGFKGPTGYKGEQGEVGKDGEKGSPGPPGPPGIPGTVGLQGPRGLRGLPGPLGPPG
DRGPIGFRGPPGTPGAPGKVGDRGERGPEGFRGPKGDLGRPGPKGIPGMAGPGGEPGMPG
KDGKDGVPGLDGEKGEAGRNGGQGEKGPNGLPGLPGRAGSKGEKGEPGRTGELGEAGPSG
EPGIPGDVGVPGERGEAGHRGSVGALGPQGPPGAPGIRGFQGQKGSTGDPGLPGPQGLRG
DVGDRVRVVLLHLPALTSLHRLKPAAAHSFPRGWQGWQHQLCGQGPGGATGPKGDQGIAG
SDGLPGDKGELGPNGPVGQKGESGSRGELGPKGIQGPNGTSGVQGVPGPPGPLGLQGVQG
VPGITGKPGVPGKEASEQRIRELCGGMISEQIAQLAAHLRKPLAPGSIGRPGPAGPPGPP
GPPGSIGHPGARGPPGYRGPTGELGDPGPRGSQGDRGDKGATGAGLDGPAGDQGYQGPQG
VPGISKDGRDGAHGEPGLPGDPGLPGAAGAQGTPGICDTSACQGAVLGGGGEKSGPRSS
NT seq 2160 nt   +upstreamnt  +downstreamnt
atgaccggagcgcctgccctagccctgctgctgctggggcagctcctgacggccacctcc
gcgcagaaagtgggaccccgaggcccccctggtccccaagggcctcctggaaaacccggc
aaggatggcattgatggagaagctggccctccaggtctgcctggccttccaggacccaaa
gggacctcagggaagccagggaagccgggagaggcaggactgccaggactgcctggtgta
gatggtctgacagggagagatggacccgcaggacccaaaggtgcccctggagaacgggga
agtctaggacccccagggccaccaggacttgggggcaaaggcctccctggacctcctgga
gaggcaggagtgagtggcctcccaggtgggattggtctacgtggccccccgggaccctct
ggacttccaggactgcctggcctcccaggacctcctggacctcctggaaaccctggagtc
ctccctgaaggtgctactgatctgcagtgtcccgccatctgcccgccaggccctccggga
cccccaggaatgccggggttcaaggggcctactggctacaaaggggaacaaggagaagtt
ggcaaagatggtgagaagggtagtcctggcccccccgggcctcctggaatcccaggcacc
gtggggctacagggcccacgaggattaagaggacttccagggccactcgggccccctggg
gaccggggtcccattgggtttcggggcccccctgggaccccaggagcacctgggaaagtg
ggtgacaggggtgaaaggggaccagaagggttccgtggccctaagggtgacctgggcagg
cctggtcccaaaggaatccctggaatggctgggccaggcggagaaccaggcatgccaggc
aaggatggcaaagatggtgtgccgggacttgatggtgagaagggagaggctggtcgcaat
ggtggccaaggagagaaaggccccaatgggctgccggggctccctggacgagcagggtcc
aaaggcgagaagggagaaccgggtagaactggcgagctgggtgaggctggcccctctgga
gagccaggtatccctggagatgttggtgttccaggggagcgtggtgaggctggtcacagg
ggctccgtgggagctcttggcccacaaggtcctcctggtgctcctggcatccgtggcttt
cagggacaaaaaggcagcacaggagaccctggccttccgggcccccaaggcctacgggga
gatgtgggtgaccgggtaagggttgttctcttgcacctccctgctctaaccagtctacat
agactcaagccagcagcagcccactccttcccacgtgggtggcagggatggcagcaccaa
ctatgtggacagggtccaggaggtgccacaggccctaagggagaccagggcattgcaggt
tccgatggtcttccaggggacaaaggcgagctgggtcctaatggccccgttggacaaaaa
ggagagtctggcagccgaggggaactgggccccaaaggcatccagggccccaatggcacc
agtggagtccagggtgtacctggtcccccaggtccactgggcctccaaggtgtgcagggt
gtcccaggcatcaccgggaagcctggagttccgggcaaggaagccagtgaacagcgcatc
agggagctatgcgggggtatgatcagtgagcaaattgcacagttggctgcacacctgagg
aagcccttagcaccaggctccatcggcaggcctggtccagctggccccccaggccctcca
ggccctccaggctctattggccaccctggcgctcggggtccccctggataccgtggtccc
actggggaactaggagatcctggaccaagagggagccagggtgacagaggggacaaggga
gcaacaggtgcagggctggatgggcctgctggggaccagggctaccaagggcctcaaggt
gtacctggcatcagcaaagatggtcgagatggtgctcatggtgagcctggccttcccggt
gatcctggccttcctggagctgctggtgctcagggtaccccagggatctgcgacacttca
gcctgccaaggagctgtgttagggggaggtggggaaaagtcgggtcctcggagctcataa

KEGG   Mus musculus (mouse): 12842
Entry
12842             CDS       T01002                                 

Gene name
Col1a1, Co, Col, Col1, Col1a-1, Cola-1, Cola1, Mov-, Mov-13, Mov13
Definition
(RefSeq) collagen, type I, alpha 1
  KO
K06236  collagen type I alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04611  Platelet activation
mmu04926  Relaxin signaling pathway
mmu04933  AGE-RAGE signaling pathway in diabetic complications
mmu04974  Protein digestion and absorption
mmu05146  Amoebiasis
mmu05165  Human papillomavirus infection
mmu05205  Proteoglycans in cancer
mmu05415  Diabetic cardiomyopathy
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12842 (Col1a1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12842 (Col1a1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12842 (Col1a1)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    12842 (Col1a1)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    12842 (Col1a1)
  09154 Digestive system
   04974 Protein digestion and absorption
    12842 (Col1a1)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    12842 (Col1a1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12842 (Col1a1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    12842 (Col1a1)
  09166 Cardiovascular disease
   05415 Diabetic cardiomyopathy
    12842 (Col1a1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    12842 (Col1a1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12842 (Col1a1)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12842 (Col1a1)
SSDB
Motif
Pfam: Collagen COLFI VWC
Other DBs
NCBI-GeneID: 12842
NCBI-ProteinID: NP_031768
MGI: 88467
Ensembl: ENSMUSG00000001506
Vega: OTTMUSG00000002059
UniProt: P11087
Position
11 59.01 cM; 11 D
AA seq 1453 aa
MFSFVDLRLLLLLGATALLTHGQEDIPEVSCIHNGLRVPNGETWKPEVCLICICHNGTAV
CDDVQCNEELDCPNPQRREGECCAFCPEEYVSPNSEDVGVEGPKGDPGPQGPRGPVGPPG
RDGIPGQPGLPGPPGPPGPPGPPGLGGNFASQMSYGYDEKSAGVSVPGPMGPSGPRGLPG
PPGAPGPQGFQGPPGEPGEPGGSGPMGPRGPPGPPGKNGDDGEAGKPGRPGERGPPGPQG
ARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGERG
RPGPPGTAGARGNDGAVGAAGPPGPTGPTGPPGFPGAVGAKGEAGPQGARGSEGPQGVRG
EPGPPGPAGAAGPAGNPGADGQPGAKGANGAPGIAGAPGFPGARGPSGPQGPSGPPGPKG
NSGEPGAPGNKGDTGAKGEPGATGVQGPPGPAGEEGKRGARGEPGPSGLPGPPGERGGPG
SRGFPGADGVAGPKGPSGERGAPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDG
KTGPPGPAGQDGRPGPAGPPGARGQAGVMGFPGPKGTAGEPGKAGERGLPGPPGAVGPAG
KDGEAGAQGAPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPG
PSGARGERGFPGERGVQGPPGPAGPRGNNGAPGNDGAKGDTGAPGAPGSQGAPGLQGMPG
ERGAAGLPGPKGDRGDAGPKGADGSPGKDGARGLTGPIGPPGPAGAPGDKGEAGPSGPPG
PTGARGAPGDRGEAGPPGPAGFAGPPGADGQPGAKGEPGDTGVKGDAGPPGPAGPAGPPG
PIGNVGAPGPKGPRGAAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPVGKEGGKGPRG
ETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGSPGTPGPQGIAGQRGVVGLPGQRGERG
FPGLPGPSGEPGKQGPSGSSGERGPPGPMGPPGLAGPPGESGREGSPGAEGSPGRDGAPG
AKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPIGPAGARGPAGPQG
PRGDKGETGEQGDRGIKGHRGFSGLQGPPGSPGSPGEQGPSGASGPAGPRGPPGSAGSPG
KDGLNGLPGPIGPPGPRGRTGDSGPAGPPGPPGPPGPPGPPSGGYDFSFLPQPPQEKSQD
GGRYYRADDANVVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMCHSDWKS
GEYWIDPNQGCNLDAIKVYCNMETGQTCVFPTQPSVPQKNWYISPNPKEKKHVWFGESMT
DGFPFEYGSEGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQTGNLKKALLLQ
GSNEIELRGEGNSRFTYSTLVDGCTSHTGTWGKTVIEYKTTKTSRLPIIDVAPLDIGAPD
QEFGLDIGPACFV
NT seq 4362 nt   +upstreamnt  +downstreamnt
atgttcagctttgtggacctccggctcctgctcctcttaggggccactgccctcctgacg
catggccaagaagacatccctgaagtcagctgcatacacaatggcctaagggtccccaat
ggtgagacgtggaaacccgaggtatgcttgatctgtatctgccacaatggcacggctgtg
tgcgatgacgtgcaatgcaatgaagaactggactgtcccaacccccaaagacgggagggc
gagtgctgtgctttctgcccggaagaatacgtatcaccaaactcagaagatgtaggagtc
gagggacccaagggagaccctggcccccaaggcccaaggggacccgttggcccccctgga
cgagatggcatccctggacagcctggacttcctggtcctcctggtccccctgggcccccc
ggaccccctggccttggaggaaactttgcttcccagatgtcctatggctatgatgaaaaa
tcagctggagtttccgtgcctggccccatgggtccttctggtcctcgtggtctccctggc
ccccctggtgcacctggtccacaaggtttccaaggcccccctggtgaacctggcgagcct
ggcggttcaggtccaatgggtccccgaggtccccctggccctcctggcaagaatggagat
gatggggaagctggcaagcccggccgtcctggtgagcgtggacctcctggacctcagggt
gctcgtggattgcctggaacagctggcctccctggaatgaagggacaccgaggcttcagt
ggtttggatggtgccaaaggagatgctggtcctgctggtcctaagggagagcccggcagt
cctggtgaaaacggagctcctggccagatgggtccccgaggtctgcccggtgagagaggt
cgccctggacctcctggcactgctggtgctcgcggtaacgatggtgctgttggtgctgct
ggaccccctggtcccaccggccccactggccctcctggcttccctggtgcagttggtgct
aagggtgaagctggtccccaaggagctagaggctctgaaggtccccagggtgtgcgtggt
gagcccggaccccctggccctgctggtgctgccggccctgctggaaaccctggtgctgat
ggacaacctggcgctaaaggtgccaatggtgctcctggtattgctggtgctcctggcttc
cctggtgcccgaggcccctctggaccccagggccccagcggccctccaggtcccaagggt
aacagtggtgaacctggtgctcctggcaacaaaggagacactggtgccaaaggagaaccc
ggtgctactggagttcaaggtcccccaggccctgccggagaagaaggaaaacgaggagcc
cgtggtgagcctggaccttccggactgcctggacctcctggcgagcgtggtggacctggt
agccgtggtttccctggtgctgatggtgttgctggccccaagggtccttccggtgaacgt
ggtgctcccggacctgctggtcccaaaggttctcctggtgaagctggtcgccccggtgaa
gctggtctccctggtgccaagggtctcactggcagtcctggcagccctggtcctgatggc
aaaaccggcccccctggtcccgctggtcaagatggtcgccctggacccgcaggtcctcct
ggagcccgtggccaggctggtgtgatgggattccctggacctaagggtaccgctggagaa
cctggaaaggctggagagcgaggccttcccggaccccctggcgctgttggtcctgctggc
aaagatggagaagctggagctcagggagcccctggccctgctggtcctgctggtgagaga
ggtgaacaaggtcccgctggctcccctggattccagggtcttcctggtcctgccggtcct
cctggtgaagcaggcaagcctggtgaacagggtgttcctggagaccttggtgcccctgga
ccctctggcgcaagaggcgagagaggtttccctggtgaacgtggtgtacaaggtccccca
ggtcctgctggtccccgaggaaacaatggtgcccccggcaacgatggtgccaagggtgat
actggtgcccccggagctcccggtagccagggtgcccccggtcttcagggaatgcctggt
gaacgtggtgcagctggtcttccaggtcctaagggtgacagaggtgatgctggtcccaaa
ggtgctgatggttctcctggtaaagatggtgcccgtggtctgactggtcccattggtcct
cctggccctgctggtgcccctggtgacaagggtgaagctggtcccagtggtcctcccggt
cccaccggagcccgtggtgctcccggagaccgtggtgaggctggtccccctggtcctgct
ggctttgccggcccccctggtgctgatggccaacctggtgcgaaaggtgaacctggtgat
actggtgttaaaggtgatgctggtcctcctggccctgctggtcctgctggaccccccggc
cccattggtaacgttggtgctcctggacccaaaggtcctcgtggtgctgctggtccccct
ggtgctactggcttccctggtgctgctggccgtgtcggtccccctggtccctctggaaat
gctggaccccctggccctcccggtcccgttggcaaagaagggggcaaaggtccccgtggt
gagactggccctgctggacgtcctggtgaagttggtcccccaggtccccccggtcctgct
ggtgagaaaggatctcctggtgctgatggacctgctggctctcctggtacccctggacct
cagggtattgctggacaacgtggtgtggtcggtcttcccggtcagagaggagaaagaggc
ttccctggtcttcctggcccctctggtgaacctggcaaacaaggtccttctggatcaagt
ggtgaacgcggtccccctggccccatggggccccctggattggctggtccccctggtgaa
tctggacgtgagggatcccctggtgctgaaggctcccctggaagggatggtgctcccggg
gccaagggtgaccgtggtgagactggccccgctggcccccctggtgcccctggtgctccc
ggtgctcccggccctgttggtcccgctggcaagaatggcgatcgtggtgagactggtcct
gctggtcctgctggtcccattggccctgctggtgcccgtggccctgctggaccccaaggc
ccccgtggtgacaagggtgagacaggcgaacaaggtgacagaggcataaagggtcatcgt
ggcttctctggtctccagggtcctcctggttctcctggttctcctggtgaacaaggcccc
tctggagcttcaggtcctgcaggcccccggggtccccctggctctgctggttctcctggc
aaagacggactcaacggtctccctggccccattggtccccctggtcctcgaggtcgcact
ggtgacagcggccctgctggtccccccggccctcctggaccccctggccctcctggacct
cccagtggcggttatgacttcagcttcctgcctcagccacctcaagagaagtctcaagat
ggtggccgctactaccgggccgatgatgctaacgtggttcgtgaccgtgaccttgaggtg
gacaccaccctcaagagcctgagtcagcagattgagaacatccgcagccccgaaggcagc
cgcaagaaccctgcccgcacatgccgcgacctcaagatgtgccactctgactggaagagc
ggagagtactggatcgaccctaaccaaggctgcaacctggacgccatcaaggtctactgc
aacatggagacaggtcagacctgtgtgttccctactcagccgtctgtgcctcagaagaac
tggtacatcagcccgaaccccaaggaaaagaagcacgtctggtttggagagagcatgacc
gatggattcccgttcgagtacggaagcgagggctccgaccccgccgatgtcgctatccag
ctgaccttcctgcgcctaatgtccaccgaggcctcccagaacatcacctatcactgcaag
aacagcgtagcctacatggaccagcagactggcaacctcaagaaggccctgctcctccag
ggatccaacgagatcgagctcagaggcgaaggcaacagtcgcttcacctacagcaccctt
gtggacggctgcacgagtcacaccggaacttggggcaagacagtcatcgaatacaaaacc
accaagacctcccgcctgcccatcatcgatgtggctcccttggacattggtgccccagac
caggaattcggactagacattggccctgcctgcttcgtgtaa

KEGG   Mus musculus (mouse): 12843
Entry
12843             CDS       T01002                                 

Gene name
Col1a2, AA960264, AI325291, Co, Col, Col1, Col1a-2, Cola-2, Cola2, oim
Definition
(RefSeq) collagen, type I, alpha 2
  KO
K06236  collagen type I alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04611  Platelet activation
mmu04926  Relaxin signaling pathway
mmu04933  AGE-RAGE signaling pathway in diabetic complications
mmu04974  Protein digestion and absorption
mmu05146  Amoebiasis
mmu05165  Human papillomavirus infection
mmu05205  Proteoglycans in cancer
mmu05415  Diabetic cardiomyopathy
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12843 (Col1a2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12843 (Col1a2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12843 (Col1a2)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    12843 (Col1a2)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    12843 (Col1a2)
  09154 Digestive system
   04974 Protein digestion and absorption
    12843 (Col1a2)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    12843 (Col1a2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12843 (Col1a2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    12843 (Col1a2)
  09166 Cardiovascular disease
   05415 Diabetic cardiomyopathy
    12843 (Col1a2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    12843 (Col1a2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12843 (Col1a2)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12843 (Col1a2)
SSDB
Motif
Pfam: Collagen COLFI
Other DBs
NCBI-GeneID: 12843
NCBI-ProteinID: NP_031769
MGI: 88468
Ensembl: ENSMUSG00000029661
Vega: OTTMUSG00000021300
UniProt: Q01149 Q3TX57
Position
6 A1; 6 1.81 cM
AA seq 1372 aa
MLSFVDTRTLLLLAVTSCLATCQYLQSGSVRKGPTGDRGPRGQRGPAGPRGRDGVDGPMG
PPGPPGSPGPPGSPAPPGLTGNFAAQYSDKGVSSGPGPMGLMGPRGPPGAVGAPGPQGFQ
GPAGEPGEPGQTGPAGPRGPAGSPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLP
GFKGVKGHSGMDGLKGQPGAQGVKGEPGAPGENGTPGQAGARGLPGERGRVGAPGPAGAR
GSDGSVGPVGPAGPIGSAGPPGFPGAPGPKGELGPVGNPGPAGPAGPRGEVGLPGLSGPV
GPPGNPGTNGLTGAKGATGLPGVAGAPGLPGPRGIPGPAGAAGATGARGLVGEPGPAGSK
GESGNKGEPGSVGAQGPPGPSGEEGKRGSPGEAGSAGPAGPPGLRGSPGSRGLPGADGRA
GVMGPPGNRGSTGPAGIRGPNGDAGRPGEPGLMGPRGLPGSPGNVGPSGKEGPVGLPGID
GRPGPIGPAGPRGEAGNIGFPGPKGPSGDPGKPGERGHPGLAGARGAPGPDGNNGAQGPP
GPQGVQGGKGEQGPAGPPGFQGLPGPSGTTGEVGKPGERGLPGEFGLPGPAGPRGERGTP
GESGAAGPSGPIGSRGPSGAPGPDGNKGEAGAVGAPGSAGASGPGGLPGERGAAGIPGGK
GEKGETGLRGDTGNTGRDGARGIPGAVGAPGPAGASGDRGEAGAAGPSGPAGPRGSPGER
GEVGPAGPNGFAGPAGAAGQPGAKGEKGTKGPKGENGIVGPTGSVGAAGPSGPNGPPGPV
GSRGDGGPPGMTGFPGAAGRTGPPGPSGIAGPPGPPGAAGKEGIRGPRGDQGPVGRTGET
GASGPPGFVGEKGPSGEPGTAGAPGTAGPQGLLGAPGILGLPGSRGERGLPGIAGALGEP
GPLGISGPPGARGPPGAVGSPGVNGAPGEAGRDGNPGSDGPPGRDGQPGHKGERGYPGSI
GPTGAAGAPGPHGSVGPAGKHGNRGEPGPAGSVGPVGAVGPRGPSGPQGIRGDKGEPGDK
GHRGLPGLKGYSGLQGLPGLAGLHGDQGAPGPVGPAGPRGPAGPSGPVGKDGRSGQPGPV
GPAGVRGSQGSQGPAGPPGPPGPPGPPGVSGGGYDFGFEGDFYRADQPRSQPSLRPKDYE
VDATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWNSDYYWIDPNQGCTMDAIKVY
CDFSTGETCIQAQPVNTPAKNSYSRAQANKHVWLGETINGGSQFEYNVEGVSSKEMATQL
AFMRLLANRASQNITYHCKNSIAYLDEETGSLNKAVLLQGSNDVELVAEGNSRFTYSVLV
DGCSKKTNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFRVEVGPVCFK
NT seq 4119 nt   +upstreamnt  +downstreamnt
atgctcagctttgtggatacgcggactctgttgctgcttgcagtaacttcgtgcctagca
acatgccaatatttgcaatcgggatcagtacgaaagggccccactggagacagaggacca
cgtggacaaaggggtcccgcaggtccccgaggcagagatggtgttgatggtcccatgggc
cctcctggtccccctggctcccctggtcctcccggctcccctgccccccctggtcttact
gggaactttgctgctcagtattctgacaaaggagtttcatctggccctggaccaatgggt
ttaatgggacccagaggccctcctggtgctgttggagcccctggccctcaaggtttccaa
ggacctgctggtgaacctggtgagcctggtcaaacgggtcctgcaggtccccgtggtcca
gctggctctcctggcaaggctggtgaggatggtcaccctggaaaacccggaagacctggg
gagagaggagtcgttggaccacagggtgctcgtggtttccccgggactcctggacttcct
ggcttcaaaggcgtgaaaggacacagtggtatggatggattaaagggacagcctggtgca
cagggtgtgaagggtgaacccggtgcccctggagagaatggaactccaggtcaagcagga
gcccgagggcttcctggtgagagagggcgtgtcggagctcctggtccagctggtgcccga
ggcagtgatggaagcgttggccccgtcggccctgctggtcccattgggtctgctggaccc
cctggtttcccaggtgctcctggtcccaagggtgaacttgggcccgttggtaaccctggc
cctgctggtcctgctggtccccgtggtgaagtgggtcttccaggtctttctggccctgtt
ggacctcctggcaaccctggaacaaatgggctcactggcgccaagggtgctactggactc
cctggtgtcgctggggctcccggtctccctggcccccgtggtattcctggccccgctggt
gctgctggtgctacaggtgccagaggacttgttggtgagcctggtcctgctggctccaaa
ggagaatccggtaacaagggtgagcctggctctgttggagcccaaggtcctcctggtccc
agcggtgaagaaggaaagagagggtctcccggagaagctggatctgctggccctgcaggg
cctccagggcttagaggcagccctggttctcgaggtcttcctggagctgatggcagagct
ggtgtaatgggtcctcctggcaatcgtggttcaaccggccctgctggaatccgaggtcct
aatggagatgccggtcgacctggggaacctggtctcatgggacccagaggtcttcctggg
tctcctggaaatgttggcccatctggtaaagaaggccctgtgggtctccctggcattgat
ggcagacctggcccaatcggcccagctggaccaagaggtgaagctggcaacatcggattc
cctggacccaaaggcccctctggtgatcctggcaaacctggtgagagaggacaccccggt
cttgctggtgctcggggagctccaggacccgatggcaacaatggagctcagggcccccct
ggaccccagggtgttcaaggtggcaaaggtgaacaaggccctgctggtcctcctggcttc
cagggtctccctggtccctcaggtactactggagaagttggcaagcccggagaaaggggt
cttcctggtgaattcggtctccctggtcctgctggtccaagaggagaacgtggtaccccg
ggtgagagtggagctgctggcccttctggtcctattggaagccgaggtcccagtggagcc
ccagggcctgatggaaacaagggtgaagctggtgcagtcggtgctccaggcagtgctggt
gcctctgggcctggtgggcttccaggagagaggggtgctgctggcatacctgggggcaaa
ggagaaaagggtgaaactggtctccgaggtgacactggcaacactggtagagatggtgct
cgtggcattcctggtgctgtaggtgcccctggtcctgctggggcctcaggtgaccggggt
gaagctggtgctgccggtccttctggcccagctggtcctcggggtagccctggtgaacgt
ggtgaagttggccctgctggccccaatggatttgctggtcctgctggtgctgctggccaa
cccggtgctaaaggagagaagggaaccaaagggcctaagggtgaaaatggcattgttggt
ccaaccgggtctgttggagctgctggcccatctggtcctaatggcccacctggtcctgtt
ggaagtcgtggtgatggagggccccctggtatgactggcttccctggtgctgctggacgg
actggtccccccggacccagtggtattgctggcccccctggcccccctggtgctgctgga
aaggaagggattcgtggtccccgtggcgaccaaggtccagttggccgaactggagaaaca
ggtgcatctggaccccctggatttgttggtgaaaagggtccctctggagaacctggtacc
gcaggagcccctggtaccgcaggtcctcagggtcttcttggtgctcccggtattctgggt
ctccctggctctagaggtgaacgtggtctgccaggtattgctggtgctctgggtgaacct
ggtcctctgggcatctcaggccctcctggtgcccgtggtccccctggtgctgtgggcagc
cctggagttaatggtgcccctggtgaagctggtcgtgatggtaaccctggcagcgatggt
cccccaggtcgtgatggtcagcctggacacaagggagagcgtggttaccctggcagcatt
ggtccgactggtgctgcaggtgcccctggtcctcacggttctgtgggtcctgctgggaaa
catggaaaccgaggtgaacctggtcctgccggttctgttggtcctgttggtgctgttggt
ccaagaggtcctagtggtcctcaaggcatccgaggtgacaaaggagaacctggtgataaa
gggcacaggggtcttcctggcttaaaaggatacagtggattgcagggtcttcctggtctt
gctggcctacatggtgaccaaggagctcctggtcctgtgggtcctgctggtcccaggggt
cctgctgggccttctggacccgttggcaaagatggtagatctgggcaacctggcccagtc
gggcctgctggtgttcgtggttctcagggtagccaaggtcccgctggtccccctggtccc
cctggacctcctggtcctcctggtgtcagcggaggtggctatgactttggttttgaagga
gacttctacagggctgaccagcctcgctcccagccttcactcagacccaaggactatgaa
gttgatgcaactctgaaatctctcaataaccagattgagacccttctcactcctgaaggc
tctagaaagaaccctgctcgcacgtgccgggacttaagactcagccacccagagtggaac
agcgattactactggattgaccctaaccaaggatgtactatggatgccatcaaagtgtac
tgtgatttctctactggtgaaacctgcatccaggcccaacctgtaaacaccccagcgaag
aactcatacagccgcgcccaggccaacaagcatgtctggttaggagagactatcaatggt
ggcagccagtttgaatacaacgtagaaggggtgtcctccaaggaaatggcaactcagctc
gccttcatgcgcctgctagccaaccgtgcttctcagaacatcacctaccactgcaagaac
agcattgcgtacctggatgaggagacgggcagcttgaacaaggctgtgcttctgcagggt
tccaacgatgttgaacttgttgctgagggcaacagcaggttcacctactctgtcctagtc
gatggctgctccaaaaagacaaatgaatggggcaagacaatcattgaatacaaaacaaat
aagccatctcgcctgccgttccttgacattgcacctctggacattggtggtgctgaccaa
gaattccgtgtggaggttggccccgtctgtttcaaataa

KEGG   Mus musculus (mouse): 245026
Entry
245026            CDS       T01002                                 

Gene name
Col6a6, E330019B14, E330026B02Rik
Definition
(RefSeq) collagen, type VI, alpha 6
  KO
K06238  collagen type VI alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04974  Protein digestion and absorption
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    245026 (Col6a6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    245026 (Col6a6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    245026 (Col6a6)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    245026 (Col6a6)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    245026 (Col6a6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    245026 (Col6a6)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    245026 (Col6a6)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   245026 (Col6a6)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   245026 (Col6a6)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   245026 (Col6a6)
SSDB
Motif
Pfam: VWA VWA_2 Collagen Integrin_beta
Other DBs
NCBI-GeneID: 245026
NCBI-ProteinID: NP_001096077
MGI: 2444259
Ensembl: ENSMUSG00000043719
Vega: OTTMUSG00000049418
UniProt: Q8C6K9
Position
9; 9 F1
AA seq 2265 aa
MLLVLCLTMICFHVCVNQDSGPEYADVVFLVDSSDHLGLKSFPLVKTFIHKMISSLPIEA
NKYRVALAQYSDALHNEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSA
PTNGRDKKQFPPILVVLASAESEDDVEEAAKALREDGVKIISVGVQKASEENLKAMATSQ
FHFNLRTARDLSVFAPNMTEIIKDVTQYREGMADDIIVEACQGPSVADVVFLLDMAINGS
QEDLDHLKAFLGESISALDIKENCMRVGLVTYSNETRVISSLSTGNNKTEVLQRIQDLSP
QVGQAYTGAALRKTRKEIFSAQRGSRKNQGVPQIAVLVTHRASEDNVTKAAVNLRREGVT
IFTMGIEGANPDELEKIASHPAEQFTSKLGNFSELATHNQTFLKKLRNQITHTVSVFSER
TETLKSACVDTEEADIYLLIDGSGSTQPTDFHEMKTFLSEVVGMFNIAPHKVRVGAVQYA
DTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAKKERGSKVPCHL
VVLTNGMSRDSVLGPAHKLREENIRVHAIGVKEANQTQLREIAGEEKRVYYVHEFDALRN
IRNQVVQEICAEEACRDMKADIMFLVDSSGSIGPENFSKMKMFMKNLVSKSQIGADRVQI
GVVQFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGARPN
VRKFLILITDGEAQDIVRDPAIALRKEGVIIYSVGVFGSNVTQLEEISGKPEMVFYVENF
DILQHIEDDLVLGICSPREECKRIEVLDVVFVIDSSGSIDYQEYNIMKDFMIGLVKKADV
GKNQVRFGALKYADDPEVLFYLDELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTE
ARGSRLHKGVPQVLIVITDGESHDAEKLNTTAKALRDKGILVLAVGIAGANSWELLAMAG
SSDKYYFVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQKMKG
FLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGD
ALRKVKYYFQPDMGSRINAGTPQVLLVLTDGRSQDEVAQAAEELRHKGVDIYSVGIGDVD
DQELVQITGTAEKKLTVHNFDELKKVKKRIVRNICTSGGESNCFVDVVVGFDISSLQRGQ
TLLEGQPWMGSYLQDLLRAISSLNGVSCEVGTETQVSIAFQVTNAMERYPSKFEIYSENI
LSSLQGVTVNGPSRLNANLLSSLWDTFQNKSAARGKVVLLFSDGLDDGIEKLEQKSDELR
KEGLNALITIAVDGAADSSDLADLLYIEFGKGFEYRTQFTIGMRNLGSQLSRQLINVAER
TCCCLLCKCTGGDGAMGDPGSAGKKGPPGFKGSDGYLGEEGIAGERGASGPMGEQGTKGC
FGAKGPKGTRGLSGEEGEVGEDGLDGLDGEQGDHGIPGRRGEKGDEGSQGNPGRRGAAGD
RGAKGLRGDPGTPGRDSSIQGPKGLKGDLGRQGRRGWPGSPGTPGSRRKMVVHGRRGHIG
PQGNPGTPGPDGLAGSPGLRGPQGPRGEVGEKGEKGSLGMKGPQGPPGPGGQAGSQGHLG
SQGNKGEPGDLGEKGAAGFPGPRGLQGDDGSPGYGSIGRKGTKGQEGFPGESGLKGDIGD
PGDPGEAGPKGARGKTVSAGIPGEPGSPGEPGPPGRKGVKGARGLASFSTCDLIQYVRDH
SPGRHGKPECPVHPTELVFVLDQSRDVTEQDFERMKGMMVSLVRDVKVREANCPVGARVA
ILAYNSHTRHLIRFSDAYRKDQLLTAIKALPYERSSDSREIGKAMRFISRNVFKRTLPGA
HVRRIATFFSSGPSADAQTITTAAMEFSALDIVPVVIAFSNVPSVKRAFSIDDTGTFQVI
VVPSGSDEGPALERLQRCTFCYDLCKPDASCDQAKPPPIQSYLDTAFLLDGSRHVGSAEF
EDMRDFLEALLDHFEITSEPETSVTGDRVALLSHAPLDFLPNTQRSPVRTEFNLTSYSSK
RLMKRHVDQAVQQLHGDAFLGHALGWALDNVFLNTPNLRRNKVIFVISAGETSHLDAETL
KKESLRAKCHGYALFVFSLGPDWDDKELEDLASHPVDQHLIQLGRIHKPDHGYSVKFVKS
FINSIRHGINKYPPVNLKAKCNRLGSRDLKPPPRQFRSFVPGPQKANLKDHTAEAAKLFQ
DKKRLSSMLKGGRATISSLSRSTRYAFKQGKEAIKATSKLGKRSA
NT seq 6798 nt   +upstreamnt  +downstreamnt
atgctgctggttttgtgcctgacaatgatttgtttccacgtgtgtgtgaaccaagattct
ggccccgagtacgcagacgtggtgtttctggtggacagctccgatcacctagggcttaag
tcctttcctcttgtgaaaactttcatccacaagatgatcagcagcctccccatagaggcc
aacaagtaccgcgtggccctggcccagtacagcgatgctctccacaatgagttccagctg
ggcaccttcaagaacaggaaccccatgctgaaccacctgaagaagaacttcgggttcatc
ggtggctccctgaagatagggaacgccctgcaggaggctcacaggacctatttctctgct
cccacaaatggaagagacaagaaacagttccccccaatcctggtggtgctggcttcagca
gagtctgaggatgatgtggaagaggctgcgaaggccctgcgggaagatggggtgaaaatc
atctctgtgggggtgcagaaggcttctgaggaaaacctgaaggcgatggccacctctcag
ttccatttcaatctcaggactgccagagacctcagcgtgtttgccccaaacatgacagag
atcatcaaggatgtgactcagtacagggaaggaatggcagatgacattattgtagaagcc
tgccaaggcccttctgtggctgatgtggtgttcctgttggatatggccatcaacggcagc
caggaggacctagatcatcttaaagcattcctgggcgaaagcatctctgccctggacata
aaggaaaattgcatgagggttggcctggtgacctatagcaatgagacaagggtgatcagc
tctctgagcacgggtaacaacaagacagaagtcttgcagcgcatacaggatctgtcccct
caagtagggcaggcctacactggagctgccctcagaaagactaggaaggaaatcttcagt
gcacagaggggcagtcggaagaaccaaggggtccctcagatcgctgtgctggtgacccac
agagcatcagaagacaacgtgaccaaggcagctgtcaacctccggcgggagggagtgacc
atctttaccatgggcatagagggggctaacccagacgagctggagaagatcgcatcccac
cctgcggagcagttcacctccaaactgggcaacttctctgagctggccacccacaaccag
acgttcctgaagaaactgcggaaccaaatcacacacacggtctctgtcttctcagaacgg
actgagaccctcaaatctgcctgtgtggacacagaggaagccgatatctatctactcatt
gatggttcagggagcacccagcccacagacttccatgaaatgaagaccttcctgtcagag
gtggtaggcatgttcaacattgctccccacaaggtgcgagtaggggccgtgcagtacgcc
gacacctgggacttggaatttgagatctctaagtatagtaacaagcctgacttgggaaag
gccatcgagaatatcaggcagatgggtgggaataccaacacaggggcggctttgaacttc
acactgaagctgttgcaaagagcaaagaaggaacgaggaagcaaagtgccgtgtcacctg
gtggttctgaccaatggcatgtctcgggacagcgtcctggggcctgcccataagctgaga
gaggaaaacatcagagtgcatgcgatcggtgtcaaggaagccaaccaaacgcagcttcgg
gagatagcgggagaggaaaagcgagtttactacgtccatgagttcgatgccttgaggaac
ataaggaaccaagtggttcaggagatctgtgctgaagaagcctgcagagacatgaaagcg
gacatcatgtttctggtggacagctctggcagcatcggacctgaaaacttcagcaagatg
aagatgtttatgaagaacctggtgagcaaatcccagatcggggctgaccgggtgcaaatt
ggcgtggtccagttcagccacgaaaacaaggaggagtttcagctcaacacgttcatgtct
caaagtgacatcgccaacgccattgaccgaatgactcacattggagaaacaaccttgacg
ggcagtgccctgacctttgtgtctcagtacttcagtcccgataagggggccaggcccaat
gtcaggaagttcctcattcttatcacggatggtgaggctcaggacatagtaagggaccca
gcgatcgcccttcgaaaagaaggtgtgattatctattctgtgggagtattcggctccaat
gtcacccagcttgaggagatcagtggaaagccagagatggttttctatgttgagaatttt
gacattctgcagcatatcgaagatgacctcgttctggggatctgcagtccccgtgaagaa
tgcaagcggattgaagttttggatgtggtgtttgtcatcgatagctccggcagcattgac
tatcaagaatataacatcatgaaggacttcatgattggcttggtgaaaaaagctgacgtg
ggcaagaatcaggtccggtttggagccctgaagtatgctgatgaccccgaagtgctgttt
tacctggatgaactaggcacgaagctggaggtagtttcagtgctccagaatgaccatccc
atgggtggaaatacttacaccgctgaggccctcgccttctccgatcacatgttcaccgaa
gcccggggcagccgtctgcacaagggagtcccccaagtcctcattgtgattaccgacggg
gaatctcatgacgcagagaagctcaacaccaccgccaaggccctgagagacaaaggcatt
ctcgtcctggctgtggggattgccggtgccaacagctgggagctcttggccatggcaggg
tcaagcgacaagtactactttgtagagaccttcggaggcctgaagggaatattttccgat
gtgtcagccagtgtctgtaactcttcaaaagttgattgtgaaattgaaaaggttgacctt
gtattcctcatggatggttcaaacagcatccatccggatgacttccagaagatgaagggg
tttttggtgtcggtcgttcaagacttcgatgtcagcctcaacagagtccgcataggcgtg
gcacagttcagcgacagctacaggtcagagtttctgctggggacgttcaccggggagagg
gagatatccacccagattgagggcatccagcagatctttggatacacccacatcggagat
gctctcaggaaggtgaagtattactttcagccagacatgggcagcaggatcaacgcaggt
accccccaggtgctgctggtcctcacagatggccggtcccaagacgaggtagctcaggcc
gccgaggagctgagacacaaaggtgtggacatctactcggtgggcatcggggatgtggat
gaccaggaattggtccagatcacggggaccgcggagaaaaaactgaccgtgcataacttc
gacgagctaaaaaaggtgaagaaaaggatcgttcggaacatctgtacctcaggtggtgag
agcaattgctttgtggatgtcgtggttggatttgacatctcaagcctgcagagagggcag
actctgctcgaaggtcagccttggatgggatcctacctccaagacctcttacgtgccatc
agctccctcaatggggtaagctgtgaagtgggcacagagactcaggtgagcatagctttt
caagtgacgaacgctatggaaagatacccttccaagtttgagatctacagtgagaacata
ctaagcagcctgcagggtgtgaccgtgaatggcccgtctcgcctcaacgccaacctgctg
agttctctgtgggatacgtttcagaataagtcagctgctcgtgggaaggtggtccttcta
ttttcagatggactggatgatggcattgagaaacttgaacagaagtctgatgaactcagg
aaggaaggcctgaatgccctcataaccattgctgtggatggagctgccgattccagtgac
ctggcggacctcctctacattgaatttgggaaaggatttgaatacaggacacagttcaca
attggaatgaggaaccttgggagtcagctgtcaaggcaactaatcaatgttgcagagagg
acctgctgctgtctgctctgcaagtgcacaggaggggacggcgccatgggggatccaggc
tcagcaggaaaaaagggacccccaggattcaaaggcagtgatggctacctcggagaggag
ggcattgctggagaaagaggagcctctgggccaatgggagagcaaggcacgaagggatgc
ttcggtgccaaagggcctaagggaaccagaggactcagtggagaagagggtgaagtcggg
gaagatgggcttgatggattagatggagaacagggtgaccatgggattcctggaagaaga
ggagaaaagggtgacgagggatctcagggaaacccaggcaggagaggggctgccggtgac
cgtggagcaaagggactgcgaggagatccgggaactcctggacgtgacagtagcatacaa
ggacccaagggcttgaaaggagaccttggaagacaaggcagaagaggctggccaggctct
cctggaacaccaggctcaagaagaaagatggtagttcatggccgaagaggacacataggc
ccacagggaaatccaggaaccccgggcccagatggacttgcaggttcgccaggacttaga
ggcccgcagggtccaagaggagaggtcggtgagaaaggagaaaaaggaagtctgggaatg
aaaggtcctcaggggcctccaggacccggaggacaagctgggagccaaggccatttggga
agccaaggaaacaaaggagaacctggagacctgggagaaaagggagctgctggctttcca
ggtcctcggggcctgcagggtgacgacggcagcccgggatatggtagcattggccgcaaa
gggacaaaggggcaagaaggattccccggagagagtggactgaagggtgatattggggac
cctggtgatccaggagaagctggtcccaagggagccagaggcaaaacggtatctgctggg
attccaggagagccggggtcccccggggagccaggaccccctggacggaagggtgtgaaa
ggggcccgaggactggcttcgttttccacatgtgacctcattcagtacgtgcgggaccac
agtcctggcagacatggaaaacctgagtgcccagtgcatccaactgagttggtgtttgtc
ctggaccagtcccgggatgtcacagagcaggactttgagcgaatgaaggggatgatggtc
tccctagttagggatgtcaaggttagggaggccaactgccccgtgggtgcccgcgttgcc
atcctagcctacaactcccacaccaggcacctcatccgcttctcagatgcctacaggaag
gaccagctgctcacggcgattaaagctcttccttatgagagatcctccgacagcagggag
attggcaaagccatgaggttcatctccaggaatgtcttcaagagaacccttccaggggct
catgtgagaagaatcgccacgttctttagcagtggtccatcggccgacgcccagaccatt
accacagccgccatggagttcagtgccctggacattgtacctgtggtgattgcgttcagc
aacgtgccctcggtcaagcgcgccttttcaattgatgacactggcacattccaagtcatt
gtggttccctccgggtctgatgaagggccagcgttagagagactccagcggtgcactttc
tgctatgatctatgcaagccggatgcttcgtgtgaccaagccaaaccaccccccattcag
tcctacctggacactgcattcctactggatggctctcggcacgtgggaagtgcagagttt
gaagacatgagagactttctagaggcactgctagatcacttcgaaatcacttcggagcca
gagacatctgtcactggagacagggtggccctgctgagccatgcccccctcgacttccta
cccaacacccagaggagtccggttagaactgagttcaacctaaccagctatagcagtaag
cgcctcatgaagagacacgtggatcaggcagtccagcagctgcatggagatgcttttctt
ggccatgccctcgggtgggcactggacaacgtctttttaaacacccccaatttgagacga
aacaaagtcatatttgtgatatctgctggggaaaccagccacctggatgcggaaacctta
aagaaagagtccctacgagccaaatgtcatgggtatgccctctttgtcttttccttgggc
cctgactgggatgacaaggaactggaagacctggccagccaccccgtagaccagcacttg
atccagcttggccgaattcataaacctgaccacggatacagtgtgaagtttgtgaagtct
tttataaactccatcaggcatgggataaacaagtatccaccggtcaacctcaaagccaag
tgcaacagactcggctctagggatctgaagccgcccccacgccagtttcgaagctttgtt
ccaggaccacagaaagccaacctcaaagaccacacagcagaggcagcaaagctctttcaa
gataaaaaacgcctttcaagtatgctgaaagggggcagagctactatttcaagtctttcc
agaagcacacgctatgcctttaaacaggggaaagaagcaataaaagccacttctaaactc
ggtaaacgaagtgcctga

KEGG   Mus musculus (mouse): 665033
Entry
665033            CDS       T01002                                 

Gene name
Col6a5, Col, Col29a1, EG665033, Gm7455
Definition
(RefSeq) collagen, type VI, alpha 5
  KO
K06238  collagen type VI alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04974  Protein digestion and absorption
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    665033 (Col6a5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    665033 (Col6a5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    665033 (Col6a5)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    665033 (Col6a5)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    665033 (Col6a5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    665033 (Col6a5)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    665033 (Col6a5)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   665033 (Col6a5)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   665033 (Col6a5)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   665033 (Col6a5)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 Radical_SAM VWA_CoxE Copine
Other DBs
NCBI-GeneID: 665033
NCBI-ProteinID: NP_001161395
MGI: 3648134
Ensembl: ENSMUSG00000091345
Vega: OTTMUSG00000049419
UniProt: A6H584 A0A140T8W1
Position
9 F1; 9
AA seq 2640 aa
MKLRLIAFVLILWTETLADQSPGPGPEYADVVFLVDSSNYLGIKSFPFVRTFLNRMISSL
PIEANKYRVALAQYSDALHNEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRT
YFSAPTNGRDKKQFPPILVVLASAESEDDVEEAAKALREDGVKIISVGVQKASEENLKAM
ATSQFHFNLRTARDLGMFAPNMTRIIKDVTQYREGTTVDLITAVAPTTPAAPATPAAPTI
PAALTTAANHVDKTVPFPTSCQKDSLADLIFLVDESVGTTQNLRDLQNFLENVTSSVDVK
DNCMRLGLMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAIEQMRKEGFSE
SSGSRKAQGVPQIAVLVTHRASDDMVREAALDLRLEGVTMFAMGIEGANNTQLEDIVSYP
SRQSISTHSSYSHLESYSGNFLKKIRNEIWTQVSTRAEQMELDKTGCVDTKEADIYFLID
GSSSIRKKEFEQIQIFMSSVIDMFPIGPNKVRVGVVQYSHKNEVEFPVSRYTDGIDLKKA
VFNIKQLKGLTFTGKALDFILPLIKKGKTERTDRAPCYLIVLTDGKSNDSVLEPANRLRA
EQITIHAIGIGEANKTQLRQIAGKDERVNFGQNFDSLKSIKNEIVHRICSEKGCEDMKAD
IMFLVDSSGSIGPTNFETMKTFMKNLVGKIQIGADRSQVGVVQFSDYNREEFQLNKYSTH
EEIYAAIDRMSPINRNTLTGGALTFVNEYFDLSKGGRPQVRKFLILLTDGKAQDEVGGPA
TALRSKSVTIFSVGVYGANRAQLEEISGDGSLVFHVENFDHLKAIESKLIFRVCALHDCK
RIELLDIVFVLDHSGSIGPREQESMMNLTIHLVKKADVGRDRVQIGALTYSNHPEILFYL
NTYSSGSAIAEHLRRPRDTGGETYTAKALQHSNVLFTEEHGSRLTQNVRQLMIVITDGVS
HDRDKLDEAARELRDKGITIFAVGVGNANQDELETMAGKKENTVHVDNFDKLRDIYLPLQ
ETLCNNSQETCNLPEADVIFLCDGSDMVSDSEFVTMTTFLSDLIDNFDIESQRMKIGMAQ
YGSRYQEIIELESSLNKTQWKSQVHSVAQSKGLPRLDFALKHVSDMFDPSVGGRRNAGVP
QTLVVITSSSPRYDVTDAVKVLKDLGICVLALGIGDVYKEQLLPITGNSEKIITFRDFNK
LKNVDVKKRMVREICQSCGKANCFVDVVVGFDISTHRQGQPLFQGHPRLESYLPGILEDI
TSIRGVSCGAGAEAQVSLAFKVNSDQEFPAKFQIYQKAAFDSLLHVTVRGPTHLDAPFLQ
SLWDMFEERSASRGQVLLIFSDGLQGESITLLERQSDRLREAGLDALLVVSLNTFGHDEF
SSFEFGKGFDYRTQLTIGMLDLGKTLSQYLGNIAERACCCTFCKCPGIPGPHGTRGLQAS
KGSSGPKGSRGHRGEDGDPGRRGEIGLQGDRGVVGCPGTRGQKGVKGFSGAQGEHGEDGL
DGLDGEEGFYGFRGGKGQKGDPGNQGYPGIRGAAGEDGEKGFPGDPGDPGKDSNIKGQKG
EKGERGRQGITGQKGTHGRPSSKGSRGMEGQRGPQGPSGQAGNPGPQGTQGPEGLQGSQG
SSGNRGGKGDKGSQGYQGPQGSPGPAGPRGDIGRPGFGGRKGEPGVPGGPGPVGPPGQRG
KQGDYGIPGYGQTGRKGVKGPTGFPGDPGQKGDAGNPGIPGGPGPKGFKGLTLSQGLKGR
SGLQGSQGPPGRRGPKGTAGQPIYSPCELIQFLRDHSPCWKDKCPVYPTELVFALDQSSG
ITERRFNETRDTITSIVSDLNIRENNCPVGARVAVVSYDSDTSYLIRGSDYHNKKHLLQL
LSQIKYQVPRKARDIGNAMRFVARNVFKRMSAGTNTRRVAVFFSNGQAASRASILTATME
LSALDISLAVFAYNERVFLDEAFGFDDTGTFQVIPVPPVGDYEPLEKLRRCTLCYDKCFP
NTCAEEPFFPENSYMDVAFLLDNSKNIASDDFQAVKALVSSVIDSFHITSNPSASESGDR
VALLSYSPSESSRRKGRVKTEFAFTTYDNQSIMKNYIYTSLQQLNGDATIGLALQWAMEG
LFLGTPNPRKHKVIIVISAGENHEEKEFVKTVALRAKCQGYVVFVISLGSTQRDEMEELA
SYPLDHHLIQLGRMYKPDLNYIVKFLKPFIYSVRRGFNQYPPPTLKDDCRLVELERGDTL
PHGLRLTAKLREVPESTISLADQELNAGKDSSFVLEDHRGDHLVYVPSQMLEPHKLVSHY
GNDRESVAMASLTSEHESHGREELGLAHEPGDASLQEYYMDVAFLIDASQRVGGRNEFKE
VRTLITSVLDYFHIAPAPLTSVLGDRVAVLTYSPPGYLPNTEECPVYLEFDLVTYNTVHQ
MKHHLQESLQQLNGDVFIGHALQWTVDNVFVGTPNLRKNKVIFIVTAGETNPLDKEVLRN
ASLRAKCQGYSIFVFSFGPIHNDMELEELASHPLDHHLVRLGRVHRPDLDYVIKFIKPFV
HSIRRAINKYPGRDLQAKCDNLTFPGPENAGTEDSALLIPEVYRIEAGENELSGDSGSQE
QHFFLLGNSHGNHSESTADLMRQLYLLLSSGELMVNDKEEPCSAETPAPVNSKQDGEDAR
NT seq 7923 nt   +upstreamnt  +downstreamnt
atgaagctccggctaattgcgtttgtcttaatcctctggactgaaaccctggcagaccag
agcccagggccaggccccgagtacgcagacgtggtgtttctggtggacagctccaattac
ctgggaattaagtccttcccatttgtgagaacttttctcaacaggatgatcagcagcctc
cccatagaggccaacaagtaccgcgtggccctggcccagtacagcgatgctctccacaat
gagttccagctgggcaccttcaagaacaggaaccccatgctgaaccacctgaagaagaac
ttcgggttcatcggtggctccctgaagatagggaacgccctgcaggaggctcacaggacc
tatttctctgctcccacaaatgggagagacaagaaacagttccccccaatcctggtggtg
ctggcttcagcagagtctgaggatgatgtggaagaggctgcgaaggccctgcgggaagat
ggggtgaaaatcatctctgtgggggtgcagaaggcttctgaggaaaacctgaaggcgatg
gccacctctcagttccatttcaatctcaggactgccagagacctcggcatgtttgcccca
aacatgacacggatcatcaaggatgtgactcagtacagggaagggacaactgtggactta
atcacagctgtagccccaaccacacctgcagccccagccacacctgcagccccaaccata
cctgcagctctaaccacagctgccaaccacgttgataagacagttccctttcccacctct
tgccagaaggactcccttgccgaccttatattcttggtggatgagtcagtggggaccaca
cagaatctgagggacctgcagaacttcctggagaatgtcaccagctctgtggatgtgaag
gacaactgcatgcggcttgggctgatgagcttcagcgacagggctcagactatttcttcc
ctgaggtcaagcgccaaccagtctgaattccagcagcaaatccagaaactctctctccag
actggggcatctaatgttggggccgccattgagcagatgaggaaagaaggcttctcagag
tccagtggcagcaggaaggcccagggcgtcccccagattgctgtgctggtcacccacaga
gcatcagatgacatggtgcgggaggctgcactggatctgcgcttggagggtgtaactatg
tttgcaatgggcatcgaaggggccaacaacacgcaattagaagacatagtgtcataccct
tccaggcagagcatttccacacacagctcctacagtcacttggaatcatacagtgggaac
ttcctgaagaagatccgcaatgaaatctggacccaagtttctactcgtgctgaacaaatg
gagctggataaaaccggctgtgtggatacaaaggaggccgacatctattttctcattgat
gggtcgtccagcatccgtaagaaagagtttgagcaaatccagatatttatgtcatctgtg
atagacatgtttcctatcggccccaacaaagttcgggtcggagttgtgcagtattcacat
aagaatgaagtggaatttcctgtctctcgctacacggatggcattgacttgaaaaaggct
gtttttaacatcaagcaactcaaaggtcttacattcaccgggaaagccctggatttcata
ttgcctctcataaagaagggaaaaacagagaggaccgacagagccccctgttacctcatc
gtgctgactgatggaaagtcaaatgacagcgttctcgaacctgccaacagactgcgggct
gagcagatcaccattcacgccatcgggattggcgaggcaaacaaaacacagctgcggcag
attgctgggaaggacgaaagggttaacttcgggcagaacttcgattctctgaagagcata
aaaaatgaaatcgttcaccgcatctgctctgaaaaaggatgtgaagacatgaaggcagac
atcatgtttttggtggacagttctggaagcatagggcctacaaattttgagacaatgaaa
actttcatgaaaaatctggtggggaagattcagattggcgccgacagatcccaagtcggt
gttgttcaattcagtgactataacagggaagaattccagcttaataaatactccacgcat
gaggaaatttatgcagccatagataggatgtctcccatcaacagaaacactctgacagga
ggagcactaacctttgtaaatgaatactttgacctttccaagggggggcgcccccaggtc
agaaagttccttatcctcctcacagatggaaaagctcaggacgaagtggggggcccagcc
acggctctcaggagcaaaagtgtgaccatcttctccgtgggtgtgtatggggccaatagg
gctcagctggaggagatcagtggagatggcagtctggttttccatgttgagaactttgac
catctgaaggccatagagagtaaactcatctttcgagtgtgtgctctgcatgattgcaag
aggatcgaactcttagacatcgtgtttgtgctggaccactcaggcagcataggtccacgt
gagcaagaaagcatgatgaacctcaccatccacttggtgaagaaggcagacgttggtagg
gaccgagtgcagattggagcgctcacatactccaaccatcccgagatcctcttctatctt
aacacttactcaagtggatcagccattgctgagcatctgaggaggcccagggacaccgga
ggggagacctacactgccaaggctcttcagcattccaacgtcctgttcacagaagagcat
ggcagccgcctgacgcagaacgtgaggcagctgatgattgtcatcacggatggcgtgtcc
cacgaccgagataaactcgacgaggcagcacgggaactcagagacaaaggcatcaccatc
tttgcagtgggtgtaggaaatgccaaccaagatgaactggagaccatggcagggaagaaa
gaaaacactgtccatgtagacaactttgacaaactgagagatatttacctgcctctgcaa
gagactctgtgtaataattcacaagagacttgtaaccttccggaagctgatgtgattttc
ctttgtgatggctctgacatggtgtctgattcagagttcgttaccatgacaactttcttg
tcagacttaattgataattttgacattgagtctcaaagaatgaagattgggatggctcaa
tacgggagccggtatcaggaaattattgagttggaaagctctctgaacaagactcagtgg
aagtctcaggtccacagtgtggcccagagcaaggggttgccacgccttgactttgctctc
aagcatgtgagtgacatgtttgatccatccgttggcgggagaaggaacgctggtgtccct
cagactctggttgttataacatctagttctccacgctatgatgtgacagacgcggtaaag
gtcctgaaagaccttggcatctgtgtcctggctttgggcataggggatgtttataaggaa
cagcttctgcccatcacaggcaattcagaaaagataattactttccgagactttaataag
ctaaagaatgtggatgtgaagaagagaatggttcgtgaaatttgccagagctgtgggaaa
gccaattgctttgtggatgtagtggtcgggttcgacatttccactcaccgtcagggccag
cctttgttccagggccatccccgcttagaatcctacctcccgggcatcttggaggacatc
acctctatcagaggggtgagctgtggtgcaggtgcagaggcgcaggtgagcctggccttt
aaggtgaacagtgaccaagagttcccagccaagttccagatctatcagaaagcagcgttt
gacagcttgttgcacgtcacagtccgtggaccaactcatctggatgcaccgttcttgcag
tcactttgggacatgtttgaggaaagatctgcatccaggggccaggttctgctcatcttc
tcagacggtctccagggtgaaagcatcaccttgctggagcggcagtcagacaggctcaga
gaagcaggacttgatgctctgttggtggtgtcccttaacacattcggtcacgatgagttt
tccagctttgaatttggaaaaggatttgactacaggactcagctgaccattgggatgcta
gacctgggcaagacgctctcacagtacctgggaaacatcgcagaaagggcttgctgctgt
acattttgtaaatgtccggggattccaggacctcatggaacccggggactccaagcctca
aagggttcttcaggtccaaaaggtagcagaggacacaggggagaggacggagaccctgga
agacgaggagaaattgggcttcaaggagacagaggagtcgtgggatgcccggggactcga
ggtcaaaagggagtcaaaggattttcaggagctcagggggaacatggagaggacgggctg
gacggactggatggagaagaaggattttacggatttcgtgggggaaagggacaaaaaggt
gaccctggcaatcagggttacccaggcatcagaggcgctgctggggaagatggcgagaag
ggcttcccaggggatcccggtgacccaggaaaagacagcaacatcaaaggccagaagggc
gagaaaggagaacgaggaagacaagggataactgggcagaaggggacacatggccgtcct
agctccaaaggaagcaggggaatggagggccagagggggccccagggtccctcagggcag
gcaggaaacccaggacctcaaggtacacaaggacctgaaggactacaaggctcacaggga
tcaagtggaaatcgtggagggaaaggagacaaaggaagtcaaggttatcagggacctcag
ggttctcctgggccagcaggacccagaggggatattggaagacctggatttggggggaga
aagggggaacctggagttcctggaggcccaggaccagtggggccccctggacagcgagga
aagcagggagattatggcatcccgggctatggtcagacaggaagaaaaggagttaagggc
ccaacaggattccctggggacccagggcagaagggtgacgctggcaatcctggaattcct
gggggacctggacccaaaggattcaagggactgactctctctcaaggcttgaaaggcagg
agtggacttcaggggtctcaaggccctcctggtcggagaggtcctaaaggcacagcaggg
caacctatatattccccctgtgaactgatccagttcctgagggaccacagcccctgttgg
aaagacaagtgccctgtgtatccaacagagctggtctttgctctggaccagtcctctggt
atcacagaaaggagatttaatgaaacaagggacaccatcacttcaattgtcagtgacctc
aacatcagggagaacaactgtcccgtgggagcacgggttgccgtggtctcctatgactca
gacaccagctatctcatccgtggctctgactaccataacaagaagcatctcctacagctt
ctttcccagataaaataccaagtgccccggaaggccagggacattggcaatgcaatgagg
tttgtggcccgcaacgttttcaagcggatgtctgcgggaaccaacaccaggagagtggct
gtgtttttcagcaatggccaggcagccagtagagcatccatcctcacggccaccatggag
ttgagtgccctcgatatcagccttgccgtctttgcttataacgaaagagtgttccttgat
gaggcttttgggtttgatgacactgggacatttcaggtgattccagtccccccagttgga
gattatgagccactggagaagcttcggcgctgcacactttgctatgacaagtgttttcca
aacacctgtgcagaagagcctttctttcctgagaattcctacatggatgttgcattcctc
ttagacaattccaaaaacatcgccagcgatgacttccaagccgtgaaagcgctggtgagc
tcagtgattgacagcttccacatcacttcaaacccttcagcctcagagtctggtgacagg
gttgctttgctgagctattctccctcagagagctccagaaggaagggcagggtgaagaca
gagtttgcctttacaacctacgacaaccaatcaatcatgaagaactacatctacacttcc
cttcaacagctcaatggagatgctactattggccttgccctacagtgggccatggaaggt
ctcttcctgggaacccccaatccaagaaaacacaaggtcatcattgtgatttcagctgga
gaaaaccatgaggagaaggaatttgtgaagacggtggctttgagggccaaatgtcaaggc
tatgtcgtatttgtgatatctctgggctccacacaaagagatgagatggaagaattagcc
agctacccactcgatcaccatctgatacagcttgggagaatgtacaagccagatctgaat
tatattgtgaaatttctgaagccctttatatactcggtcagacgcggatttaatcaatac
ccaccacccacgctgaaggacgactgcagacttgttgagttagagaggggagacacactg
ccccatggcctcagattgactgctaagctgcgtgaggttcctgagagcacaatcagcctg
gctgatcaagagctgaacgctgggaaagactcatcttttgtgttggaggatcacagaggc
gaccatttggtttacgttccaagccagatgctagagccacataagttagtgagccactat
ggaaatgaccgggaatctgtagcaatggcgagtctcacttctgaacatgaaagtcatggc
agagaagaactgggtcttgctcatgagcctggggatgcttctcttcaagaatattacatg
gacgtggctttcctcatagatgcttctcaacgagtaggaggacgtaacgagtttaaggaa
gtgagaactttgataacgtcagtactggattactttcacattgccccagctccactgacc
tctgtcttaggagacagagtggctgtcctgacctactctcccccaggctatctgccaaac
actgaagagtgcccagtctacctggaattcgatttggtcacttacaacactgtacatcaa
atgaagcatcatctccaagagtctcttcagcagctcaatggtgatgtttttattgggcat
gcccttcagtggacagttgacaatgtctttgtaggaacccccaatctgaggaaaaacaaa
gttatctttatagtaactgcaggtgaaaccaaccctttagacaaagaagttttaaggaat
gcatctctgagagccaagtgccaaggctactccatatttgtattttcctttggtccgata
cacaatgacatggagttagaagaattagcaagccaccctctggatcatcacttagtccgg
cttggccgggtacacaggccagatttggactatgtcatcaagttcatcaagccatttgtt
cattcgatcagacgtgctatcaacaaatatcccggcagagacctgcaagccaagtgtgat
aatctcaccttccctggcccagagaatgctggcacagaagacagtgcattactcatcccc
gaggtgtacagaatcgaggcaggagagaatgagctgtctggagactctggatctcaggag
cagcatttcttcctcctagggaacagtcatggcaaccattcagaaagcactgctgatctg
atgcggcagttgtacctgctcctctcctccggggaactgatggtgaatgataaggaagag
ccttgttccgcagagaccccagctccagtgaacagcaaacaagatggcgaagatgcaaga
tga

KEGG   Mus musculus (mouse): 68553
Entry
68553             CDS       T01002                                 

Gene name
Col6a4, 1110001D15Rik, AI413310, AU023415, Dv, Dvwa, EG235580, Vw, Vwa6
Definition
(RefSeq) collagen, type VI, alpha 4
  KO
K06238  collagen type VI alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04974  Protein digestion and absorption
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    68553 (Col6a4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    68553 (Col6a4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    68553 (Col6a4)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    68553 (Col6a4)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    68553 (Col6a4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    68553 (Col6a4)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    68553 (Col6a4)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   68553 (Col6a4)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   68553 (Col6a4)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   68553 (Col6a4)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 VWA_CoxE
Other DBs
NCBI-GeneID: 68553
NCBI-ProteinID: NP_081039
MGI: 1915803
Ensembl: ENSMUSG00000032572
Vega: OTTMUSG00000032575
UniProt: A2AX52 Q9CTL0
Position
9; 9 F1
AA seq 2309 aa
MGTWKTFWLIISLAAGLGFVKSQRIVCREASVGDIVFLVHNSINPQHAHSVRNFLYILAN
SLQVGRDNIRVGLAQYSDTPTSEFLLSVYHRKGDVLKHIRGLQFKPGGNRMGQALQFILE
HHFREGAGSRASQGVPQVAVVVSSGLTEDHIREPAEALRRAGILVYAIGVKDASQAELRE
ISSSPKDNFTFFVPNFPGLPGLAQKLRPELCSTLGKAAQYTERESPACSEASPADIVFLV
DSSTSIGLQNFQKVKHFLHSVVSGLDVRSDQVQVGLVQYSDNIYPAFPLKQSSLKSAVLD
RIRNLPYSMGGTSTGSALEFIRANSLTEMSGSRAKDGVPQIVVLVTDGESSDEVQDVADQ
LKRDGVFVFVVGINIQDVQELQKIANEPFEEFLFTTENFSILQALSGTLLQALCSTVERQ
MKKSTKTYADVVFLIDTSQGTSQASFQWMQNFISRIIGILEVGQDKYQIGLAQYSDQGHT
EFLFNTHKTRNEMVAHIHELLVFQGGSRKTGQGLRFLHRTFFQEAAGSRLLQGVPQYVVV
ITSGKSEDEVGEVAQILRKRGVDIVSVGLQDFDRAELEGIGPVVLVSDLQGEDRIRQLML
DVNMFIQGSPKPPRVMTDVAKDAVEECLVPVPADLVFLVEDFSSARQPNFQRVVHFLTTT
VHSLNIHPDTTRVSLVFYSEKPRLEFSLDMYQSAAQVLRHLDRLTFRARRGRAKAGAALD
FLRKEVFLPEKGSRPHRGVQQIAVVIIESPSLDNVSTPASYLRRAGVTIYAAGTQPASES
KDLEKIVTYPPWKHAIRLESFLQLSVVGNKLKKKLCPEMLSGMPPLMSFIPESTRQSTQE
GCESVEKADIYFLIDGSGSIKPNDFIEMKDFMKEVIKMFHIGPDRVRFGVVQYSDKIISQ
FFLTQYASMAGLSAAIDNIQQVGGGTTTGKALSKMVPVFQNTARIDVARYLIVITDGQST
DPVAEAAQGLRDIGVNIYAIGVRDANTTELEEIASKKMFFIYEFDSLKSIHQEVIRDICS
SENCKSQKADIIFLIDGSESIAPKDFEKMKDFMERMVNQSNIGADEIQIGLLQFSSNPQE
EFRLNRYSSKVDMCRAILSVQQMSDGTHTGKALNFTLPFFDSSRGGRPRVHQYLIVITDG
VSQDNVAPPAKALRDRNIIIFAIGVGNVQRAQLLEITNDQDKVFQEENFESLQSLEKEIL
SEVCSSQGCNIDLSVGVDTSTSSERAQQELRRLLPELMQQLAFLSNISCEAPGQMEPRFR
YVVPGSSDQPVFDSGFEKYSDETIQKFLVHQGSVNNRMDVDFLQSLGETAIHLSLAKVKV
LLVFTDGLDEDLERLRRTSEFLRSRGLSGLLLIGLGGAHKLEELQELEFGRGFAYRQPLS
SSLPSLPSVLLKQLDTIVERTCCNMYAKCYGDDGIRGEPGSRGEQGERGLDGLPGHPGEE
GDHGQRGPRGLPGLRGEEGCPGVRGPKGARGFSGEKGNPGEEGVGGLDGEQGDRGAAGPS
GEKGSSGSRGLTGLPGPAGPRGEPGLRGDPGDPGIDNLIQGPKGEKGRRGHQGSPGFHGP
LGEAGSVGPRGSLGRHGLPGLKGVLGETGELGSRGEPGHPGPQGPRGRQGPPGFFGQKGD
PGTQGNPGLPGPSGSKGPDGPRGLKGEVGPAGERGPRGQQGPRGQPGLFGPDGHGYPGRK
GRKGEPGFPGYPGVQGEDGNPGRGGEKGAKGIRGKRGNSGFPGLAGTPGDQGPPGKMGTK
GSKGLADRTPCEIVDFVRGNCPCSTGISRCPAFPTEVVFTLDMSNDVAPSDFERMRNILL
SLLMKLEMCESNCPTGARVAIVSYNTRTDYLVRLSDHRGKAALLQAVRKIPLERSSGSRN
LGATMRFVARHVFKRVRSGLLVRKVAVFFQAGRNYDTASVSTATLELHAADIATAVVTFT
EEHNLPEAGLVDGPNEFHLFTWETEGQQDVERLASCTLCYDKCRPALGCQLRAPGPQKLD
MDLVFLVDSSQGVSRDIYLGALRLVDSVLKDLEVAAQPGTSWHGARAALLTHTTPGFWPG
VDQAPVLEYFHLTSHGHRTEMQRQIREAASGLLQGGPALGHALEWTLENVLLTAVLPRRS
RVLYAIVASETSIWDREKLRTLSQEAKCKGIALFVLAVGPGVGAQELAELAKVASAPWEQ
HLLRLEGVSEAEVAYASRFTEAFLNLLNSGINQYPPPELVKECGGPNRGDTLLHFFTSAK
RFSRSQSGTSAAFANDSEALKSQGIFLGERKSRVASVALQEALGSHGKDRADTEDIDQET
PAKGRHLGPTHGPCPMGPEEGECLNYVLK
NT seq 6930 nt   +upstreamnt  +downstreamnt
atgggaacttggaagacattttggttgatcatctcccttgcagctggtttgggcttcgtc
aagtcacagaggattgtctgcagggaggcgtctgtgggagacattgtgtttctggtccac
aacagcatcaatccccagcatgcccacagtgtgcggaacttcttgtacatattggcaaac
agtttgcaagttggcagagacaacatccgtgtgggtctggcccagtacagcgatacaccc
acttctgagttcctgctttcagtttatcaccgcaaaggcgatgtgttgaaacacatccgt
ggacttcagtttaagcctggaggcaacaggatgggccaggccctgcagttcatcttagag
catcacttccgggaaggagccgggagccgggcaagccaaggggtgcctcaggtggccgtg
gtggtgagcagcggtctaacagaggaccacattcgcgaacctgcggaggctctcaggaga
gcgggaatcttggtatacgctattggggtcaaagacgcatctcaggctgaactcagggag
atttcgagtagtcccaaggataacttcaccttttttgttcccaacttccctggcttgcct
ggccttgcccagaagctgcggccagagctctgcagtactctagggaaggcagctcaatat
actgagcgggagtctccagcgtgctcagaagcatccccggctgacattgtgtttttggtg
gacagttcaactagcatcggactccagaacttccagaaagtgaagcatttcctccactct
gttgtctcggggctcgacgtcagaagcgatcaggtgcaagtggggctggtccagtacagt
gacaacatttacccagcctttccgctgaagcagtcctctctgaagagcgccgtcctggat
cggatccggaatctgccctacagcatggggggcacaagtaccggaagcgccctggagttc
atcagggccaactccttgactgagatgagtggcagccgtgccaaggatggggttcctcag
atagttgtcctggtgacagatggggagtccagtgatgaggtccaggatgtggctgaccag
ctgaaaagagacggggtctttgtgtttgtggtagggatcaatattcaagatgtccaagag
ttgcagaaaatagccaacgagccgtttgaggagtttctcttcaccacagaaaacttcagc
atcttgcaggcgctctcaggaaccctccttcaggccctgtgttccacggtggagaggcag
atgaaaaagtctaccaagacctacgcagacgtggtctttcttattgacacctcccagggg
acatcacaggctagtttccagtggatgcagaatttcatctctagaattattggcatcctg
gaggtcggccaggacaaataccagattgggctggctcagtatagcgatcaaggtcacact
gagtttttgttcaatacccacaagacccggaatgagatggtcgcacacatccatgaactc
ttggtgttccaaggtggctccaggaagactggccaaggccttcggtttcttcatcggact
ttcttccaggaggcagcaggaagccggctcttacagggtgtccctcagtatgtggtagtc
attacttcagggaaatctgaggacgaggtcggagaggtcgcacagatcctgaggaagaga
ggcgtggacatcgtgtctgtgggcttgcaggactttgacagggcagaactggaggggata
gggcctgtagtgcttgtgtctgacctgcagggagaagatagaatcagacagctaatgctg
gatgtgaatatgttcatccaggggagtccaaagcctccacgggtcatgactgatgttgcg
aaggacgctgtagaagaatgtctagttcccgttccagccgacctggtgttccttgttgag
gactttagcagtgccaggcaacccaatttccagcgagttgtccatttcttgacgaccact
gtccactctctcaacatccatcctgataccacaagagtcagcttggtcttttacagtgag
aaaccaaggctggagttttccctggatatgtatcaaagcgctgcccaagtcttacgccat
ctggacaggttaactttccgagcaagaagaggaagggcgaaggcaggtgctgctttagac
ttcctgaggaaagaagttttccttccagagaagggcagccggccccatcggggcgtacag
cagatagctgtggtcatcatagaaagtccttctctagacaatgtgtctacacctgcttcc
taccttcgcagggcgggggtcaccatctatgcagcgggcacccagcctgcctctgagagt
aaggacctggagaagattgtcacataccctccctggaagcacgccatccgcctagagtcc
tttctgcaactcagtgttgtaggaaacaaacttaaaaagaagctctgccctgagatgttg
agcggaatgcctcctctgatgagctttattccagaaagcacacgacaaagcacacaggaa
ggctgcgagagtgtggagaaggcagacatttacttccttattgatggatctggcagcatc
aaacccaacgacttcattgaaatgaaggacttcatgaaggaggtgataaagatgttccac
attgggcccgacagagtacgatttggagtcgttcagtactcagacaaaattataagccag
tttttcctcacccagtatgccagcatggcaggactgagcgcagccattgacaacatccaa
caggtgggaggtggtaccaccactggcaaggccttaagcaaaatggtaccagtcttccaa
aacactgccagaatcgacgttgcccgatacctcattgtcatcactgatggtcagtctacc
gacccggtggctgaggctgcgcagggattgagggacatcggagtcaatatttatgccatt
ggagtaagggatgctaatacgactgagcttgaggaaatagcaagcaaaaagatgtttttc
atatatgagtttgattccttgaagtccatccaccaggaagtcattcgggacatctgctcc
tcagagaactgtaagagccagaaagctgacatcatcttcttgattgatggctcagaatcc
atcgctcctaaagactttgaaaagatgaaggacttcatggagaggatggtgaaccagtct
aatatcggtgctgatgaaattcagattggccttctgcagttcagctccaatccccaggaa
gaattcaggcttaaccggtactcctcaaaggtggacatgtgcagagccatcttgagtgtc
caacagatgagtgatggtacccatactgggaaagccttgaatttcactctgccctttttt
gacagttcaagaggagggagacccagagttcatcagtatttgattgtgatcaccgacggg
gtctcccaggataatgtagctcccccagccaaagccctcagggacagaaacataattatt
tttgctatcggggtgggaaatgttcaaagagcccagcttttggagatcaccaatgaccag
gacaaagtattccaggaggaaaattttgagtccctgcagagcctggaaaaggaaattctt
tctgaggtctgtagttcccaaggctgcaacatagatctgtctgtaggagttgatacctca
acctcttcagagcgagctcagcaggagcttcgaaggttgctgccggagctgatgcaacag
ctggcttttctctctaacatcagctgtgaggctcctggccagatggagccgaggttccga
tatgtggttccaggctccagcgaccagcctgtctttgattccggctttgagaaatacagc
gatgagaccatccagaagttcttggtgcatcagggttctgtaaacaaccgtatggatgta
gactttctgcagtccctgggagagactgccattcacctgtctcttgctaaagtgaaggtc
cttttagtgtttacagacggacttgatgaagacctagagaggctgaggagaacctcagag
ttccttcgcagcagagggctctccggactgctgctcatcggcctgggaggtgcacataag
ttagaagagctgcaggagctggagtttggcagagggtttgcctacaggcagcctctgagc
tccagtctgccatccctcccaagcgtcttactgaagcaactcgacacaattgtggagagg
acatgctgtaacatgtatgccaaatgttacggagacgatgggatcagaggtgaacctggg
agtcgaggggagcagggagagaggggtttggacgggctccctggccatcctggtgaagag
ggagaccacggacaaagaggcccccgaggtcttcctggacttcgaggtgaggaaggatgt
ccaggtgtgagaggacctaagggagcaagaggattttcaggagagaagggcaaccctggt
gaggaaggtgttgggggcttggatggagaacagggcgatcgtggagccgcagggccatct
ggagagaaaggaagcagcggcagtcggggcttgacaggtctgcctggaccagctggacct
cgtggggagcctgggctgaggggagatccgggggatcctggaattgataatctcatccaa
ggccccaagggagaaaagggaaggcgtggacatcagggaagccctggtttccatggacct
ctgggggaagccggcagtgtgggacctcgggggtcacttggaagacatggcttgccaggg
ttgaagggtgtgcttggagaaaccggtgagctgggttcccgaggagagcctgggcatcca
ggcccacagggacccagaggaaggcaaggaccaccaggatttttcggtcaaaaaggggat
ccgggtacccaggggaatcctgggcttccagggccaagtggctcaaaaggaccagatgga
cccagaggactgaagggagaagtcggtcctgcaggagaaagaggtccacggggtcagcaa
ggcccaagagggcagcctggtcttttcggtccagacggccatggttatccgggaagaaaa
ggcaggaagggtgaacctgggttccctggctatcctggtgtgcaaggagaagatggtaac
ccaggccgtggaggagagaagggggccaagggaatccgagggaagaggggcaattctgga
tttcctgggcttgctggaactccaggtgaccaaggcccaccgggaaaaatgggcaccaaa
ggctcgaaaggcttggcagataggacgccttgtgaaattgttgacttcgtgcgaggaaac
tgcccttgctcaacaggtatttccaggtgcccagcattcccaacagaagtggtcttcacc
ttggacatgtccaacgatgtcgccccatcagactttgagaggatgagaaacattttatta
tctctgctgatgaagctggaaatgtgcgagagcaactgtcccactggtgcccgagtggcc
attgtttcctacaacacgaggacagattacctagtgcgcttgtcagaccatcggggaaaa
gctgcccttctgcaggctgtcaggaaaatccccctggaacgctcatctgggtctcggaac
cttggggccactatgaggtttgtggctagacacgtattcaaacgtgtacgctcaggcctt
ctggtgaggaaggtggccgtgttcttccaggcaggtaggaactatgacacagcctctgtc
agcacagccacgttggagctccacgcagcggacattgctacagcagttgtgaccttcaca
gaggagcacaacctcccagaggcagggctggtggatggacccaacgaatttcacctgttt
acgtgggagaccgagggccagcaggatgtggaacgcctggccagctgtactctctgctat
gacaagtgccgaccagccctggggtgccagctgcgtgctcctgggcctcagaagttggac
atggacttggtattcctggtggacagctcccagggtgtcagcagggacatataccttggg
gctttgcggctagtggattcagtgctcaaggacctggaggtggctgcgcagcctggcaca
tcctggcacggggcgcgcgcagccctgttgacacacacaaccccaggcttttggcctggt
gtggatcaagcccctgtgcttgagtacttccacctgacctcccacggccaccggacagaa
atgcaaaggcagattcgggaggctgcgagtggcctgctacagggaggccccgccctgggc
cacgccctggagtggacgctggaaaacgtgcttctgacagccgtactgcctcggaggtca
cgggtcctctatgccatcgtggccagcgagacaagcatctgggacagagagaagctaagg
actctgtcccaggaggccaagtgcaagggcatcgcattattcgtgctcgccgtgggccct
ggagtgggcgcccaggagctggctgagctggccaaagtggccagtgcgccctgggagcag
cacctgctacgcctagagggtgtctcagaggcagaggtggcctatgccagcagattcacg
gaggccttcctgaaccttctaaatagtggaataaaccagtatccacccccagagcttgtt
aaggagtgtggaggcccaaaccgtggggacaccttgctgcatttcttcacttctgccaag
aggttttccaggagccagtctggcacgtctgctgcctttgcgaatgactcagaagcactc
aagtcacagggcatctttctaggggagaggaaaagcagggtggcgtccgtagctctgcaa
gaagcgttgggaagtcatgggaaggacagggcggacactgaagacattgatcaggaaacg
ccagcaaaaggaagacatttgggccccacccacggtccttgtcccatgggtcctgaggaa
ggcgagtgcctgaattatgtcttgaagtag

KEGG   Mus musculus (mouse): 94216
Entry
94216             CDS       T01002                                 

Gene name
Col4a6, BB116301
Definition
(RefSeq) collagen, type IV, alpha 6
  KO
K06237  collagen type IV alpha
Organism
mmu  Mus musculus (mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04926  Relaxin signaling pathway
mmu04933  AGE-RAGE signaling pathway in diabetic complications
mmu04974  Protein digestion and absorption
mmu05146  Amoebiasis
mmu05165  Human papillomavirus infection
mmu05200  Pathways in cancer
mmu05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    94216 (Col4a6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    94216 (Col4a6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    94216 (Col4a6)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    94216 (Col4a6)
  09154 Digestive system
   04974 Protein digestion and absorption
    94216 (Col4a6)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    94216 (Col4a6)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    94216 (Col4a6)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    94216 (Col4a6)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    94216 (Col4a6)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    94216 (Col4a6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    94216 (Col4a6)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    94216 (Col4a6)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   94216 (Col4a6)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   94216 (Col4a6)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 94216
NCBI-ProteinID: NP_444415
MGI: 2152695
Ensembl: ENSMUSG00000031273
Vega: OTTMUSG00000018922
UniProt: Q9ESQ1 B1AVK5
Position
X; X F1- F2
AA seq 1691 aa
MHPGLWLFLVSLCLTKELAESGQKMSYGKPCGSRDCHGVCTCFPEKGARGHPGPIGTQGP
AGPEGFTGPTGLSGLKGERGSPGPLGPYGSKGDKGPIGVPGFVGISGIPGHPGQPGPRGP
PGPDGCNGTQGAVGFPGPDGYPGILGPPGLPGHKGAKGEPVPFQGSITGIRGDPGLPGLD
GIPGPSGSSGSPGDAGPIGPPGLQGPPGLPGLPGPEGNMGLGFQGEKGVKGDVGLPGPAG
PPPSTGELEFMGFPKGKKGSKGEPGPPGFPGMSGPPGVPEFGSTGEKGEKGILGLPGPRG
PMGSEGIQGHPGRQGKKGTSGFPGINGFPGIKGEKGDIGVRGPDSFTDADGAVISGYPGD
PGVLGLPGLRGDEGVQGQRGPSGTPGLPSLTGLPGALGPQGFPGLKGDQGNSGRTTFGEA
GLPGRVGLPGLPGLPGPSGPPGRTFETGHLSNIEPGFPGLQGEQGPKGHQGLKGVKGDSG
FCACEGGAPNIGPHGEPGLPGIQGPIGLQGFKGTKGDPGSRGASGPAGTPGLFGPRGQTG
LKGKKGEPTVSRGSKMSGDKGDPGPQGTPGLAGTPGKDGRPGLPGLPGIQGDGGSGFPGE
RGLPGLPGEKGHDGPIGPPGIGLPGPPGPRGLPGDKGVDGLPGQQGLRGAQGVTLPCIIP
GSYGPSGFPGAPGFPGSKGARGLPGIPGKPGTHGSKGGPGSPGLMHLPGLPGFPGVRGEK
GLPGFPGLPGKDGYPGKAGSPGLPGFKGAAGDIFGAENGASGEQGLQGLPGDKGFPGDSG
LPGPKGLNGKPGMLGPKGERGNPGTSGPPGQPGPSGSTDPFGIKGTSGFPGAPGLPGISG
HPGKKGQRGDIGQPGSTGKRGLPGIKGLPGPQGSPGFLGSPGLSGVTGLPGIEGQKGEKG
SSGPVGFPGLPGLPGLPGADGLKGFSGSFGKVGQPGQTGTSGEKGDRGDPGPVGISSPRP
PMLNLWFKGEKGSRGSAGSDGFPGPRGDKGEPGIPGLPGPPGATGQTNTIKGLNGRPGSP
GSMGRRGLPGLKGSLGIAGFPGMPGKSGSQGLTGTSGLLGATGIPGLKGDQGQTLGISGS
PGPKGQPGELGFKGVKGKDGLVGDRGYPGNKGDGGKVGITGDPGFPGSPGLQGISGMNGD
PGLPGSSGHLGSIGRPGPSGLIGPKGFPGAPGLHGLNGLPGTKGTHGTPGASITGVPGPA
GLPGPKGERGVPGIVIGDPGKQGSKGQKGDQGSPGLQGPAGTPGAPGISLPSVIAGQPGD
PGRPGLDGERGRPGPPGPPGPPGPSSDQGDPGDSGFPGIPGLQGFKGNQGLPGFSGLSGE
LGLKGMRGEPGLMGTPGKIGQPGDPGFPGMKGKAGPRGFSGPQGAPGHTPIAEAMQVPPG
PLGLPGIDGIPGLIGDPGSQGSVGLQGSKGLPGIPGKDGPSGLPGPSGILGDPGLPGLQG
PPGFEGAPGNQGPIGQPGMPGHGVRVGYTLVKHSQSEQVPLCPTGMSRLWVGYSLLFVEG
QEKAHNQDLGFAGSCLPRFSTMPFIYCNINEVCHYARRNDKSYWLSTTAPIPMMPVGQTQ
IPQYISRCSVCEAPSQAIAVHSQDIIVPQCPLGWHSLWIGYSFLMHTAAGAEGGGQSLVS
PGSCLEDFRATPFIECSGARGTCHYFANKYSFWLTTVEERGQFREEPVSETLKTGQLHTR
VSRCQVCMKTP
NT seq 5076 nt   +upstreamnt  +downstreamnt
atgcaccctggattgtggctgttcctggtttccttatgcttgaccaaagaactggcagaa
tcgggacagaagatgtcctatggaaagccatgtgggagccgagattgccatggagtctgt
acatgctttcctgagaaaggagcaagagggcaccctggaccaattggaactcaaggtcca
gccggtccggaagggtttactggtccaactggtttatcaggtttgaaaggagaaagaggc
tccccaggccctctgggaccatatggatcaaaaggagataagggtcccattggagttcct
ggctttgttggcatcagtggtattccgggccaccctggacagccaggtccaagaggtcca
cctggcccagatggctgtaatggaactcaaggagctgttggatttccaggccctgatggc
taccctgggattttaggaccacctgggctgcctggtcacaaaggtgccaaaggagagcct
gttccttttcaaggcagcattacaggaataaggggagatcctgggctgcctggactagat
ggaatccctggtccatcaggatcctccggctctcctggagatgcaggacccataggacca
ccaggcttacaaggtcccccaggcctccccggacttcccggtcctgagggaaatatgggg
ttaggtttccaaggagaaaaaggagtcaagggggatgttggcctccctggccctgcagga
ccacctccatctactggggaactagaattcatgggattccccaaaggaaagaaaggttcc
aagggtgagccagggcctccaggtttcccaggaatgagtggccctccaggtgtcccggaa
tttggatctactggagaaaaaggagaaaaggggattcttggtttgccaggacctaggggt
cccatgggttcagaaggaatacaaggccatccagggagacagggaaagaaggggacttca
ggttttcctgggattaacgggttcccaggaattaagggtgaaaagggagacattggtgtt
cgaggcccagacagtttcactgatgcagatggcgctgtgatctcaggttatcctggagac
cctggtgtgctgggccttccaggccttagaggagatgaaggcgtccaaggccaacgtggc
ccttctggaacccctggcctcccatcattaacaggtcttccaggtgccctagggcctcag
ggatttcctggcctgaaaggagaccaaggaaactcaggacgtaccacctttggagaagct
ggcctacctggcagggttggtttaccaggtttaccaggcctgccaggcccatcaggccca
cctggtcgcacatttgagactggacatctgtccaacatagagcctgggttccctggtctc
caaggagaacaaggtccaaaaggacatcaaggcctcaaaggagtaaaaggagactctggt
ttttgtgcttgtgaaggtggtgcccccaacattggaccacatggggaaccaggtctgcct
gggatacaaggtcccattggtctacagggttttaaggggactaaaggagatccaggctca
aggggagcatctggtcctgcagggacgccagggctatttggacctagaggtcagactggc
ctcaaaggaaagaaaggagaaccaactgtcagtagaggatcaaaaatgtcaggggacaaa
ggtgaccctggtcctcagggtaccccaggtttggcaggaactccgggcaaggatggaaga
ccaggtttaccaggcctcccaggcattcagggagatggtgggtctggcttcccaggtgaa
agagggttaccaggacttcctggtgaaaaaggccatgatggtccaattggaccaccagga
attgggctgccaggacctcctgggccccgtggacttcctggagataaaggagtagatggg
ttaccagggcaacaaggcctccgtggagctcaaggagtcaccttgccttgtatcattcct
gggtcatatggtccatcaggatttcctggagctcctggattcccaggctctaagggagct
cggggcctccctggaattccaggcaagcctggcactcacggaagcaaaggagggcctggg
agtccagggttaatgcatctcccaggattgccagggtttcctggagttcgtggggagaag
ggattacctgggtttcctgggcttcctggaaaagatggctaccctgggaaagctggcagt
ccagggttaccaggtttcaagggagctgctggtgacatatttggtgctgaaaatggtgct
tcgggggagcaaggcctacagggattgccaggggacaaaggatttcctggagactctggc
cttccaggacccaagggtttgaatgggaaacctggtatgctaggccccaagggtgagcgg
ggcaaccctggaacatcaggaccaccaggacagccaggaccctcagggtctactgaccca
tttggcatcaagggcacatctggattcccaggagcaccaggccttccaggcatctcaggg
caccctggaaagaaaggtcaaagaggagacataggtcagcctggatcaactggaaaaaga
ggtctacctgggataaaaggccttcctggtcctcaagggtcacctggcttcctggggagc
ccaggtttgtcaggcgtcactgggttgccgggcatcgaaggccaaaagggtgaaaagggg
tcctctggaccagtaggatttcctgggttacctggtcttcctggtcttcctggagctgat
ggattaaagggattttctgggtcatttggaaaggtgggacaacctggacagactggtact
tccggagaaaaaggagacagaggggatccagggccagttggcatatctagtccaagacct
ccaatgctaaacctttggttcaaaggagaaaaaggctctcgaggttctgctggatcggat
ggatttcctgggcccagaggtgacaaaggagagcctggcatcccagggttaccaggtcca
cctggagctactggacaaaccaataccatcaaagggctcaatgggaggccaggttctcct
ggatctatgggaagaaggggattacctggcctgaaggggtctcttgggattgcaggtttc
ccaggaatgccagggaaaagcggttcacaggggctcactgggacttctggcctcctggga
gcaactggtatcccaggtttaaaaggagatcaaggccaaacacttggcatttctggaagc
ccaggacccaagggacagcctggagaattaggttttaaaggtgtcaaaggaaaagatgga
ctagttggtgatagaggttacccaggaaacaaaggtgatggtggaaaagttggaattact
ggagatcctggcttccctggctccccaggacttcaagggatttcaggtatgaatggagac
ccaggcctcccaggttcctctggccatctggggtcaattggccgccctggaccctcaggc
ttaataggacctaaaggattccctggagctcctggtttacatggactgaatggacttcca
ggcaccaagggcacccatggcactccaggagccagtatcactggtgttcctgggccggct
ggcctgcctggtcccaaaggggaaaggggagtgccaggaattgtcatcggagatccaggg
aaacaaggttcaaagggacaaaaaggtgaccaaggttccccaggtcttcaaggtcctgct
ggtacccctggagccccaggcatctccttgccctcagtcatagcaggacagcctggtgac
cctgggcggccaggcctagatggagaacgaggccgcccaggccctccaggtcctccaggt
ccccctgggccatcctcagatcaaggcgatcctggagactctggcttccctgggattcca
ggccttcaagggttcaagggaaaccaaggacttccaggtttctctggcctctctggagag
ctagggctaaagggcatgagaggggaacctggcctcatggggactcctggcaagatcggg
caaccaggagacccaggatttcctgggatgaaggggaaggcggggccaagaggcttttcc
ggcccacagggagctccaggacacacaccaattgctgaagctatgcaggttcctccagga
cctttgggtctgccaggcattgatggtatcccaggcctcataggagaccctgggtctcaa
ggctctgtgggtctgcaaggttccaaaggcctacctggcatccctggtaaggatggcccc
agtgggcttccaggcccatctggaatccttggtgatcctggtctccctggactacaagga
cctccaggatttgaaggagctccgggaaatcagggccccattgggcagcctgggatgcct
gggcacggtgtgagagtgggctacacactagtaaaacacagtcagtcagaacaggtgcca
ctatgccccactgggatgagccggctctgggtgggttacagcttgctcttcgtggagggg
caggagaaagcccataaccaggatctgggatttgctggctcctgcctgccccgattcagc
accatgcccttcatatactgcaacatcaatgaagtgtgccactatgccagacgcaatgat
aaatcttactggctctccactactgcccccattcccatgatgcctgtgggccagacccag
attccccagtatatcagccgctgctctgtatgtgaggcaccctcacaagccattgctgta
cacagtcaggacatcattgtcccacagtgccctctgggctggcacagcctctggatcgga
tactccttcctcatgcacactgctgctggggctgagggtggaggccagtccctagtctcc
cctggctcgtgtcttgaggacttccgggccactcctttcattgagtgcagcggcgcccgt
ggtacatgccactactttgccaacaagtacagtttctggctgacaacggtggaggagaga
gggcagtttcgggaagagcccgtgtctgagacgctgaaaactgggcagcttcacaccagg
gtgagccgctgccaggtgtgtatgaaaaccccgtag

DBGET integrated database retrieval system