KEGG   Homo sapiens (human): 10319
Entry
10319             CDS       T01001                                 
Symbol
LAMC3, OCCM
Name
(RefSeq) laminin subunit gamma 3
  KO
K06247  laminin, gamma 3
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa05145  Toxoplasmosis
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Disease
H02501  Occipital cortical malformation
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    10319 (LAMC3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    10319 (LAMC3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    10319 (LAMC3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    10319 (LAMC3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    10319 (LAMC3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    10319 (LAMC3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    10319 (LAMC3)
   05145 Toxoplasmosis
    10319 (LAMC3)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N Laminin_B
Other DBs
NCBI-GeneID: 10319
NCBI-ProteinID: NP_006050
OMIM: 604349
HGNC: 6494
Ensembl: ENSG00000050555
Pharos: Q9Y6N6(Tbio)
UniProt: Q9Y6N6 Q8N2D6
Position
9:131009174..131094473
AA seq 1575 aa
MAAAALLLGLALLAPRAAGAGMGACYDGAGRPQRCLPVFENAAFGRLAQASHTCGSPPED
FCPHVGAAGAGAHCQRCDAADPQRHHNASYLTDFHSQDESTWWQSPSMAFGVQYPTSVNI
TLRLGKAYEITYVRLKFHTSRPESFAIYKRSRADGPWEPYQFYSASCQKTYGRPEGQYLR
PGEDERVAFCTSEFSDISPLSGGNVAFSTLEGRPSAYNFEESPGLQEWVTSTELLISLDR
LNTFGDDIFKDPKVLQSYYYAVSDFSVGGRCKCNGHASECGPDVAGQLACRCQHNTTGTD
CERCLPFFQDRPWARGTAEAAHECLPCNCSGRSEECTFDRELFRSTGHGGRCHHCRDHTA
GPHCERCQENFYHWDPRMPCQPCDCQSAGSLHLQCDDTGTCACKPTVTGWKCDRCLPGFH
SLSEGGCRPCTCNPAGSLDTCDPRSGRCPCKENVEGNLCDRCRPGTFNLQPHNPAGCSSC
FCYGHSKVCASTAQFQVHHILSDFHQGAEGWWARSVGGSEHPPQWSPNGVLLSPEDEEEL
TAPEKFLGDQRFSYGQPLILTFRVPPGDSPLPVQLRLEGTGLALSLRHSSLSGPQDAGHP
REVELRFHLQETSEDVAPPLPPFHFQRLLANLTSLRLRVSPGPSPAGPVFLTEVRLTSAR
PGLSPPASWVEICSCPTGYTGQFCESCAPGYKREMPQGGPYASCVPCTCNQHGTCDPNTG
ICVCSHHTEGPSCERCLPGFYGNPFAGQADDCQPCPCPGQSACTTIPESREVVCTHCPPG
QRGRRCEVCDDGFFGDPLGLFGHPQPCHQCQCSGNVDPNAVGNCDPLSGHCLRCLHNTTG
DHCEHCQEGFYGSALAPRPADKCMPCSCHPQGSVSEQMPCDPVTGQCSCLPHVTARDCSR
CYPGFFDLQPGRGCRSCKCHPLGSQEDQCHPKTGQCTCRPGVTGQACDRCQLGFFGFSIK
GCRACRCSPLGAASAQCHENGTCVCRPGFEGYKCDRCHDNFFLTADGTHCQQCPSCYALV
KEEAAKLKARLTLTEGWLQGSDCGSPWGPLDILLGEAPRGDVYQGHHLLPGAREAFLEQM
MSLEGAVKAAREQLQRLNKGARCAQAGSQKTCTQLADLEAVLESSEEEILHAAAILASLE
IPQEGPSQPTKWSHLATEARALARSHRDTATKIAATAWRALLASNTSYALLWNLLEGRVA
LETQRDLEDRYQEVQAAQKALRTAVAEVLPEAESVLATVQQVGADTAPYLALLASPGALP
QKSRAEDLGLKAKALEKTVASWQHMATEAARTLQTAAQATLRQTEPLTKLHQEARAALTQ
ASSSVQAATVTVMGARTLLADLEGMKLQFPRPKDQAALQRKADSVSDRLLADTRKKTKQA
ERMLGNAAPLSSSAKKKGREAEVLAKDSAKLAKALLRERKQAHRRASRLTSQTQATLQQA
SQQVLASEARRQELEEAERVGAGLSEMEQQIRESRISLEKDIETLSELLARLGSLDTHQA
PAQALNETQWALERLRLQLGSPGSLQRKLSLLEQESQQQELQIQGFESDLAEIRADKQNL
EAILHSLPENCASWQ
NT seq 4728 nt   +upstreamnt  +downstreamnt
atggcggcggctgcgcttctgctggggctggcgctgctggcaccgcgggcggccggcgcg
ggcatgggcgcgtgctatgacggcgcagggcgcccgcagcgctgcctgccggtgttcgag
aacgcggcgtttgggcggctcgcccaggcctcgcacacgtgcggcagcccgcccgaggac
ttctgtccccacgtgggcgccgcgggcgcgggggctcattgccagcgctgcgacgccgcc
gacccccagcgccaccacaacgcctcctacctcaccgacttccacagccaggacgagagc
acctggtggcagagcccgtccatggccttcggcgtgcagtaccccacctcggtcaacatc
accctccgcctagggaaggcttatgagatcacgtatgtgaggctgaagttccacaccagt
cgccctgagagctttgccatctacaagcgcagccgcgccgacggcccatgggagccctac
cagttctacagcgcctcctgccagaagacctacggccggcccgagggccagtacctgcgc
cccggcgaggacgagcgcgtggccttctgcacctctgagttcagcgacatctccccgctg
agtggcggcaacgtggccttctccaccctggagggccggcccagcgcctacaacttcgag
gagagccctgggctgcaggagtgggtcaccagcaccgaactcctcatctctctagaccgg
ctcaacacgtttggggacgacatcttcaaggaccccaaggtgctccagtcctactattat
gccgtgtccgacttctctgtgggcggcaggtgcaagtgcaacgggcatgccagcgagtgc
ggccccgacgtggcaggccagttggcctgccggtgccagcacaacaccaccggcacagac
tgtgagcgctgcctgcccttcttccaggaccgcccgtgggcccggggcaccgccgaggct
gcccacgagtgtctgccctgcaactgcagtggccgctccgaggaatgcacgtttgatcgg
gagctcttccgcagcacaggccacggcgggcgctgtcaccactgccgtgaccacacagct
gggccacactgtgagcgctgtcaggagaatttctatcactgggacccgcggatgccatgc
cagccctgtgactgccagtcggcaggctccctacacctccagtgcgatgacacaggcacc
tgcgcctgcaagcccacggtgactggctggaagtgtgaccgctgtctgcccgggttccac
tcgctcagtgagggaggctgcagaccctgcacttgcaatcccgctggcagcctggacacc
tgtgacccccgcagtgggcgctgcccctgcaaagagaatgtggaaggcaacctatgtgac
agatgtcgcccggggacctttaacctgcagccccacaatccagctggctgcagcagctgt
ttctgctatggccactccaaggtgtgcgcgtccactgcccagttccaggtgcatcacatc
ctcagcgatttccaccagggagccgaaggctggtgggccagaagtgtggggggctctgag
caccccccacaatggagcccaaatggggtcctcctgagcccagaagacgaggaggagctc
acagcaccagagaagttcctgggagaccagcggttcagctatgggcagcccctcatactg
accttccgggtgccccccggggactccccactccctgtacagctgaggctggaagggaca
ggcttggccctgtccctgaggcactctagcctgtctggcccccaggatgccgggcatccc
agggaggtagagctcaggttccacctgcaggagacctccgaggacgtggcccctccactg
ccccccttccacttccagcggctcctcgccaacctgaccagcctccgcctccgcgtcagt
cccggccccagccctgccggtccagtgttcctgactgaggtccggctcacatccgcccgg
ccagggctttccccgccagcctcctgggtggagatttgttcatgtcccactggctacacg
ggccagttctgtgaatcctgtgctccgggatacaagagggagatgccacaggggggtccc
tatgccagctgtgtcccctgcacctgtaaccagcatggcacctgtgaccccaacacaggg
atctgtgtctgcagccaccataccgagggcccatcctgtgaacgctgtttgccaggtttc
tatggcaaccctttcgcgggccaagccgacgactgccagccctgtccctgccctggccag
tcggcctgtacgaccatcccagagagccgggaggtggtgtgtacccactgccccccgggc
cagagagggcggcgctgtgaggtctgtgatgatggcttttttggggacccgctggggctc
tttgggcacccccagccctgccaccagtgccagtgtagcgggaacgtggaccccaatgcc
gtgggcaactgtgaccccctgtctggccactgcctgcgctgcctgcacaacaccacgggt
gaccactgtgagcactgtcaggaaggcttctacgggagcgccctggcccctcgacccgca
gacaaatgcatgccttgcagctgtcacccacagggctcggtcagtgagcagatgccctgc
gacccagtgacaggccaatgctcctgcctgcctcatgtgactgcacgggactgcagccgc
tgctaccctggcttcttcgacctccagcctgggaggggctgccggagctgcaagtgtcac
ccactgggctcccaggaggaccagtgccatcccaagactggacagtgcacctgccgccca
ggtgtcacaggccaggcctgtgacaggtgccagctgggtttcttcggcttctccatcaag
ggctgccgggcctgcaggtgctccccactgggcgctgcctcggcccagtgccacgagaac
ggcacatgcgtgtgcaggcctggcttcgagggctacaaatgtgaccgctgccacgacaac
ttcttcctcacggcagacggcacacactgccagcaatgtccgtcctgctacgccctggtg
aaggaggaggcagccaagctgaaggccagactgactttgacggaggggtggctccaaggg
tccgactgtggcagtccctggggaccactagacattctgctgggagaggccccaaggggg
gacgtctaccagggccatcacctgcttccaggggctcgggaagccttcctggagcagatg
atgagcctcgagggtgctgtcaaggccgcccgggagcagctgcagaggctgaacaagggt
gcccgctgtgcccaggccggatcccagaagacctgcacccagctggcagacctggaggca
gtgctggagtcctcggaagaggagattctgcatgcagctgccattctcgcgtctctggag
attcctcaggaaggtcccagtcagccgaccaaatggagccacctggccacagaggcccgt
gccctcgccaggagccacagagacaccgccaccaagatcgcagccactgcttggagggcc
ctgctcgcctccaacaccagctacgcgcttctctggaatctgctggagggaagggtggcc
ctagagacccagcgggacctggaggacaggtaccaggaggtccaggcggcccagaaagca
ctgaggacggctgtggcagaggtgctgcctgaagcggaaagcgtgttggccaccgtgcag
caagttggcgcagatacagccccgtacctggccttgctggcttccccgggagctctgcct
cagaagtcccgggctgaagacctgggcctgaaggcgaaggccctggagaagacagttgca
tcatggcagcacatggccactgaggctgcccgaaccctccagactgctgcccaggcgacg
ctacggcaaacagaacccctcacaaagctgcaccaggaggccagagccgccctgacccag
gcttcctcatctgtccaggctgcgacagtgactgtcatgggagccaggactctgctggct
gatctggaaggaatgaagctgcagtttccccggcccaaggaccaggcggcattgcagagg
aaggcagactccgtcagtgacagactccttgcagacacgagaaagaagaccaagcaggcg
gagaggatgctgggaaacgcggcccctctttcctccagtgccaagaagaagggcagagaa
gcagaggtgttggccaaggacagtgccaagcttgccaaggccttgctgagggagcggaaa
caggcgcaccgccgtgccagcaggctcaccagccagacgcaagccacgctccaacaggcg
tcccagcaggtgctggcgtctgaagcacgcagacaggagctggaggaagctgagcgggtg
ggtgctgggctgagcgagatggagcagcagatccgggaatcgcgtatctcactggagaag
gacatcgagaccttgtcagagctgcttgccaggctggggtcgctggacacccatcaagcc
ccagcccaggccctgaacgagactcagtgggcactagaacgcctgaggctgcagctgggc
tccccggggtccttgcagaggaaactcagtctgctggagcaggaatcccagcagcaggag
ctgcagatccagggcttcgagagtgacctcgccgagatccgcgccgacaaacagaacctg
gaggccattctgcacagcctgcccgagaactgtgccagctggcagtga

KEGG   Homo sapiens (human): 1282
Entry
1282              CDS       T01001                                 
Symbol
COL4A1, BSVD, BSVD1, COL4A1s, PADMAL, RATOR
Name
(RefSeq) collagen type IV alpha 1 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Disease
H00579  Hereditary angiopathy with nephropathy, aneurysms, and muscle cramps (HANAC)
H00839  Porencephaly
H00877  Brain small vessel disease
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1282 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1282 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1282 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1282 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    1282 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1282 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1282 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1282 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1282 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1282 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1282 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1282 (COL4A1)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1282 (COL4A1)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1282 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1282
NCBI-ProteinID: NP_001836
OMIM: 120130
HGNC: 2202
Ensembl: ENSG00000187498
Pharos: P02462(Tbio)
UniProt: P02462
Structure
Position
13:complement(110148963..110307157)
AA seq 1669 aa
MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHVPGMLLKGERGFPGI
PGTPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGEKGEPGKPGPRGKPGKDGDKGEKGS
PGFPGEPGYPGLIGRQGPQGEKGEAGPPGPPGIVIGTGPLGEKGERGYPGTPGPRGEPGP
KGFPGLPGQPGPPGLPVPGQAGAPGFPGERGEKGDRGFPGTSLPGPSGRDGLPGPPGSPG
PPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGFIGEIGEKGQKGESCLICDIDGYRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLIGQPGAKGEPGEFYFDLRLK
GDKGDPGFPGQPGMPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDT
GPPGPPGYGPAGPIGDKGQAGFPGGPGSPGLPGPKGEPGKIVPLPGPPGAEGLPGSPGFP
GPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFNG
LPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGSIGVPGVPGEHGAIGPPGLQGI
RGEPGPPGLPGSVGSPGVPGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPG
PKGDKGAQGLPGITGQSGLPGLPGQQGAPGIPGFPGSKGEMGVMGTPGQPGSPGPVGAPG
LPGEKGDHGFPGSSGPRGDPGLKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIG
PIGEKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGPQGSPGLPGDKGAKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGE
KGEKGSIGIPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPG
IDGVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG
LPGLQGIKGDQGDQGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPGPK
GQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAV
HSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcagcgtctggctgctgctgctgcccgccgcccttctgctccacgag
gagcacagccgggccgctgcgaagggtggctgtgctggctctggctgtggcaaatgtgac
tgccatggagtgaagggacaaaagggtgaaagaggcctcccggggttacaaggtgtcatt
gggtttcctggaatgcaaggacctgaggggccacagggaccaccaggacaaaagggtgat
actggagaaccaggactacctggaacaaaagggacaagaggacctccgggagcatctggc
taccctggaaacccaggacttcccggaattcctggccaagacggcccgccaggcccccca
ggtattccaggatgcaatggcacaaagggggagagagggccgctcgggcctcctggcttg
cctggtttcgctggaaatcccggaccaccaggcttaccagggatgaagggtgatccaggt
gagatacttggccatgtgcccgggatgctgttgaaaggtgaaagaggatttcccggaatc
ccagggactccaggcccaccaggactgccagggcttcaaggtcctgttgggcctccagga
tttaccggaccaccaggtcccccaggccctcccggccctccaggtgaaaagggacaaatg
ggcttaagttttcaaggaccaaaaggtgacaagggtgaccaaggggtcagtgggcctcca
ggagtaccaggacaagctcaagttcaagaaaaaggagacttcgccaccaagggagaaaag
ggccaaaaaggtgaacctggatttcaggggatgccaggggtcggagagaaaggtgaaccc
ggaaaaccaggacccagaggcaaacccggaaaagatggtgacaaaggggaaaaagggagt
cccggttttcctggtgaacccgggtacccaggactcataggccgccagggcccgcaggga
gaaaagggtgaagcaggtcctcctggcccacctggaattgttataggcacaggacctttg
ggagaaaaaggagagaggggctaccctggaactccggggccaagaggagagccaggccca
aaaggtttcccaggactaccaggccaacccggacctccaggcctccctgtacctgggcag
gctggtgcccctggcttccctggtgaaagaggagaaaaaggtgaccgaggatttcctggt
acatctctgccaggaccaagtggaagagatgggctcccgggtcctcctggttcccctggg
ccccctgggcagcctggctacacaaatggaattgtggaatgtcagcccggacctccaggt
gaccagggtcctcctggaattccagggcagccaggatttataggcgaaattggagagaaa
ggtcaaaaaggagagagttgcctcatctgtgatatagacggatatcgggggcctcccggg
ccacagggacccccgggagaaataggtttcccagggcagccaggggccaagggcgacaga
ggtttgcctggcagagatggtgttgcaggagtgccaggccctcaaggtacaccagggctg
ataggccagccaggagccaagggggagcctggtgagttttatttcgacttgcggctcaaa
ggtgacaaaggagacccaggctttccaggacagcccggcatgccagggagagcgggttct
cctggaagagatggccatccgggtcttcctggccccaagggctcgccgggttctgtagga
ttgaaaggagagcgtggcccccctggaggagttggattcccaggcagtcgtggtgacacc
ggcccccctgggcctccaggatatggtcctgctggtcccattggtgacaaaggacaagca
ggctttcctggaggccctggatccccaggcctgccaggtccaaagggtgaaccaggaaaa
attgttcctttaccaggcccccctggagcagaaggactgccggggtccccaggcttccca
ggtccccaaggagaccgaggctttcccggaaccccaggaaggccaggcctgccaggagag
aagggcgctgtgggccagccaggcattggatttccagggccccccggccccaaaggtgtt
gacggcttacctggagacatggggccaccggggactccaggtcgcccgggatttaatggc
ttacctgggaacccaggtgtgcagggccagaagggagagcctggagttggtctaccggga
ctcaaaggtttgccaggtcttcccggcattcctggcacacccggggagaaggggagcatt
ggggtaccaggcgttcctggagaacatggagcgatcggaccccctgggcttcaggggatc
agaggtgaaccgggacctcctggattgccaggctccgtggggtctccaggagttccagga
ataggcccccctggagctaggggtccccctggaggacagggaccaccggggttgtcaggc
cctcctggaataaaaggagagaagggtttccccggattccctggactggacatgccgggc
cctaaaggagataaaggggctcaaggactccctggcataacgggacagtcggggctccct
ggccttcctggacagcagggggctcctgggattcctgggtttccaggttccaagggagaa
atgggcgtcatggggacccccgggcagccgggctcaccaggaccagtgggtgctcctgga
ttaccgggtgaaaaaggggaccatggctttccgggctcctcaggacccaggggagaccct
ggcttgaaaggtgataagggggatgtcggtctccctggcaagcctggctccatggataag
gtggacatgggcagcatgaagggccagaaaggagaccaaggagagaaaggacaaattgga
ccaattggtgagaagggatcccgaggagaccctgggaccccaggagtgcctggaaaggac
gggcaggcaggacagcctgggcagccaggacctaaaggtgatccaggtataagtggaacc
ccaggtgctccaggacttccgggaccaaaaggatctgttggtggaatgggcttgccagga
acacctggagagaaaggtgtgcctggcatccctggcccacaaggttcacctggcttacct
ggagacaaaggtgcaaaaggagagaaagggcaggcaggcccacctggcataggcatccca
gggctgcgaggtgaaaagggagatcaagggatagcgggtttcccaggaagccctggagag
aagggagaaaaaggaagcattgggatcccaggaatgccagggtccccaggccttaaaggg
tctcccgggagtgttggctatccaggaagtcctgggctacctggagaaaaaggtgacaaa
ggcctcccaggattggatggcatccctggtgtcaaaggagaagcaggtcttcctgggact
cctggccccacaggcccagctggccagaaaggggagccaggcagtgatggaatcccgggg
tcagcaggagagaagggtgaaccaggtctaccaggaagaggattcccagggtttccaggg
gccaaaggagacaaaggttcaaagggtgaggtgggtttcccaggattagccgggagccca
ggaattcctggatccaaaggagagcaaggattcatgggtcctccggggccccagggacag
ccggggttaccgggatccccaggccatgccacggaggggcccaaaggagaccgcggacct
cagggccagcctggcctgccaggacttccgggacccatggggcctccagggcttcctggg
attgatggagttaaaggtgacaaaggaaatccaggctggccaggagcacccggtgtccca
gggcccaagggagaccctggattccagggcatgcctggtattggtggctctccaggaatc
acaggctctaagggtgatatggggcctccaggagttccaggatttcaaggtccaaaaggt
cttcctggcctccagggaattaaaggtgatcaaggcgatcaaggcgtcccgggagctaaa
ggtctcccgggtcctcctggccccccaggtccttacgacatcatcaaaggggagcccggg
ctccctggtcctgagggccccccagggctgaaagggcttcagggactgccaggcccgaaa
ggccagcaaggtgttacaggattggtgggtatacctggacctccaggtattcctgggttt
gacggtgcccctggccagaaaggagagatgggacctgccgggcctactggtccaagagga
tttccaggtccaccaggccccgatgggttgccaggatccatggggcccccaggcacccca
tctgttgatcacggcttccttgtgaccaggcatagtcaaacaatagatgacccacagtgt
ccttctgggaccaaaattctttaccacgggtactctttgctctacgtgcaaggcaatgaa
cgggcccatggccaggacttgggcacggccggcagctgcctgcgcaagttcagcacaatg
cccttcctgttctgcaatattaacaacgtgtgcaactttgcatcacgaaatgactactcg
tactggctgtccacccctgagcccatgcccatgtcaatggcacccatcacgggggaaaac
ataagaccatttattagtaggtgtgctgtgtgtgaggcgcctgccatggtgatggccgtg
cacagccagaccattcagatcccaccgtgccccagcgggtggtcctcgctgtggatcggc
tactcttttgtgatgcacaccagcgctggtgcagaaggctctggccaagccctggcgtcc
cccggctcctgcctggaggagtttagaagtgcgccattcatcgagtgtcacggccgtggg
acctgcaattactacgcaaacgcttacagcttttggctcgccaccatagagaggagcgag
atgttcaagaagcctacgccgtccaccttgaaggcaggggagctgcgcacgcacgtcagc
cgctgccaagtctgtatgagaagaacataa

KEGG   Homo sapiens (human): 1284
Entry
1284              CDS       T01001                                 
Symbol
COL4A2, BSVD2, ICH, POREN2
Name
(RefSeq) collagen type IV alpha 2 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Disease
H00839  Porencephaly
H00877  Brain small vessel disease
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1284 (COL4A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1284 (COL4A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1284 (COL4A2)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1284 (COL4A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    1284 (COL4A2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1284 (COL4A2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1284 (COL4A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1284 (COL4A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1284 (COL4A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1284 (COL4A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1284 (COL4A2)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1284 (COL4A2)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1284 (COL4A2)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1284 (COL4A2)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1284
NCBI-ProteinID: NP_001837
OMIM: 120090
HGNC: 2203
Ensembl: ENSG00000134871
Pharos: P08572(Tbio)
UniProt: P08572 A0A024RDW8
Structure
Position
13:110307284..110513209
AA seq 1712 aa
MGRDQRAVAGPALRRWLLLGTVTVGFLAQSVLAGVKKFDVPCGGRDCSGGCQCYPEKGGR
GQPGPVGPQGYNGPPGLQGFPGLQGRKGDKGERGAPGVTGPKGDVGARGVSGFPGADGIP
GHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGPQGPKGQKGEPYALPKEERD
RYRGEPGEPGLVGFQGPPGRPGHVGQMGPVGAPGRPGPPGPPGPKGQQGNRGLGFYGVKG
EKGDVGQPGPNGIPSDTLHPIIAPTGVTFHPDQYKGEKGSEGEPGIRGISLKGEEGIMGF
PGLRGYPGLSGEKGSPGQKGSRGLDGYQGPDGPRGPKGEAGDPGPPGLPAYSPHPSLAKG
ARGDPGFPGAQGEPGSQGEPGDPGLPGPPGLSIGDGDQRRGLPGEMGPKGFIGDPGIPAL
YGGPPGPDGKRGPPGPPGLPGPPGPDGFLFGLKGAKGRAGFPGLPGSPGARGPKGWKGDA
GECRCTEGDEAIKGLPGLPGPKGFAGINGEPGRKGDRGDPGQHGLPGFPGLKGVPGNIGA
PGPKGAKGDSRTITTKGERGQPGVPGVPGMKGDDGSPGRDGLDGFPGLPGPPGDGIKGPP
GDPGYPGIPGTKGTPGEMGPPGLGLPGLKGQRGFPGDAGLPGPPGFLGPPGPAGTPGQID
CDTDVKRAVGGDRQEAIQPGCIGGPKGLPGLPGPPGPTGAKGLRGIPGFAGADGGPGPRG
LPGDAGREGFPGPPGFIGPRGSKGAVGLPGPDGSPGPIGLPGPDGPPGERGLPGEVLGAQ
PGPRGDAGVPGQPGLKGLPGDRGPPGFRGSQGMPGMPGLKGQPGLPGPSGQPGLYGPPGL
HGFPGAPGQEGPLGLPGIPGREGLPGDRGDPGDTGAPGPVGMKGLSGDRGDAGFTGEQGH
PGSPGFKGIDGMPGTPGLKGDRGSPGMDGFQGMPGLKGRPGFPGSKGEAGFFGIPGLKGL
AGEPGFKGSRGDPGPPGPPPVILPGMKDIKGEKGDEGPMGLKGYLGAKGIQGMPGIPGLS
GIPGLPGRPGHIKGVKGDIGVPGIPGLPGFPGVAGPPGITGFPGFIGSRGDKGAPGRAGL
YGEIGATGDFGDIGDTINLPGRPGLKGERGTTGIPGLKGFFGEKGTEGDIGFPGITGVTG
VQGPPGLKGQTGFPGLTGPPGSQGELGRIGLPGGKGDDGWPGAPGLPGFPGLRGIRGLHG
LPGTKGFPGSPGSDIHGDPGFPGPPGERGDPGEANTLPGPVGVPGQKGDQGAPGERGPPG
SPGLQGFPGITPPSNISGAPGDKGAPGIFGLKGYRGPPGPPGSAALPGSKGDTGNPGAPG
TPGTKGWAGDSGPQGRPGVFGLPGEKGPRGEQGFMGNTGPTGAVGDRGPKGPKGDPGFPG
APGTVGAPGIAGIPQKIAVQPGTVGPQGRRGPPGAPGEMGPQGPPGEPGFRGAPGKAGPQ
GRGGVSAVPGFRGDEGPIGHQGPIGQEGAPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQ
EPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASR
NDKSYWLSTTAPLPMMPVAEDEIKPYISRCSVCEAPAIAIAVHSQDVSIPHCPAGWRSLW
IGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYYANKYSFWLTTIP
EQSFQGSPSADTLKAGLIRTHISRCQVCMKNL
NT seq 5139 nt   +upstreamnt  +downstreamnt
atggggagagaccagcgcgcggtggccggccctgccctacggcggtggctgctgctgggg
acagtgaccgtggggttcctcgcccagagcgtcttggcgggtgtgaagaagtttgatgtg
ccgtgtggaggaagagattgcagtgggggctgccagtgctaccctgagaaaggtggacgt
ggtcagcctgggccagtgggcccccaggggtacaatgggccaccaggattacaaggattc
ccgggactgcagggacgtaaaggagacaagggtgaaaggggagcccccggagtaacggga
cccaagggcgacgtgggagcaagaggcgtttctggattccctggtgccgatggaattcct
ggacacccggggcaaggtgggcccaggggaaggccgggctacgatggctgcaacggaacc
cagggagactcaggtccacaggggccccccggctctgaggggttcaccgggcctcccggg
ccccaaggaccaaaagggcagaaaggtgagccttatgcactgcctaaagaggagcgcgac
agatatcggggtgaacctggagagcctggattggtcggtttccagggacctcccggccgc
cctgggcatgtgggacagatgggtccagttggagctccagggagaccaggaccacctgga
ccccctggaccaaaaggacagcaaggcaacagaggacttggtttctacggagttaagggt
gaaaagggtgacgtagggcagccgggacccaacgggattccatcagacaccctccacccc
atcatcgcgcccacaggagtcaccttccacccagatcagtacaagggtgaaaaaggcagt
gagggggaaccaggaataagaggcatttccttgaagggagaagaaggaatcatgggcttt
cctggactgaggggttaccctggcttgagtggtgaaaaaggatcaccaggacagaaggga
agccgaggcctggatggctatcaagggcctgatggaccccggggacccaagggagaagcc
ggagacccagggccccctggactacctgcctactcccctcacccttccctagcaaaaggt
gccagaggtgacccgggattcccaggggcccaaggggagccaggaagccagggtgagcca
ggagacccgggcctcccaggtccccctggcctctccatcggagatggagatcagaggaga
ggcctgccgggtgagatgggacccaagggcttcatcggagaccccggcatccctgcgctc
tacgggggcccacctggacctgatggaaagcgagggcctccaggaccccccgggctccct
ggaccacctggacctgatggcttcctgtttgggctgaaaggagcaaaaggaagagcaggc
ttccctgggcttcccggctcccctggagcccgcggaccaaaggggtggaaaggtgacgct
ggggaatgcagatgtacagaaggcgacgaagctatcaaaggtcttccgggactgccagga
cccaagggcttcgcaggcatcaacggggagccggggaggaaaggggacagaggagacccc
ggccaacacggcctccctgggttcccagggctcaagggagtgcctggcaacattggtgct
cccggacccaaaggagcaaaaggagattccagaacaatcacaaccaaaggtgagcgggga
cagcccggcgtcccaggtgtgcccgggatgaaaggtgacgatggcagcccaggccgcgat
gggctcgatggattccccggcctcccaggccctcccggtgatggcatcaagggccctcca
ggggacccaggctatccaggaatacctggaacgaagggtactccaggagaaatgggcccc
ccaggactgggccttcccggcctcaaaggccaacgtggtttccctggagacgccggctta
cctggaccaccaggcttcctgggccctcctggccccgcagggaccccaggacaaatagat
tgtgacacagatgtgaaaagggccgttggaggtgacagacaggaggccatccagccaggt
tgcataggagggcccaagggattgccaggcctgccaggacccccaggccccacaggtgcc
aaaggcctccgaggaatcccaggcttcgcaggagctgatggaggaccagggcccaggggc
ttgccaggagacgcaggtcgtgaagggttcccaggacccccagggttcataggaccccga
ggatccaaaggtgcagtgggcctccctggcccagatggatccccaggtcccatcggcctg
ccagggccagatgggccccctggggaaaggggcctccctggagaagtcctgggagctcag
cccgggccacggggagatgctggtgtgcctggacagcctgggcttaaaggccttcccgga
gacagaggcccccctggattcagaggaagccaagggatgcctgggatgccagggctgaag
ggccagccaggcctcccaggaccttccggccagccaggcctgtatgggcctccaggactg
catggattcccaggagctcctggccaagaggggcccttggggctgccaggaatcccaggc
cgtgaaggtctgcctggtgatagaggggaccctggggacacaggcgctcctggccctgtg
ggcatgaaaggtctctctggtgacagaggagatgctggcttcacaggggagcaaggccat
ccaggaagccctggatttaaaggaattgatggaatgcctgggacccccgggctaaaagga
gatagaggctcacctgggatggatggtttccaaggcatgcctggactcaaagggagaccc
gggtttccagggagcaaaggcgaggctggatttttcggaatacccggtctgaagggtctg
gctggtgagccaggttttaaaggcagccgaggggaccctgggcccccaggaccacctcct
gtcatcctgccaggaatgaaagacattaaaggagagaaaggagatgaagggcctatgggg
ctgaaaggatacctgggcgcaaaaggtatccaaggaatgccaggcatcccagggctgtca
ggaatccctgggctgcctgggaggcccggccacatcaaaggagtcaagggagacatcgga
gtccccggcatccccggtttgccaggattccctggggtggctggcccccctggaattacg
ggattcccaggattcataggaagccggggtgacaaaggtgccccagggagagcaggcctg
tatggcgagattggcgcgactggtgatttcggtgacatcggggacactataaatttacca
ggaagaccaggcctgaagggggagcggggcaccactggaataccaggtctgaagggattc
tttggagagaagggaacagaaggtgacatcggcttccctgggataacaggcgtgactgga
gtccaaggccctcctggacttaaaggacaaacaggctttccagggctgactgggcctcca
gggtcgcagggagagctggggcggattggactgcctggtggcaaaggagatgatggctgg
ccgggagctccgggcttaccaggttttccgggactccgtgggatccgcggcttacacggc
ttgccaggcaccaagggctttccaggatccccaggttctgacatccacggagacccaggc
ttcccaggccctcctggggaaagaggtgacccaggagaggccaacacccttccaggccct
gtgggagtcccaggacagaaaggagaccaaggagctccaggggaacgaggcccacctggg
agcccaggacttcaggggttccctggtatcacacccccttccaacatctctggggcacct
ggtgacaaaggggcgccagggatatttggcctgaaaggttatcggggcccaccagggcca
ccaggttctgctgctcttcctggaagcaaaggtgacacagggaacccaggagctccagga
accccagggaccaaaggatgggccggggactccgggccccagggcaggcctggtgtgttt
ggtctcccaggagaaaaagggcccaggggtgaacaaggcttcatggggaacactggaccc
actggggcggtgggcgacagaggccccaagggacccaagggagacccaggattccctggt
gcccccgggactgtgggagcccccgggattgcaggaatcccccagaagattgccgtccaa
ccagggacagtgggtccccaggggaggcgaggcccccctggggcaccgggggagatgggg
ccccagggcccccccggagaaccaggtttccgtggggctccagggaaagctgggccccaa
ggaagaggtggtgtgtctgctgttcccggcttccggggagatgaaggacccataggccac
caggggccgattggccaagaaggtgcaccaggccgtccagggagcccgggcctgccgggt
atgccaggccgcagcgtcagcatcggctacctcctggtgaagcacagccagacggaccag
gagcccatgtgcccagtgggcatgaacaaactctggagtggatacagcctgctgtacttc
gagggccaggagaaggcgcacaaccaggacctggggctggcgggctcctgcctggcgcgg
ttcagcaccatgcccttcctgtactgcaaccctggtgatgtctgctactatgccagccgg
aacgacaagtcctactggctctctaccactgcgccgctgcccatgatgcccgtggccgag
gacgagatcaagccctacatcagccgctgttctgtgtgtgaggccccggccatcgccatc
gcggtccacagtcaggatgtctccatcccacactgcccagctgggtggcggagtttgtgg
atcggatattccttcctcatgcacacggcggcgggagacgaaggcggtggccaatcactg
gtgtcaccgggcagctgtctagaggacttccgcgccacaccattcatcgaatgcaatgga
ggccgcggcacctgccactactacgccaacaagtacagcttctggctgaccaccattccc
gagcagagcttccagggctcgccctccgccgacacgctcaaggccggcctcatccgcaca
cacatcagccgctgccaggtgtgcatgaagaacctgtga

KEGG   Homo sapiens (human): 1285
Entry
1285              CDS       T01001                                 
Symbol
COL4A3, ATS2, ATS3
Name
(RefSeq) collagen type IV alpha 3 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Disease
H00581  Alport syndrome
H00582  Benign familial hematuria
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1285 (COL4A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1285 (COL4A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1285 (COL4A3)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1285 (COL4A3)
  09154 Digestive system
   04974 Protein digestion and absorption
    1285 (COL4A3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1285 (COL4A3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1285 (COL4A3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1285 (COL4A3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1285 (COL4A3)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1285 (COL4A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1285 (COL4A3)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1285 (COL4A3)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1285 (COL4A3)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1285 (COL4A3)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1285
NCBI-ProteinID: NP_000082
OMIM: 120070
HGNC: 2204
Ensembl: ENSG00000169031
Pharos: Q01955(Tbio)
UniProt: Q01955
Structure
Position
2:227164624..227314792
AA seq 1670 aa
MSARTAPRPQVLLLPLLLVLLAAAPAASKGCVCKDKGQCFCDGAKGEKGEKGFPGPPGSP
GQKGFTGPEGLPGPQGPKGFPGLPGLTGSKGVRGISGLPGFSGSPGLPGTPGNTGPYGLV
GVPGCSGSKGEQGFPGLPGTLGYPGIPGAAGLKGQKGAPAKEEDIELDAKGDPGLPGAPG
PQGLPGPPGFPGPVGPPGPPGFFGFPGAMGPRGPKGHMGERVIGHKGERGVKGLTGPPGP
PGTVIVTLTGPDNRTDLKGEKGDKGAMGEPGPPGPSGLPGESYGSEKGAPGDPGLQGKPG
KDGVPGFPGSEGVKGNRGFPGLMGEDGIKGQKGDIGPPGFRGPTEYYDTYQEKGDEGTPG
PPGPRGARGPQGPSGPPGVPGSPGSSRPGLRGAPGWPGLKGSKGERGRPGKDAMGTPGSP
GCAGSPGLPGSPGPPGPPGDIVFRKGPPGDHGLPGYLGSPGIPGVDGPKGEPGLLCTQCP
YIPGPPGLPGLPGLHGVKGIPGRQGAAGLKGSPGSPGNTGLPGFPGFPGAQGDPGLKGEK
GETLQPEGQVGVPGDPGLRGQPGRKGLDGIPGTPGVKGLPGPKGELALSGEKGDQGPPGD
PGSPGSPGPAGPAGPPGYGPQGEPGLQGTQGVPGAPGPPGEAGPRGELSVSTPVPGPPGP
PGPPGHPGPQGPPGIPGSLGKCGDPGLPGPDGEPGIPGIGFPGPPGPKGDQGFPGTKGSL
GCPGKMGEPGLPGKPGLPGAKGEPAVAMPGGPGTPGFPGERGNSGEHGEIGLPGLPGLPG
TPGNEGLDGPRGDPGQPGPPGEQGPPGRCIEGPRGAQGLPGLNGLKGQQGRRGKTGPKGD
PGIPGLDRSGFPGETGSPGIPGHQGEMGPLGQRGYPGNPGILGPPGEDGVIGMMGFPGAI
GPPGPPGNPGTPGQRGSPGIPGVKGQRGTPGAKGEQGDKGNPGPSEISHVIGDKGEPGLK
GFAGNPGEKGNRGVPGMPGLKGLKGLPGPAGPPGPRGDLGSTGNPGEPGLRGIPGSMGNM
GMPGSKGKRGTLGFPGRAGRPGLPGIHGLQGDKGEPGYSEGTRPGPPGPTGDPGLPGDMG
KKGEMGQPGPPGHLGPAGPEGAPGSPGSPGLPGKPGPHGDLGFKGIKGLLGPPGIRGPPG
LPGFPGSPGPMGIRGDQGRDGIPGPAGEKGETGLLRAPPGPRGNPGAQGAKGDRGAPGFP
GLPGRKGAMGDAGPRGPTGIEGFPGPPGLPGAIIPGQTGNRGPPGSRGSPGAPGPPGPPG
SHVIGIKGDKGSMGHPGPKGPPGTAGDMGPPGRLGAPGTPGLPGPRGDPGFQGFPGVKGE
KGNPGFLGSIGPPGPIGPKGPPGVRGDPGTLKIISLPGSPGPPGTPGEPGMQGEPGPPGP
PGNLGPCGPRGKPGKDGKPGTPGPAGEKGNKGSKGEPGPAGSDGLPGLKGKRGDSGSPAT
WTTRGFVFTRHSQTTAIPSCPEGTVPLYSGFSFLFVQGNQRAHGQDLGTLGSCLQRFTTM
PFLFCNVNDVCNFASRNDYSYWLSTPALMPMNMAPITGRALEPYISRCTVCEGPAIAIAV
HSQTTDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLECHGRG
TCNYYSNSYSFWLASLNPERMFRKPIPSTVKAGELEKIISRCQVCMKKRH
NT seq 5013 nt   +upstreamnt  +downstreamnt
atgagcgcccggaccgcccccaggccgcaggtgctcctgctgccgctcctgctggtgctc
ctggcggcggcgcccgcagccagcaagggttgtgtctgtaaagacaaaggccagtgcttc
tgtgacggggccaaaggggagaagggggagaagggctttcctggaccccccggttctcct
ggccagaaaggattcacaggtcctgaaggcttgcctggaccgcagggacccaagggcttt
ccaggacttccaggactcacgggttccaaaggtgtaaggggaataagtggattgccagga
ttttctggttctcctggacttccaggcaccccaggcaataccgggccttacggacttgtc
ggtgtaccaggatgcagtggttctaagggtgagcaggggtttccaggactcccagggaca
ctgggctacccagggatcccgggtgctgctggtttgaaaggacaaaagggtgctcctgct
aaagaagaagatatagaacttgatgcaaaaggcgaccccgggttgccaggggctccagga
ccccagggtttgccaggccctccaggttttcctgggcctgttggcccacctggtcctccg
ggattctttggctttccaggagccatgggacctagaggacctaagggtcacatgggtgaa
agagtgataggacataaaggagagcggggtgtgaaagggttaacaggacccccgggacca
ccaggaacagttattgtgaccctaactggcccagataacagaacggacctcaagggggaa
aagggagacaagggagcaatgggcgagcctggacctcctggaccctcaggactgcctgga
gaatcatatggatctgaaaagggtgctcctggagaccctggcctgcagggaaaacccgga
aaagatggtgttcctggcttccctggaagtgagggagtcaagggcaacaggggtttccct
gggttaatgggtgaagatggcattaagggacagaaaggggacattggccctccaggattt
cgtggtccaacagaatattatgacacataccaggaaaagggagatgaaggcactccaggc
ccaccagggcccagaggagctcgtggcccacaaggtcccagtggtccccccggagttcct
ggaagtcctggatcatcaaggcctggcctcagaggagcccctggatggccaggcctgaaa
ggaagtaaaggggaacgaggccgcccaggaaaggatgccatggggactcctgggtcccca
ggttgtgctggttcaccaggtcttccaggatcaccgggacctccaggaccgccaggtgac
atcgtttttcgcaagggtccacctggagatcacggactgccaggctatctagggtctcca
ggaatcccaggagttgatgggcccaaaggagaaccaggcctcctgtgtacacagtgccct
tatatcccagggcctcccggtctcccaggattgccagggttacatggtgtaaaaggaatc
ccaggaagacaaggcgcagctggcttgaaaggaagcccagggtccccaggaaatacaggt
cttccaggatttccaggtttcccaggtgcccagggtgacccaggacttaaaggagaaaaa
ggtgaaacacttcagcctgaggggcaagtgggtgtcccaggtgacccggggctcagaggc
caacctgggagaaagggcttggatggaattcctggaactccgggagtgaaaggattacca
ggacctaaaggcgaactggctctgagtggtgagaaaggggaccaaggtcctccaggggat
cctggctcccctgggtccccaggacctgcaggaccagctggaccacctggctacggaccc
caaggagaacctggtctccagggcacgcaaggagttcctggagcccccggaccacccgga
gaagccggccctaggggagagctcagtgtttcaacaccagttccaggcccaccaggacct
ccagggccccctggccatcctggcccccaaggtccacctggtatccctggatccctgggg
aaatgtggagatcctggtcttccagggcctgatggtgaaccaggaattccaggaattgga
tttcctgggcctcctggacctaagggagaccaaggttttccaggtacaaaaggatcactg
ggttgtcctggaaaaatgggagagcctgggttacctggaaagccaggcctcccaggagcc
aagggagaaccagcagtagccatgcctggaggaccaggaacaccaggttttccaggagaa
agaggcaattctggggaacatggagaaattggactccctggacttccaggtctccctgga
actccaggaaatgaagggcttgatggaccacgaggagatccagggcagcctggaccacct
ggagaacaaggacccccaggaaggtgcatagagggtcccaggggagcccaaggacttcca
ggcttaaatggattgaaagggcaacaaggcagaagaggtaaaacggggccaaagggagac
ccaggaattccaggcttggatagatcaggatttcctggagaaactggatcaccaggaatt
ccaggtcatcaaggtgaaatgggaccactgggtcaaagaggatatccaggaaatccggga
attttagggccaccaggtgaagatggagtgattgggatgatgggctttcctggagccatt
ggccctccagggccccctgggaacccaggcacaccagggcagagggggagccctggaatt
ccaggagtaaagggccagagaggaaccccaggagccaagggggaacaaggagataaagga
aatcccgggccttcagagatatcccacgtaataggggacaaaggagaaccaggtctcaaa
ggattcgcaggaaatccaggtgagaaaggaaacagaggcgttccagggatgccaggttta
aagggcctcaaaggactacccggaccagcaggaccaccaggccccagaggagatttgggc
agcactgggaatcctggagaaccaggactgcgtggtataccaggaagcatggggaacatg
ggcatgccaggttctaaaggaaaaaggggaactttgggattcccaggtcgagcaggaaga
ccaggcctcccaggtattcatggtctccagggagataagggagagccaggttattcagaa
ggtacaaggccaggaccaccgggaccaacgggggatccaggactgccgggtgatatggga
aagaaaggagaaatggggcaacctggcccacctggacatttggggcctgctggacctgag
ggagcccctggaagtcctggaagtcctggcctcccaggaaagccaggtcctcatggtgat
ttgggttttaaaggaatcaaaggcctcctgggccctccaggaatcagaggccctccaggt
cttccaggatttccaggatctcctggaccaatgggtataagaggtgaccaaggacgtgat
ggaattcctggtccagccggagaaaagggagaaacgggtttattgagggcccctccaggc
ccaagagggaaccctggtgctcaaggagccaaaggagacaggggagccccaggttttcct
ggcctcccgggcagaaaaggggccatgggagatgctggacctcgaggacccacaggcata
gaaggattcccagggccaccaggtctgcccggtgcaattatccctggccagacaggaaat
cgtggtccaccaggctcaagaggaagcccaggtgcgcctggtccccctggacctccaggg
agtcatgtaataggcataaaaggagacaaagggtctatgggccaccctggcccaaaaggt
ccacctggaactgcaggagacatgggaccaccaggtcgtctgggagcaccaggtactcca
ggtcttccaggacccagaggtgatcctggattccaggggtttccaggcgtgaaaggagaa
aagggtaatcctggatttctaggatccattggacctccaggaccaattgggccaaaagga
ccacctggtgtacgtggagaccctggcacacttaagattatctcccttccaggaagccca
gggccacctggcacacctggagaaccagggatgcagggagaacctgggccaccagggcca
cctggaaacctaggaccctgtgggccaagaggtaagccaggcaaggatggaaaaccagga
actcctggaccagctggagaaaaaggcaacaaaggttctaaaggagagccaggaccagct
ggatcagatggattgccaggtttgaaaggaaaacgtggagacagtggatcacctgcaacc
tggacaacgagaggctttgtcttcacccgacacagtcaaaccacagcaattccttcatgt
ccagaggggacagtgccactctacagtgggttttcttttctttttgtacaaggaaatcaa
cgagcccacggacaagaccttggaactcttggcagctgcctgcagcgatttaccacaatg
ccattcttattctgcaatgtcaatgatgtatgtaattttgcatctcgaaatgattattca
tactggctgtcaacaccagctctgatgccaatgaacatggctcccattactggcagagcc
cttgagccttatataagcagatgcactgtttgtgaaggtcctgcgatcgccatagccgtt
cacagccaaaccactgacattcctccatgtcctcacggctggatttctctctggaaagga
ttttcattcatcatgttcacaagtgcaggttctgagggcaccgggcaagcactggcctcc
cctggctcctgcctggaagaattccgagccagcccatttctagaatgtcatggaagagga
acgtgcaactactattcaaattcctacagtttctggctggcttcattaaacccagaaaga
atgttcagaaagcctattccatcaactgtgaaagctggggaattagaaaaaataataagt
cgctgtcaggtgtgcatgaagaaaagacactga

KEGG   Homo sapiens (human): 1286
Entry
1286              CDS       T01001                                 
Symbol
COL4A4, ATS2, BFH, CA44
Name
(RefSeq) collagen type IV alpha 4 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Disease
H00581  Alport syndrome
H00582  Benign familial hematuria
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1286 (COL4A4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1286 (COL4A4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1286 (COL4A4)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1286 (COL4A4)
  09154 Digestive system
   04974 Protein digestion and absorption
    1286 (COL4A4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1286 (COL4A4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1286 (COL4A4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1286 (COL4A4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1286 (COL4A4)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1286 (COL4A4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1286 (COL4A4)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1286 (COL4A4)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1286 (COL4A4)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1286 (COL4A4)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1286
NCBI-ProteinID: NP_000083
OMIM: 120131
HGNC: 2206
Ensembl: ENSG00000081052
Pharos: P53420(Tbio)
UniProt: P53420
Structure
Position
2:complement(226967360..227164488)
AA seq 1690 aa
MWSLHIVLMRCSFRLTKSLATGPWSLILILFSVQYVYGSGKKYIGPCGGRDCSVCHCVPE
KGSRGPPGPPGPQGPIGPLGAPGPIGLSGEKGMRGDRGPPGAAGDKGDKGPTGVPGFPGL
DGIPGHPGPPGPRGKPGMSGHNGSRGDPGFPGGRGALGPGGPLGHPGEKGEKGNSVFILG
AVKGIQGDRGDPGLPGLPGSWGAGGPAGPTGYPGEPGLVGPPGQPGRPGLKGNPGVGVKG
QMGDPGEVGQQGSPGPTLLVEPPDFCLYKGEKGIKGIPGMVGLPGPPGRKGESGIGAKGE
KGIPGFPGPRGDPGSYGSPGFPGLKGELGLVGDPGLFGLIGPKGDPGNRGHPGPPGVLVT
PPLPLKGPPGDPGFPGRYGETGDVGPPGPPGLLGRPGEACAGMIGPPGPQGFPGLPGLPG
EAGIPGRPDSAPGKPGKPGSPGLPGAPGLQGLPGSSVIYCSVGNPGPQGIKGKVGPPGGR
GPKGEKGNEGLCACEPGPMGPPGPPGLPGRQGSKGDLGLPGWLGTKGDPGPPGAEGPPGL
PGKHGASGPPGNKGAKGDMVVSRVKGHKGERGPDGPPGFPGQPGSHGRDGHAGEKGDPGP
PGDHEDATPGGKGFPGPLGPPGKAGPVGPPGLGFPGPPGERGHPGVPGHPGVRGPDGLKG
QKGDTISCNVTYPGRHGPPGFDGPPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTAEIPGP
PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPGDPAFGHLGPPGKRGLSGVPG
IKGPRGDPGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGPPGHSCERGAPGIPGQPG
LPGYPGSPGAPGGKGQPGDVGPPGPAGMKGLPGLPGRPGAHGPPGLPGIPGPFGDDGLPG
PPGPKGPRGLPGFPGFPGERGKPGAEGCPGAKGEPGEKGMSGLPGDRGLRGAKGAIGPPG
DEGEMAIISQKGTPGEPGPPGDDGFPGERGDKGTPGMQGRRGEPGRYGPPGFHRGEPGEK
GQPGPPGPPGPPGSTGLRGFIGFPGLPGDQGEPGSPGPPGFSGIDGARGPKGNKGDPASH
FGPPGPKGEPGSPGCPGHFGASGEQGLPGIQGPRGSPGRPGPPGSSGPPGCPGDHGMPGL
RGQPGEMGDPGPRGLQGDPGIPGPPGIKGPSGSPGLNGLHGLKGQKGTKGASGLHDVGPP
GPVGIPGLKGERGDPGSPGISPPGPRGKKGPPGPPGSSGPPGPAGATGRAPKDIPDPGPP
GDQGPPGPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPGPPGPPGYKGFPGCDGK
DGQKGPVGFPGPQGPHGFPGPPGEKGLPGPPGRKGPTGLPGPRGEPGPPADVDDCPRIPG
LPGAPGMRGPEGAMGLPGMRGPSGPGCKGEPGLDGRRGVDGVPGSPGPPGRKGDTGEDGY
PGGPGPPGPIGDPGPKGFGPGYLGGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEGQE
KAHNQDLGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLASAAPLPMMPLSEEAIR
PYVSRCAVCEAPAQAVAVHSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPG
SCLEDFRAAPFLECQGRQGTCHFFANKYSFWLTTVKADLQFSSAPAPDTLKESQAQRQKI
SRCQVCVKYS
NT seq 5073 nt   +upstreamnt  +downstreamnt
atgtggtctctgcacatagtactaatgaggtgctccttcagattgaccaagtccttggcc
acaggtccctggtcacttatactcattctcttttctgtacaatatgtatatgggagtgga
aagaaatacattggtccttgtggaggaagagattgctctgtttgccactgtgttcctgaa
aaggggtctcggggtccaccaggaccaccagggccacagggtccaattggacccctggga
gccccaggacccattgggctttcaggagagaaaggaatgagaggggaccgcggccctcct
ggagcagcaggggacaaaggagataagggtccaactggtgttcctggatttccaggttta
gatggcatacctgggcacccagggcctcctggacccagaggcaaacctggtatgagtggc
cacaatggctcaagaggtgacccagggtttccaggaggaagaggagctcttggcccagga
ggccccctaggccatcctggggaaaagggagaaaaaggaaattcagtgttcattttaggt
gccgttaaaggtattcagggagacagaggggacccaggactgcctggcttaccaggatct
tggggtgcaggaggaccggcaggtcccacaggatatcctggagagccagggttagtggga
cctccgggccaaccagggcgtccaggtttgaagggaaatcccggtgtgggagtaaagggg
caaatgggagacccgggtgaggttggtcagcaaggttctcctggacccaccctgttggta
gagccacctgacttttgtctctataaaggagaaaagggtataaaaggaattcctggaatg
gttggactgccaggaccaccaggacgcaagggagaatctggtattggggcaaaaggagaa
aaaggtattcctggatttccagggcctcggggggatcctggttcctatggatctccaggt
tttccaggattaaagggagaactaggactggttggagatcctgggctatttggattaatt
ggcccaaagggggatcctggaaatcgagggcacccaggaccaccaggtgttttggtgact
ccacctcttccactcaaaggcccaccaggggacccagggttccctggccgctatggagaa
acaggggatgttggaccacctggtcccccaggtctcttgggcagaccaggggaagcctgt
gcaggcatgataggaccccctgggccacaaggatttcctggtcttcctgggcttccagga
gaagctggtattcctgggagacctgattctgctccaggaaaaccagggaagccaggatca
cctggcttgcctggagcaccaggcctgcagggcctcccaggatcaagtgtgatatactgt
agtgttgggaaccccggaccacaaggaataaaaggcaaagttggtcccccaggaggaaga
ggcccaaaaggagaaaaaggaaatgaaggactctgtgcctgtgagcctggacccatgggc
ccccctggccctccaggacttcctgggaggcaggggagtaagggagacttggggctccct
ggctggcttggaacaaaaggtgacccaggacctcctggtgctgaaggacctccagggcta
ccaggaaagcatggtgcctctggaccacctggcaacaaaggggcgaagggtgacatggtt
gtatcaagagttaaagggcacaaaggagaaagaggtcctgatgggcccccaggatttcca
gggcagccaggatcacatggtcgggatggacatgctggagaaaaaggggatccaggacct
ccaggggatcatgaagatgcgaccccaggtggtaaaggatttcctggacctctgggcccc
ccaggcaaagcaggacctgtggggcccccaggactgggatttcctggtccaccaggagag
cgaggccacccaggagttccaggccacccaggtgtgaggggccctgatggcttgaagggt
cagaaaggtgacacaatttcttgcaacgtaacctaccctgggaggcatggccctccaggt
tttgatggacctccaggtccgaagggatttccaggtccccaaggtgcccctgggctgagt
ggttcagatgggcataaaggcagacctggcacaccaggaacagcggaaataccaggtcca
cctggttttcgtggtgacatgggagatccgggttttggaggtgaaaaggggtcctcccct
gttgggcccccaggccctcccggctcaccaggagtgaatggtcagaaaggaatcccggga
gaccctgcatttggtcacctgggacccccgggaaagaggggtctttcaggagtgccaggg
ataaaaggacccagaggtgatccgggatgtccaggggctgaagggccagctggcattcct
ggattcctaggtctcaaaggtcccaaaggcagagagggacatgctgggtttccaggtgtc
ccaggtccacctggccattcctgtgaaagaggtgctccagggataccagggcaaccggga
ctccctgggtatccaggtagcccaggtgctccaggtgggaaaggacagccgggagatgtg
gggcctcccgggccagctggaatgaaaggcctccccggactcccaggacggcctggggca
catggtcccccaggcctcccaggaatcccaggtccctttggagatgatgggctacctggt
cctccaggtccaaagggaccccgggggctgcctggtttcccaggttttcccggagaaaga
ggaaagcctggtgcagagggatgtcctggcgcaaagggagaacctggagagaagggcatg
tctggccttcctggagaccggggactgagaggggccaaaggagccataggacctcccgga
gatgaaggagaaatggctatcatttcacaaaagggaacacctggggaacctggacctcct
ggagatgatggattcccaggagaaagaggtgataaaggaactcccgggatgcaagggaga
agaggagagccgggaagatacggaccacctggatttcacagaggggaacctggtgagaaa
ggtcagccagggcctcctggacccccaggccctccaggctcaactggtctaagagggttc
attggttttccaggacttccaggtgaccagggtgagccaggttctccaggtccccctgga
ttttcaggaattgatggagcaagaggacctaaaggaaacaaaggtgaccctgccagtcac
tttggtccacctggtccaaagggtgagccaggtagccctggatgtccagggcattttgga
gcatccggagagcagggcttgcctggtattcaagggcccagaggatcacctggaaggcca
gggccacctggctcctctggaccaccagggtgcccaggtgatcacgggatgcctgggctg
aggggacagccaggagaaatgggagaccctgggccaagaggcctccagggggatccaggg
ataccaggtcctccgggaataaaaggtccctccggatcacctggcctgaacggcttgcat
ggattgaaaggtcagaaaggaactaaaggtgcttcaggtttgcatgatgtggggccacct
ggtccagtgggaatacctgggctaaaaggggagagaggagaccctgggagcccaggaatc
tctcctccaggtcctcgtggaaagaaaggtcccccaggacccccagggagttcaggacca
cctggtcctgcaggtgccacaggaagagctcctaaggacattcctgacccgggtccacct
ggagatcagggacctcctggtcctgatggcccaagaggagcacctgggcctccaggcctc
cctgggagtgttgaccttctgagaggggagccaggtgactgtggtctaccagggccacca
ggtccccctggcccaccaggccctccaggatacaaaggctttccaggatgtgatggaaaa
gatggccagaaaggaccagtgggattcccgggaccgcagggaccacatggatttcctggg
ccacctggagagaagggtttacctggacctccagggagaaaagggcccactggtcttccg
ggtcccagaggtgaaccggggccacctgcagatgtggatgactgtccccgaatcccaggc
cttcctggggcgccaggcatgagaggaccagaaggagccatggggctccctggaatgaga
ggcccctcaggaccagggtgcaaaggagagcctgggctggatggcaggaggggtgtggat
ggcgtccctgggtctcctgggcctcccggacgtaaaggtgacacaggagaagacggctac
cctggaggaccagggcctcctggtcccattggggatcctgggcccaaagggtttggccct
ggatacctcggtggcttcctcctggttctccacagtcagacggaccaggagcccacctgc
cccctgggcatgcccaggctctggactgggtatagtctgttatacctggaagggcaagag
aaagctcacaatcaagaccttggtctggcagggtcttgccttcccgtatttagcacgctg
ccctttgcctactgcaacatccaccaggtgtgccactatgcccagagaaacgacagatcc
tactggctggccagcgctgcgcccctccccatgatgccactctctgaagaggcgatccgc
ccctatgtcagccgctgtgcggtatgcgaggccccggcccaggcggtggcggtgcacagc
caggaccagtccatccccccatgtccgcagacctggaggagcctctggatcgggtattca
ttcctgatgcacacaggagctggggaccaaggaggagggcaggcccttatgtcacctggc
agctgcctggaagatttcagagcagcaccattccttgaatgccagggccggcagggaact
tgccactttttcgcaaataagtatagcttctggctcacaacggtgaaagcagacttgcag
ttttcctctgctccagcaccagacaccttaaaagaaagccaggcccaacgccagaaaatc
agccggtgccaggtctgcgtgaagtatagctag

KEGG   Homo sapiens (human): 1287
Entry
1287              CDS       T01001                                 
Symbol
COL4A5, ASLN, ATS, ATS1, CA54
Name
(RefSeq) collagen type IV alpha 5 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Disease
H00581  Alport syndrome
H01640  Uterine leiomyoma
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1287 (COL4A5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1287 (COL4A5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1287 (COL4A5)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1287 (COL4A5)
  09154 Digestive system
   04974 Protein digestion and absorption
    1287 (COL4A5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1287 (COL4A5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1287 (COL4A5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1287 (COL4A5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1287 (COL4A5)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1287 (COL4A5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1287 (COL4A5)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1287 (COL4A5)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1287 (COL4A5)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1287 (COL4A5)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1287
NCBI-ProteinID: NP_000486
OMIM: 303630
HGNC: 2207
Ensembl: ENSG00000188153
Pharos: P29400(Tbio)
UniProt: P29400 Q49AM6 A7MBN3
Structure
Position
X:108439838..108697545
AA seq 1685 aa
MKLRGVSLAAGLFLLALSLWGQPAEAAACYGCSPGSKCDCSGIKGEKGERGFPGLEGHPG
LPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQG
IPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMSSLPGPKGNPGYPGPPG
IQGLPGPTGIPGPIGPPGPPGLMGPPGPPGLPGPKGNMGLNFQGPKGEKGEQGLQGPPGP
PGQISEQKRPIDVEFQKGDQGLPGDRGPPGPPGIRGPPGPPGGEKGEKGEQGEPGKRGKP
GKDGENGQPGIPGLPGDPGYPGEPGRDGEKGQKGDTGPPGPPGLVIPRPGTGITIGEKGN
IGLPGLPGEKGERGFPGIQGPPGLPGPPGAAVMGPPGPPGFPGERGQKGDEGPPGISIPG
PPGLDGQPGAPGLPGPPGPAGPHIPPSDEICEPGPPGPPGSPGDKGLQGEQGVKGDKGDT
CFNCIGTGISGPPGQPGLPGLPGPPGSLGFPGQKGEKGQAGATGPKGLPGIPGAPGAPGF
PGSKGEPGDILTFPGMKGDKGELGSPGAPGLPGLPGTPGQDGLPGLPGPKGEPGGITFKG
ERGPPGNPGLPGLPGNIGPMGPPGFGPPGPVGEKGIQGVAGNPGQPGIPGPKGDPGQTIT
QPGKPGLPGNPGRDGDVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGLPGPPGPKGFPGI
PGPPGAPGTPGRIGLEGPPGPPGFPGPKGEPGFALPGPPGPPGLPGFKGALGPKGDRGFP
GPPGPPGRTGLDGLPGPKGDVGPNGQPGPMGPPGLPGIGVQGPPGPPGIPGPIGQPGLHG
IPGEKGDPGPPGLDVPGPPGERGSPGIPGAPGPIGPPGSPGLPGKAGASGFPGTKGEMGM
MGPPGPPGPLGIPGRSGVPGLKGDDGLQGQPGLPGPTGEKGSKGEPGLPGPPGPMDPNLL
GSKGEKGEPGLPGIPGVSGPKGYQGLPGDPGQPGLSGQPGLPGPPGPKGNPGLPGQPGLI
GPPGLKGTIGDMGFPGPQGVEGPPGPSGVPGQPGSPGLPGQKGDKGDPGISSIGLPGLPG
PKGEPGLPGYPGNPGIKGSVGDPGLPGLPGTPGAKGQPGLPGFPGTPGPPGPKGISGPPG
NPGLPGEPGPVGGGGHPGQPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGNPGPPGLPGL
SGQKGDGGLPGIPGNPGLPGPKGEPGFHGFPGVQGPPGPPGSPGPALEGPKGNPGPQGPP
GRPGLPGPEGPPGLPGNGGIKGEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRPGLNGMK
GDPGLPGVPGFPGMKGPSGVPGSAGPEGEPGLIGPPGPPGLPGPSGQSIIIKGDAGPPGI
PGQPGLKGLPGPQGPQGLPGPTGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGTRGLDGP
PGPDGLQGPPGPPGTSSVAHGFLITRHSQTTDAPQCPQGTLQVYEGFSLLYVQGNKRAHG
QDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPMSMQPLKGQSIQPF
ISRCAVCEAPAVVIAVHSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSC
LEEFRSAPFIECHGRGTCNYYANSYSFWLATVDVSDMFSKPQSETLKAGDLRTRISRCQV
CMKRT
NT seq 5058 nt   +upstreamnt  +downstreamnt
atgaaactgcgtggagtcagcctggctgccggcttgttcttactggccctgagtctttgg
gggcagcctgcagaggctgcggcttgctatgggtgttctccaggatcaaagtgtgactgc
agtggcataaaaggggaaaagggagagagagggtttccaggtttggaaggacacccagga
ttgcctggatttccaggtccagaagggcctccggggcctcggggacaaaagggtgatgat
ggaattccagggccaccaggaccaaaaggaatcagaggtcctcctggacttcctggattt
ccagggacaccaggtcttcctggaatgccaggccacgatggggccccaggacctcaaggt
attcccggatgcaatggaaccaagggagaacgtggatttccaggcagtcccggttttcct
ggtttacagggtcctccaggaccccctgggatcccaggtatgaagggtgaaccaggtagt
ataattatgtcatcactgccaggaccaaagggtaatccaggatatccaggtcctcctgga
atacaaggcctacctggtcccactggtataccagggccaattggtcccccaggaccacca
ggtttgatgggccctcctggtccaccaggacttccaggacctaaggggaatatgggctta
aatttccagggacccaaaggtgaaaaaggtgagcaaggtcttcagggcccacctgggcca
cctgggcagatcagtgaacagaaaagaccaattgatgtagagtttcagaaaggagatcag
ggacttcctggtgaccgagggcctcctggacctccagggatacgtggtcctccaggtccc
ccaggtggtgagaaaggtgagaagggtgagcaaggagagccaggcaaaagaggtaaacca
ggcaaagatggagaaaatggccaaccaggaattcctggtttgcctggtgatcctggttac
cctggtgaacccggaagggatggtgaaaagggccaaaaaggtgacactggcccacctgga
cctcctggacttgtaattcctagacctgggactggtataactataggagaaaaaggaaac
attgggttgcctgggttgcctggagaaaaaggagagcgaggatttcctggaatacagggt
ccacctggccttcctggacctccaggggctgcagttatgggtcctcctggccctcctgga
tttcctggagaaaggggtcagaaaggtgatgaaggaccacctggaatttccattcctgga
cctcctggacttgacggacagcctggggctcctgggcttccagggcctcctggccctgct
ggccctcacattcctcctagtgatgagatatgtgaaccaggccctccaggccccccagga
tctccaggtgataaaggactccaaggagaacaaggagtgaaaggtgacaaaggtgacact
tgcttcaactgcattggaactggtatttcagggcctccaggtcaacctggtttgccaggt
ctcccaggtcctccaggatctcttggtttccctggacagaaaggggaaaaaggacaagct
ggtgcaactggtcccaaaggattaccaggcattccaggagctccaggtgctccaggcttt
cctggatctaaaggtgaacctggtgatatcctcacttttccaggaatgaagggtgacaaa
ggagagttgggttcccctggagctccagggcttcctggtttacctggcactcctggacag
gatggattgccagggcttcctggcccgaaaggagagcctggtggaattacttttaagggt
gaaagaggtccccctgggaacccaggtttaccaggcctcccagggaatatagggcctatg
ggtccccctggtttcggccctccaggcccagtaggtgaaaaaggcatacaaggtgtggca
ggaaatccaggccagccaggaataccaggtcctaaaggggatccaggtcagactataacc
cagccggggaagcctggcttgcctggtaacccaggcagagatggtgatgtaggtcttcca
ggtgaccctggacttccagggcaaccaggcttgccagggatacctggtagcaaaggagaa
ccaggtatccctggaattgggcttcctggaccacctggtcccaaaggctttcctggaatt
ccaggacctccaggagcacctgggacacctggaagaattggtctagaaggccctcctggg
ccacccggctttccaggaccaaagggtgaaccaggatttgcattacctgggccacctggg
ccaccaggacttccaggtttcaaaggagcacttggtccaaaaggtgatcgtggtttccca
ggacctccgggtcctccaggacgcactggcttagatgggctccctggaccaaaaggtgat
gttggaccaaatggacaacctggaccaatgggacctcctgggctgccaggaataggtgtt
cagggaccaccaggaccaccagggattcctgggccaataggtcaacctggtttacatgga
ataccaggagagaagggggatccaggacctcctggacttgatgttccaggacccccaggt
gaaagaggcagtccagggatccccggagcacctggtcctataggacctccaggatcacca
gggcttccaggaaaagcaggtgcctctggatttccaggtaccaaaggtgaaatgggtatg
atgggacctccaggcccaccaggacctttgggaattcctggcaggagtggtgtacctggt
cttaaaggtgatgatggcttgcagggtcagccaggacttcctggccctacaggagaaaaa
ggtagtaaaggagagcctggccttccaggccctcctggaccaatggatccaaatcttctg
ggctcaaaaggagagaagggggaacctggcttaccaggtatacctggagtttcagggcca
aaaggttatcagggtttgcctggagacccagggcaacctggactgagtggacaacctgga
ttaccaggaccaccaggtcccaaaggtaaccctggtctccctggacagccaggtcttata
ggacctcctggacttaaaggaaccatcggtgatatgggttttccagggcctcagggtgtg
gaagggcctcctggaccttctggagttcctggacaacctggctccccaggattacctgga
cagaaaggcgacaaaggtgatcctggtatttcaagcattggtcttccaggtcttcctggt
ccaaagggtgagcctggtctgcctggatacccagggaaccctggtatcaaaggttctgtg
ggagatcctggtttgcccggattaccaggaacccctggagcaaaaggacaaccaggcctt
cctggattcccaggaaccccaggccctcctggaccaaaaggtattagtggccctcctggg
aaccccggccttccaggagaacctggtcctgtaggtggtggaggtcatcctgggcaacca
gggcctccaggcgaaaaaggcaaacccggtcaagatggtattcctggaccagctggacag
aagggtgaaccaggtcaaccaggctttggaaacccaggaccccctggacttccaggactt
tctggccaaaagggtgatggaggattacctgggattccaggaaatcctggccttccaggt
ccaaagggcgaaccaggctttcacggtttccctggtgtgcagggtcccccaggccctcct
ggttctccgggtccagctctggaaggacctaaaggcaaccctgggccccaaggtcctcct
gggagaccaggtctaccaggtccagaaggtcctccaggtctccctggaaatggaggtatt
aaaggagagaagggaaatccaggccaacctgggctacctggcttgcctggtttgaaagga
gatcaaggaccaccaggactccagggtaatcctggccggccgggtctcaatggaatgaaa
ggagatcctggtctccctggtgttccaggattcccaggcatgaaaggacccagtggagta
cctggatcagctggccctgagggggaaccgggacttattggtcctccaggtcctcctgga
ttacctggtccttcaggacagagtatcataattaaaggagatgctggtcctccaggaatc
cctggccagcctgggctaaagggtctaccaggaccccaaggacctcaaggcttaccaggt
ccaactggccctccaggagatcctggacgcaatggactccctggctttgatggtgcagga
gggcgcaaaggagacccaggtctgccaggacagccaggtacccgtggtttggatggtccc
cctggtccagatggattgcaaggtcccccaggtccccctggaacctcctctgttgcacat
ggatttcttattacacgccacagccagacaacggatgcaccacaatgcccacagggaaca
cttcaggtctatgaaggcttttctctcctgtatgtacaaggaaataaaagagcccacggt
caagacttggggacggctggcagctgccttcgtcgctttagtaccatgcctttcatgttc
tgcaacatcaataatgtttgcaactttgcttcaagaaatgactattcttactggctctct
accccagagcccatgccaatgagcatgcaacccctaaagggccagagcatccagccattc
attagtcgatgtgcagtatgtgaagctccagctgtggtgatcgcagttcacagtcagacg
atccagattccccattgtcctcagggatgggattctctgtggattggttattccttcatg
atgcatacaagtgcaggggcagaaggctcaggtcaagccctagcctcccctggttcctgc
ttggaagagtttcgttcagctcccttcatcgaatgtcatgggaggggtacctgtaactac
tatgccaactcctacagcttttggctggcaactgtagatgtgtcagacatgttcagtaaa
cctcagtcagaaacgctgaaagcaggagacttgaggacacgaattagccgatgtcaagtg
tgcatgaagaggacataa

KEGG   Homo sapiens (human): 1288
Entry
1288              CDS       T01001                                 
Symbol
COL4A6, CXDELq22.3, DELXq22.3, DFNX6
Name
(RefSeq) collagen type IV alpha 6 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Disease
H01209  Deafness, X-linked
H01640  Uterine leiomyoma
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1288 (COL4A6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1288 (COL4A6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1288 (COL4A6)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1288 (COL4A6)
  09154 Digestive system
   04974 Protein digestion and absorption
    1288 (COL4A6)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1288 (COL4A6)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1288 (COL4A6)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1288 (COL4A6)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1288 (COL4A6)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1288 (COL4A6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1288 (COL4A6)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1288 (COL4A6)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1288 (COL4A6)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1288 (COL4A6)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1288
NCBI-ProteinID: NP_001838
OMIM: 303631
HGNC: 2208
Ensembl: ENSG00000197565
Pharos: Q14031(Tbio)
UniProt: Q14031
Position
X:complement(108155614..108439458)
AA seq 1691 aa
MLINKLWLLLVTLCLTEELAAAGEKSYGKPCGGQDCSGSCQCFPEKGARGRPGPIGIQGP
TGPQGFTGSTGLSGLKGERGFPGLLGPYGPKGDKGPMGVPGFLGINGIPGHPGQPGPRGP
PGLDGCNGTQGAVGFPGPDGYPGLLGPPGLPGQKGSKGDPVLAPGSFKGMKGDPGLPGLD
GITGPQGAPGFPGAVGPAGPPGLQGPPGPPGPLGPDGNMGLGFQGEKGVKGDVGLPGPAG
PPPSTGELEFMGFPKGKKGSKGEPGPKGFPGISGPPGFPGLGTTGEKGEKGEKGIPGLPG
PRGPMGSEGVQGPPGQQGKKGTLGFPGLNGFQGIEGQKGDIGLPGPDVFIDIDGAVISGN
PGDPGVPGLPGLKGDEGIQGLRGPSGVPGLPALSGVPGALGPQGFPGLKGDQGNPGRTTI
GAAGLPGRDGLPGPPGPPGPPSPEFETETLHNKESGFPGLRGEQGPKGNLGLKGIKGDSG
FCACDGGVPNTGPPGEPGPPGPWGLIGLPGLKGARGDRGSGGAQGPAGAPGLVGPLGPSG
PKGKKGEPILSTIQGMPGDRGDSGSQGFRGVIGEPGKDGVPGLPGLPGLPGDGGQGFPGE
KGLPGLPGEKGHPGPPGLPGNGLPGLPGPRGLPGDKGKDGLPGQQGLPGSKGITLPCIIP
GSYGPSGFPGTPGFPGPKGSRGLPGTPGQPGSSGSKGEPGSPGLVHLPELPGFPGPRGEK
GLPGFPGLPGKDGLPGMIGSPGLPGSKGATGDIFGAENGAPGEQGLQGLTGHKGFLGDSG
LPGLKGVHGKPGLLGPKGERGSPGTPGQVGQPGTPGSSGPYGIKGKSGLPGAPGFPGISG
HPGKKGTRGKKGPPGSIVKKGLPGLKGLPGNPGLVGLKGSPGSPGVAGLPALSGPKGEKG
SVGFVGFPGIPGLPGIPGTRGLKGIPGSTGKMGPSGRAGTPGEKGDRGNPGPVGIPSPRR
PMSNLWLKGDKGSQGSAGSNGFPGPRGDKGEAGRPGPPGLPGAPGLPGIIKGVSGKPGPP
GFMGIRGLPGLKGSSGITGFPGMPGESGSQGIRGSPGLPGASGLPGLKGDNGQTVEISGS
PGPKGQPGESGFKGTKGRDGLIGNIGFPGNKGEDGKVGVSGDVGLPGAPGFPGVAGMRGE
PGLPGSSGHQGAIGPLGSPGLIGPKGFPGFPGLHGLNGLPGTKGTHGTPGPSITGVPGPA
GLPGPKGEKGYPGIGIGAPGKPGLRGQKGDRGFPGLQGPAGLPGAPGISLPSLIAGQPGD
PGRPGLDGERGRPGPAGPPGPPGPSSNQGDTGDPGFPGIPGPKGPKGDQGIPGFSGLPGE
LGLKGMRGEPGFMGTPGKVGPPGDPGFPGMKGKAGPRGSSGLQGDPGQTPTAEAVQVPPG
PLGLPGIDGIPGLTGDPGAQGPVGLQGSKGLPGIPGKDGPSGLPGPPGALGDPGLPGLQG
PPGFEGAPGQQGPFGMPGMPGQSMRVGYTLVKHSQSEQVPPCPIGMSQLWVGYSLLFVEG
QEKAHNQDLGFAGSCLPRFSTMPFIYCNINEVCHYARRNDKSYWLSTTAPIPMMPVSQTQ
IPQYISRCSVCEAPSQAIAVHSQDITIPQCPLGWRSLWIGYSFLMHTAAGAEGGGQSLVS
PGSCLEDFRATPFIECSGARGTCHYFANKYSFWLTTVEERQQFGELPVSETLKAGQLHTR
VSRCQVCMKSL
NT seq 5076 nt   +upstreamnt  +downstreamnt
atgcttataaacaagttgtggctgctcctggttacgttgtgcctgaccgaggaactggca
gcagcgggagagaagtcttatggaaagccatgtgggggccaggactgcagtgggagctgt
cagtgttttcctgagaaaggagcgagaggacgacctggaccaattggaattcaaggccca
acaggtcctcaaggattcactggctctactggtttatcgggattgaaaggagaaaggggt
ttcccaggccttctgggaccttatggaccaaaaggagataagggtcccatgggagttcct
ggctttcttggcatcaatgggattccgggccaccctggacaaccaggccccagaggccca
cctggtctggatggctgtaatggaactcaaggagctgttggatttccaggccctgatggc
tatcctgggcttctcggaccacccgggcttcctggtcagaaaggatcaaaaggtgaccct
gtccttgctccaggtagtttcaaaggaatgaagggggatcctgggctgcctggactggat
ggaatcactggcccacaaggagcacccggatttcctggagctgtaggacctgcaggacca
ccaggattacaaggtcctccagggcctcctggtcctcttggtcctgatgggaatatgggg
ctaggttttcaaggagagaaaggagtcaagggggatgttggcctccctggcccagcagga
cctccaccatctactggagagctggaattcatgggattccccaaagggaagaaaggatcc
aagggtgaaccagggcctaagggttttccaggcataagtggccctccaggcttcccgggc
cttggaactactggagaaaagggagaaaagggagaaaagggaatccctggtttgccagga
cctaggggtcccatgggttcagaaggagtccaaggccctccagggcaacagggcaagaaa
gggaccctgggatttcctgggcttaatggattccaaggaattgagggtcaaaagggtgac
attggcctgccaggcccagatgttttcatcgatatagatggtgctgtgatctcaggtaat
cctggagatcctggtgtacctggcctcccaggccttaaaggagatgaaggcatccaaggc
ctacgtggcccttctggtgtccctggattgccagcattatcaggtgtcccaggagcccta
gggcctcagggatttccagggctgaagggggaccaaggaaacccaggccgtaccacaatt
ggagcagctggcctccctggcagagatggtttgccaggcccaccaggtccaccaggccca
cctagtccagaatttgagactgaaactctacacaacaaagagtcagggttccctggtctc
cgaggagaacaaggtccaaaaggaaacctaggcctcaaaggaataaaaggagactcaggt
ttctgtgcttgtgacggtggtgttcccaacactggaccacccggggaaccaggcccacct
ggtccatggggtctcataggccttccaggccttaaaggagccagaggagatcgaggctct
gggggtgcacagggcccagcaggggctccaggcttagttgggcctctgggtccttcagga
cccaaaggaaagaagggggaaccaattctcagtacaatccaaggaatgccaggagatcgg
ggtgattctggctcccagggcttccgtggtgtaataggagaaccaggcaaggacggagta
ccaggtttaccaggtctgccaggccttccgggtgatggtggacagggcttcccaggtgaa
aaggggttacctggacttcctggtgaaaaaggccatcctggtccacctggcctcccagga
aatgggttaccaggacttcctggaccccgtgggcttcctggagataaaggcaaggatgga
ttaccgggacaacaaggccttcccggatctaagggaatcaccctgccctgtattattcct
gggtcatacggtccatcaggatttccaggcactcccggattcccaggccctaaagggtct
cgaggcctccctgggaccccaggccagcctgggtcaagtggaagtaaaggagagccaggg
agtccaggattggttcatcttcctgaattaccaggatttcctggacctcgtggggagaag
ggcttgcctgggtttcctgggctccctggaaaagatggcttgcctgggatgattggcagt
ccaggcttacctggttccaagggagccactggtgacatctttggtgctgaaaatggtgct
ccgggggaacaaggcctacaaggattaacagggcacaaaggatttcttggagactctggc
cttccaggactcaagggtgtgcacgggaagcctggcttactaggccccaaaggtgagcgg
ggcagccctgggacaccaggacaggtgggacagccaggcaccccaggatctagtggtcca
tatggcatcaagggcaaatctgggctcccaggagcaccaggcttcccaggcatctcagga
catcctggaaagaaaggaacaagaggcaagaaaggtcctcctggatcaattgtaaagaaa
gggctgccagggctaaaaggccttcctggaaatccaggcctagtaggactgaaaggaagc
ccaggctctccaggggtcgctgggttgccagccctctctggacccaagggagagaagggg
tctgttggattcgtaggttttccaggaataccaggtctgcctggtattcctggaacaaga
ggattaaaaggaattccaggatcaactggaaaaatgggaccatctggacgtgctggtact
cctggtgaaaagggagacagaggcaatccggggccagtcggaatacctagtccaagacgt
ccaatgtcaaacctttggctcaaaggagacaaaggctctcaaggctcagccggatccaat
ggatttcctgggccaagaggtgacaaaggagaggctggtcgacctggaccaccaggccta
cctggagctcctggcctcccaggcattatcaaaggagttagtggaaagccagggccccct
ggcttcatgggaatccggggcttacctggcctgaaggggtcctctgggatcacaggtttc
ccaggaatgccaggagaaagtggttcacaaggtatcagagggtcgcctggactcccagga
gcatctggtctcccaggcctgaaaggagacaacggccagacagttgaaatttccggtagc
ccaggacccaagggacagcctggcgaatctggttttaaaggcacaaaaggaagagatgga
ctaataggcaatataggcttccctggaaacaaaggtgaagatggaaaagttggtgtttct
ggagatgttggccttcctggagctccaggatttccaggagttgccggcatgagaggagaa
ccaggacttccaggttcttctggtcaccaaggggcaattgggcctctaggatcccccgga
ttaataggacccaaaggcttccctggatttcctggtttacatggactgaatgggcttccg
ggcaccaagggtacccatggcactccaggacctagtatcaccggtgtgcctgggcctgct
ggtctccctggacccaaaggagaaaaaggatatccaggaattggcatcggagctccaggg
aagccgggcctgagagggcaaaaaggtgatcgaggtttcccaggtctccagggccctgct
ggtctccccggtgccccaggcatctccttgccctcactcatagcaggacagcctggtgac
cccgggcgaccaggcctagatggagaacgaggccgcccaggccccgctggacccccaggt
ccccctgggccatcctcgaatcaaggcgacaccggagaccctggcttccctggaattcct
ggacctaaagggcctaagggagaccaaggaattccaggtttttctggcctccctggagag
ctaggactgaaaggcatgagaggtgagcctggcttcatggggactccaggcaaggttggg
ccacctggagacccaggatttcccggaatgaaggggaaggcagggccaagaggctcttct
ggcctccaaggtgatcctggacaaacaccaactgcagaagctgtccaggttcctcctgga
cccttgggtctaccagggatcgatggcatccctggcctcactggggaccctggggctcaa
ggccctgtaggcctacaaggctccaaaggtttacctggcatccccggtaaagatggcccc
agtgggctcccaggcccacctggggctcttggtgatcctggtctgcctggactgcaaggc
cctccaggatttgaaggagctccagggcagcaaggccccttcgggatgcctggaatgcct
ggccagagcatgagagtgggctacacgttggtaaagcacagccagtcggaacaggtgccc
ccgtgtcccatcgggatgagccagctgtgggtggggtacagcttactgtttgtggagggg
caagagaaagcccacaaccaggacctgggctttgctggctcctgtctgccccgcttcagc
accatgcccttcatctactgcaacatcaacgaggtgtgccactatgccaggcgcaatgat
aaatcttactggctctccactaccgcccctatccccatgatgcccgtcagccagacccag
attccccagtacatcagccgctgctctgtgtgtgaggcaccctcgcaagccattgctgtg
cacagccaggacatcaccatcccgcagtgccccctgggctggcgcagcctctggattggg
tactctttcctcatgcacactgccgctggtgccgagggtggaggccagtccctggtctca
cctggctcctgcctagaggactttcgggccactcctttcatcgaatgcagtggtgcccga
ggcacctgccactactttgcaaacaagtacagtttctggttgaccacagtggaggagagg
cagcagtttggggagttgcctgtgtctgaaacgctgaaagctgggcagctccacactcga
gtcagtcgctgccaggtgtgtatgaaaagcctgtag

KEGG   Homo sapiens (human): 22798
Entry
22798             CDS       T01001                                 
Symbol
LAMB4
Name
(RefSeq) laminin subunit beta 4
  KO
K06245  laminin, beta 4
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa05145  Toxoplasmosis
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    22798 (LAMB4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    22798 (LAMB4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    22798 (LAMB4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    22798 (LAMB4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    22798 (LAMB4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    22798 (LAMB4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    22798 (LAMB4)
   05145 Toxoplasmosis
    22798 (LAMB4)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N
Other DBs
NCBI-GeneID: 22798
NCBI-ProteinID: NP_001304975
OMIM: 616380
HGNC: 6491
Ensembl: ENSG00000091128
Pharos: A4D0S4(Tdark)
UniProt: A4D0S4 B4DTV7 B7ZMJ6
Position
7:complement(108011662..108130361)
AA seq 1761 aa
MQFQLTLFLHLGWLSYSKAQDDCNRGACHPTTGDLLVGRNTQLMASSTCGLSRAQKYCIL
SYLEGEQKCFICDSRFPYDPYDQPNSHTIENVIVSFEPDREKKWWQSENGLDHVSIRLDL
EALFRFSHLILTFKTFRPAAMLVERSTDYGHNWKVFKYFAKDCATSFPNITSGQAQGVGD
IVCDSKYSDIEPSTGGEVVLKVLDPSFEIENPYSPYIQDLVTLTNLRINFTKLHTLGDAL
LGRRQNDSLDKYYYALYEMIVRGSCFCNGHASECRPMQKMRGDVFSPPGMVHGQCVCQHN
TDGPNCERCKDFFQDAPWRPAADLQDNACRSCSCNSHSSRCHFDMTTYLASGGLSGGVCE
DCQHNTEGQHCDRCRPLFYRDPLKTISDPYACIPCECDPDGTISGGICVSHSDPALGSVA
GQCLCKENVEGAKCDQCKPNHYGLSATDPLGCQPCDCNPLGSLPFLTCDVDTGQCLCLSY
VTGAHCEECTVGYWGLGNHLHGCSPCDCDIGGAYSNVCSPKNGQCECRPHVTGRSCSEPA
PGYFFAPLNFYLYEAEEATTLQGLAPLGSETFGQSPAVHVVLGEPVPGNPVTWTGPGFAR
VLPGAGLRFAVNNIPFPVDFTIAIHYETQSAADWTVQIVVNPPGGSEHCIPKTLQSKPQS
FALPAATRIMLLPTPICLEPDVQYSIDVYFSQPLQGESHAHSHVLVDSLGLIPQINSLEN
FCSKQDLDEYQLHNCVEIASAMGPQVLPGACERLIISMSAKLHDGAVACKCHPQGSVGSS
CSRLGGQCQCKPLVVGRCCDRCSTGSYDLGHHGCHPCHCHPQGSKDTVCDQVTGQCPCHG
EVSGRRCDRCLAGYFGFPSCHPCPCNRFAELCDPETGSCFNCGGFTTGRNCERCIDGYYG
NPSSGQPCRPCLCPDDPSSNQYFAHSCYQNLWSSDVICNCLQGYTGTQCGECSTGFYGNP
RISGAPCQPCACNNNIDVTDPESCSRVTGECLRCLHNTQGANCQLCKPGHYGSALNQTCR
RCSCHASGVSPMECPPGGGACLCDPVTGACPCLPNVTGLACDRCADGYWNLVPGRGCQSC
DCDPRTSQSSHCDQLTGQCPCKLGYGGKRCSECQENYYGDPPGRCIPCDCNRAGTQKPIC
DPDTGMCRCREGVSGQRCDRCARGHSQEFPTCLQCHLCFDQWDHTISSLSKAVQGLMRLA
ANMEDKRETLPVCEADFKDLRGNVSEIERILKHPVFPSGKFLKVKDYHDSVRRQIMQLNE
QLKAVYEFQDLKDTIERAKNEADLLLEDLQEEIDLQSSVLNASIADSSENIKKYYHISSS
AEKKINETSSTINTSANTRNDLLTILDTLTSKGNLSLERLKQIKIPDIQILNEKVCGDPG
NVPCVPLPCGGALCTGRKGHRKCRGPGCHGSLTLSTNALQKAQEAKSIIRNLDKQVRGLK
NQIESISEQAEVSKNNALQLREKLGNIRNQSDSEEENINLFIKKVKNFLLEENVPPEDIE
KVANGVLDIHLPIPSQNLTDELVKIQKHMQLCEDYRTDENRLNEEADGAQKLLVKAKAAE
KAANILLNLDKTLNQLQQAQITQGRANSTITQLTANITKIKKNVLQAENQTREMKSELEL
AKQRSGLEDGLSLLQTKLQRHQDHAVNAKVQAESAQHQAGSLEKEFVELKKQYAILQRKT
STTGLTKETLGKVKQLKDAAEKLAGDTEAKIRRITDLERKIQDLNLSRQAKADQLRILED
QVVAIKNEIVEQEKKYARCYS
NT seq 5286 nt   +upstreamnt  +downstreamnt
atgcaatttcaactgaccctttttttgcaccttgggtggctcagttactcaaaagctcaa
gatgactgcaacaggggtgcctgtcatcccaccactggtgatctcctggtgggcaggaac
acgcagcttatggcttcttctacctgtgggctgagcagagcccagaaatactgcatcctc
agttacctggagggggaacaaaaatgcttcatctgtgactctagatttccatatgatccg
tatgaccaacccaacagccacaccattgagaatgtcattgtaagttttgaaccagacaga
gaaaagaaatggtggcaatctgaaaatggtcttgatcatgtcagcatcagactggactta
gaggcattatttcggttcagccaccttatcctgacctttaagacttttcggcctgctgca
atgttagttgaacgttccacagactatggacacaactggaaagtgttcaaatattttgca
aaagactgtgccacttcctttcctaacatcacatctggccaggcccagggagtgggagac
attgtttgtgactccaaatactcggatattgaaccctcaacaggtggagaggttgtttta
aaagttttggatcccagttttgaaattgaaaacccttatagcccctacatccaagacctt
gtgacattgacaaacctgaggataaactttaccaagctccacacccttggggatgctttg
cttggaaggaggcaaaatgattcccttgataaatactactatgctctgtacgagatgatt
gttcggggaagctgcttttgcaatggccatgctagcgaatgtcgccctatgcagaagatg
cggggagatgttttcagccctcctggaatggttcacggtcagtgtgtgtgtcagcacaat
acagatggtccgaactgtgagagatgcaaggacttcttccaggatgctccttggaggcca
gctgcagacctccaggacaacgcttgcagatcgtgcagctgtaatagccactccagccgc
tgtcactttgacatgactacgtacctggcaagcggtggcctcagcgggggcgtgtgtgaa
gactgccagcacaacactgaggggcagcactgcgaccgctgcagacccctcttctacagg
gacccgctcaagaccatctcagatccctacgcgtgcattccttgtgaatgtgaccccgat
gggaccatatctggtggcatttgtgtgagccactctgatcctgccttagggtctgtggcc
ggccagtgcctttgtaaagagaacgtggaaggagccaaatgcgaccagtgcaaacccaac
cactacggactaagcgccaccgaccccctgggctgccagccctgcgactgtaaccccctt
gggagtctgccattcttgacctgtgatgtggatacaggccaatgcttgtgcctgtcatat
gtcaccggagcacactgcgaagaatgcactgttggatactggggcctgggaaatcatctc
catgggtgttctccctgtgactgtgatattggaggtgcttattctaacgtgtgctcaccc
aagaatgggcagtgtgaatgccgcccacatgtcactggccgtagctgctctgaaccagcc
cctggctacttctttgctcctttgaatttctatctctacgaggcagaggaagccacaaca
ctccaaggactggcgcctttgggctcggagacgtttggccagagtcctgctgttcacgtt
gttttaggagagccagttcctgggaaccctgttacatggactggacctggatttgccagg
gttctccctggggctggcttgagatttgctgtcaacaacattccctttcctgtggacttc
accattgccattcactatgaaacccagtctgcagctgactggactgtccagattgtggtg
aacccccctggagggagtgagcactgcatacccaagactctacagtcaaagcctcagtct
tttgccttaccagcggctacgagaatcatgctgcttcccacacccatctgtttagaacca
gatgtacaatattccatagatgtctatttttctcagcctttgcaaggagagtcccacgct
cattcacatgtcctggtggactctcttggccttattccccaaatcaattcattggagaat
ttctgcagcaagcaggacttagatgagtatcagcttcacaactgtgttgaaattgcctca
gcaatgggacctcaagtgctcccgggtgcctgtgaaaggctgatcatcagcatgtctgcc
aagctgcatgatggggctgtggcctgcaagtgtcacccccagggctcagtcggatccagc
tgcagccgacttggaggccagtgccagtgtaaacctcttgtggtcgggcgctgctgtgac
aggtgctcaactggaagctatgatttggggcatcacggctgtcacccatgtcactgccat
cctcaaggatcaaaggacactgtatgtgaccaagtaacaggacagtgcccctgccatgga
gaggtgtctggccgccgctgtgatcgctgcctggcaggctactttggatttcccagctgc
cacccttgcccttgtaataggtttgctgaactttgtgatcctgagacagggtcatgcttc
aattgtggaggctttacaactggcagaaactgtgaaaggtgtattgatggttactatgga
aatccttcttcaggacagccctgtcgtccttgcctgtgtccagatgatccctcaagcaat
cagtattttgcccattcctgttatcagaatctgtggagctcagatgtaatctgcaattgt
cttcaaggttatacgggtactcagtgtggagaatgctctactggtttctatggaaatcca
agaatttcaggagcaccttgccaaccatgtgcctgcaacaacaacatagatgtaaccgat
ccagagtcctgcagccgggtaacaggggagtgccttcgatgtttgcacaacactcagggc
gcaaactgccagctctgcaaaccaggtcactatggatcagccctcaatcagacctgcaga
agatgctcctgccatgcttccggcgtgagtcccatggagtgtccccctggtgggggagct
tgcctctgtgaccctgtcactggtgcatgtccttgtctgccgaatgtcacaggcctggcc
tgtgaccgttgtgctgatggatactggaatctggtccctggcagaggatgtcagtcatgt
gactgtgaccctaggacctctcaaagtagccactgtgaccagcttacaggccagtgtccg
tgtaaattaggttacggcgggaaacgttgcagtgagtgccaggaaaattattatggtgat
ccacctgggcgatgcattccatgtgattgtaacagggcaggtacccagaagcccatctgt
gatccagacacaggcatgtgccgctgccgggagggtgtcagcggccagagatgtgatcgc
tgtgcccggggacacagccaggaattccctacttgtcttcaatgtcacttgtgctttgat
cagtgggaccacaccatttcttccctctccaaagcggtgcaagggttaatgagactggct
gctaacatggaagataaaagagagaccctgcctgtctgtgaggcagacttcaaagacctc
agagggaacgtgtctgaaatagaaaggattttgaaacatcctgttttcccatctgggaaa
ttcttaaaagtcaaggattatcatgactctgttagaagacaaatcatgcagctaaatgaa
caactgaaagcagtgtatgaatttcaagatctgaaagatacaatagaaagagcaaagaat
gaagcagacctcttacttgaagaccttcaggaagaaattgatttgcaatccagtgtcctt
aatgcaagcattgcggactcctcagaaaacatcaagaaatattatcacatatcatcatct
gctgaaaagaaaattaatgaaactagttccaccattaatacctctgcaaatacaaggaat
gacttacttaccatcttagatacactaacctcaaaaggaaacttgtcattggaaagatta
aagcagattaagataccagatatccaaatattgaatgaaaaggtgtgcggagatccagga
aatgtgccatgtgtgcccttgccctgtggcggtgctctctgcacgggccggaaggggcac
aggaagtgtaggggtcccggctgtcacggctccctgaccctctcaacgaatgccctccaa
aaagcccaggaagcaaaatccattattcgtaatttggacaaacaggttcgtgggttgaaa
aatcagatcgaaagtataagtgaacaggcagaagtctccaaaaacaatgccttacagctg
agggaaaaactgggaaatataagaaaccaaagtgactctgaagaagaaaacatcaatctt
ttcatcaaaaaagtgaaaaactttttgttagaggaaaacgtgcctccagaagacatcgag
aaggttgcgaatggtgtgcttgacattcacctaccaattccatcccaaaatctaaccgat
gaacttgtcaaaatacagaaacatatgcaactctgtgaggattacaggacagatgaaaac
aggttaaatgaagaagcagatggagcccaaaagcttttggtgaaggccaaagcagctgag
aaagcagcaaatattctattaaatcttgacaaaacattgaaccagttacaacaagctcaa
atcactcaaggacgggcaaactctaccattacacagctgactgccaatataacaaaaata
aaaaagaatgtgctgcaggctgaaaatcaaaccagggaaatgaagagtgagctggagtta
gcaaagcagcgatcagggctggaggatggactttccctgctgcagaccaagttgcaaagg
catcaagaccacgctgtcaatgcgaaagttcaggctgaatctgcccaacaccaggctggg
agtcttgagaaggaatttgttgagctgaaaaaacaatatgctattctccaacgtaagaca
agcactacaggactaacaaaggagacattaggaaaagttaaacagctaaaagatgcggca
gaaaaattggctggagatacagaggccaagataagaagaataacagatttagaaaggaaa
atccaagatttgaatctaagtagacaagcaaaagctgatcaactgagaatattggaagat
caagttgttgccattaaaaatgaaattgttgaacaagaaaaaaaatatgctaggtgctat
agctag

KEGG   Homo sapiens (human): 2335
Entry
2335              CDS       T01001                                 
Symbol
FN1, CIG, ED-B, FINC, FN, FNZ, GFND, GFND2, LETS, MSF, SMDCF
Name
(RefSeq) fibronectin 1
  KO
K05717  fibronectin 1
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04810  Regulation of actin cytoskeleton
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa05100  Bacterial invasion of epithelial cells
hsa05135  Yersinia infection
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05205  Proteoglycans in cancer
hsa05222  Small cell lung cancer
Network
nt06135  Cytoskeletal regulation (viruses and bacteria)
nt06167  Human cytomegalovirus (HCMV)
  Element
N00393  ITGA/B-RhoGAP-RhoA signaling pathway
N00951  ITGA/B-RHOG-RAC signaling pathway
N01068  ITGA/B-FAK-RAC signaling pathway
N01070  ITGA/B-FAK-CDC42 signaling pathway
N01072  ITGA/B-RhoGEF-RhoA signaling pathway
N01080  ITGA/B-TALIN/VINCULIN signaling pathway
Disease
H01260  Glomerulopathy with fibronectin deposits
H02185  Spondylometaphyseal dysplasia
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    2335 (FN1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    2335 (FN1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    2335 (FN1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    2335 (FN1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    2335 (FN1)
   05205 Proteoglycans in cancer
    2335 (FN1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    2335 (FN1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    2335 (FN1)
  09171 Infectious disease: bacterial
   05135 Yersinia infection
    2335 (FN1)
   05100 Bacterial invasion of epithelial cells
    2335 (FN1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    2335 (FN1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    2335 (FN1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:hsa04131]
    2335 (FN1)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    2335 (FN1)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    2335 (FN1)
   04990 Domain-containing proteins not elsewhere classified [BR:hsa04990]
    2335 (FN1)
Membrane trafficking [BR:hsa04131]
 Endoplasmic reticulum (ER) - Golgi transport
  Forward pathways
   ER-Golgi intermediate compartment (ERGIC) proteins
    2335 (FN1)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of bladder cancer cells
   2335 (FN1)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   2335 (FN1)
Domain-containing proteins not elsewhere classified [BR:hsa04990]
 Fibronectin (FN) domain-containing proteins
  Fibronectin type I domain-containing proteins
   2335 (FN1)
SSDB
Motif
Pfam: fn3 fn1 fn2 Pur_ac_phosph_N NDNF
Other DBs
NCBI-GeneID: 2335
NCBI-ProteinID: NP_997647
OMIM: 135600
HGNC: 3778
Ensembl: ENSG00000115414
Pharos: P02751(Tchem)
UniProt: P02751 Q6MZM7 Q9UQS6
Structure
Position
2:complement(215360865..215436068)
AA seq 2477 aa
MLRGPGPGLLLLAVQCLGTAVPSTGASKSKRQAQQMVQPQSPVAVSQSKPGCYDNGKHYQ
INQQWERTYLGNALVCTCYGGSRGFNCESKPEAEETCFDKYTGNTYRVGDTYERPKDSMI
WDCTCIGAGRGRISCTIANRCHEGGQSYKIGDTWRRPHETGGYMLECVCLGNGKGEWTCK
PIAEKCFDHAAGTSYVVGETWEKPYQGWMMVDCTCLGEGSGRITCTSRNRCNDQDTRTSY
RIGDTWSKKDNRGNLLQCICTGNGRGEWKCERHTSVQTTSSGSGPFTDVRAAVYQPQPHP
QPPPYGHCVTDSGVVYSVGMQWLKTQGNKQMLCTCLGNGVSCQETAVTQTYGGNSNGEPC
VLPFTYNGRTFYSCTTEGRQDGHLWCSTTSNYEQDQKYSFCTDHTVLVQTRGGNSNGALC
HFPFLYNNHNYTDCTSEGRRDNMKWCGTTQNYDADQKFGFCPMAAHEEICTTNEGVMYRI
GDQWDKQHDMGHMMRCTCVGNGRGEWTCIAYSQLRDQCIVDDITYNVNDTFHKRHEEGHM
LNCTCFGQGRGRWKCDPVDQCQDSETGTFYQIGDSWEKYVHGVRYQCYCYGRGIGEWHCQ
PLQTYPSSSGPVEVFITETPSQPNSHPIQWNAPQPSHISKYILRWRPKNSVGRWKEATIP
GHLNSYTIKGLKPGVVYEGQLISIQQYGHQEVTRFDFTTTSTSTPVTSNTVTGETTPFSP
LVATSESVTEITASSFVVSWVSASDTVSGFRVEYELSEEGDEPQYLDLPSTATSVNIPDL
LPGRKYIVNVYQISEDGEQSLILSTSQTTAPDAPPDTTVDQVDDTSIVVRWSRPQAPITG
YRIVYSPSVEGSSTELNLPETANSVTLSDLQPGVQYNITIYAVEENQESTPVVIQQETTG
TPRSDTVPSPRDLQFVEVTDVKVTIMWTPPESAVTGYRVDVIPVNLPGEHGQRLPISRNT
FAEVTGLSPGVTYYFKVFAVSHGRESKPLTAQQTTKLDAPTNLQFVNETDSTVLVRWTPP
RAQITGYRLTVGLTRRGQPRQYNVGPSVSKYPLRNLQPASEYTVSLVAIKGNQESPKATG
VFTTLQPGSSIPPYNTEVTETTIVITWTPAPRIGFKLGVRPSQGGEAPREVTSDSGSIVV
SGLTPGVEYVYTIQVLRDGQERDAPIVNKVVTPLSPPTNLHLEANPDTGVLTVSWERSTT
PDITGYRITTTPTNGQQGNSLEEVVHADQSSCTFDNLSPGLEYNVSVYTVKDDKESVPIS
DTIIPEVPQLTDLSFVDITDSSIGLRWTPLNSSTIIGYRITVVAAGEGIPIFEDFVDSSV
GYYTVTGLEPGIDYDISVITLINGGESAPTTLTQQTAVPPPTDLRFTNIGPDTMRVTWAP
PPSIDLTNFLVRYSPVKNEEDVAELSISPSDNAVVLTNLLPGTEYVVSVSSVYEQHESTP
LRGRQKTGLDSPTGIDFSDITANSFTVHWIAPRATITGYRIRHHPEHFSGRPREDRVPHS
RNSITLTNLTPGTEYVVSIVALNGREESPLLIGQQSTVSDVPRDLEVVAATPTSLLISWD
APAVTVRYYRITYGETGGNSPVQEFTVPGSKSTATISGLKPGVDYTITVYAVTGRGDSPA
SSKPISINYRTEIDKPSQMQVTDVQDNSISVKWLPSSSPVTGYRVTTTPKNGPGPTKTKT
AGPDQTEMTIEGLQPTVEYVVSVYAQNPSGESQPLVQTAVTNIDRPKGLAFTDVDVDSIK
IAWESPQGQVSRYRVTYSSPEDGIHELFPAPDGEEDTAELQGLRPGSEYTVSVVALHDDM
ESQPLIGTQSTAIPAPTDLKFTQVTPTSLSAQWTPPNVQLTGYRVRVTPKEKTGPMKEIN
LAPDSSSVVVSGLMVATKYEVSVYALKDTLTSRPAQGVVTTLENVSPPRRARVTDATETT
ITISWRTKTETITGFQVDAVPANGQTPIQRTIKPDVRSYTITGLQPGTDYKIYLYTLNDN
ARSSPVVIDASTAIDAPSNLRFLATTPNSLLVSWQPPRARITGYIIKYEKPGSPPREVVP
RPRPGVTEATITGLEPGTEYTIYVIALKNNQKSEPLIGRKKTDELPQLVTLPHPNLHGPE
ILDVPSTVQKTPFVTHPGYDTGNGIQLPGTSGQQPSVGQQMIFEEHGFRRTTPPTTATPI
RHRPRPYPPNVGEEIQIGHIPREDVDYHLYPHGPGLNPNASTGQEALSQTTISWAPFQDT
SEYIISCHPVGTDEEPLQFRVPGTSTSATLTGLTRGATYNVIVEALKDQQRHKVREEVVT
VGNSVNEGLNQPTDDSCFDPYTVSHYAVGDEWERMSESGFKLLCQCLGFGSGHFRCDSSR
WCHDNGVNYKIGEKWDRQGENGQMMSCTCLGNGKGEFKCDPHEATCYDDGKTYHVGEQWQ
KEYLGAICSCTCFGGQRGWRCDNCRRPGGEPSPEGTTGQSYNQYSQRYHQRTNTNVNCPI
ECFMPLDVQADREDSRE
NT seq 7434 nt   +upstreamnt  +downstreamnt
atgcttaggggtccggggcccgggctgctgctgctggccgtccagtgcctggggacagcg
gtgccctccacgggagcctcgaagagcaagaggcaggctcagcaaatggttcagccccag
tccccggtggctgtcagtcaaagcaagcccggttgttatgacaatggaaaacactatcag
ataaatcaacagtgggagcggacctacctaggcaatgcgttggtttgtacttgttatgga
ggaagccgaggttttaactgcgagagtaaacctgaagctgaagagacttgctttgacaag
tacactgggaacacttaccgagtgggtgacacttatgagcgtcctaaagactccatgatc
tgggactgtacctgcatcggggctgggcgagggagaataagctgtaccatcgcaaaccgc
tgccatgaagggggtcagtcctacaagattggtgacacctggaggagaccacatgagact
ggtggttacatgttagagtgtgtgtgtcttggtaatggaaaaggagaatggacctgcaag
cccatagctgagaagtgttttgatcatgctgctgggacttcctatgtggtcggagaaacg
tgggagaagccctaccaaggctggatgatggtagattgtacttgcctgggagaaggcagc
ggacgcatcacttgcacttctagaaatagatgcaacgatcaggacacaaggacatcctat
agaattggagacacctggagcaagaaggataatcgaggaaacctgctccagtgcatctgc
acaggcaacggccgaggagagtggaagtgtgagaggcacacctctgtgcagaccacatcg
agcggatctggccccttcaccgatgttcgtgcagctgtttaccaaccgcagcctcacccc
cagcctcctccctatggccactgtgtcacagacagtggtgtggtctactctgtggggatg
cagtggctgaagacacaaggaaataagcaaatgctttgcacgtgcctgggcaacggagtc
agctgccaagagacagctgtaacccagacttacggtggcaactcaaatggagagccatgt
gtcttaccattcacctacaatggcaggacgttctactcctgcaccacagaagggcgacag
gacggacatctttggtgcagcacaacttcgaattatgagcaggaccagaaatactctttc
tgcacagaccacactgttttggttcagactcgaggaggaaattccaatggtgccttgtgc
cacttccccttcctatacaacaaccacaattacactgattgcacttctgagggcagaaga
gacaacatgaagtggtgtgggaccacacagaactatgatgccgaccagaagtttgggttc
tgccccatggctgcccacgaggaaatctgcacaaccaatgaaggggtcatgtaccgcatt
ggagatcagtgggataagcagcatgacatgggtcacatgatgaggtgcacgtgtgttggg
aatggtcgtggggaatggacatgcattgcctactcgcagcttcgagatcagtgcattgtt
gatgacatcacttacaatgtgaacgacacattccacaagcgtcatgaagaggggcacatg
ctgaactgtacatgcttcggtcagggtcggggcaggtggaagtgtgatcccgtcgaccaa
tgccaggattcagagactgggacgttttatcaaattggagattcatgggagaagtatgtg
catggtgtcagataccagtgctactgctatggccgtggcattggggagtggcattgccaa
cctttacagacctatccaagctcaagtggtcctgtcgaagtatttatcactgagactccg
agtcagcccaactcccaccccatccagtggaatgcaccacagccatctcacatttccaag
tacattctcaggtggagacctaaaaattctgtaggccgttggaaggaagctaccatacca
ggccacttaaactcctacaccatcaaaggcctgaagcctggtgtggtatacgagggccag
ctcatcagcatccagcagtacggccaccaagaagtgactcgctttgacttcaccaccacc
agcaccagcacacctgtgaccagcaacaccgtgacaggagagacgactcccttttctcct
cttgtggccacttctgaatctgtgaccgaaatcacagccagtagctttgtggtctcctgg
gtctcagcttccgacaccgtgtcgggattccgggtggaatatgagctgagtgaggaggga
gatgagccacagtacctggatcttccaagcacagccacttctgtgaacatccctgacctg
cttcctggccgaaaatacattgtaaatgtctatcagatatctgaggatggggagcagagt
ttgatcctgtctacttcacaaacaacagcgcctgatgcccctcctgacacgactgtggac
caagttgatgacacctcaattgttgttcgctggagcagaccccaggctcccatcacaggg
tacagaatagtctattcgccatcagtagaaggtagcagcacagaactcaaccttcctgaa
actgcaaactccgtcaccctcagtgacttgcaacctggtgttcagtataacatcactatc
tatgctgtggaagaaaatcaagaaagtacacctgttgtcattcaacaagaaaccactggc
accccacgctcagatacagtgccctctcccagggacctgcagtttgtggaagtgacagac
gtgaaggtcaccatcatgtggacaccgcctgagagtgcagtgaccggctaccgtgtggat
gtgatccccgtcaacctgcctggcgagcacgggcagaggctgcccatcagcaggaacacc
tttgcagaagtcaccgggctgtcccctggggtcacctattacttcaaagtctttgcagtg
agccatgggagggagagcaagcctctgactgctcaacagacaaccaaactggatgctccc
actaacctccagtttgtcaatgaaactgattctactgtcctggtgagatggactccacct
cgggcccagataacaggataccgactgaccgtgggccttacccgaagaggacagcccagg
cagtacaatgtgggtccctctgtctccaagtacccactgaggaatctgcagcctgcatct
gagtacaccgtatccctcgtggccataaagggcaaccaagagagccccaaagccactgga
gtctttaccacactgcagcctgggagctctattccaccttacaacaccgaggtgactgag
accaccattgtgatcacatggacgcctgctccaagaattggttttaagctgggtgtacga
ccaagccagggaggagaggcaccacgagaagtgacttcagactcaggaagcatcgttgtg
tccggcttgactccaggagtagaatacgtctacaccatccaagtcctgagagatggacag
gaaagagatgcgccaattgtaaacaaagtggtgacaccattgtctccaccaacaaacttg
catctggaggcaaaccctgacactggagtgctcacagtctcctgggagaggagcaccacc
ccagacattactggttatagaattaccacaacccctacaaacggccagcagggaaattct
ttggaagaagtggtccatgctgatcagagctcctgcacttttgataacctgagtcccggc
ctggagtacaatgtcagtgtttacactgtcaaggatgacaaggaaagtgtccctatctct
gataccatcatcccagaggtgccccaactcactgacctaagctttgttgatataaccgat
tcaagcatcggcctgaggtggaccccgctaaactcttccaccattattgggtaccgcatc
acagtagttgcggcaggagaaggtatccctatttttgaagattttgtggactcctcagta
ggatactacacagtcacagggctggagccgggcattgactatgatatcagcgttatcact
ctcattaatggcggcgagagtgcccctactacactgacacaacaaacggctgttcctcct
cccactgacctgcgattcaccaacattggtccagacaccatgcgtgtcacctgggctcca
cccccatccattgatttaaccaacttcctggtgcgttactcacctgtgaaaaatgaggaa
gatgttgcagagttgtcaatttctccttcagacaatgcagtggtcttaacaaatctcctg
cctggtacagaatatgtagtgagtgtctccagtgtctacgaacaacatgagagcacacct
cttagaggaagacagaaaacaggtcttgattccccaactggcattgacttttctgatatt
actgccaactcttttactgtgcactggattgctcctcgagccaccatcactggctacagg
atccgccatcatcccgagcacttcagtgggagacctcgagaagatcgggtgccccactct
cggaattccatcaccctcaccaacctcactccaggcacagagtatgtggtcagcatcgtt
gctcttaatggcagagaggaaagtcccttattgattggccaacaatcaacagtttctgat
gttccgagggacctggaagttgttgctgcgacccccaccagcctactgatcagctgggat
gctcctgctgtcacagtgagatattacaggatcacttacggagagacaggaggaaatagc
cctgtccaggagttcactgtgcctgggagcaagtctacagctaccatcagcggccttaaa
cctggagttgattataccatcactgtgtatgctgtcactggccgtggagacagccccgca
agcagcaagccaatttccattaattaccgaacagaaattgacaaaccatcccagatgcaa
gtgaccgatgttcaggacaacagcattagtgtcaagtggctgccttcaagttcccctgtt
actggttacagagtaaccaccactcccaaaaatggaccaggaccaacaaaaactaaaact
gcaggtccagatcaaacagaaatgactattgaaggcttgcagcccacagtggagtatgtg
gttagtgtctatgctcagaatccaagcggagagagtcagcctctggttcagactgcagta
accaacattgatcgccctaaaggactggcattcactgatgtggatgtcgattccatcaaa
attgcttgggaaagcccacaggggcaagtttccaggtacagggtgacctactcgagccct
gaggatggaatccatgagctattccctgcacctgatggtgaagaagacactgcagagctg
caaggcctcagaccgggttctgagtacacagtcagtgtggttgccttgcacgatgatatg
gagagccagcccctgattggaacccagtccacagctattcctgcaccaactgacctgaag
ttcactcaggtcacacccacaagcctgagcgcccagtggacaccacccaatgttcagctc
actggatatcgagtgcgggtgacccccaaggagaagaccggaccaatgaaagaaatcaac
cttgctcctgacagctcatccgtggttgtatcaggacttatggtggccaccaaatatgaa
gtgagtgtctatgctcttaaggacactttgacaagcagaccagctcagggagttgtcacc
actctggagaatgtcagcccaccaagaagggctcgtgtgacagatgctactgagaccacc
atcaccattagctggagaaccaagactgagacgatcactggcttccaagttgatgccgtt
ccagccaatggccagactccaatccagagaaccatcaagccagatgtcagaagctacacc
atcacaggtttacaaccaggcactgactacaagatctacctgtacaccttgaatgacaat
gctcggagctcccctgtggtcatcgacgcctccactgccattgatgcaccatccaacctg
cgtttcctggccaccacacccaattccttgctggtatcatggcagccgccacgtgccagg
attaccggctacatcatcaagtatgagaagcctgggtctcctcccagagaagtggtccct
cggccccgccctggtgtcacagaggctactattactggcctggaaccgggaaccgaatat
acaatttatgtcattgccctgaagaataatcagaagagcgagcccctgattggaaggaaa
aagacagacgagcttccccaactggtaacccttccacaccccaatcttcatggaccagag
atcttggatgttccttccacagttcaaaagacccctttcgtcacccaccctgggtatgac
actggaaatggtattcagcttcctggcacttctggtcagcaacccagtgttgggcaacaa
atgatctttgaggaacatggttttaggcggaccacaccgcccacaacggccacccccata
aggcataggccaagaccatacccgccgaatgtaggtgaggaaatccaaattggtcacatc
cccagggaagatgtagactatcacctgtacccacacggtccgggactcaatccaaatgcc
tctacaggacaagaagctctctctcagacaaccatctcatgggccccattccaggacact
tctgagtacatcatttcatgtcatcctgttggcactgatgaagaacccttacagttcagg
gttcctggaacttctaccagtgccactctgacaggcctcaccagaggtgccacctacaac
gtcatagtggaggcactgaaagaccagcagaggcataaggttcgggaagaggttgttacc
gtgggcaactctgtcaacgaaggcttgaaccaacctacggatgactcgtgctttgacccc
tacacagtttcccattatgccgttggagatgagtgggaacgaatgtctgaatcaggcttt
aaactgttgtgccagtgcttaggctttggaagtggtcatttcagatgtgattcatctaga
tggtgccatgacaatggtgtgaactacaagattggagagaagtgggaccgtcagggagaa
aatggccagatgatgagctgcacatgtcttgggaacggaaaaggagaattcaagtgtgac
cctcatgaggcaacgtgttatgatgatgggaagacataccacgtaggagaacagtggcag
aaggaatatctcggtgccatttgctcctgcacatgctttggaggccagcggggctggcgc
tgtgacaactgccgcagacctgggggtgaacccagtcccgaaggcactactggccagtcc
tacaaccagtattctcagagataccatcagagaacaaacactaatgttaattgcccaatt
gagtgcttcatgcctttagatgtacaggctgacagagaagattcccgagagtaa

KEGG   Homo sapiens (human): 284217
Entry
284217            CDS       T01001                                 
Symbol
LAMA1, LAMA, PTBHS, S-LAM-alpha
Name
(RefSeq) laminin subunit alpha 1
  KO
K05637  laminin, alpha 1/2
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa05145  Toxoplasmosis
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
hsa05410  Hypertrophic cardiomyopathy
hsa05412  Arrhythmogenic right ventricular cardiomyopathy
hsa05414  Dilated cardiomyopathy
hsa05416  Viral myocarditis
Disease
H02464  Poretti-Boltshauser syndrome
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    284217 (LAMA1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    284217 (LAMA1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    284217 (LAMA1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    284217 (LAMA1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    284217 (LAMA1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    284217 (LAMA1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    284217 (LAMA1)
   05145 Toxoplasmosis
    284217 (LAMA1)
  09166 Cardiovascular disease
   05410 Hypertrophic cardiomyopathy
    284217 (LAMA1)
   05412 Arrhythmogenic right ventricular cardiomyopathy
    284217 (LAMA1)
   05414 Dilated cardiomyopathy
    284217 (LAMA1)
   05416 Viral myocarditis
    284217 (LAMA1)
SSDB
Motif
Pfam: Laminin_G_1 Laminin_EGF Laminin_G_2 Laminin_I Laminin_B Laminin_N Laminin_II Laminin_G_3 3keto-disac_hyd
Other DBs
NCBI-GeneID: 284217
NCBI-ProteinID: NP_005550
OMIM: 150320
HGNC: 6481
Ensembl: ENSG00000101680
Pharos: P25391(Tbio)
UniProt: P25391
Position
18:complement(6941742..7117797)
AA seq 3075 aa
MRGGVLLVLLLCVAAQCRQRGLFPAILNLASNAHISTNATCGEKGPEMFCKLVEHVPGRP
VRNPQCRICDGNSANPRERHPISHAIDGTNNWWQSPSIQNGREYHWVTITLDLRQVFQVA
YVIIKAANAPRPGNWILERSLDGTTFSPWQYYAVSDSECLSRYNITPRRGPPTYRADDEV
ICTSYYSRLVPLEHGEIHTSLINGRPSADDLSPKLLEFTSARYIRLRLQRIRTLNADLMT
LSHREPKELDPIVTRRYYYSIKDISVGGMCICYGHASSCPWDETTKKLQCQCEHNTCGES
CNRCCPGYHQQPWRPGTVSSGNTCEACNCHNKAKDCYYDESVAKQKKSLNTAGQFRGGGV
CINCLQNTMGINCETCIDGYYRPHKVSPYEDEPCRPCNCDPVGSLSSVCIKDDLHSDLHN
GKQPGQCPCKEGYTGEKCDRCQLGYKDYPTCVSCGCNPVGSASDEPCTGPCVCKENVEGK
ACDRCKPGFYNLKEKNPRGCSECFCFGVSDVCSSLSWPVGQVNSMSGWLVTDLISPRKIP
SQQDALGGRHQVSINNTAVMQRLAPKYYWAAPEAYLGNKLTAFGGFLKYTVSYDIPVETV
DSNLMSHADVIIKGNGLTLSTQAEGLSLQPYEEYLNVVRLVPENFQDFHSKRQIDRDQLM
TVLANVTHLLIRANYNSAKMALYRLESVSLDIASSNAIDLVVAADVEHCECPQGYTGTSC
ESCLSGYYRVDGILFGGICQPCECHGHAAECNVHGVCIACAHNTTGVHCEQCLPGFYGEP
SRGTPGDCQPCACPLTIASNNFSPTCHLNDGDEVVCDWCAPGYSGAWCERCADGYYGNPT
VPGESCVPCDCSGNVDPSEAGHCDSVTGECLKCLGNTDGAHCERCADGFYGDAVTAKNCR
ACECHVKGSHSAVCHLETGLCDCKPNVTGQQCDQCLHGYYGLDSGHGCRPCNCSVAGSVS
DGCTDEGQCHCVPGVAGKRCDRCAHGFYAYQDGSCTPCDCPHTQNTCDPETGECVCPPHT
QGVKCEECEDGHWGYDAEVGCQACNCSLVGSTHHRCDVVTGHCQCKSKFGGRACDQCSLG
YRDFPDCVPCDCDLRGTSGDACNLEQGLCGCVEETGACPCKENVFGPQCNECREGTFALR
ADNPLGCSPCFCSGLSHLCSELEDYVRTPVTLGSDQPLLRVVSQSNLRGTTEGVYYQAPD
FLLDAATVRQHIRAEPFYWRLPQQFQGDQLMAYGGKLKYSVAFYSLDGVGTSNFEPQVLI
KGGRIRKQVIYMDAPAPENGVRQEQEVAMRENFWKYFNSVSEKPVTREDFMSVLSDIEYI
LIKASYGQGLQQSRISDISMEVGRKAEKLHPEEEVASLLENCVCPPGTVGFSCQDCAPGY
HRGKLPAGSDRGPRPLVAPCVPCSCNNHSDTCDPNTGKCLNCGDNTAGDHCDVCTSGYYG
KVTGSASDCALCACPHSPPASFSPTCVLEGDHDFRCDACLLGYEGKHCERCSSSYYGNPQ
TPGGSCQKCDCNPHGSVHGDCDRTSGQCVCRLGASGLRCDECEPRHILMETDCVSCDDEC
VGVLLNDLDEIGDAVLSLNLTGIIPVPYGILSNLENTTKYLQESLLKENMQKDLGKIKLE
GVAEETDNLQKKLTRMLASTQKVNRATERIFKESQDLAIAIERLQMSITEIMEKTTLNQT
LDEDFLLPNSTLQNMQQNGTSLLEIMQIRDFTQLHQNATLELKAAEDLLSQIQENYQKPL
EELEVLKEAASHVLSKHNNELKAAEALVREAEAKMQESNHLLLMVNANLREFSDKKLHVQ
EEQNLTSELIVQGRGLIDAAAAQTDAVQDALEHLEDHQDKLLLWSAKIRHHIDDLVMHMS
QRNAVDLVYRAEDHAAEFQRLADVLYSGLENIRNVSLNATSAAYVHYNIQSLIEESEELA
RDAHRTVTETSLLSESLVSNGKAAVQRSSRFLKEGNNLSRKLPGIALELSELRNKTNRFQ
ENAVEITRQTNESLLILRAIPKGIRDKGAKTKELATSASQSAVSTLRDVAGLSQELLNTS
ASLSRVNTTLRETHQLLQDSTMATLLAGRKVKDVEIQANLLFDRLKPLKMLEENLSRNLS
EIKLLISQARKQAASIKVAVSADRDCIRAYQPQISSTNYNTLTLNVKTQEPDNLLFYLGS
STASDFLAVEMRRGRVAFLWDLGSGSTRLEFPDFPIDDNRWHSIHVARFGNIGSLSVKEM
SSNQKSPTKTSKSPGTANVLDVNNSTLMFVGGLGGQIKKSPAVKVTHFKGCLGEAFLNGK
SIGLWNYIEREGKCRGCFGSSQNEDPSFHFDGSGYSVVEKSLPATVTQIIMLFNTFSPNG
LLLYLGSYGTKDFLSIELFRGRVKVMTDLGSGPITLLTDRRYNNGTWYKIAFQRNRKQGV
LAVIDAYNTSNKETKQGETPGASSDLNRLDKDPIYVGGLPRSRVVRRGVTTKSFVGCIKN
LEISRSTFDLLRNSYGVRKGCLLEPIRSVSFLKGGYIELPPKSLSPESEWLVTFATTNSS
GIILAALGGDVEKRGDREEAHVPFFSVMLIGGNIEVHVNPGDGTGLRKALLHAPTGTCSD
GQAHSISLVRNRRIITVQLDENNPVEMKLGTLVESRTINVSNLYVGGIPEGEGTSLLTMR
RSFHGCIKNLIFNLELLDFNSAVGHEQVDLDTCWLSERPKLAPDAEDSKLLPEPRAFPEQ
CVVDAALEYVPGAHQFGLTQNSHFILPFNQSAVRKKLSVELSIRTFASSGLIYYMAHQNQ
ADYAVLQLHGGRLHFMFDLGKGRTKVSHPALLSDGKWHTVKTDYVKRKGFITVDGRESPM
VTVVGDGTMLDVEGLFYLGGLPSQYQARKIGNITHSIPACIGDVTVNSKQLDKDSPVSAF
TVNRCYAVAQEGTYFDGSGYAALVKEGYKVQSDVNITLEFRTSSQNGVLLGISTAKVDAI
GLELVDGKVLFHVNNGAGRITAAYEPKTATVLCDGKWHTLQANKSKHRITLIVDGNAVGA
ESPHTQSTSVDTNNPIYVGGYPAGVKQKCLRSQTSFRGCLRKLALIKSPQVQSFDFSRAF
ELHGVFLHSCPGTES
NT seq 9228 nt   +upstreamnt  +downstreamnt
atgcgcgggggcgtgctcctggtcttgctgctgtgtgtcgccgcgcagtgccggcagaga
ggcctgtttcctgccattctcaatcttgccagcaatgctcacatcagcaccaatgccacc
tgtggcgagaaggggccggagatgttctgcaaacttgtggagcatgtgccaggtcggccc
gtccgaaacccacagtgccggatctgtgatggcaacagcgcaaaccccagagaacgccat
ccaatatcacatgccatagatggcaccaataactggtggcaaagtcccagcattcagaat
gggagagaatatcactgggtcacaatcactctggacttaagacaggtctttcaagttgca
tatgtcatcattaaagctgccaatgcccctcgacctggaaactggattttggagcgttct
ctggatggcaccacgttcagcccctggcagtattatgcagtcagcgactcagagtgtttg
tctcgttacaatataactccaagacgagggccacccacctacagggctgatgatgaagtg
atctgcacctcctattattccagattggtgccacttgagcatggagagattcatacatca
ctcatcaatggcagaccaagcgctgacgatctttcacccaagttgttggaattcacttct
gcacgatatattcgccttcgcttgcaacgcattagaacgctcaatgcagatctcatgacc
cttagccaccgggaacctaaagaactggatcctattgttaccagacgctattattattca
ataaaggacatttctgttggaggcatgtgtatctgctatggccatgctagtagctgccca
tgggatgaaactacaaagaaactgcagtgtcaatgtgagcataatacttgcggggagagc
tgtaacaggtgctgtcctgggtaccatcagcagccctggaggccgggaaccgtgtcctcc
ggcaatacatgtgaagcatgtaattgtcacaataaagccaaagactgttactatgatgaa
agtgttgcaaagcagaagaaaagtttgaatactgctggacagttcagaggaggaggggtt
tgcataaattgcttgcagaacaccatgggaatcaactgtgaaacctgtattgatggatat
tatagaccacacaaagtgtctccttatgaggatgagccttgccgcccctgtaattgtgac
cctgtggggtccctcagttctgtctgtattaaggatgacctccattctgacttacacaat
gggaagcagccaggtcagtgcccatgtaaggaaggttatacaggagaaaaatgtgatcgc
tgccaacttggctataaggattacccgacctgtgtctcctgtgggtgcaacccagtgggc
agtgccagtgatgagccctgcacagggccctgtgtttgtaaggaaaacgttgaggggaag
gcctgtgatcgctgcaagccaggattctataacttgaaggaaaaaaacccccggggctgc
tccgagtgcttctgctttggcgtttctgatgtctgcagcagcctctcttggcctgttggt
caggtaaacagtatgtccgggtggctggtcaccgacttgatcagtcccaggaagatcccg
tctcagcaagatgcactaggcgggcgccatcaggtcagcatcaacaacaccgcggtcatg
cagagactggctcccaagtactactgggcagcccccgaggcctaccttggaaataagctg
actgcgtttggcggattcctgaaatacacggtgtcctacgatattccggtagagacggta
gacagtaacctcatgtcgcatgctgacgtcatcattaagggaaacggactcactttaagc
acacaggctgagggtctgtcattgcagccttatgaagagtacctaaacgtggttagactt
gtgcctgaaaacttccaagattttcacagcaaaaggcagattgatcgtgaccagctgatg
actgtccttgccaatgtgacacatcttttgatcagagccaactacaattctgcaaaaatg
gctctttacaggttggagtccgtctctctggacatagccagctctaatgccatcgacctg
gtggtggccgctgatgtggagcactgtgaatgtccgcaaggctacacagggacctcctgt
gagtcgtgcctctctggctattaccgcgtggatggaatactctttggaggaatttgtcaa
ccctgtgaatgccacggccatgcagctgagtgtaatgttcacggcgtttgcattgcgtgt
gcgcacaacaccaccggcgtccactgtgagcagtgcttgcccggcttctacggggagcct
tcccgagggacacctggggactgccagccctgcgcctgccctctcaccatagcctccaac
aatttcagccccacctgccacctcaatgatggagatgaagtggtctgtgactggtgtgcc
ccgggctactcaggagcttggtgtgagagatgtgcagatggttactatggaaacccaaca
gtgcctggcgaatcttgtgttccctgtgactgcagcggcaacgtggacccctcggaggct
ggtcactgtgactcagtcaccggggagtgcctgaagtgcctggggaacacagatggcgcc
cactgtgaaaggtgtgctgacgggttctatggggacgctgtgacagccaagaactgccgc
gcctgtgaatgccatgtgaaaggctcccattctgccgtgtgccatcttgagaccgggctc
tgtgactgcaaaccaaacgtgactggacagcagtgtgaccagtgcttgcatggctattat
gggctggactcaggccatggctgccggccctgcaactgcagcgtggcaggctccgtgtca
gatggctgcacggatgaaggccagtgtcactgtgtcccaggtgtggcagggaaaaggtgt
gacaggtgtgcccatggcttctacgcctaccaggatggtagctgtacaccctgtgactgc
ccacacactcagaatacctgcgacccagaaactggagagtgtgtctgcccccctcacaca
cagggtgtgaagtgtgaagaatgtgaggatgggcactggggctacgatgcggaggtgggg
tgccaggcctgcaattgcagtctcgtggggtcgactcatcatcggtgcgatgtggtcacc
ggccattgccagtgcaagtcaaaatttggtggccgggcctgcgatcagtgttccttgggt
tacagagactttcccgactgtgttccctgtgactgtgacctgagggggacgtcgggggac
gcctgcaacctggagcagggtctctgcggctgtgtggaggaaaccggggcctgcccttgc
aaggaaaatgtctttggtcctcagtgcaacgaatgtcgagagggcaccttcgctctccgc
gcagacaaccccctgggctgcagcccgtgcttctgctccgggctgtcccacctctgctca
gagctggaggactacgtgaggaccccagtaacgctgggctccgatcagcctcttctgcgt
gtggtttctcagagtaacttgaggggcacgaccgagggggtttactaccaggcccccgac
ttcctgctggatgccgccaccgtccggcagcacatccgtgcagagccgttttactggcgg
ctgccgcagcagttccaaggagaccagctcatggcctatggtggcaaactgaagtacagc
gtggccttctattctttggatggcgtcggcacctccaattttgagcctcaagttctcatc
aaaggtggtcggatcagaaagcaagtcatttacatggatgcaccagccccagagaatgga
gtgagacaggaacaagaagtagcaatgagagagaatttttggaaatattttaactctgtt
tctgaaaaacctgtcacgcgagaggattttatgtctgtcctcagcgatattgagtacatc
ctcatcaaggcatcgtatggtcaaggattacagcagagcagaatctcagacatttcaatg
gaggttggcagaaaggctgaaaagctgcacccagaagaagaggttgcatctcttttagag
aattgtgtctgtcctcctggcactgtgggattctcatgtcaggactgcgcccctgggtac
cacagagggaagctcccagcagggagtgacaggggaccacgccctctggttgctccttgt
gttccctgcagttgcaacaaccacagtgacacctgtgaccccaacaccgggaagtgtctg
aactgtggcgataacacagcaggtgaccattgtgatgtgtgtacttctggctactacggg
aaggtgactggctcagcaagtgactgtgctctgtgtgcctgtcctcacagccctcctgcc
agttttagtcccacttgtgtcttggaaggggaccacgatttccgttgtgacgcctgtctc
ctgggctatgaaggaaaacactgtgaaaggtgctcctcaagctattatgggaaccctcaa
acaccaggtggcagttgccagaagtgtgactgcaacccgcacggctctgtccacggtgac
tgtgaccgcacatctgggcagtgcgtttgcaggctgggggcctcggggctccggtgcgat
gagtgtgaaccgaggcacattctgatggaaacagattgtgtttcctgtgatgatgagtgt
gtaggtgtgctgctgaatgacttggatgagattggtgatgccgttctttctctgaacctc
actggcattatccctgtcccatatggaattttgtcaaacctggaaaatacaactaaatat
ctccaggaatctttattaaaagaaaatatgcaaaaggacctgggaaaaattaagcttgaa
ggtgttgcagaagaaacggacaacctgcaaaagaagctcactaggatgttagcgagtacc
caaaaggtgaatagggcaactgagagaatcttcaaggagagtcaagacctggccatagcc
attgagaggctgcagatgagcatcacagaaattatggaaaagacaactttaaatcagact
ttggatgaagatttcctactacccaattctactcttcagaacatgcaacagaatggtaca
tctttgctagaaatcatgcagataagagacttcacacagttgcaccaaaatgccaccctt
gaactcaaggctgctgaagatttattgtcacaaattcaggaaaattaccagaagccgctg
gaagaattggaggtattgaaagaagcagcaagccacgtcctttcaaagcacaacaatgaa
ctaaaggcggctgaggcgctcgtgagggaagctgaggcaaagatgcaggaaagcaaccac
ctgctgctcatggtcaatgctaatctgagagaattcagtgataaaaagctgcatgttcaa
gaagaacaaaatctgacctcagagctcattgtccaaggaagaggattgatagatgctgct
gctgcacaaacagatgctgtacaagatgctctagagcacttagaggatcaccaggataag
ctacttttatggtctgccaaaatcaggcaccacatagatgacctggtcatgcacatgtcc
caaaggaacgcagtcgacctggtctacagagctgaggaccatgccgctgagttccagaga
ctagcagatgttctgtacagtggccttgaaaacatcagaaatgtgtccctgaatgccacc
agtgcagcctatgtccattacaacatccagagcctgattgaagaatcggaggaactggcc
agagatgctcacaggactgtgactgagacgagcctgctctcagaatcccttgtttctaac
gggaaagcggccgtgcagcgcagctccagatttctaaaagaaggcaacaacctcagcagg
aagcttccaggtattgcattggaactgagtgaattgagaaataagacaaacagatttcaa
gagaatgctgttgaaattaccaggcaaaccaatgaatcactcttgatacttagagcaatt
cctaaaggtataagagacaagggagccaaaaccaaagagctggccacgtctgcaagccag
agcgcggtgagcacgctgagggacgtggcggggctgagccaggagctgctgaacacatct
gccagcctgtccagggtcaacaccacattacgagagacacaccagcttctgcaggactcc
accatggccactctgttggctggaagaaaagtcaaagacgtggaaattcaagccaacctt
ttgtttgatcggttgaagcctttgaagatgttagaggagaatctgagcagaaacctatca
gaaattaaactgttgatcagccaggcccgcaaacaagcagcttctattaaagtcgccgtg
tctgcagacagagattgcatccgggcctaccagcctcagatttcctctaccaactacaat
accttaacactaaatgttaagacacaggaacccgataatcttctcttctacctcggtagc
agcaccgcttctgatttccttgcagtggagatgcggcgagggagagtggccttcctgtgg
gacctgggctccgggtccacacgcttggagtttccagactttcccattgatgacaacaga
tggcacagtatccatgtagccagatttggaaacattggttcactgagtgtaaaggaaatg
agctcaaatcaaaagtcaccaacaaaaacaagtaaatcccctgggacagctaatgttctg
gatgtaaacaattcaacactcatgtttgttggaggtcttggaggacaaatcaagaaatct
cctgctgtgaaggttactcattttaaaggctgcttgggggaggccttcctgaatggaaaa
tccataggcctatggaactatattgaaagggaaggcaagtgccgtgggtgcttcggaagc
tcccagaatgaagacccttccttccattttgacgggagtgggtactctgtcgtggagaag
tcacttccggctaccgtgacccagataatcatgctttttaataccttttcacctaatgga
cttcttctctacctgggttcatacggcacaaaagactttttatccatcgagctgtttcgt
ggcagagtgaaggttatgactgacctgggttcaggacccattacccttttgacagacaga
cgttataacaatggaacctggtacaaaattgccttccagcgaaaccggaagcaaggagtg
ctagcagttatcgatgcctataacaccagtaataaagaaaccaagcagggcgagactccg
ggagcatcttctgacctcaaccgcctagacaaggacccgatttatgtgggtggattacca
aggtcaagagttgtaaggagaggtgtcaccaccaaaagctttgtgggctgcatcaagaac
ctggaaatatccagatcaacctttgacttactcagaaattcctatggagtgagaaaaggc
tgtttactggagcccatccggagtgttagcttcctgaaaggcggctacattgaattgcca
cccaaatctttgtcaccagaatcagaatggctggtaacatttgccaccacgaacagcagt
ggcatcatcctggctgccctcggcggggatgtggagaagcggggtgatcgtgaggaagca
cacgtgcccttcttttccgtcatgctgatcggaggcaacattgaggtacatgtcaatcct
ggggatgggacaggcctgagaaaagctctcctgcacgctcccacgggtacctgcagtgat
ggacaagcgcattccatctccttggtcaggaatcggagaattatcactgtccaattggat
gagaacaatcctgtggaaatgaagttgggcacattagtagaaagcaggacgataaatgtg
tccaatctgtacgtcgggggaattccagagggagaggggacgtcactgctcacaatgaga
agatcgttccatggctgtatcaaaaacctgatcttcaatttggaacttttggatttcaac
agtgcagttggccatgagcaagtcgacctggacacctgctggctgtcagaaaggcctaag
ctggctcccgatgcagaggacagcaagctcttgccagagccccgggcttttccagaacag
tgtgtggtggatgcagctctggagtacgttcccggcgctcaccagtttggtctcacacaa
aacagccatttcatcttgccttttaatcagtcggctgtcagaaagaagctctcggttgag
ctaagcatccgcacgttcgcctccagcggcctgatttactacatggctcatcagaaccaa
gcagactacgctgtgctccagctgcacgggggccgcctccacttcatgtttgaccttggc
aaaggcagaacaaaggtctctcaccctgcactgctcagtgatggcaagtggcacacggtc
aagacagactatgttaaaagaaaaggcttcataactgtcgacggccgagagtctcccatg
gtgactgtggtgggagatggaaccatgctggatgtggagggtttgttctacctaggaggc
ctgccctcccagtaccaggccaggaaaattggaaatatcacccacagcatccctgcctgc
attggggatgtgacggttaacagcaaacagctggacaaggacagcccggtgtctgccttc
acggtgaacaggtgctacgcagtggcccaggaaggaacatactttgacggaagcggatat
gcagctcttgtcaaagagggctacaaagtccagtcagatgtgaacatcacactggagttt
cgaacctcctcgcagaatggcgtcctcctggggatcagcactgccaaagtggatgccatt
ggactagagcttgtggacggcaaggtcttgttccatgtcaacaatggtgctggcaggata
acagctgcatatgagcccaaaaccgccactgtgctctgtgatggaaaatggcacactctt
caagctaacaaaagcaaacaccgtatcactctgattgttgacgggaacgcagttggcgct
gaaagtccacacacccagtctacctcagtggacaccaacaatcccatttatgttggtggc
tatcctgctggtgtgaagcaaaaatgcctgcgcagccagacctcgttccgcgggtgtttg
aggaagctagctctgattaagagcccgcaggtgcagtcctttgacttcagcagagcgttc
gaactgcacggagttttccttcattcctgtcctgggaccgagtcctga

DBGET integrated database retrieval system