KEGG   Physeter catodon (sperm whale): 102978798
Entry
102978798         CDS       T06011                                 
Symbol
COL4A1
Name
(RefSeq) collagen alpha-1(IV) chain
  KO
K06237  collagen type IV alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04820  Cytoskeleton in muscle cells
pcad04926  Relaxin signaling pathway
pcad04933  AGE-RAGE signaling pathway in diabetic complications
pcad04974  Protein digestion and absorption
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102978798 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102978798 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102978798 (COL4A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    102978798 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    102978798 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    102978798 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102978798 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102978798 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102978798 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102978798 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    102978798 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102978798 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102978798 (COL4A1)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   102978798 (COL4A1)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102978798 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 102978798
NCBI-ProteinID: XP_023978941
Ensembl: ENSPCTG00005018947
UniProt: A0A2Y9SJX4
Position
13:complement(79089312..79243216)
AA seq 1668 aa
MGPRLGVWLLLPAALLLHEESSRAAAKGVCAGSGCGKCDCHGVKGQKGERGLPGLQGVIG
FPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPPG
IPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHIPGTLLKGERGFSGPP
GAPGLPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPPG
VPGQAQVREKGEYAAKGEKGQKGEPGFQGMPGVGEKGEPGKPGPRGKPGKDGEKGEKGSL
GFPGDSGYPGLPGREGLKGDKGEAGPPGPPGIVIGTGPLGEKGERGYPGAPGLKGEPGPK
GFPGIQGLPGPPGFPIPGLIGAPGFPGERGEKGDQGLPGVSLPGPGGRDGLPGPLGPPGP
PGQPGHTNGIVECQPGPPGDQGPPGSAGQPGLTGEVGEKGQKGESCLICDSTGLRGPPGP
QGPPGEIGFPGQPGAKGDRGLPGRDGLEGLPGPQGVPGLMGQPGAKGEPGEIYFETRLKG
DKGDPGFPGQPGMPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDVG
PPGPPGFGPIGPIGDKGQMGFPGNPGSPGLPGPKGEEGKVTPLPGPPGVSGLPGSPGFQG
PQGDRGFPGTPGRPGLSGEKGSVGQPGIGFPGPPGPKGVDGLPGDTGPPGNPGRQGFNGL
PGNPGLPGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGNIGGPGIPGEHGAIGPAGLQGIR
GDPGPPGLQGPKGASGVPGIGPPGALGPPGGQGPPGSSGPPGVKGEKGFPGFPGLDMPGP
KGDKGPQGLPGLTGQSGLPGIPGQQGTPGQPGFPGPKGEMGIMGTPGQPGSPGPAGVPGL
PGEKGDHGFPGTSGPRGDPGFKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIGP
SGDKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPGI
PGEKGVPGLPGLQGIPGSPGEKGAKGEKGQEGLPGIGIPGRPGEKGDQGVAGFPGSPGEK
GEKGSSGIPGMPGSPGPKGSPGSAGYPGSPGLPGEKGDKGLPGLDGIPGIKGEAGLPGKP
GTTGPAGQKGEPGSDGFPGSAGEKGEPGLPGRGFPGFPGAKGEKGSKGDVGFPGLAGSPG
IPGSKGEQGFMGPPGPQGQPGLPGTPGHAVEGRKGDRGPQGQPGLPGLPGPMGPPGLPGL
DGLKGDKGNPGWPGTPGAPGPKGDPGFQGMPGVGGSPGATGAKGDMGPPGVPGFQGQKGL
PGLQGVKGDQGDQGFPGTKGLPGPPGPPGPYDIIKGEPGLPGPEGPAGLKGLQGPPGPKG
QQGVTGSVGLPGPPGIPGFDGAPGQKGEAGPFGPPGPRGFPGPPGPDGLPGSMGPPGTPS
VDHGFLVTRHSQTTDEPQCPPGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTMP
FLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGEGIRPFISRCTVCEAPAMVMAVH
SQTIQIPQCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRGT
CNYYANAYSFWLATIDRSQMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5007 nt   +upstreamnt  +downstreamnt
atggggccccggctcggcgtctggctgctgctgcccgccgccctcctgctccacgaggag
agcagccgggccgccgcgaagggtgtatgtgctggctctggctgcgggaaatgcgactgc
catggcgtaaagggacaaaagggagaaagaggtctcccagggttgcaaggtgtcatcggc
ttcccgggaatgcaaggacctgaggggccgcagggacccccgggacagaagggggacacc
ggagaaccaggactgccaggaactaaagggacgaggggacccccaggagcatctggttac
cctggaaacccaggacttcctggtattcctggccaagacggtcctccgggtcccccaggt
atcccaggatgcaacgggacgaagggtgagagagggcctctggggcctccgggtttgcct
ggattcgctggaaatcccggaccgccagggttaccgggaatgaagggggatccaggtgaa
atacttggccatataccagggaccctgttgaaaggtgaaagaggattttctggacccccc
ggagcacctggtttgccaggactgccagggctgcaaggtcctgttggccccccgggattc
actggaccaccaggtcccccaggccctcctggccctccaggtgaaaaggggcaaatgggc
ttgagttttcaagggccaaaaggtgacaagggtgatcaaggggtcagcgggccccccgga
gtaccaggacaagctcaagttcgagagaaaggagagtatgctgcaaaaggagagaagggc
caaaaaggtgaacctggatttcaggggatgccaggggttggagagaaaggtgaacccgga
aaaccaggaccccgaggaaaaccaggaaaagacggtgaaaaaggagaaaaagggagtcta
gggtttccgggggattcgggatacccaggactcccaggccgagagggtttaaagggagac
aaaggtgaagcaggccctcctggccctcctggaattgttatcggcacagggcccttggga
gagaagggagagcgggggtacccaggggctccagggttgaaaggggagccgggccccaaa
ggtttcccaggaatacaaggcctgccaggccctccaggcttcccgataccagggctgatt
ggtgcccccggcttccccggtgaaagaggagagaaaggtgaccagggcttgccaggcgtg
tccttgcccggaccaggtggaagggatgggctaccaggcccccttgggccccccggcccc
cctgggcagccaggccacacaaatggaattgtggaatgccagcccgggccgccaggtgac
cagggtcctcccggaagtgcggggcagccggggttgacaggcgaagttggagaaaaaggc
caaaaaggagaaagttgcctcatctgtgactcaacaggacttcgtgggcccccagggcca
cagggaccccccggagaaataggtttcccaggacagccaggggccaagggcgacagaggt
ttacccggcagggatggtctcgaaggattgcctggaccacaaggtgtgccagggctgatg
ggccagccaggagccaagggcgagcctggcgagatttacttcgaaactcgactcaagggc
gacaaaggagacccaggtttcccaggccagcccgggatgccaggcagagcaggctctccc
ggaagagacggccatccgggtctgcccggccccaaaggctccccgggttcagtaggatta
aaaggagaacgtggccccccgggaggagttggattccccggcagccgcggtgacgtcggc
cctcctgggcctccagggtttggccctattggccccattggtgacaaaggacagatgggc
tttccgggaaaccccgggtccccaggcctgccaggtcccaagggtgaagaaggaaaggtc
acgcccttacccggcccccctggagtctcaggcctgccggggtcccccggcttccaaggg
cctcaaggtgaccgaggttttcctggaaccccgggaaggccgggcctctctggagagaag
ggttcagtcggccagcctgggattggctttccagggcctcccggccccaaaggtgttgac
ggtttacctggagacactggacctcctggaaatcccggtcgccaaggttttaatggctta
cccggcaacccaggtctgcctggccaaaagggagagcctggagttggtctgccgggactc
aaaggtctgccaggactccctggcatccccggcacccctggagagaagggaaacatcggg
ggaccaggcattcccggagagcacggcgccatcggccctgcaggccttcagggaatcaga
ggtgacccgggacctcctggattgcaaggtcccaaaggagcttctggagtccccggaata
ggccctcctggagctttgggaccccctggaggacagggacccccagggtcatcaggcccc
cctggagtgaaaggagagaagggcttccccggattcccaggcctggacatgccgggcccc
aaaggagacaaagggccgcaggggctccccggcctgacgggacaatcggggctgcctggt
atccctggacagcagggcacacctggacagcccgggttcccaggtcccaagggagagatg
ggcatcatggggacccccgggcagcccggctcgccaggaccggcgggtgtgccaggattg
ccgggtgaaaaaggggaccacggcttcccgggcacctcgggacccaggggagaccctggc
ttcaagggagataaaggagatgtgggtcttcctggcaagccgggctccatggataaagtg
gacatgggcagcatgaagggccagaagggtgaccaaggagaaaaaggacaaatcggccca
agtggtgataaaggatcccggggagatcctggaaccccaggagtgcctggaaaggacggt
caggcaggacaacctgggcagccaggacctaaaggtgatccgggcataagtgggacccca
ggtgctccgggacttcctggacccaaaggatcggttggtggaatgggcctgccaggaata
cctggagaaaaaggtgtgcctggcctccctggcctgcagggcatccctggctcacctgga
gaaaagggagcaaaaggagagaaagggcaggagggtctgcctggcattggaattccagga
cggcccggggaaaagggagaccaaggggtagcaggttttccaggaagccctggagagaag
ggagagaaaggaagcagtgggatcccagggatgcccgggtctccaggccccaaaggctca
ccagggagtgctggctatccaggaagccctgggttgcctggagaaaaaggtgacaaaggc
ctcccgggattggatggtatccctggcatcaaaggagaagcaggtcttcctgggaagcct
ggtaccacaggcccggccggccagaaaggggagcccggcagtgatggattcccggggtca
gcaggagagaagggtgaaccaggtctacccggaagaggattcccagggtttccaggggcc
aaaggagagaaaggttcaaagggcgacgtgggtttcccaggcttagctgggagcccagga
attcctggatccaaaggagaacaaggattcatgggtcctccggggccacagggacagccg
ggattgccaggcaccccgggccacgcagtggaggggcgcaaaggagaccggggcccacag
ggacagcctggcctgccagggcttccgggacccatggggcctccagggctccctgggctt
gatgggctgaaaggtgacaagggaaacccaggctggccgggcactcctggggctccaggg
cccaagggagacccaggattccagggcatgccgggggtcggtggctctccaggagctaca
ggtgctaagggtgatatgggacctccaggagttccagggtttcaagggcagaaaggcctc
cctggccttcagggagttaaaggtgaccaaggagaccaaggtttccctggaactaaaggt
cttcctggccctccgggccccccgggtccatacgacatcatcaaaggggagccagggctc
cctggtcctgagggccccgcaggtctgaaagggcttcagggacctccaggccccaaagga
caacaaggtgtgacaggatctgtgggcttacctgggccaccaggtattcccgggtttgat
ggggcccctggccagaaaggagaggcaggaccctttggacctcctggtccaagaggcttc
ccgggtccacccggccctgatgggttgccgggatccatgggtcccccaggcaccccatct
gttgatcacggcttcctcgtgaccaggcacagtcagacaacagacgagccccagtgtcct
cccgggaccaaaatcctctaccatgggtactctctgctctacgtacaaggcaatgagcgg
gcacacggccaggacttgggcacggcgggcagctgtctgcgcaagttcagcacgatgccc
ttcctcttctgtaacatcaacaacgtctgcaacttcgcctcccgaaatgattactcgtac
tggctgtccaccccggagcctatgcccatgtccatggctcccatcaccggggagggcatc
aggcccttcatcagcaggtgtactgtgtgtgaggccccggccatggtcatggctgtgcac
agccaaaccatccagatcccgcagtgccccagcggctggtcctcgctatggattggctac
tcctttgtgatgcacaccagcgctggagccgaaggttctggccaagccctggcctctcct
gggtcctgtctggaagagttcagaagcgcccccttcatcgagtgccacggccgcgggact
tgcaattactatgcaaacgcttacagcttttggcttgccacgatagacagaagccagatg
ttcaagaagcccacgccgtccaccctgaaggctggagagctgcgcacccacgtcagtcgc
tgtcaagtgtgcatgcggagaacataa

DBGET integrated database retrieval system