KEGG   Callithrix jacchus (white-tufted-ear marmoset): 100386176Help
Entry
100386176         CDS       T03264                                 

Gene name
COL1A1
Definition
(RefSeq) collagen type I alpha 1 chain
  KO
K06236  collagen, type I, alpha
Organism
cjc  Callithrix jacchus (white-tufted-ear marmoset)
Pathway
cjc04151  PI3K-Akt signaling pathway
cjc04510  Focal adhesion
cjc04512  ECM-receptor interaction
cjc04611  Platelet activation
cjc04926  Relaxin signaling pathway
cjc04933  AGE-RAGE signaling pathway in diabetic complications
cjc04974  Protein digestion and absorption
cjc05146  Amoebiasis
cjc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:cjc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100386176 (COL1A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100386176 (COL1A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100386176 (COL1A1)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    100386176 (COL1A1)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100386176 (COL1A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    100386176 (COL1A1)
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100386176 (COL1A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100386176 (COL1A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100386176 (COL1A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:cjc00536]
    100386176 (COL1A1)
Glycosaminoglycan binding proteins [BR:cjc00536]
 Heparan sulfate/Haparin
  Extracellular matrix molecules
   100386176 (COL1A1)
BRITE hierarchy
SSDB OrthologParalogGFIT
Motif
Pfam: Collagen COLFI VWC
Motif
Other DBs
NCBI-GeneID: 100386176
NCBI-ProteinID: XP_008988203
Ensembl: ENSCJAG00000003818
Position
X
AA seq 1465 aa AA seqDB search
MFSFVDLRLLLLLAAAALLTHGQEEGQVEGQDEDIPPITCVQNGLRYHDRALVEKPEPCR
ICVCDNGKVLCDDVICDETKNCPGAEIPEGECCPVCPDGSEATTDRETTGVEGPKGDTGP
RGPRGPAGPPGRDGIPGQPGLPGPPGPPGPPGPPGLGGNFAPQLSYGYDEKSTGGISVPG
PMGPSGPRGLPGPPGSPGPQGFQGPPGEPGEPGASGPMGPRGPPGPPGKNGDDGEAGKPG
RPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGENGAPG
QMGPRGLPGERGRPGPPGPAGARGNDGATGAAGPPGPTGPAGPAGFPGAVGAKGEAGPQG
PRGSEGPQGVRGEPGPPGPAGAAGPAGNPGADGQPGAKGANGAPGIAGAPGFPGARGPSG
PQGPSGPPGPKGNSGEPGAPGSKGDTGAKGEPGPVGVQGPPGPAGEEGKRGARGEPGPTG
LPGPPGERGGPGSRGFPGADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKG
LTGSPGSPGPDGKTGPPGPAGQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGKAGERG
VPGPPGAVGPAGKDGEAGAQGPPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPG
EQGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPG
SQGAPGLQGMPGERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPG
DKGETGPSGPAGPTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAG
PPGPAGPAGPPGPIGNVGAPGPKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPG
PAGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRG
VVGLPGQRGERGFPGLPGPSGEPGKQGPSGTSGERGPPGPMGPPGLAGPPGESGREGAPG
AEGSPGRDGSPGPKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPIG
PVGSRGPAGPQGPRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAG
PRGPPGSAGAPGKDGLNGLPGPIGPPGPRGRTGDAGPVGPPGPPGPPGPPGPPSGGFDFS
FLPQPLKRRAHDGGRYYRADDANVVRDRDLEVDTTLKXLSQQIENIRSPEGSRKNPARTC
RDLKMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPNVAQKNWYISKNPK
DKRHVWFGESMTDGFQFEYGGEGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQ
QTGNLKKALLLQGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSRLPI
IDVAPLDIGAPDQEFGFDVGPACFL
NT seq 4398 nt NT seq  +upstreamnt  +downstreamnt
atgttcagctttgtggacctccggctcctgctcctcttagcggccgccgccctcctgacg
cacggccaagaggaaggccaagtcgagggccaagacgaagacatcccaccaatcacctgc
gtacagaacggcctcaggtaccatgaccgagccttagtagagaaacccgagccctgccgg
atctgcgtctgcgacaacggcaaggtgttgtgtgatgacgtgatctgtgatgagaccaag
aactgccccggtgccgaaatccccgagggcgagtgctgtcccgtctgccccgacggctca
gaggccaccaccgaccgagaaaccaccggcgtcgagggccccaagggagacactggcccc
cgaggcccaaggggacccgcaggcccccctggccgagatggcatccccggacagcctgga
cttcccggaccccccggaccccccggacctcccggaccgcctggcctcggaggaaacttt
gctccccagctgtcttatggctatgatgagaaatcaacaggaggaatttccgtgcctggc
cctatgggtccctctggtcctcgtggtctccctggcccccctggttcacctggtccccaa
ggcttccaaggtccccctggtgagcctggcgagcctggagcttcaggtcccatgggtccc
cgaggtccccccggtccccctggaaagaacggagatgatggggaagctggaaaacctggt
cgtcctggtgagcgtgggcctcctgggcctcagggtgctcgaggattgcctggaacagct
ggcctccctggaatgaagggacacagaggcttcagtggtttggatggtgccaagggagat
gctggtcctgctggtcccaagggtgagcctggcagccctggtgaaaatggagctcctggt
cagatgggcccccgtggtctgcctggtgagagaggtcgccctggaccccctggccctgct
ggtgctcgtggaaatgatggtgctactggtgctgctggcccccctggtcccactggcccc
gctggtcctgctggcttccctggtgctgttggtgctaagggtgaagctggtccccaaggg
ccccgaggctctgaaggtccccagggtgtgcgtggtgagcctggcccccctggccctgct
ggtgctgctggccctgctggaaaccctggtgctgacggacagcctggtgctaaaggtgcc
aatggtgctcctggtattgctggtgctcctggcttccctggtgcccgaggcccctctgga
ccccagggccccagcggccctcccggtcccaagggtaacagtggtgaacctggtgctccc
ggcagcaaaggagacactggtgccaaaggagaacccggccctgttggtgttcaaggaccc
cctggccctgctggagaggaaggaaagcgaggagcccgaggtgaacccggacccactggc
ctgcccggaccccctggcgaacgcggtggacctggtagccgtggtttccctggcgccgat
ggtgttgctggtcccaagggtcccgctggtgaacgtggttctcccggccctgctggcccc
aaaggatctcctggtgaagctggtcgtcccggtgaagctggcctgcctggtgccaagggt
ctgactggaagccctggcagccccggtcctgatggcaaaactggcccccctggtcccgct
ggtcaagatggtcgccccggacccccaggccctcctggtgcccgtggtcaggctggtgtg
atgggatttcctggacctaaaggtgctgctggagagcccggcaaggctggagagcgaggt
gttcctggaccccctggtgctgttggtcctgctggcaaagatggagaggctggagctcag
ggaccccctggccctgctggtcccgctggtgagagaggtgaacaaggtcctgctgggtcc
cccggattccagggtctccctggacccgctggtcctcctggtgaagcaggcaaacctggt
gaacagggtgttcctggagaccttggtgcccccggcccctctggagcaagaggcgagaga
ggcttccctggcgagcgtggtgtgcaaggtccccctggtcctgctggtccccgtggggcc
aatggtgctcccggcaacgatggtgctaagggtgatgctggtgctcctggagcccccggt
agccagggtgcccctggccttcagggaatgcctggtgaacgtggtgcagctggccttcca
gggcctaagggtgacagaggtgatgctggtcccaaaggtgctgatggctctcctggcaaa
gatggcgtccgtggtctaactggccccattggtcctcctggccctgctggtgcccctggt
gacaagggtgaaaccggtcccagcggccctgctggtcccactggagctcgtggcgcccct
ggagaccgtggtgagcctggtccccctggccctgctggcttcgctggcccccccggtgct
gatggccaacctggtgctaaaggcgaacctggagatgctggtgctaaaggcgatgctggt
ccccctggccctgccggacccgctggcccccctggccccattggtaacgttggtgctcct
ggacccaaaggtgctcgcggcagcgctggtccccctggtgctactggtttccctggtgct
gctggccgagtcggtcctcctggcccttccggaaatgctggaccccctggccctcctggt
cctgctggcaaagaaggcggcaaaggtccccgtggtgagactggccctgctggacgtcct
ggtgaagttggcccccctggtccccctggccctgctggagagaaaggatcccctggtgct
gatggacctgctggtgctcctggtactcccggacctcaaggtattgctggacagcgtggt
gtggtcggcctgcctggtcagagaggagaaagaggcttccctggtcttcctggcccctct
ggtgaacctggcaaacaaggtccctctggaacaagtggtgaacgtggtccccctggtccc
atgggcccacctggattggccggaccccctggtgaatctggacgtgagggagctcctggc
gctgaaggttcccctggacgagatggttctcctggccccaagggtgaccgtggtgagact
ggccccgctggaccccctggtgctcctggtgctcctggtgcccctggccccgttggccct
gctggcaagagtggtgatcgtggtgagactggtcctgctggtcccgccggtcctatcggc
cctgttggctcccgtggccccgctggaccccaaggcccccgtggtgacaagggcgagact
ggtgaacagggtgacagaggcataaagggtcaccgtggcttctctggcctccagggtccc
cctggccctcccggctctcctggtgaacaaggtccctctggagcctctggtcctgctggt
ccccgaggtccccctggctctgctggtgctcctggcaaagatggactcaacggtctccca
ggccccatcgggccccctggtcctcgcggtcgcactggtgatgctggtcctgttggtccc
cccggccctcctggccctcctggtccccctggtcctcccagcggtggtttcgatttcagc
ttcttgccccagccactcaagaggagggctcacgatggtggccgctactaccgggctgat
gatgccaatgtggttcgtgaccgtgacctcgaggtggacaccaccctcaagnagctgagc
cagcagatcgagaacatccggagccctgagggcagccgcaaaaaccccgcccgcacctgc
cgcgacctcaagatgtgccactctgactggaagagcggagagtactggattgaccccaac
caaggctgcaacctggatgccatcaaggtcttctgcaacatggagactggagagacctgc
gtgtaccccactcagcccaacgtggcccagaagaactggtacatcagcaagaaccccaag
gacaagaggcacgtgtggtttggcgagagcatgaccgacggattccagttcgagtatggt
ggtgagggctccgaccctgccgatgtggccatccagctgaccttcctgcgcctgatgtcc
accgaagcctcccagaacatcacctaccactgcaagaacagcgtggcctacatggaccag
cagactggcaacctcaagaaggctctgctcctccagggctccaacgagatcgagatccgc
gccgagggcaacagccgcttcacctacagcgtcaccgtcgatggctgcacgagtcacact
ggagcctggggcaagacagtgatcgaatacaaaaccaccaagacctcccgcctgcccatc
atcgatgtggcccccttggacattggcgccccagaccaggaattcggcttcgacgttggc
cctgcctgcttcctgtaa

KEGG   Callithrix jacchus (white-tufted-ear marmoset): 100388461Help
Entry
100388461         CDS       T03264                                 

Gene name
COL4A4
Definition
(RefSeq) collagen type IV alpha 4 chain
  KO
K06237  collagen, type IV, alpha
Organism
cjc  Callithrix jacchus (white-tufted-ear marmoset)
Pathway
cjc04151  PI3K-Akt signaling pathway
cjc04510  Focal adhesion
cjc04512  ECM-receptor interaction
cjc04926  Relaxin signaling pathway
cjc04933  AGE-RAGE signaling pathway in diabetic complications
cjc04974  Protein digestion and absorption
cjc05146  Amoebiasis
cjc05165  Human papillomavirus infection
cjc05200  Pathways in cancer
cjc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:cjc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100388461 (COL4A4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100388461 (COL4A4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100388461 (COL4A4)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100388461 (COL4A4)
  09154 Digestive system
   04974 Protein digestion and absorption
    100388461 (COL4A4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100388461 (COL4A4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100388461 (COL4A4)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100388461 (COL4A4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100388461 (COL4A4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100388461 (COL4A4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:cjc04147]
    100388461 (COL4A4)
   00536 Glycosaminoglycan binding proteins [BR:cjc00536]
    100388461 (COL4A4)
Exosome [BR:cjc04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100388461 (COL4A4)
Glycosaminoglycan binding proteins [BR:cjc00536]
 Heparan sulfate/Haparin
  Extracellular matrix molecules
   100388461 (COL4A4)
BRITE hierarchy
SSDB OrthologParalogGFIT
Motif
Pfam: Collagen C4
Motif
Other DBs
NCBI-GeneID: 100388461
NCBI-ProteinID: XP_017829263
Position
6
AA seq 1411 aa AA seqDB search
MVLMRCSFRLTKALGTDPWSLILILFSVHYVYGSGKKYVGPCGGRDCSVCHCIPEKGSRG
PPGPPGPQGPIGPLGAPGPIGLSGEKGMKGDHGPPGAAGDKGDKGPTGVPGFPGLDGIPG
YPGPPGPRGKPGMSGYNGSRGDPGFPGGRGALGPEGPPGHPGEKGEKGNSAFILGAIKGI
QGDRGDPGLPGLPGSWGARGPAGPTGHPGEPGLVGAPGQPGRPGLKGNPGVGVKGQMGDP
GEVGQQGSPGPTLLVEPPDFCLYKGEKGIKGIPGMIGLPGPPGRKGESGIGAKGEKGIPG
FPGPQGDPGSYGSPGFPGLKGELGRVGDPGLFGFLGPKGDPGERGHPGPPGVLVTPPLPL
KGPPGDPGFPGRYGETGDVGLLGPPGLSGRPGEACAGMTGPPGPQGFPGLPGIPGEAGIP
GRPDSAPGKPGKPGSPGLPGAPGLQGLPGSSVTYCSVGNPGPQGIKGKVGPPGGRGSKGE
KGNEGLCACEPGPMGPPGSPGLPGRQGSKGDLGLPGWLGTKGDPGPPGAEGPPGLPGKPG
ASGRPGNKGAKGDMVVSRVKGDKGERGPDGPPGFPGQLGLHGQDGRAGEKGDPGPPGDHE
DATPGGKGFPGPLGPPGKTGPVGPPGLGFPGPPGERGHPGVPGHPGVRGPDGLKGQKGDT
VSCNVTYPGRPGPPGFDGPPGPKGFPGPQGAPGRSGSDGHKGRRGTAGTSEIQGPPGFRG
DMGDPGFGGERGSSPIGPPGPPGSPGVNGQKGIPGDPAFGHLGPPGKRGLSGVPGIKGPR
GDPGCPGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGPPGHSCERGAPGIPGQPGLPGDP
GSPGAPGEKGQLGDAGPPGPAGMKGLPGLPGRPGAHGPPGLLGIPGPFGDDGLRGPPGPK
GPRGLPGFPGFPGERGKPGAEGCPGTKGEPGEKGMPGFPGDRGVRGAKGAIGPPGDEGEM
AIISKKGKPGEPGSPGDEGLPGERGDKGTPGMQGRRGEPGRYGPPGFHRGEPGGKGQPGP
PGPPGRPGSPGPPGFSGIDGSRGPKGPMGFPGPQGPHGFPGPPGEKGLPGPPGRKGPTGL
PGPRGAPGPPADTDDCPRIPGLPGVPGLRGPEGAMGLPGMRGPPGPGCKGEPGLDGRRGV
DGIPGSPGPPGDRGDTGEDGCPGGPGPPGPTGDPGPKGLDPGYLSGFLLVLHSQSDQEPT
CPLGMPRLWTGYSLLYLEGQEKAHNQDLGLAGSCLPMFSTLPFAYCNIHQVCHYAQRNDR
SYWLASAAPLPMMPLSEEVIRPYISRCAVCEAPAQAVAVHSQDQSIPPCPQTWRSLWIGY
SFLMHTGAGDQGGGQALMSPGSCLEDFRAAPFLECQGRQGTCHFFANEYSFWLTTVKADL
QFSPAPAPDTLKESQAQRQKTSRCQVCMKYS
NT seq 4236 nt NT seq  +upstreamnt  +downstreamnt
atggtgctaatgaggtgctctttcagattgaccaaggccttgggcacagacccctggtca
cttatactcattctcttttctgtacattatgtgtatgggagtggaaagaaatacgttggt
ccttgtggaggaagagattgctctgtttgccattgcattcctgaaaagggatctcggggt
ccaccaggaccaccagggccacagggtccaattgggcccctgggagccccaggacccatt
ggactttcaggagagaaaggaatgaaaggggaccacggccctcctggagcagcaggggac
aaaggagataagggtccaactggtgttcctggatttccaggtttagatggcatacctggg
tacccagggcctcctggacccagaggcaaacctggtatgagtggctacaatggctctaga
ggtgacccagggtttccaggaggaagaggagctcttggcccagaaggtccccccggccat
cctggggaaaagggagaaaaaggaaattcagcgttcattttaggtgccattaaaggtatt
cagggagacagaggggacccaggactgcctggcttaccaggatcttggggtgcaagagga
ccggcaggccccacaggacatcctggagagccagggttagtgggagctccaggtcaacca
gggcgtccaggtttgaagggaaatcccggtgtgggagtaaaggggcaaatgggagacccg
ggtgaggttggccagcaaggttctcctggacccaccctgttggtagagccacctgacttt
tgtctctataaaggagaaaagggtataaaaggaattcctggaatgattggactgccagga
ccaccaggacgcaagggagaatctggtattggggcaaaaggagaaaaaggtattcctgga
tttccagggcctcagggggatcctggttcctatggatctccaggttttccgggattaaag
ggagaactaggacgggttggagatcctgggctatttggatttcttggcccaaagggggat
cctggagaacgagggcacccaggaccaccaggtgttttggtgactccacctcttccgctc
aaaggtcccccaggggacccagggttccctggccgctatggagaaacaggggatgttgga
ctacttggtcccccaggtctctcgggcagaccaggggaagcctgtgcaggcatgacagga
ccccctgggccacaaggatttcctggtcttcctgggattccaggagaagctggtattcct
gggagacctgattctgctccaggaaaaccagggaagccaggatcacctggcctgcctgga
gcaccaggcctgcagggcctcccaggatcaagtgtgacatactgtagtgttgggaaccct
ggaccacaaggaataaaaggcaaagtgggtcctccaggaggaagaggctcaaaaggagaa
aaaggaaatgaaggactctgtgcctgtgagcctggtcccatgggccctcctggctctcca
ggacttcctgggaggcaggggagtaagggagacctggggctccctggctggcttggaaca
aaaggtgacccaggacctcctggtgctgaaggacctccagggctaccaggaaagcctggt
gcctctggtcgacctggcaacaaaggggcaaagggtgacatggtcgtatcaagagttaaa
ggggacaaaggagaaagaggtcctgatgggcccccaggatttccagggcagctgggatta
catggtcaggatggacgtgctggagaaaaaggggatccaggacccccaggcgatcatgaa
gatgcgaccccaggtggtaaaggatttcctggacctctgggacctccgggcaaaacagga
cctgtggggcccccaggtctgggatttcctggtccaccaggagagcgaggccacccagga
gttccaggccacccaggtgtgaggggccctgatggcttgaagggtcagaaaggtgacaca
gtttcttgcaacgtaacctaccccgggaggccaggtcctccaggttttgatggacctcca
ggtccaaagggatttccaggtccccaaggtgcccctgggcggagcggttcagatgggcat
aaaggcagacgtggcacagcaggaacatcggaaatacaaggtccacctggtttccgtggt
gacatgggagatccgggttttggaggggaaagggggtcctcccctattgggcccccaggc
cctcccgggtcaccaggagtgaatggtcagaaaggaatcccgggagaccctgcatttggt
cacctgggacccccgggaaaaaggggtctttcaggagtgccagggataaagggacccaga
ggcgatccaggatgtccaggcgctgaagggccagctggcattcctggattcccaggtctc
aaaggtcccaaaggcagagagggacatgctgggtttccaggtgtcccaggtccacctggg
cattcctgtgaaagaggtgctccagggataccagggcaaccaggacttcctggggatccg
ggtagtccaggtgctccaggtgagaaaggacagctgggagatgcggggcctcctggacca
gctggaatgaagggtctgcctggactcccaggacgacctggggcacatggacccccaggg
ctcctaggaatcccaggtccctttggggatgacgggctacgtggtcctcccggtccaaag
ggaccccgggggctgcctggtttcccaggttttcctggagaaagaggaaagcctggtgca
gagggatgtcctggcacaaagggagaacctggagagaagggcatgcctggctttcctgga
gatcggggagtgagaggggccaaaggagccataggacctcccggagatgaaggagaaatg
gctatcatttccaaaaagggaaaacctggggaacctggatctcctggagatgagggactc
ccaggagaaagaggtgataaaggaactcccgggatgcaagggagaagaggagagccagga
agatatggaccacctggatttcacagaggggaacctggtgggaaaggtcagccagggcct
cctggacccccaggtcgtccaggttctccaggtcccccaggattttcaggaattgatgga
tcaagaggacctaaaggaccaatgggattcccagggccacagggaccacatggatttcct
gggccacctggagagaagggtttacctggacctccagggaggaaagggcccactggtctt
ccaggtcccagaggtgcaccggggccacctgccgacacggatgactgtccccgaatccca
gggcttcctggggtaccaggcttgagaggaccagaaggagccatggggctccctggaatg
agaggccccccaggaccagggtgcaaaggagagcctgggctggatggcaggaggggtgtg
gatggcatccctgggtctcctgggcctcccggagacagaggtgacaccggagaagacggc
tgccctggaggaccagggcctcctggtcctactggggatccggggcccaaagggcttgac
cctggataccttagtggcttcctcctggttctccacagtcagtcggaccaggagcccacc
tgccccctgggcatgcccaggctctggactgggtacagcctgttatacctggaagggcaa
gagaaagctcacaatcaagaccttggtctggcagggtcttgccttcccatgttcagcacg
ctgccttttgcctactgcaacatccaccaggtgtgccactatgcccagagaaacgacaga
tcctactggctggccagtgctgcacccctccccatgatgccactctctgaagaggtgatc
cgcccctacatcagtcgctgtgcggtatgtgaggccccggctcaggcagtggcagtgcac
agccaggaccagtccatcccgccatgtccacagacctggaggagcctctggattggatat
tcattcctgatgcacacaggagctggggaccaaggaggagggcaggccctcatgtcaccc
ggcagctgcctggaagatttcagagcagcaccattccttgaatgccaaggccggcaggga
acttgccacttttttgcaaatgagtatagtttctggctaacaacggtgaaagcagacttg
caattttcccctgctccagcaccagacaccttaaaagaaagccaggcccaacgccagaaa
accagccggtgccaggtctgcatgaaatatagctag

KEGG   Callithrix jacchus (white-tufted-ear marmoset): 100388823Help
Entry
100388823         CDS       T03264                                 

Gene name
COL4A3
Definition
(RefSeq) collagen type IV alpha 3 chain
  KO
K06237  collagen, type IV, alpha
Organism
cjc  Callithrix jacchus (white-tufted-ear marmoset)
Pathway
cjc04151  PI3K-Akt signaling pathway
cjc04510  Focal adhesion
cjc04512  ECM-receptor interaction
cjc04926  Relaxin signaling pathway
cjc04933  AGE-RAGE signaling pathway in diabetic complications
cjc04974  Protein digestion and absorption
cjc05146  Amoebiasis
cjc05165  Human papillomavirus infection
cjc05200  Pathways in cancer
cjc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:cjc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100388823 (COL4A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100388823 (COL4A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100388823 (COL4A3)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100388823 (COL4A3)
  09154 Digestive system
   04974 Protein digestion and absorption
    100388823 (COL4A3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100388823 (COL4A3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100388823 (COL4A3)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100388823 (COL4A3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100388823 (COL4A3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100388823 (COL4A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:cjc04147]
    100388823 (COL4A3)
   00536 Glycosaminoglycan binding proteins [BR:cjc00536]
    100388823 (COL4A3)
Exosome [BR:cjc04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100388823 (COL4A3)
Glycosaminoglycan binding proteins [BR:cjc00536]
 Heparan sulfate/Haparin
  Extracellular matrix molecules
   100388823 (COL4A3)
BRITE hierarchy
SSDB OrthologParalogGFIT
Motif
Pfam: Collagen C4
Motif
Other DBs
NCBI-GeneID: 100388823
NCBI-ProteinID: XP_008997864
Ensembl: ENSCJAG00000012821
Position
6
AA seq 1666 aa AA seqDB search
MALRPQVLLLPLLLVLLAAASAAGKGCICKDKGQCFCDGAKGQKGEKGFPGPPGSPGQKG
FTGPEGLPGPQGPKGSPGLPGLTGAKGVRGITGLPGFSGSPGLPGTPGNTGPYGLTGIPG
CNGSKGEQGFPGLPGTPGYPGILGAAGLKGQKGAPAKGEDTELDAKGDPGLPGAPGPQGL
PGPPGFPGPVGPPGPPGFFGFPGAVGPRGPKGHMGDRVIGRKGERGVKGLTGPPGPPGTV
TVTLTGPDNRTNLKGEKGDRGAMGEPGPPGPSGLPGESYGSEKGAPGEPGPQGKPGKDGV
PGFPGSEGVKGNRGFPGLTGEDGIKGQKGDIGPPGFRGPTEYYDTYQEKGDKGIPGPPGP
KGARGPQGPSGPPGVPGSPGPSRPGLRGVPGWPGLKGSKGERGPPGKDAMGTPGSPGCPG
SPGPTGSPGPPGPPGDIVFRKGPPGDRGLPGYLGSPGILGVDGPKGEPGLLCTQCPYIPG
PPGLPGLPGLHGVKGIPGRQGAAGLKGSPGSPGNTGLPGFPGFPGAQGDPGLKGEKGETL
QPKGQVGAPGDPGLRGQPGRKGLDGIPGTPGVKGLPGPKGELALSGEKGDRGPPGDPGSP
GFPGPAGPAGPPGYGPQGKPGPKGTQGIPGAPGPPGEAGPRGEVSVSTPVPGPPGPPGAP
GLAGPQGPPGIPGSMGKYGDPGLPGPDGEPGIPGIGFPGPPGPKGEQGFPGTKGSLGCPG
KMGEPGLPGKPGLPGDKGEPALAMPGGPGTPGFPGERGNSGEHGEIGLPGLPGLPGTPGN
EGLDGPRGDPGQPGPPGEQGPQGRCIEGPRGAQGLPGLNGLKGQQGRRGKTGPKGDPGIP
GLDRSGFPGETGSPGRPGHQGEMGPPGQKGYPGNPGILGPPGEDGMVGMMGFPGAIGPPG
PPGIPGMPGQRGSLGIPGVKGQRGTPGPKGEQGDKGNPGPSQISHLIGDKGEPGLKGFAG
NMGEKGNRGVPGLPGLKGLQGLPGPPGQPGPRGDLGSIGNPGEPGPRGVPGSMGNMGMPG
SKGKKGTLGFPGRAGRPGLPGIHGLQGDKGEPGYSEGTRPGPPGPTGDPGLPGDMGKKGE
MGQPGPPGHSGPAGPEGVIGSPGSPGLPGKPGPHGDLGFKGIKGFTGPPGIKGPPGLPGF
PGSPGPMGIRGDQGRDGIPGPAGEKGETGLLGAPPGPRGNPGAPGAKGDRGAPGLPGLPG
RKGAMGDAGPRGPTGIEGFPGPPGLPGAIIPGQKGNRGPAGSRGSPGEPGPSGPPGSHIK
GIKGDKGSMGHPGPKGPPGTVGDMGPPGHPGAPGTPGLPGLRGDPGFQGFPGVKGEKGNP
GFLGSIGPPGPIGPKGPPGIRGDPGTLQIISLPGSPGPPGTPGEPGMRGEPGPPGPPGNL
GPCGPRGKPGKDGKPGIPGPVGEKGNKGSKGEQGPPGSDGLPGLKGKPGDIGSPATWTTR
GFVFTRHSQTTAIPSCPEGTVPLYSGFSFLYVQGNQRAHGQDLGTLGSCLQRFTTMPFLF
CNINDVCNFASRNDYSYWLSTPALMPMNMAPITGRALEPYISRCTVCEGPAIVIAVHSQT
TDIPPCPHGWISLWTGFSFIMFTSAGSEGAGQALASPGSCLEEFRASPFLECHGRGTCNY
YSNSYSFWLASLNPERMFRKPIPSTVKAGELEKIISRCQVCMKKRQ
NT seq 5001 nt NT seq  +upstreamnt  +downstreamnt
atggcgctcaggccgcaggtgctcctgctgccgctcctgctggtgctcctggcggcggcg
agcgcggccggcaagggctgtatctgtaaagacaaaggccagtgcttctgtgatggagcc
aaaggacagaagggggaaaaaggttttcctggaccccctggttctcctggccagaaagga
ttcacaggtcctgaaggcttgcctggaccacagggacccaagggttctccaggacttcca
ggactcactggtgccaaaggtgtaaggggaataactggattgccaggattttccggttct
cctggacttccaggcaccccaggcaatactggaccatacggacttactggtataccagga
tgcaatggttctaagggtgagcaggggtttccaggactcccagggacaccaggctaccca
gggatcctgggtgctgctggtttgaaagggcaaaagggtgctcctgctaaaggagaagat
acagaacttgatgcaaaaggtgatcccgggttgccaggggctccaggaccgcagggtttg
ccaggccctccaggttttcctggccctgttggcccacctggtcctccgggattctttggc
tttccaggagccgtgggacctagaggacctaagggtcacatgggtgacagagtgatagga
cgaaaaggagagcggggtgtgaaagggttaacaggacccccagggccaccaggaacagtt
actgtgaccctcactggcccagataacagaacgaacctcaagggggaaaagggagacagg
ggagcgatgggcgagcctggacctcctggaccctcaggactgcctggagaatcatatgga
tctgaaaagggtgctcctggagaacctggcccacagggaaaacctggaaaagatggtgtt
cctggcttccctggaagtgagggagtcaagggcaacaggggtttccctgggttaacgggt
gaagatggcattaagggacagaaaggggacattggccctccaggatttcgtggtccaaca
gaatattatgacacataccaggaaaagggagataaaggcattccaggcccgccagggccc
aaaggagctcgtggcccacaaggtcccagtggtcccccaggagttcctggaagtcctgga
ccatcaaggcctggcctcagaggagtccctggatggccaggcttgaaaggaagtaaaggg
gaacgaggccccccaggaaaggatgccatggggactcctgggtccccaggatgtcctggt
tcaccaggccctacaggatctccgggacctccaggaccaccaggtgacatcgtttttcgc
aagggtccacctggagatcgtggactgccaggctatctaggctctccaggaatcctcgga
gttgatgggcccaaaggagaaccaggtctcttgtgtacacagtgcccttacatcccaggg
cctcccggtctcccaggattgccagggttacatggcgtaaaaggaattccaggaagacaa
ggtgcagctggcttgaaaggaagtccagggtccccaggaaatacaggtcttccaggattt
ccaggattcccaggtgctcagggtgacccaggacttaaaggagaaaaaggtgaaacgctt
cagcctaaggggcaagtgggtgccccaggggacccggggctaagagggcaacctgggaga
aagggcttggatggaattcctggaactccgggagtgaaaggattaccaggacctaaaggc
gaactggctctgagtggtgagaagggggaccgaggtcctccaggggatcctggctctcct
gggttcccaggacctgcaggaccagctggaccaccgggctatggaccccaaggaaagcct
ggtcccaagggcactcaaggaattcctggagcccctggaccacctggagaagccggtcct
agaggagaagtcagtgtttcaacaccagttccaggcccaccaggacctcccggggcccct
ggccttgctggcccccaaggtccacccggtatccctggatccatggggaaatatggagat
cctggtcttccagggcctgatggtgaaccaggaattccaggaattggatttcctgggcct
cctggacctaagggagagcaaggatttccaggtacaaaaggatccctggggtgtcctgga
aaaatgggagagcctgggttacctggaaagccaggcctcccaggagacaagggagagcca
gcactagccatgcctggaggaccaggaacaccaggttttccaggagaaagaggcaattct
ggggaacatggagaaattggactccctggacttccaggtctccctggaactccaggaaat
gaagggcttgatggaccacgaggagatccagggcagcctggaccacctggagaacaagga
ccccaaggaaggtgcatagagggtccaagaggagcccaaggacttccaggcttaaacgga
ttgaaagggcaacaaggcagaagaggtaaaacagggccaaagggagacccaggaattcca
ggcttggatagatcaggatttcctggagaaactggatcaccaggaaggccaggccatcaa
ggcgagatgggaccaccaggtcaaaaaggatatccaggaaatccaggaattttagggcca
ccaggtgaagatggaatggttgggatgatgggctttcctggagccattggccctccaggg
ccccctgggatcccaggcatgccagggcagaggggaagccttggaattccaggagtaaag
ggccagagaggaaccccaggacccaaaggggaacaaggagataaaggaaatcccgggcct
tctcagatatcccacttaataggggacaaaggagaaccaggtctcaaaggattcgcagga
aatatgggtgagaaaggaaacaggggtgttccagggctgccaggtttaaaaggcctccaa
ggactacctggaccaccaggacaaccaggccccagaggagatttgggcagcattgggaat
cctggagaaccaggaccacgtggtgtgccagggagcatggggaacatgggtatgccaggt
tctaaaggaaaaaagggaactttgggattcccaggtcgagctggaagaccaggccttcca
ggtattcacggtctccagggagataagggagagccaggttattcagaaggtacaagacca
ggaccaccaggaccaacgggggatccaggactgccaggtgatatgggaaagaaaggagaa
atggggcaacctggcccacctggacattcggggcctgccggacctgagggagtcattgga
agtcctggaagtcctggccttccaggaaagccaggtcctcatggtgatttgggttttaaa
ggaatcaaaggcttcacgggccctccaggaatcaaaggccctccaggtcttccaggattc
ccaggatctcctggaccaatgggtataagaggtgaccaaggacgtgatggaattcctggt
ccagccggagaaaagggagaaacaggtttattgggggcccctccaggcccaagagggaac
cctggtgctccaggagccaaaggagacagaggagccccaggtttgcctggcctccctggc
agaaaaggggccatgggagatgctgggcctcgagggcccacaggcattgaaggattccca
gggccaccaggtctgccgggtgcaatcatccctggccagaaaggaaaccgtggtccggca
ggctcaagaggaagcccaggtgagcctggtccctctggacctccagggagtcacataaaa
ggcataaaaggagacaaagggtctatgggccaccccggcccaaaaggtccacctggaact
gtaggagacatgggaccaccaggtcatccgggagcaccagggactccaggtcttccaggc
ctcagaggtgatcctggattccaggggtttcccggtgtgaaaggagaaaagggtaatcct
ggatttctaggatccattggacctccaggaccaattgggccaaaaggaccacctggtata
cgtggagaccctggcacccttcagattatctcccttccaggaagcccaggaccacctggc
acacctggagaaccagggatgcggggagaacctgggccaccagggccgcctggaaaccta
ggaccttgtgggccaagaggtaaaccaggcaaggatggaaaaccaggaattcctggacca
gttggagaaaaaggcaacaaaggttctaaaggagagcaaggaccacctggatcagacgga
ttacccggtttgaagggaaaacctggagacattggatcacctgcaacctggacgacgaga
ggctttgtcttcacccgacacagtcaaaccacagcaattccttcatgtccagaagggaca
gtgccactctacagtgggttttcttttctttatgtacaaggaaatcaacgagcccacgga
caagaccttggaactcttggcagctgcctgcagcgatttaccacaatgccattcttattc
tgcaatatcaatgatgtatgtaatttcgcatctcgaaatgattattcatactggctgtca
acaccagctctgatgccaatgaacatggctcccattactggcagggcccttgagccttac
ataagcagatgcactgtctgtgaaggtcctgcgatcgtcatagccgttcacagtcaaacc
actgacattcctccatgtcctcatggctggatttctctctggacaggattttctttcatc
atgttcacaagtgcaggttctgagggtgctgggcaagcactggcctcccctggctcctgc
ctggaagaattccgagccagcccatttcttgaatgtcatggaagaggaacgtgcaactac
tattcaaattcctacagtttctggctggcttcattaaacccagaaagaatgttcagaaag
cctattccatcaactgtgaaagctggggaattagaaaaaataataagtcgctgtcaggtg
tgcatgaagaaaagacaatga

KEGG   Callithrix jacchus (white-tufted-ear marmoset): 100394494Help
Entry
100394494         CDS       T03264                                 

Gene name
COL3A1
Definition
(RefSeq) collagen type III alpha 1 chain
  KO
K19720  collagen, type III, alpha
Organism
cjc  Callithrix jacchus (white-tufted-ear marmoset)
Pathway
cjc04611  Platelet activation
cjc04926  Relaxin signaling pathway
cjc04933  AGE-RAGE signaling pathway in diabetic complications
cjc04974  Protein digestion and absorption
cjc05146  Amoebiasis
Brite
KEGG Orthology (KO) [BR:cjc00001]
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    100394494 (COL3A1)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100394494 (COL3A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    100394494 (COL3A1)
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100394494 (COL3A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100394494 (COL3A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:cjc00536]
    100394494 (COL3A1)
Glycosaminoglycan binding proteins [BR:cjc00536]
 Heparan sulfate/Haparin
  Extracellular matrix molecules
   100394494 (COL3A1)
BRITE hierarchy
SSDB OrthologParalogGFIT
Motif
Pfam: Collagen COLFI VWC
Motif
Other DBs
NCBI-GeneID: 100394494
NCBI-ProteinID: XP_002749597
Ensembl: ENSCJAG00000005210
UniProt: F7FL95
Position
6
AA seq 1465 aa AA seqDB search
MMSFVQKGSWLLLALLHPTVILAQQEAVEGGCSHLGQTYADRDVWKPEPCQICVCDSGSV
LCDDIICDDQELDCPNPEIPFGECCAVCPQPPTAPTRPPNGQGPQGPKGDPGPPGIPGRN
GDPGIPGQPGSPGSPGPPGICESCPTGPQNYSPQYDSYDVKSGVAVGGLGGYPGPAGPPG
PPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPPGTMGPSGPAGKDGESGRPGRPG
ERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETGAPGLKGENGLPGENGAPGPMG
PRGAPGERGRPGLPGAAGARGNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPG
SSGAPGQRGEPGPQGHAGAQGPPGPPGINGSPGGKGEMGPAGIPGAPGLIGARGPPGPPG
ANGAPGLRGGAGEPGKNGAKGEPGPRGERGEAGTPGIPGAKGEDGKDGSPGEPGANGLPG
AAGERGAPGFRGPAGPNGVPGEKGPAGERGAPGPAGPRGAAGEPGRDGLPGGPGMRGMPG
SPGGPGSDGKPGPPGSQGESGRPGPPGPSGPRGQPGVMGFPGPKGNDGAPGKNGERGGPG
GPGPQGPAGKNGETGPQGPPGPTGPGGDKGDTGPPGPPGLQGLPGTGGPPGENGKPGEPG
PKGDAGAPGAPGGKGDAGAPGERGPPGLAGAPGLRGGAGPPGPEGGKGPAGPPGPPGTAG
SPGLQGMPGERGGPGSPGPKGDKGEPGGAGADGVPGKDGPRGPTGPIGPPGPAGQPGDKG
EGGAPGLPGIAGPRGGPGERGEPGPPGPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPG
VAGPPGGSGPAGPPGPQGVKGERGSPGGPGAAGFPGARGLPGPPGNNGNPGPPGPGGSPG
KDGPPGPAGNTGAPGSPGVSGPKGDAGQPGEKGSPGAQGPPGAPGPLGIAGITGARGLAG
PPGMPGPRGSPGPQGVKGESGKPGANGLSGERGPPGPQGLPGLAGTAGEPGRDGNPGSDG
LPGRDGSPGGKGDRGENGSPGAPGAPGHPGPPGPVGPSGKSGDRGETGPAGPAGAPGPAG
ARGAPGPQGPRGDKGETGERGANGIKGHRGFPGNPGAPGSPGPAGHQGAVGSPGPAGPRG
PVGPSGPPGKDGTSGHPGPIGPPGPRGNRGERGSEGSPGHPGQPGPPGPPGAPGPCCGGG
AAAIAGIGGEKAGGFAPYYGDEPMDFKINTDEIMNSLKSVNGQIESLISPDGSRKNPARN
CRDLNFCHPELKSGEYWIDPNQGCKLDAIKVFCNMETGETCISASPSSVPQKHWWTDSGA
EKKHVWFGESMDGGFQFSYGNPELPEDILDVQLAFLRLLSSRASQNITYHCKNSIAYMDQ
ASGNVKKALKLMGSNEGEFKAEGNSKFTYTVLEDGCTKHTGEWSKTVFEYRTRKAVRLPI
VDIAPYDIGGPDQEFGVDVGPVCFL
NT seq 4398 nt NT seq  +upstreamnt  +downstreamnt
atgatgagctttgtgcaaaaggggagctggctacttcttgctctgcttcatcccactgtt
attttggcacaacaggaagctgtcgaaggaggatgttcccatcttggccagacctatgcg
gatcgagatgtctggaagccagaaccatgccaaatatgtgtctgtgactcaggatccgtt
ctctgcgacgacataatatgtgacgatcaggaattagactgccccaacccagaaattcca
tttggagaatgttgcgcagtttgcccacagcctccaactgctcctactcgccctcctaat
ggccaaggacctcaaggcccaaagggagatccaggccctcctggtattcctgggagaaat
ggtgaccctggtattccaggacaaccaggttcccctggttctcctggccctcctggaatc
tgtgaatcatgccctactggtcctcagaactattctccccagtatgattcatatgatgtc
aagtctggagtagcagtaggaggactcggtggctatcctgggccagctggccccccaggt
cctcccggtccccctggtacatctggtcatcctggttcccctggatctccaggataccaa
ggaccccctggtgaacctgggcaagctggtcctgcaggccctccaggacctcctggtact
atgggtccatctggtcctgctggaaaagatggagagtcaggtagaccaggacgacctgga
gagcgaggattgcctggacctccaggtattaaaggtccagctggaatacctggattccct
ggtatgaaaggacacagaggctttgatggacgaaatggagaaaagggtgaaacaggtgct
cctggcttaaagggtgaaaacggtcttccaggtgaaaatggagctcctggacccatgggt
ccaagaggggctcctggtgagagaggacggccaggacttcctggagccgcaggtgctcgg
ggtaatgacggtgctcgaggcagtgatggtcaaccaggccctcctggtcctcctggaact
gccggattccctggatcccctggtgctaagggtgaagttggacctgcaggctctcctggt
tcaagtggtgcccctggacaaagaggagaacctggacctcagggacatgctggtgctcaa
ggtcctcctggccctcctgggattaacggtagtcctggtggtaaaggcgaaatgggtcct
gctggcattcctggagctcctggactgatcggagccaggggtcctccaggaccacccggt
gctaatggtgctcctggactgcgaggtggtgcaggtgagcctggtaagaatggtgccaaa
ggagagccaggaccacgaggtgaacgtggtgaggctggtactccaggtattccaggagct
aaaggtgaagatggcaaggatggatcacctggagaacctggtgcaaatgggcttccagga
gctgcaggagaaaggggtgcccctgggttccgaggacctgctggaccaaatggtgttcca
ggagaaaagggtcccgctggagagcgtggtgctccaggacccgcagggcccagaggagca
gctggagaacccggcagagatggtctccctggaggtccaggcatgaggggcatgcccgga
agtccaggaggaccaggcagtgatgggaaaccagggcctcccggaagtcaaggagaaagt
ggtcgaccaggtcctcctgggccatctggtccccgaggtcagcctggtgtcatgggtttc
cccggtcctaaaggaaatgatggtgctcctggtaagaatggagaacgaggtggccctgga
ggacctggccctcagggtcctgctggaaagaatggtgaaactggacctcagggaccccca
gggccaactgggcctggtggtgacaaaggagacacaggaccccctggtccacctgggtta
caaggcttgcctggaacaggtggtcctccaggagaaaacggaaaacctggggaaccaggt
ccaaagggtgatgctggtgcacctggagctccaggaggcaagggtgacgctggtgcccct
ggcgaacgtggacctcctggattggcaggggccccaggacttagaggtggagctggtcct
cctggtcccgaaggaggaaagggacctgctggtcctcctgggccacctggtactgctggt
agtcctggtctgcaaggaatgcctggagaaagaggaggtcctggaagtcctggtccaaaa
ggtgacaagggtgaaccaggcggcgcaggtgctgatggtgtcccagggaaagatggtcca
aggggtcctactggtcctattggtcctcctggtccagctggccagcctggagataagggt
gaaggtggtgcccccgggcttccaggtatagctggacctcgtggtggccctggtgagaga
ggtgaacctggccctccaggacctgctggcttccctggtgctcctggacaaaatggtgaa
cctggtggtaaaggagaaagaggggctccaggtgagaaaggtgaaggaggccctcctgga
gttgcaggaccccctggaggttctggacctgctggtcctcctggtccccaaggcgtcaaa
ggtgaacgtggcagtcctggtggacccggtgctgctggcttccctggtgctcgtggtctt
cctggtcctcctggtaataacggtaacccaggacccccaggtcctggcggttctccaggc
aaggatgggcccccaggtcctgcaggtaacactggtgctcctggcagccccggagtgtct
ggaccaaaaggtgatgctggccaaccaggagaaaagggatcacctggtgcccagggcccc
ccaggagctccgggcccacttggaattgctggaattacaggagcacggggtcttgcagga
ccaccaggcatgccaggtcctaggggaagccctggccctcagggcgtcaagggtgaaagt
ggaaaaccaggtgctaatggtctcagtggagaacgtggtccccctggaccccagggtctt
cctggtttggctggtacagctggtgaacctggaagagatggaaaccctggatcagatggt
cttccaggccgagatggatctcctggtggcaagggtgatcgtggtgaaaatggttctcct
ggtgcccctggtgctcctggtcatccaggcccacctggtcctgtcggtccatccggaaag
agtggtgacagaggagaaactggtcctgctggccctgctggtgctcctggtcctgctggt
gcccgaggtgctcctggtcctcaaggcccacgtggtgacaaaggtgaaacaggtgaacgt
ggcgctaatggcatcaaaggacatcgaggattccctggtaatccaggtgccccaggctct
ccaggtcctgctggtcaccagggtgcagtcggtagtccaggacctgcaggccccagagga
cctgttggacccagtgggcctcctggcaaagatggaaccagtggacatccaggtcccatt
ggaccaccagggcctcgaggtaacagaggtgaaagaggatctgagggctccccaggccac
ccagggcaaccaggcccccctggacctcctggtgcccctggtccttgctgtggtggtgga
gccgctgccattgctgggattggaggtgaaaaagctggcggttttgccccatattatgga
gatgaaccaatggatttcaaaatcaacaccgatgagattatgaattcactcaagtctgtt
aacggacaaatagaaagcctcattagtcctgatggttctcgtaaaaaccctgctagaaac
tgcagagacctgaatttctgccatcctgaactcaagagtggagaatattggattgaccct
aaccaaggatgcaaattggatgctatcaaggtattctgtaatatggaaactggggaaaca
tgtataagtgccagtccttcgagtgttccacagaaacactggtggacagactctggtgct
gagaagaaacacgtttggttcggagagtccatggatggtggttttcagtttagctatggc
aatcctgaacttcctgaagatatccttgatgtgcagctggcattccttcgacttctctcc
agccgagcctcccagaacatcacatatcactgcaaaaatagcattgcatacatggatcag
gccagtggaaatgtaaagaaggctctgaagctgatggggtcaaatgaaggtgaattcaag
gctgaaggaaatagcaaattcacctacacggttctggaggatggttgcacgaaacacact
ggggaatggagcaaaacagtctttgagtatcgaacacgcaaggccgtgagactacctatt
gtagatattgcaccctacgacattggtggtcctgatcaagaatttggtgtggacgttgga
cctgtttgctttttataa

KEGG   Callithrix jacchus (white-tufted-ear marmoset): 100394953Help
Entry
100394953         CDS       T03264                                 

Gene name
COL4A6
Definition
(RefSeq) collagen type IV alpha 6 chain
  KO
K06237  collagen, type IV, alpha
Organism
cjc  Callithrix jacchus (white-tufted-ear marmoset)
Pathway
cjc04151  PI3K-Akt signaling pathway
cjc04510  Focal adhesion
cjc04512  ECM-receptor interaction
cjc04926  Relaxin signaling pathway
cjc04933  AGE-RAGE signaling pathway in diabetic complications
cjc04974  Protein digestion and absorption
cjc05146  Amoebiasis
cjc05165  Human papillomavirus infection
cjc05200  Pathways in cancer
cjc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:cjc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100394953 (COL4A6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100394953 (COL4A6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100394953 (COL4A6)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100394953 (COL4A6)
  09154 Digestive system
   04974 Protein digestion and absorption
    100394953 (COL4A6)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100394953 (COL4A6)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100394953 (COL4A6)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100394953 (COL4A6)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100394953 (COL4A6)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100394953 (COL4A6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:cjc04147]
    100394953 (COL4A6)
   00536 Glycosaminoglycan binding proteins [BR:cjc00536]
    100394953 (COL4A6)
Exosome [BR:cjc04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100394953 (COL4A6)
Glycosaminoglycan binding proteins [BR:cjc00536]
 Heparan sulfate/Haparin
  Extracellular matrix molecules
   100394953 (COL4A6)
BRITE hierarchy
SSDB OrthologParalogGFIT
Motif
Pfam: Collagen C4
Motif
Other DBs
NCBI-GeneID: 100394953
NCBI-ProteinID: XP_008987997
Ensembl: ENSCJAG00000007564
Position
X
AA seq 1705 aa AA seqDB search
MHPGLWLLLVTLCLTEELAGAGEKSYGKPCGGQDCSGSCKCFPEKGARGRPGPIGIQGPT
GPQGFAGPTGLSGLKGERGSPGPLGXDGPKGDKGPMGVPGFLGINGIPGHPGQPGPRGTP
GLDGYNGTQGAVGFPGPSGYPGLLGPPGLPGQKGSKGDPVLAPGSFKGMKGDPGLPGLDG
ITGPQGAPGSPGAVGPTGPPGLQGLPGPPGPPGPDGNMGLGFQGEKGVKGDVGLPGPAGP
PPSTGELEFMGFPKGKKGSKGEPGPKGFPGISGPPGFPGLGTTGEKGEKGIPGLPGPRGP
MGSEGVQGPPGKQGKKGSPGFPGLNGFQGIEGEKGDIGLPGPDVFIDIDGAVISGNPGDP
GVPGLPGLKGDEGIQGLRGPSGVPGLPALSGVPGALGPQGFPGLKGDQGNPGRTTIGAAG
LPGRDGLPGPPGPPGPPSPEFETETLHNKEPGFPGLRGEQGPKGNPGLKGVKGDSGFCAC
DSGVPSNGLPGEPGPPGPQGLIGLPGLKGARGDRGSGGAQGPAGSPGFHGRPGLSGPKGK
KGEPTLGTISGMKGDQGDPGSQGFPGVTGERGKDGIPGLPGLPGLPGDGGQGFPGEKGLP
GLPGEKGQPGLPGLPGIGLPGLPGPRGLPGDKGKDGSPGQQGTPGLKGDCCCRETVGKGD
LDTERGITLPCIIPRSYGPSGFPGTPGFPGPKGSRGLPGTRGPPGSHGNKGKPGSPGLVH
LPELPGFPGPRGEKGLPGFPGLPGKDGLPGIIGSPGLPGSKGATGDIFGAENGAPGEQGL
QGLPGDKGLIGDSGLPGLKGLHGKPGLLGPKGERGSPGTPGQVGEPGTPGSSGPYGIKGK
SGLPGAPGFQGTSGHPGNKGTRGEKGLPGSLVKKGLPGLKGLPGNPGLIGQKGSPGSPGI
SGLPALPGLKGEKGSVGSLGFPGMPGFPGIPGARGLKGIPGSTGKIGQSGHPGTPGEKGD
RGNPGPVGIPSPRHPMSNLWLKGDKGSQGSAGSDGFPGPRGDKGEAGRPGPPGLPGAPGL
PGTIKGVSGKPGPPGFMGIQGLPGLKGSSGITGFPGMPGESGSQGIRGSPGLPGASGLPG
LKGDNGRTLEISGSPGLKGQPGESGFKGAKGRDGLIGNMGFPGNKGKDGKVGVPGDVGLP
GAPGFPGTTGIRGEPGLPGSSGHQGAIGPPGLPGLIGPKGFPGFPGLHGLNGLPGTKGTH
GTPGPSITGVPGPAGIPGPKGEKGNPGIGIGAPGKPGQRGQKGDRGFPGLQGPAGLPGAP
GISLPSLIAGQPGDPGRPGLDGERGRLGPPGAPGPPGPSSNQGDTGDPGFPGIPGPQGPK
GDLGIPGFSGLPGELGLKGMRGEPGFMGTPGKVGPPGDPGFPGMKGKAGPRGSSGPQGAP
GQTPTTEAVQVPPGPLGLPGIDGIPGLIGDPGIQGPVGLKGFKGLSGVPGKDGPNGLPGP
PGALGDPGLPGLQGPPGFEGAPGQQGPFGMPGMPGQSVRVGYTLVKHSQSEQVPLCPIGM
SQLWVGYSLLFVEGQEKAHNQDLGFAGSCLPRFSTMPFIYCNINEVCHYARRNDKSYWLS
TTAPIPMMPVGQTQIAQYISRCSVCEAPSQAIAVHSQDITIPKCPLGWRSLWIGYSFLMH
TAAGAEGGGQSLVSPGSCLEDFRATPFIECSGARGTCHYFANKYSFWLTTVEERQQFGES
PMSETLKAGQLHTRVSRCQVCMKSL
NT seq 5118 nt NT seq  +upstreamnt  +downstreamnt
atgcaccctgggttgtggctgctcctggttacgttgtgcttgaccgaggaactggcagga
gcgggagagaagtcttatggaaagccatgtgggggccaagactgcagtgggagctgtaag
tgctttcctgagaaaggagcgagagggcgacctggaccaattggaattcaaggtccaaca
ggtcctcaaggattcgctggccctactggtttatcgggattgaaaggagaaaggggttcc
ccaggccctctggganctgatggaccaaaaggagataagggtcccatgggagttcctggc
tttcttggcatcaatgggattccggggcaccctggtcagccaggccccagaggcacacct
ggtctggacggctataatggaactcaaggagctgttggatttccaggccctagtggctat
cctgggcttctcggaccacctgggcttcctggtcagaaaggatcaaaaggtgaccctgtc
cttgctccaggtagtttcaaaggaatgaagggggatcctgggctacctggacttgatgga
atcactggcccacaaggagcacccggatctcctggagctgtaggacccacaggaccacca
ggattacaaggtcttccagggcctcctggtcctcctggtcctgatgggaatatggggcta
ggttttcaaggagagaaaggagtcaagggggatgtaggcctccctggcccagcaggacct
ccaccatctactggagagctggaattcatgggattccccaaagggaagaaaggatccaag
ggtgaaccagggcctaagggttttccaggcataagtggccctccaggctttccgggcctt
ggaactactggagaaaagggagaaaagggaatccctggtttgccaggacctaggggtcca
atgggttcagaaggagtccaaggccctccagggaaacagggcaagaaggggtccccagga
tttcctgggcttaatggattccaaggaattgagggtgaaaagggtgacattggcctgcca
ggcccagatgttttcatcgatatagatggtgctgtgatctcaggtaatcctggagaccct
ggtgtacccggcctcccaggccttaaaggagatgaaggcatccagggcctgcgtggccct
tctggtgtccctggcttgccagcattatcaggtgtcccaggagccctagggcctcaggga
tttccaggactgaagggggaccaaggaaacccaggccgtaccacaatcggggcagctggc
ctccctggcagagatggtttgccaggcccaccaggtccaccaggcccacctagtccagaa
tttgagactgagaccctacacaacaaagaaccagggttccctggtctccgaggagaacaa
ggtccaaaaggaaacccaggcctcaaaggagtaaaaggagactcaggtttctgtgcttgt
gacagcggtgtccccagcaatggactacccggggaaccaggcccacctggtccacaaggt
ctcataggccttccaggccttaaaggagccagaggagatcgaggttctggaggtgcacag
ggcccagcagggtctccaggctttcacgggcgtccaggtctttcaggacccaaaggaaag
aagggcgaaccaactctcggtacaatctcaggaatgaaaggggatcagggtgatcctggc
tcccagggttttcctggtgtgacaggagaacgaggaaaggatggaataccaggtttacca
ggtctgccaggtcttccgggtgatggtggacagggctttccaggtgaaaagggattacca
ggacttcctggtgaaaaaggccaacctggtctacctggcctcccaggaattgggttacca
ggacttcctggaccccgtgggcttcctggagataaaggcaaggatggatcaccaggacaa
caaggcacccctggattgaagggtgactgttgctgcagagagacggttggtaaaggagac
ttagacacagagagaggaatcaccttgccttgtattattccccggtcatacggtccatca
ggatttccaggcactcccggattcccaggccctaaagggtcccgtggcctccctgggacc
cgaggccctcctggatcacatggaaataaaggaaagccagggagtccaggactggttcat
cttcctgaactaccaggatttcctggacctcgtggggagaagggcttgcccgggtttcct
ggtctcccaggaaaagatggcttgcctgggataattggcagtccaggtttacctggttcc
aagggagccactggtgacatcttcggtgctgaaaatggtgctccaggggaacaaggccta
caaggattgccaggagacaaaggattgattggagactctggccttccaggactcaagggt
ttgcatgggaagcctggcttgctgggccccaaaggtgagcggggcagccctggaacacca
ggacaggtgggagagccaggcactccaggatctagtggcccatatggcatcaagggcaaa
tctggactcccaggagctccaggcttccaaggtacttcaggacaccctggaaacaaagga
actagaggagagaaaggtcttcccggatcacttgtaaagaaagggctgccagggctgaaa
ggccttcccgggaatccaggcctaataggacaaaaaggaagcccaggctctccagggatc
agtgggttgccagccctccctggactcaagggagagaaggggtctgttggatccttgggt
tttccaggaatgccaggttttcctggtattcccggagcaagaggtttaaaggggattccg
ggatcaacaggaaaaattggacaatctggacaccctggtactcctggtgaaaagggagac
agaggcaatccagggccagtcggaatacctagtccaagacatccaatgtcaaacctttgg
ttaaaaggagacaaaggctctcaaggctcagcaggatccgatggatttcctgggcctaga
ggtgacaaaggagaggctggtcgacctgggccaccaggcctgcctggagctcccggcctg
ccaggcactatcaaaggagttagtggaaagccagggccccctggcttcatgggaatccag
ggcttacctggcctgaaggggtcttccgggatcacaggtttcccaggaatgccaggagaa
agtggttcacaaggtatcagagggtcacctggactcccaggagcatctggtctcccgggc
ctgaaaggagacaatggccggacacttgaaatttccggtagcccaggacttaagggacaa
cctggtgaatctggttttaaaggtgcaaaaggaagagatggactaataggaaacatgggc
ttccctggaaacaaaggcaaagatggaaaagttggtgttcctggagatgttggccttcct
ggagctccaggatttccagggactacaggcatcagaggagaaccaggacttccaggttct
tctggtcatcaaggggcaattgggcctccgggactccctggattaataggacccaaaggc
ttccctggatttcctggtttacatggactgaatgggcttccaggcaccaagggtacccat
ggcactccaggacctagtatcaccggtgtgcctgggcctgctggtatccctggacccaaa
ggagaaaaaggaaatccaggaattggcatcggagctccagggaagccaggccagagaggg
caaaaaggtgatcgaggtttccccggtctccagggccctgctggtctccctggtgcccca
ggcatctccttgccctcactcatagcaggacagcctggtgaccctgggcgaccaggccta
gatggagaacgaggccgcctaggcccccctggggccccaggtccccctgggccatcctcg
aatcaaggtgacactggagaccctggcttccctggaattcctggacctcaggggcctaag
ggagacctaggaattccaggtttttctggcctccctggagagctaggactgaaaggcatg
agaggtgagcctggcttcatggggactccaggcaaggttgggccacctggagacccagga
tttcccgggatgaaggggaaggctgggccaagaggctcttccggcccccaaggtgctcct
ggacaaacaccaactacagaagctgtccaggttcctcccggacccttgggtctaccaggg
atagatggcatccctggcctcattggggaccctgggattcaaggccctgtaggcctaaaa
ggcttcaaaggtttatctggtgtccctggcaaagatggccccaatgggctcccaggccca
cctggggctcttggtgatcctggtctgcctggactgcaaggccctccaggatttgaagga
gctccagggcagcaaggccccttcgggatgcctggaatgcctggccagagcgtgagagtg
ggctacacgttggtaaagcacagccagtcggaacaggtgcccttgtgtcccatcgggatg
agtcagctgtgggtggggtacagcttactgttcgtggagggacaagagaaagcccacaac
caggacctgggttttgctggctcctgtctgccccgcttcagcaccatgcccttcatctac
tgcaacatcaacgaggtgtgccactatgccaggcgcaatgataaatcctactggctctcc
actaccgcccctatccccatgatgcccgtcggccagacccagattgcccagtacatcagc
cgctgctctgtgtgtgaggcgccatcacaagccattgctgtgcacagccaggacatcacc
atcccaaagtgccccctgggctggcgcagcctctggatcggatactccttcctcatgcac
actgccgctggtgccgagggtggaggccagtccctggtctcacctggatcctgcctagag
gactttcgggccactcctttcatcgagtgcagtggtgcccgaggcacctgccactacttt
gcaaacaagtacagtttctggttgactacagtggaggagaggcagcagtttggagagtcg
cctatgtctgaaacgctgaaagctgggcagctccacacccgagtaagtcgctgccaggtg
tgtatgaaaagcctatag

KEGG   Callithrix jacchus (white-tufted-ear marmoset): 100400812Help
Entry
100400812         CDS       T03264                                 

Gene name
COL4A1
Definition
(RefSeq) collagen type IV alpha 1 chain
  KO
K06237  collagen, type IV, alpha
Organism
cjc  Callithrix jacchus (white-tufted-ear marmoset)
Pathway
cjc04151  PI3K-Akt signaling pathway
cjc04510  Focal adhesion
cjc04512  ECM-receptor interaction
cjc04926  Relaxin signaling pathway
cjc04933  AGE-RAGE signaling pathway in diabetic complications
cjc04974  Protein digestion and absorption
cjc05146  Amoebiasis
cjc05165  Human papillomavirus infection
cjc05200  Pathways in cancer
cjc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:cjc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100400812 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100400812 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100400812 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100400812 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    100400812 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100400812 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100400812 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100400812 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100400812 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100400812 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:cjc04147]
    100400812 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:cjc00536]
    100400812 (COL4A1)
Exosome [BR:cjc04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100400812 (COL4A1)
Glycosaminoglycan binding proteins [BR:cjc00536]
 Heparan sulfate/Haparin
  Extracellular matrix molecules
   100400812 (COL4A1)
BRITE hierarchy
SSDB OrthologParalogGFIT
Motif
Pfam: Collagen C4
Motif
Other DBs
NCBI-GeneID: 100400812
NCBI-ProteinID: XP_017825949
Ensembl: ENSCJAG00000014430
Position
1
AA seq 1669 aa AA seqDB search
MGPRLGVWLLLLPAALLLHEERSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFTGNPGPPGLPGMKGDPGEILGHVPGMLLKGERGFPGP
PGMTGPPGLPGLTGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDYATKGEKGQKGEPGFQGMPGVGEKGEPGKPGPRGKPGKDGDKGEKGS
PGFPGDPGYPGLVGRQGPQGDKGEAGPPGPPGIVIGTGPLGEKGERGYPGTPGPRGEPGP
KGFPGIQGQPGPPGLPVPGQIGAPGFPGERGEKGDRGFPGVSLPGPSGRDGFPGPPGLPG
PPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGLAGEIGEKGQKGESCLICDTTGYRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLMGQPGAKGEPGEIYFDLRLK
GDKGDPGFPGQPGVPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDT
GPPGPPGYGPAGPIGDKGEAGFPGGPGSPGLPGPKGEAGKVVPLPGPPGAEGLPGSPGFP
GPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFNG
LPGNPGVQGQKGEPGVGLPGLKGLPGIPGTPGTPGEKGSVGAPGVPGEHGAIGPPGLQGI
RGDPGPPGLPGPMGSPGVPGIGPPGARGPPGGQGPPGLAGPPGIKGEKGFPGFPGLDMPG
PKGDKGAQGLPGVTGQSGLPGLPGQQGTPGLPGFPGSKGEMGVMGTPGQPGSPGPVGSPG
IPGVKGDHGFPGSSGPRGDPGLKGDKGDVGLPGQPGSMDKVDMGTMKGQKGDQGEKGQIG
PIGEKGSRGDPGSPGVPGKDGQAGQPGQPGPKGDPGISGTPGSPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGPQGSPGLPGEKGAKGEKGQAGPPGIGIPGLPGDKGDQGIAGFPGSPGE
KGEKGSVGIPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLLGT
PGPTGPAGQKGXPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGAKGEVGFPGLAGSP
GIPGAKGEQGFMGPPGPQGQPGLPGSPGHAREGPKGDRGPQGQPGQPGLPGPMGPPGFPG
IDGIKGDKGQPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGLPGVQGFQGAKG
LPGLQGIKGDQGDQGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGPPGPK
GQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGDNIRPFISRCAVCEAPAMVMAV
HSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFRKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt NT seq  +upstreamnt  +downstreamnt
atggggccccggctcggcgtctggctgctgttgctgcccgccgcccttctgctccacgag
gagcgcagccgggccgctgcgaagggtggctgtgctggctctggctgtggtaaatgtgac
tgccatggagtgaagggacagaagggtgaaagaggcctcccagggttacaaggtgtcatt
gggtttcctggaatgcaaggacctgaggggccacaaggaccaccaggacaaaagggtgat
actggagagccaggactacctggaacaaaagggacaagagggcccccaggagcatctggc
taccctggaaacccaggacttcccggtattcctggccaagacggccctccaggcccccca
ggtattccaggatgcaacggcacaaagggagagagagggccgctcgggccccccggcttg
cctggcttcaccggaaatcccggaccaccagggttaccaggaatgaagggtgatccaggc
gagatacttggccatgtgcccgggatgctgttgaaaggtgaacgaggatttcccggaccc
ccagggatgacaggcccaccaggactgccggggcttacaggccctgttgggcctccagga
tttacgggaccaccaggtcccccaggccctcccggccctccaggtgaaaaggggcaaatg
ggcttaagttttcaaggaccgaaaggtgacaagggtgatcagggggtcagtgggcctccg
ggagtaccaggacaagctcaagttcaggaaaaaggcgactatgccactaagggggaaaag
ggccaaaaaggtgaacctggatttcaaggaatgccaggggtcggagagaaaggtgaacct
ggaaaaccaggacccagaggaaaacccggaaaagatggtgacaaaggggagaaggggagt
cccggttttcctggagatcccgggtacccaggactcgtaggccgccagggcccgcaggga
gataaaggtgaagcaggtcctcccggcccacctggaattgttataggcacaggacctttg
ggagaaaaaggagagcggggctaccctggaactccgggaccaagaggagagccaggcccg
aaaggtttcccgggaatacaaggccaacccggacctccaggcctccctgtacctgggcag
attggtgcccctggcttccctggcgaaagaggagaaaaaggtgaccgaggatttccaggc
gtatctctgccaggaccaagtggaagagatgggttcccgggtcctcctggcctccccggg
ccccctgggcagcctggctacacaaatggaattgtggaatgtcagcccggacctccaggt
gaccagggtcctcctggaattccagggcagccaggattggcaggtgaaattggagagaaa
ggtcaaaaaggagagagttgcctcatctgtgatacaacaggttatcgggggcctcctggg
ccacagggacccccaggagaaataggtttcccaggacagccaggggccaagggcgacaga
ggtttgcctggcagagacggtgttgccggagtgcccgggcctcaaggtacaccagggctg
atgggccagccaggagccaagggggagcctggtgagatttatttcgacctgcggctcaaa
ggtgacaaaggagacccaggctttccaggacagcctggtgtgccagggagagcaggttct
cctggaagagacggccatccaggtcttcctggccccaagggctcgccgggttccgtggga
ctgaaaggagagcgtggcccccctggaggggttggattcccaggcagtcgtggtgacacc
ggcccccctgggcccccaggatatggtcctgctggtcccattggtgacaaaggagaagca
ggctttcctggaggtcctgggtccccaggcctgccaggtccaaagggtgaagcaggaaag
gttgttcctttaccaggcccccctggagcagaaggactgccgggatccccaggattccca
ggtccccaaggagaccgaggctttcctggaaccccaggaaggccaggcctgccgggagag
aagggtgctgtgggccagccggggattggatttccagggccccccggccccaaaggtgtt
gatggcttacctggagacatggggcctccggggactccaggtcgcccaggatttaacggc
ttacctgggaacccaggtgtgcagggccagaagggagagcctggagttggtctaccagga
ctcaaaggcttgccaggtatccccggcaccccgggcacccccggggagaaggggagtgtt
ggagcaccaggcgttcctggagaacacggagcgatcggcccccctgggcttcaggggatc
agaggtgacccaggacctcctggattgccaggccccatggggtctccaggagttccagga
ataggccctcctggagctaggggcccccctggcggacagggaccaccagggttggcaggc
cctcctggaataaaaggcgagaagggtttccctggattccccggactggacatgcctggc
cctaaaggagataaaggggctcaaggacttcccggcgtaacagggcagtcagggctccct
ggccttcctggacagcaggggactccagggcttcctgggtttccaggttccaagggagaa
atgggcgtcatggggacccctgggcagccgggctcaccaggaccagtgggttctccagga
ataccgggtgtaaaaggggaccatggctttccaggctcctcgggacccaggggagaccct
ggcttgaaaggtgataagggtgatgtcggtctccccggccagcctggctccatggataaa
gtggacatgggcacaatgaagggccagaagggagaccaaggagagaaaggacaaatcgga
ccaattggtgagaaaggttcccgaggagaccctgggagcccaggagtgcctggaaaggac
gggcaggcgggacagcccgggcagccaggacccaaaggtgatccaggcataagcggaacc
ccaggttctccgggacttccaggaccaaaagggtcggttggtggcatgggcttgccagga
acacctggagagaaaggcgtgcctggcatccctggcccacaaggttcccctggcttacct
ggagaaaaaggtgcaaaaggagagaaagggcaggcaggcccacctggcataggtatccct
gggctgcctggtgacaagggagatcaagggatagcgggtttcccaggaagccctggagag
aagggagaaaaaggaagcgttgggatcccaggaatgccagggtccccaggccttaaaggc
tctcctggaagcgttggctatccaggaagccctgggctgcctggagaaaaaggtgacaaa
ggcctcccaggattggatggcatccctggcgtcaaaggagaagcaggtctccttgggacg
cctggccccacaggcccagctggccagaaagggnaaccaggcagtgatggaatcccaggg
tcagcaggagagaagggtgaaccaggtcttccaggaagagggttcccaggctttccaggg
gccaaaggagacaaaggtgcaaagggtgaggtgggttttccaggattagctgggagccca
ggaattcctggagccaaaggagagcaaggattcatgggtcctccggggcctcagggacag
ccagggttaccaggatccccaggccacgccagggagggacccaaaggagaccgtggacct
cagggccagcctggccagccaggacttccgggacccatggggcctccagggtttcctggg
attgatggaataaaaggtgacaaaggacaacccggctggccaggagcacccggtgtccca
gggcccaagggagaccctggattccagggcatgcctggaattggtggctctccaggaatc
acaggttctaagggtgatatgggacttccaggagttcaaggatttcaaggtgcaaaaggg
cttcctggcctccagggaattaaaggtgatcaaggagatcagggtgtcccaggagctaaa
ggtctcccgggtcctcctggccccccaggtccttatgacatcatcaaaggggagcctggg
ctccctggtccggagggccccccagggttgaaaggacttcagggacctccaggccctaaa
ggccagcaaggtgtgacaggactggtgggtatacctggacccccaggtattcctgggttt
gacggtgcccctggccagaaaggagagatgggacctgccgggcctactggtccaagagga
tttccaggtccaccaggccccgatgggttgccaggatccatggggcccccaggcacccca
tctgttgatcacggcttccttgtgaccagacacagtcaaacaatagatgacccacagtgt
ccttctgggaccaaaattctttaccatgggtactctctgctctatgtgcaaggcaatgaa
cgggcccacggccaggacttgggcacggccggcagctgcctgcgcaagttcagcaccatg
cccttcctgttctgcaatatcaacaacgtgtgcaacttcgcatcacggaacgactactcg
tactggctgtccactcccgagcccatgcccatgtcaatggcacccatcacaggggacaac
ataagaccatttattagtaggtgtgctgtgtgtgaggcacccgccatggtgatggctgtg
cacagtcagaccattcagatccctccgtgccccagtgggtggtcctcgctgtggatcggc
tactcatttgtgatgcacaccagcgctggtgcagaaggctctggccaagccctggcatcc
cccggctcctgcctggaggagtttagaagcgcgccattcatcgagtgtcacggccgtggg
acctgtaattactacgcaaatgcttacagcttttggctcgccaccatagagagaagcgag
atgttcaggaagcctacgccgtccaccttgaaggcaggggagctacgcacccacgtcagc
cgctgccaagtctgtatgagaagaacataa

KEGG   Callithrix jacchus (white-tufted-ear marmoset): 100405184Help
Entry
100405184         CDS       T03264                                 

Gene name
COL4A2
Definition
(RefSeq) collagen type IV alpha 2 chain
  KO
K06237  collagen, type IV, alpha
Organism
cjc  Callithrix jacchus (white-tufted-ear marmoset)
Pathway
cjc04151  PI3K-Akt signaling pathway
cjc04510  Focal adhesion
cjc04512  ECM-receptor interaction
cjc04926  Relaxin signaling pathway
cjc04933  AGE-RAGE signaling pathway in diabetic complications
cjc04974  Protein digestion and absorption
cjc05146  Amoebiasis
cjc05165  Human papillomavirus infection
cjc05200  Pathways in cancer
cjc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:cjc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100405184 (COL4A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100405184 (COL4A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100405184 (COL4A2)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100405184 (COL4A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    100405184 (COL4A2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100405184 (COL4A2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100405184 (COL4A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100405184 (COL4A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100405184 (COL4A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100405184 (COL4A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:cjc04147]
    100405184 (COL4A2)
   00536 Glycosaminoglycan binding proteins [BR:cjc00536]
    100405184 (COL4A2)
Exosome [BR:cjc04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100405184 (COL4A2)
Glycosaminoglycan binding proteins [BR:cjc00536]
 Heparan sulfate/Haparin
  Extracellular matrix molecules
   100405184 (COL4A2)
BRITE hierarchy
SSDB OrthologParalogGFIT
Motif
Pfam: Collagen C4 DUF3930
Motif
Other DBs
NCBI-GeneID: 100405184
NCBI-ProteinID: XP_017825702
Ensembl: ENSCJAG00000014238
Position
1
AA seq 1905 aa AA seqDB search
MGRNQRAAAGPALPRWLLLGTVTVGFLSQSVLAGVKKSDVPCGGRDCSGGCQCFPEKGGR
GQPGPVGPQGYTGPPGLQGFPGLQGRKGDKGERGAPGITGPKGDVGARGVSGFPGADGIP
GHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGPQGPKGQKGEPYALPKEERD
RYRGEPGEPGLVGFQGPPGRPGPVGQMGPVGAPGRPGPPGPPGPKGQQGNRGLGFYGVKG
EKGDVGQPGPNGIPPETHPVIAPTRVTHHPEQDKGEKGSEGEPGIKGISLKGEEGIMGFP
GPRGYPGLSGEKGSPGQKGSRGLDGYQGSDGPRGPKGETGDPGPPGLPAYSPHPSLAKGA
RGDPGFPGARGEPGSQGEPGDPGPRGPPGISITAEDLGRGLPGEMGPKGFIGDPGIPALY
PGPPGLDGKPGLPGPPGLPGPPGPDGFLFGLKGAEGRGGFPGLSGSPGARGQKGRKGDAG
DCRCVEGDEAVRGLPGLPGPKGFPGINGEPGRKGDRGDPGQHGLPGFSGLKGVPGNLGAP
GPKGAKGDSRTITTKGERGQPGIPGVPGMKGDDGIPGREGLDGFPGLPGPPGDGIKGPPG
DPGYPGIPGTKGSPGEMGPPGLGLPGFKGQRGFPGDAGLPGPRGFPGPPGPPGTPGQADC
DPDVRRPIAGDRQEAVQPGCAGGPKGLPGLPGPPGPTGAKGLRGTPGFSGADGGAGPKGL
PGDPGREGFPGPPGFIGPRGSKGAAGLPGPDGLPGPVGLPGPVGPPGXKGLPGEVLGAQP
GPRGDAGVPGHPGLKGLPGDRGTPGFRGKCPMGEQRPLPSYTVASKEEPGPFLGIWCFGT
FQHEDKWQQAYFQHLIKVLTTMGVTFMTLSLVCLVRQNVLDGKNPDSQNIKIPGHYFSFS
SAHQCDGSSVQTGPASVRLAVRGSHKHTQIEFQKITVSSFPTVLREGVRIDASMHLQASH
SRHGFECASRSVSLLPRCRLLQSDNAVSSLTLSPSQATALSRTLAGLAREPPCHSKTHTG
FPHVEPRLCARAGGWTQKRSFLCVCPPSDLLLFPTGLPGDRGEPGDVGAPGPVGMKGVSG
DTGDAGLAGERGRPGSPGFKGIDGMPGAPGLKGERGSPGMDGFQGMPGLKGRPGIPGSKG
EAGFFGIPGLKGLAGEPGFKGSRGDPGPPGPPPIILPGMKDIKGEKGDEGPMGLKGYLGA
KGTQGMPGIPGLSGIPGLPGRPGHIKGLKGDIGAPGIPGLPGFPGVAGPPGITGFPGFTG
SRGDKGAPGRAGLYGEAGQTGDFGDIGDTINLPGRPGLKGERGTAGIPGLKGFFGEKGTE
GDIGFPGITGVTGVQGPPGLKGQAGFPGLTGPPGPQGEPGRIGLPGGKGDDGWPGAPGLP
GFPGPRGISGLHGLPGTKGFPGSPGADIHGDPGYSGPPGERGDPGEANTLPGPVGVPGQK
GEQGAPGQRGPPGSPGLQGFPGITPPSNISGAPGDKGAPGIFGLKGYRGPPGPPGSAALP
GSKGDTGNPGAPGTPGTKGWVGDPGPQGRPGVLGLPGEKGPRGEQGFMGNPGLPGPVGDR
GPKGPKGDPGFPGAPGIVGAPGIAGIPQKIATQPGTVGPQGRRGPPGAPGEMGPQGPPGE
PGFRGAPGKAGPQGRGGVSAVPGFRGDEGPTGHQGPIGQEGVPGRPGSPGLPGMPGRSVS
IGYLLVKHSQTDQEPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFL
YCNPGDVCYYASRNDKSYWLSTTAPLPMMPVAEDEIKPYISRCSVCEAPAVAIAVHSQDV
SIPHCPAGWRSLWIGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHY
YANKYSFWLTTIPEQSFQGSPSADTLKAGLIRTHISRCQVCMKNL
NT seq 5718 nt NT seq  +upstreamnt  +downstreamnt
atggggagaaaccagcgcgcggcggccggccctgccctaccacggtggctgctgctgggg
accgtgaccgtggggttcctctcccagagcgtcttggcgggtgtgaagaagtctgatgtg
ccgtgtggcggaagagattgcagtgggggctgccagtgcttccctgagaaaggcggacgc
ggtcagcctgggccagtgggcccccaggggtacactgggccaccaggactgcaaggattc
ccgggactgcagggccgcaaaggagacaagggtgaaagaggagcccccggaatcacagga
cccaagggagacgtgggagcaagaggcgtttctggattccctggtgccgatggaattcct
ggacacccggggcaaggtgggcccaggggaaggcccggctacgatggctgcaacggaacc
cagggagactcaggtccacagggtccccccggctcggaggggttcaccgggcctcccggg
ccccaaggaccaaaagggcagaaaggtgagccttacgcactgcctaaagaggagcgcgac
agatatcggggtgaacctggagagcctggattggttggtttccagggacctcccggccgc
cctgggcctgtgggacagatgggtccagttggagctccagggagaccaggaccacctgga
ccccctggaccaaaaggacagcaaggaaacagaggacttggtttctacggagtgaagggt
gaaaagggtgacgtagggcagccaggacccaacgggattccaccagaaacccaccccgtc
atcgcgcccacgagagtcacccaccacccagaacaggacaagggtgaaaaaggcagcgag
ggcgaaccgggaataaaaggcatttccttgaagggagaagaaggaatcatgggctttcct
ggaccaaggggttaccctggcttgagtggtgaaaaaggatcgccgggacagaagggaagc
cgaggcctggatggttatcaaggctctgatgggccccggggccccaagggagaaaccgga
gacccaggaccccctggactacctgcctactcccctcacccttccctagcaaaaggtgcc
agaggtgacccaggattcccaggggcccgaggggagccaggaagccagggtgagccagga
gacccaggcccccgaggcccacctggcatctccatcacagctgaagatctggggagaggc
ctgccgggtgagatgggacccaaaggcttcatcggagaccccggcatcccggctctctac
ccgggcccacccggacttgacggaaagccggggcttccaggaccccccgggctccctgga
ccgcctgggcctgatggcttcctgtttggcctgaaaggagcagaaggaagagggggcttc
cctgggctttctggctcccctggagcccgtggacagaagggacggaaaggtgatgctgga
gactgcagatgtgtagaaggcgacgaagctgtcagaggtcttccgggactgccaggaccc
aagggcttcccaggcatcaacggggagccagggaggaaaggggacagaggagaccccggc
caacacggccttcctgggttctcagggctcaagggagtccctggcaatcttggtgctccc
gggcccaagggagcaaaaggagattccagaacaatcacaacgaaaggtgagcggggacag
cccggcatcccaggtgtccccgggatgaaaggtgacgacggcatcccaggccgcgagggg
ctcgatggattccccggcctcccgggccctcccggcgatggcatcaaaggccctccaggg
gacccgggctatccaggaatacctggaacaaagggttctccaggagaaatgggtccccca
ggactgggccttcccggcttcaaaggccaacgtggtttccctggagacgccggcttacct
ggaccacgcggcttcccaggccctcccggccctccagggaccccaggacaggcagattgt
gacccagatgtgagaaggcccattgcaggtgacagacaggaggccgtccagccaggttgc
gcaggagggcccaagggattgccaggcctgccaggacccccaggccccacaggtgccaaa
ggcctccgaggaaccccaggcttctcaggagctgatggaggagcaggacccaaaggcttg
ccaggagacccaggtcgtgaagggttcccaggacccccagggttcataggaccccgagga
tccaaaggtgcagcgggtctccctggcccagatggactcccaggtcccgtcggcctgcca
ggtccagtcgggccccccgggnacaagggccttcctggagaagtcctgggagcccaacct
ggaccacggggagatgctggcgtgcctggacaccctgggctgaaaggccttcctggagac
agaggcacccctggattcagaggtaagtgccccatgggggagcagaggccccttcccagc
tacacagtggcttccaaggaggagcctggtcccttcctggggatttggtgcttcggaacg
tttcagcatgaagacaaatggcagcaggcttactttcaacatctcattaaggtcctcacc
acgatgggcgtcacctttatgacgttaagcctcgtatgccttgttaggcagaatgttttg
gatggtaaaaatcctgactcccaaaacatcaaaattccaggtcactatttcagtttcagt
tctgctcaccagtgcgacgggagcagcgtgcagacgggtcctgcttctgttcgactggct
gtgcggggcagccacaaacacacacagatagaattccagaaaataaccgtgtcctcattt
cccacagtactacgggagggggtgagaatcgatgcatccatgcatctccaagcatctcat
tccagacatggcttcgagtgcgcgtcccgctcagtcagccttctccctcgctgcaggttg
ctgcagtcagacaacgcagtcagcagcctgactctcagtcctagtcaggcgacagctttg
tccaggaccttagcaggtcttgctagggagccgccatgccactcgaagacgcacactggc
ttcccccatgtggaacctcgcctgtgtgcaagagctgggggctggactcagaagagaagt
ttcctgtgtgtgtgtccaccctctgatttgctcctcttcccgacaggtctgcctggagat
agaggggagcccggcgacgtgggtgctcccggccctgtgggcatgaaaggtgtctctggt
gacacaggagatgctggcttggcgggagagcgaggccgtccaggaagccctgggtttaaa
ggaattgatggaatgcctggggcccccgggctcaaaggagagagaggctcgcctgggatg
gacggtttccagggcatgcctggactcaaagggagacccgggattccggggagcaaaggc
gaggcaggatttttcggaatacctggtctgaagggtctggctggtgaaccaggttttaaa
ggcagccgaggggatcctgggcccccaggaccacctcccatcatcctgccaggaatgaaa
gacatcaagggggagaaaggagatgaagggcctatggggctgaaaggatacctgggcgcg
aaaggtacccaaggaatgccaggcatcccggggctgtcaggaatccctggactgcctggg
cggcctggccacatcaaaggactcaagggagacattggagcccccggcatccctggtttg
ccaggattccctggggtggccggcccccctggaattacaggattcccaggattcacagga
agccggggtgacaaaggtgccccggggagagcaggcctgtacggcgaggctggccagacc
ggtgatttcggtgatatcggggacactataaatttgccaggaagaccaggcctgaagggg
gagcggggcaccgctggaataccaggtctgaagggattctttggagaaaagggaacagaa
ggtgacattggcttccctgggataacaggcgtgactggagtccaaggtcctcccggactt
aaaggacaagcaggctttccagggctgactgggccaccagggcctcagggagagcccggg
cggattggactgcctggtggcaaaggggatgatggctggccaggagctccaggcttacca
ggttttccaggaccccgtgggatcagcggcttacacggcttgccaggcaccaaaggcttc
ccgggatccccaggtgctgacatccacggagacccaggctactcaggtcctcctggggag
agaggtgacccaggagaggccaacaccctgccaggccctgtgggagtcccaggacagaaa
ggagaacaaggagctccagggcaacgaggcccacccgggagtccaggacttcaggggttc
cccggcatcacgcccccttccaacatctctggggcacctggtgacaaaggggcgccaggg
atatttggcctgaaaggttatcggggcccacccgggccaccgggatctgctgctcttcct
ggaagcaaaggtgacacagggaacccaggagctccaggaaccccagggaccaaaggatgg
gttggggaccccgggccccagggcaggcccggcgtgctcggtctcccaggagaaaaaggg
cccaggggtgaacaaggcttcatggggaaccctggactccccggacctgtgggtgacaga
ggccccaagggacccaagggagacccaggattccccggtgcccccggcatcgtgggagcc
cccgggattgcaggaattccccagaagatcgctacccaaccagggacagtgggtccccag
gggaggcgaggccctcctggggcacccggggagatggggccccagggcccccccggagaa
ccaggtttccgtggggctccagggaaagccgggccccagggaagaggtggcgtgtctgct
gttcctggcttccggggagatgaagggcccacaggccaccaggggccgattggccaagaa
ggtgtgccaggccgtccagggagcccgggcctgcccggcatgccaggccgcagtgtgagc
atcggctacctcttagtgaagcacagccagacggaccaggagcccatgtgccctgtgggc
atgaacaagctctggagcgggtacagcctgctgtactttgagggccaggagaaagcacac
aaccaggacctggggctagcgggctcctgcctggcacggttcagcaccatgccctttctc
tactgcaaccctggggacgtctgctactatgccagccggaacgacaagtcttactggctc
tccaccaccgccccgctgcccatgatgcccgtggccgaagacgagatcaagccttacatc
agccgctgctctgtgtgtgaggccccagccgtcgccattgctgtccacagccaagatgtc
tccatcccccactgcccagctgggtggcggagtttgtggatcggatattccttcctcatg
cacacggcagcgggagacgaaggcggtggccagtcactggtgtcaccaggcagttgtctg
gaggacttccgcgccacgccattcatcgagtgcaatggaggccgcggcacctgccactac
tacgccaacaagtacagcttctggctgaccaccatccccgagcagagcttccagggctcg
ccctcggctgacacactcaaggctggcctcatccgcacacacatcagccgctgccaggtg
tgcatgaagaacctgtga

KEGG   Callithrix jacchus (white-tufted-ear marmoset): 100409842Help
Entry
100409842         CDS       T03264                                 

Gene name
COL4A5
Definition
(RefSeq) collagen type IV alpha 5 chain
  KO
K06237  collagen, type IV, alpha
Organism
cjc  Callithrix jacchus (white-tufted-ear marmoset)
Pathway
cjc04151  PI3K-Akt signaling pathway
cjc04510  Focal adhesion
cjc04512  ECM-receptor interaction
cjc04926  Relaxin signaling pathway
cjc04933  AGE-RAGE signaling pathway in diabetic complications
cjc04974  Protein digestion and absorption
cjc05146  Amoebiasis
cjc05165  Human papillomavirus infection
cjc05200  Pathways in cancer
cjc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:cjc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100409842 (COL4A5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100409842 (COL4A5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100409842 (COL4A5)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100409842 (COL4A5)
  09154 Digestive system
   04974 Protein digestion and absorption
    100409842 (COL4A5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100409842 (COL4A5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100409842 (COL4A5)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100409842 (COL4A5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100409842 (COL4A5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100409842 (COL4A5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:cjc04147]
    100409842 (COL4A5)
   00536 Glycosaminoglycan binding proteins [BR:cjc00536]
    100409842 (COL4A5)
Exosome [BR:cjc04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100409842 (COL4A5)
Glycosaminoglycan binding proteins [BR:cjc00536]
 Heparan sulfate/Haparin
  Extracellular matrix molecules
   100409842 (COL4A5)
BRITE hierarchy
SSDB OrthologParalogGFIT
Motif
Pfam: Collagen C4
Motif
Other DBs
NCBI-GeneID: 100409842
NCBI-ProteinID: XP_017824036
Ensembl: ENSCJAG00000007434
UniProt: F7I070
Position
X
AA seq 1691 aa AA seqDB search
MKLRGVSLAAGLFLLALSLWGQPAEAATCYGCSPESKCDCSGIKGEKGERGFPGLEGHPG
LPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQG
VPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMSSLPGPKGNPGYPGPPG
IQGLPGPTGIPGPIGPPGPPGLMGPPGPPGLPGPKGNMGLNFQGPKGEKGEQGLQGPPGP
PGQISEQKRPIDVEFQKGDQGLPGDRGPPGPPGMHGPPGPPGGVKGEKGEQGEPGKRGKP
GKDGENGQPGIPGLPGDPGYPGEPGRDGEKGQKGDIGPPGPPGLVIPRPGTGITIGEKGN
IGLPGLPGEKGERGFPGIQGPPGLPGPPGAAAMGPPGPPGFPGERGQKGDEGPPGISIPG
PPGLDGQPGAPGLPGPPGPPGPHIPPSDELCEPGPPGPPGSPGDKGLQGEQGVKGDKGDT
CFNCIGTGISGPPGQPGLPGLPGPPGSLGFPGQKGEKGQAGATGPKGLPGIPGAPGAPGF
PGSKGEPGDILTFPGMKGDKGELGSPGAPGLPGLPGTPGQNGLPGLPGPKGEPGRITSKG
ERGPPGNPGLPGHPGNIGPMGPPGFGPPGPVGEKGIQGVAGNPGQPGIPGPKGDPGQTIT
QPGKPGLPGNPGRDGEVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGLPGPPGPKGFPGI
PGPPGAPGTPGITGLEGPPGPPGFPGPKGEPGFAVPGPPGPPGLPGFKGTLGPKGDRGFP
GPPGPPGRTGLDGLPGPKGDVGPSGQPGPMGPPGLPGIGVQGPPGPPGIPGPIGQPGLHG
IPGEKGDPGPPGLDVPGPPGERGSPGIPGAPGPIGPPGPPGLPGKAGASGFPGTKGEMGM
MGPPGPPGPLGIPGRSGVPGLKGDDGLQGQPGLPGPAGEKGSKGEPGFPGLPGPMDPNLL
GSKGEKGEPGLPGIPGVSGPKGYQGLPGDPGQPGLSGQHGLPGPPGPKGNPGLPGQPGLI
GPPGLKGTIGDMGFPGPQGVEGPPGPPGVPGQPGSPGLPGQKGDKGDPGISGIGLPGLPG
PKGEPGLPGYPGNPGIKGSVGDPGSPGLPGTPGAKGQPGLPGFPGTPGPPGPKGISGPPG
NPGLPGEPGPIGGGGRPGQPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGNPGPPGLPGL
SGQKGDGGLPGIPGNPGLPGPKGEPGFHGFPGVQGPPGPPGSPGPALEGPKGNPGPQGPP
GRPGPTGFQGLPGPEGPPGLPGNGGIKGEKGNPGQPGLPGLPGLKGDQGQPGLQGNPGRP
GLNGMKGDPGLPGVPGFPGMKGPSGVPGSAGPEGEPGLTGPPGPPGLPGPSGQSIIIKGD
AGPPGIPGQPGLKGLPGPPGPQGLPGPIGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGT
RGLDGPPGPDGLQGPPGPPGTSSIAHGFLITRHSQTMDAPQCPQGTLQVYEGFSLLYVQG
NKRAHGQDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPMSMQPLKG
QGIQPFISRCAVCEAPAMVIAVHSQTIQIPRCPQGWNSLWIGYSFMMHTSAGAEGSGQAL
ASPGSCLEEFRSAPFIECHGRGTCNYYANSYSFWLATVDVSDMFSKPQSETLKAGDLRTR
ISRCQVCMKRT
NT seq 5076 nt NT seq  +upstreamnt  +downstreamnt
atgaaactgcgtggagtcagcctggctgccggcttgttcttactggccctgagtctttgg
gggcagcctgcagaggctgcgacttgctatgggtgttctccagaatcaaagtgtgactgc
agtggcataaaaggggaaaagggagagagagggtttccaggtttggaaggacacccagga
ttgcctggatttcctggtccagaagggcctccaggacctcggggacaaaagggtgatgat
ggaattccagggccaccaggaccaaaaggaatcagaggtcctcctggacttcctggattt
ccagggacaccaggtcttcctggaatgccaggccatgacggggccccaggacctcaaggt
gttcctggatgcaatggaacaaagggagaacgtggatttccaggcagtcccggttttcct
ggtttacagggtcctccaggaccccctgggatcccaggtatgaagggtgaaccaggtagt
ataattatgtcctcactgccaggaccaaagggtaatccgggatatccaggtcctcctgga
atacaaggcctacctggtcccactggtataccagggccaattggtcccccaggaccacca
ggcttgatgggccctcctggtccaccaggacttccaggaccaaaggggaatatgggctta
aatttccagggacccaaaggtgaaaagggtgagcaaggtcttcaaggcccacctgggcca
cctgggcagatcagtgaacagaaaagaccaattgatgtagagtttcagaaaggagatcag
ggacttcctggtgaccgagggcctcctggacctccagggatgcatggtcctcccggtcct
ccaggtggtgtgaaaggtgagaagggtgagcaaggagagccaggcaaaagaggtaaacca
ggcaaagatggagaaaatggccaaccaggaatccccggtttgcctggtgatcctggttac
cctggtgaacccggaagggatggtgaaaagggccagaaaggtgacattggcccacctgga
cctcctggacttgtaattcctagacctggaactggcataactataggagaaaaaggaaac
attgggttgcctggcttgcctggagaaaaaggagagcgaggatttcctggaatacagggt
ccacctggcctccctggacctccaggggctgcagctatgggtcctcctggcccccctgga
tttcctggagaaaggggtcagaaaggtgatgaaggaccacctggaatttccattcctgga
ccccctggacttgatggacagcctggggctcctgggcttccagggcctcctggccctcct
ggccctcacattcctcctagtgatgagctatgtgaaccaggccctccagggcccccggga
tctccaggcgataaaggactccaaggagaacaaggagtgaaaggtgacaaaggtgacact
tgcttcaactgcattggaactggtatttcagggcctccaggtcaacctggtttgccaggt
ctcccaggtcctccaggatctcttggttttcctggacagaaaggtgaaaaaggacaagct
ggagcaactggtcccaaaggattgcccggcattccaggagctccaggtgctccaggcttt
cctggatctaaaggtgaacctggtgatatcctcacttttccaggaatgaagggtgacaaa
ggagagttgggttctcctggagctccagggcttccgggtttacctggcactcctggacag
aatggattgccagggcttcctggcccgaaaggagagcctggcagaattacttctaagggt
gaaagaggtccccctgggaacccaggtttaccaggccacccagggaacatagggcctatg
ggtcctcctggttttggccctccaggcccagtaggtgaaaaaggcatacaaggtgtggca
ggaaatccaggccagccaggaataccaggtcctaaaggggatccaggtcagaccataacc
cagccaggaaagcctggcttgcctggtaacccaggcagagatggtgaagtaggtcttcca
ggtgaccctggactcccaggacaaccaggcttgccaggaatacctggtagcaaaggagaa
ccaggtatccctggaattgggcttcctggaccacctggtcccaaaggctttcctggaatt
ccaggacctccaggagcacctgggacacctggaataactggtctagaaggtcctcctggg
ccacccggctttccaggaccaaagggtgagccaggatttgcagtacctgggccacctgga
cctccaggacttccaggtttcaaaggaacacttggtccaaaaggcgatcgtggtttccca
ggacctccaggtcctccaggacgcactggcttggatgggcttcctggaccaaaaggtgat
gttggaccaagtggacaacctggaccaatgggacctcctgggctgccaggaataggtgtt
cagggaccaccaggaccaccagggattcctgggccaataggccaacctggtttacatgga
atacccggagagaagggggatccaggacctcctggacttgatgttccaggacctccaggt
gaaagaggcagtccagggatccctggagcacctggtcccataggacctccaggaccacca
ggacttccaggaaaagcaggtgcctctggatttccaggtaccaaaggtgaaatgggcatg
atgggacctccaggcccaccaggacctttgggaattcctggcaggagtggtgttcctggt
cttaaaggtgatgatggcttgcaaggtcagccaggacttcctggccctgcgggagaaaaa
ggtagtaaaggagagcctggctttccaggccttcctggacccatggatccaaatcttctg
ggctcaaaaggagagaagggggaacctggcttaccaggtatacctggagtttcagggcca
aaaggttatcagggtttgcctggagacccagggcaacctggactgagtggacaacatgga
ttaccaggaccaccaggtcccaaaggtaaccctggtctccctggacagccaggacttata
ggacctcctggacttaaaggaaccatcggtgatatgggttttccagggcctcagggtgtg
gaagggcctcctggacctcctggagttcctggacaacctggctccccaggattacctgga
cagaaaggcgacaaaggtgatcctggtatttcaggcattggtcttccaggtcttcctggt
ccaaagggtgagcctggtctgcctggatacccagggaaccctggtatcaaaggttctgtg
ggagatcctggttcgcctggattaccaggaacccctggagcaaaaggacaaccaggcctt
cctggattcccaggaacccctggccctcctggaccaaaaggtattagtggccctcctggg
aaccctggccttccaggagaacctggtcctataggtggtggaggtcgtcctggacaacca
gggcctccaggtgaaaaaggcaaacctggtcaagatggtattcctggaccagctgggcag
aagggtgaaccaggtcaaccaggctttggaaacccaggaccccctggacttccaggactt
tctggccaaaagggtgatggaggattacctgggattccaggaaatcctggccttccaggt
ccaaagggcgaaccaggctttcatggtttccctggtgttcagggtcctccaggccctcct
ggttctccgggtccagctctggaaggacctaaaggcaaccctgggccccaaggtcctcct
gggagaccaggtcctacaggttttcaaggtctaccaggtccagaaggtcccccgggtctc
cctggaaatggaggtattaaaggagagaaggggaacccaggccaacctgggctacctggt
ttgcctggtttgaaaggagatcaaggacaaccaggactccagggtaatcctggccggccg
ggtctcaatggaatgaaaggagatcctggtctccctggtgttccaggattcccaggcatg
aaaggacccagtggagtacctggatcagctggccctgagggagaaccaggacttactggt
cctccaggtcctcctggattacctggtccttcaggacagagtattataatcaaaggagat
gctggtcctccaggaatcccaggccagcctgggttaaaaggtctaccaggacccccagga
cctcaaggtttaccaggtccaattggtcctccaggagatcctggacgcaatggactccct
ggctttgatggtgcaggagggcgcaaaggagacccaggtcttccaggacagccaggtacc
cgtggtttggacggtccccccgggccagatggattgcaaggtcccccaggtccccctgga
acctcatctattgcgcatggatttcttattacacgccacagccagacaatggatgcgcca
cagtgcccgcagggaacacttcaggtctatgaaggcttttctctcctgtatgtacaagga
aataaaagagcccatggtcaagacttggggacggctggcagctgccttcgtcgttttagt
accatgcctttcatgttctgcaacatcaataatgtttgcaactttgcttcaagaaatgac
tattcttactggctctctaccccagagcccatgccaatgagcatgcaacccttaaagggc
cagggcatccagccattcattagtcgatgtgcagtatgtgaagccccagccatggtgatc
gcagttcacagtcagacgatccagattccccgttgtcctcagggatggaattctctgtgg
attggttattccttcatgatgcatacaagtgcaggagcagaaggctcaggtcaagcccta
gcctcccctggttcctgcttggaagagtttcgttcagctcccttcatcgaatgtcatgga
aggggtacctgtaactactacgccaactcctacagcttttggctggcaactgtagatgtg
tcagacatgttcagtaaacctcagtcagaaacgctgaaagcaggagacttgaggacacgc
attagccgatgtcaagtgtgcatgaagaggacataa

KEGG   Callithrix jacchus (white-tufted-ear marmoset): 100411438Help
Entry
100411438         CDS       T03264                                 

Gene name
COL1A2
Definition
(RefSeq) collagen type I alpha 2 chain
  KO
K06236  collagen, type I, alpha
Organism
cjc  Callithrix jacchus (white-tufted-ear marmoset)
Pathway
cjc04151  PI3K-Akt signaling pathway
cjc04510  Focal adhesion
cjc04512  ECM-receptor interaction
cjc04611  Platelet activation
cjc04926  Relaxin signaling pathway
cjc04933  AGE-RAGE signaling pathway in diabetic complications
cjc04974  Protein digestion and absorption
cjc05146  Amoebiasis
cjc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:cjc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100411438 (COL1A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100411438 (COL1A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100411438 (COL1A2)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    100411438 (COL1A2)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100411438 (COL1A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    100411438 (COL1A2)
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100411438 (COL1A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100411438 (COL1A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100411438 (COL1A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:cjc00536]
    100411438 (COL1A2)
Glycosaminoglycan binding proteins [BR:cjc00536]
 Heparan sulfate/Haparin
  Extracellular matrix molecules
   100411438 (COL1A2)
BRITE hierarchy
SSDB OrthologParalogGFIT
Motif
Pfam: Collagen COLFI
Motif
Other DBs
NCBI-GeneID: 100411438
NCBI-ProteinID: XP_003733643
Ensembl: ENSCJAG00000013547
UniProt: U3D607
Position
8
AA seq 1366 aa AA seqDB search
MLSFVDTRTLLLLAVTSCLATCQSLQEETVRKGPAGDRGPRGERGPPGPPGRDGEDGPTG
PPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPPGAAGAPGPQGFQGPAGEP
GEPGQTGPAGARGPPGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLPGFKGIR
GHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGLPGERGRVGAPGPAGARGSDGSV
GPVGPAGPIGSAGPPGFPGAPGPKGEIGAIGNPGIAGPAGPRGEVGLPGLSGPVGPPGNP
GANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNK
GEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGPPGPPGLRGSPGSRGLPGADGRAGVMGPA
GSRGATGPAGVRGPNGDAGRPGEPGLMGPRGLPGSPGNIGPAGKEGPVGLPGIDGRPGPI
GPAGARGEPGSIGFPGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQ
GGKGEQGPAGPPGFQGLPGPSGPAGELGKPGERGLPGEFGLPGPAGPRGERGPPGESGAA
GPTGPIGSRGPSGPPGPDGNKGEPGVVGAAGTAGPSGPSGLPGERGAAGIPGGKGEKGEP
GLRGEIGNPGRDGARGAPGAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPA
GPNGFAGPAGAAGQPGAKGERGAKGPKGENGVVGPTGPVGAAGPSGPNGPPGPAGSRGDG
GPPGMTGFPGAAGRTGPPGPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGETGAVGPP
GFAGEKGPSGEAGTAGPPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIA
GPPGARGPPGAVGSPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAA
GAPGPHGPVGPAGKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGDKGPRGLP
GLKGHNGLQGLPGLAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIR
GPQGHQGPAGPPGPPGPPGPPGVSGGGYDFGYDGDFFRADQPRSAPSLRPKDYEVDATLK
SLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTG
ETCIRAQPENIPAKNWYRSSKDKKHVWLGETINGGSQFEYNVEGVTSKEMATQLAFMRLL
ANYASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCFK
NT seq 4101 nt NT seq  +upstreamnt  +downstreamnt
atgctcagctttgtggatacgcggactttgttgctgcttgcagtaacctcgtgcctagca
acatgccaatctttacaagaggaaactgtaagaaagggcccagctggagatagaggacca
cgtggagaaaggggtccaccaggccccccaggcagagatggtgaagatggtcccacaggc
cctcctggtccacctggtcctcctggtccccctggtctcggtgggaactttgctgctcag
tatgatggaaaaggagttggacttggccctggaccaatgggtttaatgggacctagaggc
cctcctggtgcagctggagccccaggtcctcaaggtttccaaggacctgctggtgagcct
ggtgaacctggtcaaactggtcctgcaggtgctcgtggtccacctggccctcctggcaag
gctggcgaagatggtcaccctggaaaacccggacgacctggtgagagaggagttgttgga
ccacagggtgctcgtggtttccctggaactcctggacttcctggcttcaaaggcattagg
ggacacaatggtctggatggattgaagggacagcctggtgctcctggtgtgaagggtgaa
cctggtgcccctggtgaaaatggaactccaggtcaaacaggagcccgtgggcttcctggt
gagagaggacgtgttggtgcccctggcccagctggtgcccgtggcagtgacggaagtgtg
ggtcccgtgggtcctgctggtcccattgggtctgctggccctccaggcttcccaggtgcc
cctggtcccaagggtgaaattggagctattggtaaccctggtattgctggtcctgccggt
ccccgtggtgaagtgggtcttccaggcctctctggccccgttggacctcctggtaatcct
ggagcaaacggccttactggtgccaagggtgctgctggccttcctggtgttgctggggct
cccggcctccctggaccccgtggtattcctggccctgttggtgctgctggtgctactggt
gccagaggacttgttggtgagcctggtccagctggctccaaaggagagagtggtaacaag
ggtgagcccggctctgctggaccccaaggtcctcctggtcccagtggtgaagaaggaaag
agaggccccaatggggaagctggatctgctggccctccaggacctcctgggctgagaggt
agtcctggttctcgtggtcttcctggagctgatggcagagctggcgtcatgggccctgct
ggtagtcgaggtgcaactggccctgctggagtccgaggccccaatggagatgctggtcgc
cctggtgagcctggtctcatgggacccagaggtcttcctggttcccctggaaatattggc
cccgctggaaaggaaggtcctgttggcctccctggtatcgacggcaggcctggcccaatt
ggcccagctggagcaagaggagagcctggcagcattggattccctggacccaaaggcccc
actggtgatcctggcaaaaacggagataaaggtcatgctggtcttgctggtgctcggggt
gctccaggtcctgatggaaacaatggtgctcaaggacctcccggaccgcagggtgtccaa
ggtggaaaaggtgaacagggtcccgctggtcctccaggcttccagggtctgcctggcccc
tcaggtccagctggtgaacttggcaaaccaggagaaaggggtctccctggtgagtttggt
ctccctggtcctgctggtccaagaggggaacgtggtcccccaggtgagagtggtgctgct
ggtcctactggtcctattggaagccgaggtccttctggacccccagggcctgatggaaac
aagggtgaacctggtgttgttggtgctgcgggcactgctggtccatctggtcctagtgga
ctcccaggagagaggggtgctgctggcatacctggaggcaagggagaaaagggtgaacct
ggtctcagaggtgaaattggtaaccctggcagagatggtgctcgtggtgctcctggtgct
gtaggtgcccctggtcctgctggagccacaggtgaccggggagaagctggtgctgctggt
cctgctggtcctgctggtcctcggggaagccctggtgaacgtggtgaggttggtcctgct
ggccccaatggatttgctggtcccgctggtgctgctggtcaaccaggtgctaaaggagaa
agaggagccaaagggcctaagggtgaaaatggtgttgttgggcccacaggccccgttgga
gctgctggcccatctggtccaaatggtccccccggtcctgctggaagtcgtggtgatgga
ggtccccctggtatgactggtttccctggtgctgctggacggactggtcccccaggaccc
tctggtatttctggccctcctggtccccctggtcctgctggaaaagaagggcttcgtggt
cctcgtggtgaccaaggtccagttggccgaactggagaaacaggtgcagttggtccccct
ggctttgctggtgagaagggtccttctggagaggctggtactgctggacctccaggtact
ccaggtcctcagggtcttcttggtgctcctggtattctgggtctgcctggctcgagaggt
gaacgtggtctgccaggtgttgctggtgctgtgggtgaacctggtcctcttggcattgct
ggccctcctggagctcgtggtccccctggtgctgtgggtagtcctggagtcaatggcgct
cctggtgaagctggtcgtgatggcaatcctgggaatgatggtcccccaggtcgcgatggt
caacctggacacaagggagagcgtggttaccctggcaacattggtccagttggcgctgca
ggtgcacctggtcctcatggccccgtgggtcctgctggcaaacatggaaaccgtggtgaa
actggtccttctggtcctgttggtcctgctggtgctgttggtccaagaggtcctagtggc
ccacaaggcattcgtggtgataagggagagcctggtgataaagggcccagaggtcttcct
ggcttaaagggacacaatggattgcagggtctgcctggtcttgctggtcaccatggtgat
caaggtgctcctggctctgtgggtcctgctggtcctaggggccctgctggtccttctggc
cctgctgggaaagatggtcgcactggacatcctggcacagttggacctgctggcattcga
ggccctcagggtcaccaaggtcctgctggcccccctggtccccctggccctcctgggcct
ccaggtgtaagcggtggtggttatgactttggttacgatggagacttcttcagagctgac
cagcctcgctcagcaccttctctcagacccaaggactatgaagttgatgctactctgaag
tctctcaacaaccagattgagacccttcttactcctgaaggttccagaaagaacccagct
cgcacatgccgtgacttgagactcagccacccagagtggagcagcggttactactggatt
gaccctaaccaaggatgcactatggatgctatcaaagtatattgtgatttctctactggc
gaaacctgtatccgggcccaacctgaaaacatcccagccaagaactggtataggagctcc
aaggacaagaagcatgtctggctaggagaaaccatcaatggtggcagccagtttgaatat
aatgtagaaggagtgacttccaaggaaatggctacccaacttgccttcatgcgcctgctg
gccaactatgcctctcagaacatcacctaccactgcaagaacagcatcgcatacatggat
gaggagactggcaacctgaaaaaggctgtcattctgcaaggatctaatgatgttgaactt
gtggctgagggcaacagcaggttcacttacactgttcttgtagatggctgctctaaaaag
acaaatgaatggggaaagacaataattgaatacaaaacaaataagccatctcgcctgccc
ttccttgatattgcacctttggacatcggtggtgctgaccaggaattctttgtggacatt
ggcccagtctgtttcaaataa

DBGET integrated database retrieval system