KEGG   Rattus norvegicus (rat): 116640
Entry
116640            CDS       T01003                                 

Gene name
Tnc
Definition
(RefSeq) tenascin precursor
  KO
K06252  tenascin
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05165  Human papillomavirus infection
rno05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    116640 (Tnc)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    116640 (Tnc)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    116640 (Tnc)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    116640 (Tnc)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    116640 (Tnc)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    116640 (Tnc)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   116640 (Tnc)
SSDB
Motif
Pfam: fn3 Fibrinogen_C EGF_2 EGF_Tenascin EGF Pur_ac_phosph_N NDNF Interfer-bind fn3_2 DUF4998 hEGF
Other DBs
NCBI-GeneID: 116640
NCBI-ProteinID: NP_446313
RGD: 621057
Ensembl: ENSRNOG00000058645
UniProt: B2LYI9
Position
5
AA seq 2019 aa
MGAVXWLLPGIFLALFALTPEGGVLKKVIRHKRESGLNLTLPEENQPVVFNHVYNIKLPM
GSQCSVDLESTSGEKDLSPTPESSGSFQEHTVDGENQIVFTHRINIPRRACGCAAAPDVK
ELLSRLEELEMLVSSLREQCTMGTGCCLQPAEGRLDSRPFCSGRGNFSAEGCGCVCEPGW
KGPNCSEPECPGNCNLRGQCLDGQCICDQGFTGEDCSQLACPNDCNDQGKCVNGVCVCFE
GYAGLDCGLEVCPVPCSEEHGMCVDGRCVCKDGFAGEDCNEPLCFNNCNNRGRCVENECV
CDEGFTGEDCSELICPNDCFDRGRCINGTCYCEEGFTGEDCGELTCPNNCQGHGQCEEGQ
CVCNEGFAGADCSEKRCPADCHHHGRCLNGQCECDDGFTGADCGELQCPNGCSGHGRCVN
GQCVCDEGYTGEDCSQRRCPNDCHNRGHCVQGKCICEQGFKGFDCSEMSCPNDCHQHGRC
VNGMCICDDDYTGEDCRDRRCPRDCSQRGRCVDGQCICEDGFTGSDCAELSCPGDCHGHG
RCVNGQCICHEGFTGXDCKEQRCPSDCHGQGRCEDGQCICHEGFTGLDCGQRSCPNDCSN
QGQCVAGRCICNEGYTGVDCSEVSPPKDLIVTEVTEETVBLAWDNEMRVTEYLIMYTPTH
ADGLEMQFRVPGDQTSTTIRELEPGVEYFIRVFAILENKRSIPVSARVATYLPAPEGLKF
KSIKETSVEVEWDPLDIAFETWEIIFRNMNKEDEGEITKSLRRPETSYRQTGLAPGQEYE
ISLHIVKNNTRGPGLKKVTTTRLDAPSQIEVRDITDTTALITWSKPLAEIDSIELSYGIK
DVPGDRTTIDLTHEDNQYSIGNLKPDTEYEVSLISRRVDMASNPAKETFITGLDAPRNLR
RVSQTDNSITLEWRNVKADIDSYRIKYAPISGGDHAEIDVPKSQQATTKTTLTGLRPGTE
YGVGVSAVKGDKEGDPATINAATEIDAPRDLQVSETTQDSLTLFWKTPLAKFDRYRLNYS
LPTGQSIEIQLPKDATSHVLTDLEPGQEYTVLLTAEKGRHKSKPARVKASTEEVPSLENL
TVTEAGWDGLRLNWTADDLAYEYFVIQVQEANKVETAHNFTVLGNLRTADIPGLKAATPY
RVSIYGVARGYKTPVLSAETFTGKTPSLGEVTVAEVGWDALKLNWTAPEGAYTNFFIQVL
EADMTQTVQNLTVPGGLRSVDLPGLKAATRYYITVRGVTQDFSTAPLSVEVLTEEIPQLG
GLSVTEVSWDSLTLNWTTDDLAYKHFIIQVQEANNVEAAQNLTVSGSLRVVDIPGLKADT
PYRVSIYGVIQGYRTPMLSADVSTAKEPEIGNLNISDVTPESFNLSWTATDGIFDMFTIE
IIDSNRLLQTAEHNISGAERTAHISGLPPSTDFIVYLSGIAPSIRTKTISTTATTEAEPE
VDNLLVSDATPDGFCLSWTADEGIFDSFVIRIRDTKKQSEPQEITLPSPDRTRDITGLRE
ATEYEIELYGISRGRRSQPVSAIATTAMGSPKEIMFSDITENAATVSWRAPTAQVESFRI
TYVPVTGGPPSMVTVDGTDTETRLVRLTPGVEYHVSVIAMKGFEESDPVSGSLITALDGP
SGLLTANITDSEALAMWQPAIATVDSYVISYTGERVPEITRTVSGNTVEFELHDLEPATE
YTLSVFAEKGHQKSSTIATKFTTDLDSPRELTATEVQSETAFLTWRPPRASVTGYLLVYE
SVDGTVKEVIVGPDTTSYSLADLSPSTHYTARIQALSGSLRSKLIQTIFTTIGLLYPFPR
DCSQAMLNGDTTSGLYTIYINGDKTQALEVYCDMTSDGGGWIVFLRRKNGREDFYRNWKA
YATGFGDRREEFWLGLDNLSKITAQGQYELRVDLQDHGESAYAVYDRFSVGDAKSRYKLK
VEGYSGTAGDSMNYHNGRSFSTYDKDTDSAITNCALSYKGAFWYKNCHRVNLMGRYGDNN
HSQGVNWFHWKGHEYSIQFAEMKLRPSNFRNLEGRRKRA
NT seq 6060 nt   +upstreamnt  +downstreamnt
atgggggccgtgrcctggctattgccaggcatcttcctagctttgtttgccctcactcct
gaaggtggggtcctcaagaaagtcatcaggcacaagcgagagagtgggctgaacttgacc
ttgccagaggagaatcagccagtggtgttcaaccatgtctacaacatcaagttgcccatg
ggttctcagtgctcagtggatctagagtcaacgagtggagagaaagacctgagccccaca
ccagagtccagtggaagcttccaggagcatacagtggatggggaaaatcagattgtgttc
acacaccgcatcaacatccctcgtcgggcctgtggctgtgccgcagctccagatgtgaag
gagctcctgagcagactggaggaactggagatgttggtatcttctctaagggagcagtgc
accatgggtacaggctgttgcctccaacctgcagaaggccgtctggacagcaggcccttc
tgcagcggcaggggtaacttcagtgctgaaggttgtggctgtgtctgtgaaccaggctgg
aaaggtcccaactgctctgagcctgaatgccctggaaactgtaatctcagaggccagtgc
cttgatggacagtgtatctgtgaccagggtttcactggggaagactgcagccagctagcc
tgtcccaatgactgcaatgaccagggcaagtgtgtgaatggggtctgtgtgtgcttcgaa
ggctatgccggccttgactgtggcctggaagtctgcccagtgccgtgcagcgaggaacac
gggatgtgtgtggatggcaggtgtgtgtgcaaagatggctttgctggtgaagactgcaat
gagcccctttgcttcaacaactgcaacaaccgtgggcggtgtgtggagaacgaatgcgtc
tgtgatgagggcttcacgggcgaagactgcagcgagctcatttgccccaatgactgcttc
gaccgaggtcgctgcatcaacggcacctgctactgtgaagaaggtttcacaggtgaagac
tgtggtgagctcacctgccccaacaactgtcagggccatggccagtgtgaagagggacag
tgtgtttgcaacgagggctttgcaggggcagactgcagtgaaaagcggtgtcccgcagac
tgtcaccaccatggccgctgtctcaacggacagtgtgagtgtgacgatgggttcacaggg
gctgactgtggggaactccagtgccccaatggctgcagtgggcatggccgctgtgtcaat
gggcagtgtgtgtgtgacgagggctacaccggagaagactgtagccaacggcgatgcccc
aatgactgccacaaccggggtcactgtgtacagggcaaatgcatatgtgagcaaggcttc
aaaggctttgactgtagtgagatgagctgtcccaatgactgccaccagcatggccgctgt
gtgaatggcatgtgcatctgtgatgatgactacactggggaagactgcagagaccgccgc
tgtccccgggactgtagccagcgggggcgctgtgtggacggacagtgcatttgtgaggat
ggtttcactggttctgactgtgctgagctctcctgccctggtgactgccacggccatggc
cgctgtgtgaatggccaatgcatctgccacgagggcttcaccggcraagactgcaaagag
caaaggtgccccagtgactgccatggccaaggccgctgtgaggacggccaatgtatctgc
catgagggcttcacaggcctggactgtgggcagcgctcctgtcccaatgactgcagcaac
cagggacaatgtgtggcaggccgctgcatctgcaatgaaggctacacaggggtagactgc
tctgaggtgtcccctcccaaagaccttattgtgacagaagtaacagaggagactgtarat
ctggcatgggacaatgagatgcgggtcactgagtacctcattatgtacacacccacccat
gctgatggcctggagatgcagttccgtgtgcctggggaccagacatctaccaccatccgg
gagctggagccaggagtggagtacttcattcgtgtgttcgccatcttggagaacaagagg
agcatccctgtcagtgccagagttgccacctacttgcctgcacctgaaggcctaaaattc
aagtctatcaaggagacatctgtggaagtagagtgggatcctctggacatcgcttttgaa
acgtgggagatcattttcagaaatatgaacaaagaagatgagggagagatcacaaaaagc
ttgaggaggccagagacctcctaccgccaaactggcctggctcctggccaagaatatgaa
atatctcttcacattgtgaaaaacaacacccgaggccctggcttgaagaaagtgaccaca
actcgcctggatgcccccagccaaattgaggtgagagacatcacagacactacagcactg
atcacctggtccaagcccttggctgaaattgatagcattgagctctcctatggcatcaag
gatgtgcctggagaccggaccaccatcgacctcacacatgaagacaaccagtactccatc
gggaacctgaaacctgacacggagtatgaggtgtccctcatctcccgcagagtggacatg
gcaagcaaccctgccaaggagaccttcatcacaggcctggatgctcccaggaatctccgt
cgcgtctcacagacagacaacagcatcaccttggagtggaggaatgtcaaggcagacatt
gatagttatagaattaagtatgcacctatctctggaggtgaccatgctgagatagatgtt
ccaaagagccagcaagccacaaccaaaaccacactcacaggtctaaggccaggaactgaa
tatggagtgggcgtctctgctgttaagggagacaaggagggcgatccagcaaccatcaat
gcagccacagaaattgatgcacccagggacttacaggtgtctgagaccacacaagacagt
ctgactttattctggaagacacctctggccaagtttgatcgttaccgcctcaactacagc
ctccccacaggccagtcaatagagattcagctgccaaaggatgccacctcccatgtcctg
acagacctggagccagggcaagaatacactgttctcctcaccgctgagaagggcagacat
aagagcaagcctgcgcgtgtgaaggcatccacagaagaagtaccttctctggaaaatctc
accgtgactgaggccggctgggatggcctcagactcaactggactgcagatgacctggcc
tatgagtactttgttattcaggtacaggaagccaacaaggtggagactgctcacaacttc
acagtacttggtaacctccggactgcagacatcccgggcctcaaggctgccactccttat
agagtctccatttatggggtagctcggggctataaaacacccgtgctctctgctgagacc
ttcacagggaaaactcccagtttgggagaggtgactgtggctgaggtgggctgggatgcc
ctcaagctcaactggactgctccagaaggagcctatacgaactttttcattcaggtgcta
gaggctgacatgacccagactgtccagaacctcacagtcccagggggactgaggtcagtg
gacctacctgggctcaaagcagccacccgctattacatcaccgttcgaggggtcacccag
gacttcagcacagcccctctctctgttgaggtcttgacagaggagatccctcaactggga
ggcttgtccgtgactgaggtcagctgggacagccttacactcaactggaccacagatgat
ctggcctataagcactttatcattcaggtgcaggaggccaacaatgtagaggctgctcag
aacctcacagtatctggtagcctcagagttgtggacatcccaggcctcaaggctgatacc
ccttatagagtctccatctatggggtgatccagggctatagaacaccaatgctctctgct
gatgtctccacagccaaagaacctgaaattggaaacttaaatatttctgatgtaactcct
gagagcttcaatctctcctggacagctaccgacgggatcttcgacatgtttactatcgaa
attattgattctaataggttgctgcagacagcagaacataatatatctggtgctgaacga
actgcccacatctcgggccttccacctagtactgatttcattgtctacctctctggaatt
gctcccagcatccgtaccaaaaccatcagtaccacagccaccacagaagctgaaccggaa
gttgacaaccttctagtttcagatgctactccagacggtttctgtctgtcctggactgct
gatgaagggatattcgacagttttgttatcaggatcagagataccaaaaagcaatctgaa
ccacaagaaataaccctcccttcccccgaccgtaccagggacataacaggtctcagagag
gccactgagtacgaaattgaactctatggaataagccgtggaaggcgatcccagccagtc
agtgccatagcaacaacagccatgggctctccgaaggaaatcatgttctcagacatcact
gaaaatgcagccacagtcagctggagggcaccgactgctcaggtggagagtttccggatt
acttatgtgcctgtaacaggaggtcccccatccatggtgaccgtggatgggacggatact
gagacccgactggtgaggctcacccctggagtggagtaccacgtcagtgtgattgccatg
aagggatttgaagaaagtgatccggtctcggggtctctaatcacagctctcgatggtcca
tctggccttctgacagccaacatcacagactcagaagccttggccatgtggcagccggcc
attgccactgtggacagttatgtcatctcctacacaggggagagagtgccagaaattacc
cgcacagtgtctgggaatacagtggagtttgaactacatgacctggagcctgccacagag
tacacactgagtgtctttgcagagaaaggtcaccagaagagttctaccatcgctacgaaa
tttaccacagacctggattccccaagagaattgactgctactgaggttcagtcagaaact
gccttcctcacctggagacctccccgggcatcggtcactggatacctcctggtctatgag
tctgtggatggtacagtcaaggaagtcattgtggggcctgacaccacctcctacagcctg
gcagacctgagcccatccacccactacacagcaagaatccaggcattgagtgggtccctg
aggagcaaactgatccaaaccatcttcaccacaattggactcctatatccattccctagg
gattgctctcaagcaatgttgaatggcgacaccacctctggcctctataccatctatata
aatggtgacaagactcaagcactggaagtctactgtgatatgacctctgatggaggtgga
tggatcgttttcctgagacgcaaaaatggacgtgaggacttctatcgaaactggaaggcc
tatgccactgggtttggagaccgcagagaagaattctggcttggactggataacctgagc
aaaatcacagcccaagggcaatatgagctccgggtggacctacaagaccatggggagtca
gcctatgctgtgtatgacaggttcagtgttggagatgccaagagtcgctacaagctgaag
gtagaaggatacagtggaacggcaggtgactccatgaactatcacaatggtagatccttc
tctacctatgacaaggacacagactcagccatcaccaactgtgccctgtcctacaaagga
gctttctggtataagaactgtcatcgtgtcaacctgatgggcagatatggggacaataat
cacagtcagggtgttaactggttccactggaagggccatgagtactcaatccagtttgcg
gagatgaaactaagacccagcaacttccgaaatctggaaggcaggcgtaagcgggcataa

KEGG   Rattus norvegicus (rat): 116669
Entry
116669            CDS       T01003                                 

Gene name
Vwf
Definition
(RefSeq) von Willebrand factor precursor
  KO
K03900  von Willebrand factor
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04610  Complement and coagulation cascades
rno04611  Platelet activation
rno04613  Neutrophil extracellular trap formation
rno05165  Human papillomavirus infection
rno05171  Coronavirus disease - COVID-19
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    116669 (Vwf)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    116669 (Vwf)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    116669 (Vwf)
 09150 Organismal Systems
  09151 Immune system
   04610 Complement and coagulation cascades
    116669 (Vwf)
   04611 Platelet activation
    116669 (Vwf)
   04613 Neutrophil extracellular trap formation
    116669 (Vwf)
 09160 Human Diseases
  09172 Infectious disease: viral
   05171 Coronavirus disease - COVID-19
    116669 (Vwf)
   05165 Human papillomavirus infection
    116669 (Vwf)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03110 Chaperones and folding catalysts [BR:rno03110]
    116669 (Vwf)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    116669 (Vwf)
Chaperones and folding catalysts [BR:rno03110]
 Intramolecular chaperones
  Others
   116669 (Vwf)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   116669 (Vwf)
SSDB
Motif
Pfam: VWD VWA C8 VWA_N2 TIL VWA_2 VWC Pacifastin_I MCR_beta_N
Other DBs
NCBI-GeneID: 116669
NCBI-ProteinID: NP_446341
RGD: 621759
Ensembl: ENSRNOG00000019689
UniProt: F1M957
Position
4
AA seq 2812 aa
MNPFGYEICLLVLALTWPGTLCTEDPRDRPSTARCSLFGDGFINTFDEAMYSFAGGCSYL
LAGDCQKRSFSILGNFQDGKRVGLSVYLGEFFDIHLFANGTVMQGDQSISMPYASKGLYV
EHEAGYYKISSEAFGFAARIDGDGNFQVLMSDRHFNKTCGLCGDFNIFAEDDFRTQEGTL
TSDPYDFANSWALSSENQRCKRTSPPSTNCTSSSGDMQQAMWEQCQLLKTASVFARCHPL
VDPEPFVALCEKTLCTCVTGPECACPALLEYARTCAQEGMVLYGWADHSACRPACPAGME
YKECVSPCTRTCQSLPINEVCQQQCVDGCGCPEGELLDEDRCVQSSDCSCVHAGKRYPPG
TSLPQDCNTCICRHSLWICSNEECPGECLVTGQSHFKSFDNRHFTFSGICQYLLARDCQD
HSFSIVIETMQCADDPDAVCTRSVSVRLSALHNSLMKLKHGGGVAIDGQDVQIPFLQGDL
RIQHTVMASVRLSYGEDLQMEWDGRGRLLVKLSPIYSGKTCGLCGNYNGNKGDDFLTPAG
LVEPLVVDFGNAWKLQGDCPDLHRQHSDPCSLNPRLTRFAEEACALLTSSKFETCHHAVS
PLPYLQNCRYDVCSCADSQDCLCSAVANYAAACARKGVHIGWREPDFCALSCPQGQVYLQ
CGNSCNMTCRSLSLPDEECSEVCLEGCFCPPGLYQDERGDCVPKAQCPCYHDGELFQPAD
IFSDHHTMCYCEDGFMHCTTSGASGSLLPNIVLSSPLSHRSKRSLSCRPPMVKLVCPADN
PRAQGLECAKTCQNYDLECMSLGCVSGCLCPPGMVRHENRCVALERCPCFHQGEAYSPGD
TVKIGCNTCVCRDRRWNCTTHVCDATCSALGMAHYLTFDGLKYLFPGECQYVLVQDYCGG
SSGTFRILVGNEGCTYPSVKCKKRVTILVDGGEIELFNGEVNVKRPLKDESHFEMVESGQ
YVILLLGQGLSVVWDHRLSISVVLKYTYQEHVCGLCGNFDGIQNNDFTSSSLQVEEDPID
FGNSWKVSPQCADTRKLSLDVSLAACHNNIMKQTMVDSSCRILTSDIFQGCNRLVDPEPY
LDICIYDTCSCESIGDCSCFCDTIAAYAHVCAQHGQVVAWRKPTLCPQSCEERNVRENSY
ECEWRYNSCAPACPVTCQHPEPLACPVQCVEGCHAHCPPGKILDELLQTCVDAQDCPVCE
VAGRRLASGKKITLSPNDPQHCQNCHCDGVNLTCEACQEPGGLVVPPTDAPVSSTTPYVE
DTPEPPLHNFYCSKLLDLVFLLDGSYRLSEAEFEVLKAFVVGTMERLHISQKRIRVAVVE
YHDGSHAYLELRARKRPSELRRIASQIKYVGSQLASTSEVLKYTLFQIFGKIDRPEASRV
ILLLTASQEPQRMARYFTRYLQGFKKKKVILIPVGIGPHANLKQIRLIEKQAPENKAFLL
SGVDELEQRRDEIINYLCDLAPEAPAPTKPPQVAHITVSPGISGVSSPGPKRKSLVLDVV
FVLEASDEVGEANFNKSKEFLEEVIQRMDVSPAGTHIAVLQYSYTVNVEYTFKEAQSKED
VLRHVREIRYQGGNRTNTGQALQYLSEHSFSPSQGDREQAPNLVYMVTGNPASDEIRRLP
GDIQVVPIGVGSRANLQELERISRPIAPIFIQDFETLPREAPDLVLRTCCSKEGLQLPTL
PPLPDCSQPLDVVLLLDGSSSLPASSFDEMKSFAKAFISKANIGPHLTQVSVIQYGSINT
IDVPWNVAQEKAYLQSLVDLMQQEGGPSQIGNALAFAVRYVTSQIHGARPGASKAVVMII
MDTSLDSVDTAVDAARSNRVAVFPIGVGDRYDEAQLRILAGPGASSNVVKLQQVEDLLTM
VTPGNSFFHRLCSGFSGVCVDEDGNEKRPGDVWTLPDQCHTVTCLANGQTLLQSHRVNCD
HGPRPSCSNSQSPVRVEETCGCRWTCPCVCTGSSTRHIVTFDGQNFKLTGNCSYVIFQNK
EQDLEVVLHNGACSPGAVQTCMKTIEVKHAGLSVELRSDMEMAVNGRPVLAPYVGGNMQV
SIYGAIMYEVRFTHLGHTLTFTPQNNEFQLQLSPKTFASKMYGLCGICDENGANDFTLRD
GTVTTDWKRLIQEWTVQQPGSTCQSVPEEQCPISDSSHCQVLLSASFAECHKVIAPATFH
TICQQDSCHQERVCEVIASYAHLCRTNGVCVDWRTTDFCAMSCPPSLIYNHCERGCPRYC
DGNTSFCGDHPSEGCFCPQHQVLLEGSCVPEEACTQCVGDDGVRHQFLETWVPDHQPCQI
CMCLSGRKINCTAQPCPTARAPTCGPCEVARLKQSADLCCPEYECVCDLVNCNLPPVPPC
EGGLQPTLTNPGECRPTFTCACRKEECKRVSPPSCPPHRTPTLRKTQCCDEYECTCSCVN
STLSCPLGYLASATTNDCGCTTTTCLPDKVCVHRGTVYPVGQFWEEGCDTCTCTDMEDTV
VGLRVAQCSQKPCEDSCQPGFSYVLHEGECCGKCLPSACKVAGSPRGDSLSSWKSVGSQW
AVPENPCLINECVRVEDAVFVQQRNISCPQLAVPTCPTGFQLNCETSACCPSCHCEPVEA
CLLNGTIIGPGKSVMVDLCTTCRCIVQRGAIFRFKLECRKTTCEACPVGYREEKSQSECC
GRCLPTACTIQLRGGRVMTLKQDETFQDGCDSHLCRVNERGEYIWEKRVTGCPPFDERKC
LAEGGKIVKIPGTCCDTCEEPDCKDITAKVQYIKVGDCKSEEEVDIHYCQGKCASKAVYS
IDIEDLQEQCSCCWPSSTERMRVPLLCTNGSVVHHEVINAMQCRCSPRNCGK
NT seq 8439 nt   +upstreamnt  +downstreamnt
atgaatcctttcggatatgagatctgcctgttagttctggccctcacctggccagggacc
ctctgcacagaagatcctcgtgacaggccgtcgacagcccgatgcagcctctttggggat
ggcttcatcaacactttcgatgaagccatgtacagttttgcaggaggctgcagttatctt
ctggctggggactgccagaaacgttccttctcaattctcgggaacttccaagatggcaag
agagtgggcctgtctgtgtatcttggggagttttttgacatccatttgtttgccaacggc
accgtgatgcagggggaccaaagcatctccatgccctatgcctccaagggactctatgta
gagcatgaggctgggtactacaaaatttccagtgaggccttcggctttgcggccagaatt
gacggcgacggcaacttccaagtcctgatgtcagacagacacttcaacaagacctgtggg
ctgtgtggtgacttcaacatcttcgctgaagatgactttaggacgcaggaggggaccttg
acctcagacccctacgactttgccaactcctgggccctgagtagtgagaaccagcggtgt
aaacggacatctcctcccagcactaactgcaccagctcctctggggacatgcagcaggcc
atgtgggagcagtgccagttactgaagacggcgtcggtgtttgcccgctgccaccctctg
gtggatcccgagcccttcgtggctctctgtgagaagactttgtgtacatgcgtcacgggg
ccagagtgcgcgtgtcctgctctccttgagtatgcccgaacctgtgcccaggaagggatg
gtgctgtatggctgggctgaccacagtgcctgtcgcccagcatgcccggctggcatggag
tataaggagtgtgtgtctccttgcaccagaacctgccagagcctgcctatcaatgaagtg
tgtcagcagcaatgtgtagatggctgtggctgccctgagggagagctcctggacgaagac
cgctgtgtgcagagctctgactgttcttgtgtacacgccggaaagcggtacccacctggc
acctccctccctcaagactgcaacacttgcatctgcagacacagtctctggatctgcagc
aatgaggaatgcccaggggagtgtcttgtcacaggccaatcccacttcaagagcttcgac
aacagacacttcaccttcagtggcatctgccagtacctgctggcccgggactgccaagac
cactctttctccattgtcatagagaccatgcagtgcgctgatgaccctgatgctgtctgc
acccgctcagtcagtgtgcggctgtctgccctgcacaacagcctcatgaagctgaagcac
gggggaggggtggccatcgatggtcaggatgtccagattcccttcctgcaaggtgacctc
cgtatccagcacacggtgatggcttctgtacgcctcagctatggggaggacctgcagatg
gaatgggatggccgcgggcggctactggttaagctgtccccaatctattctgggaagacc
tgtggtttgtgtgggaattacaatggcaacaagggagacgacttcctcacgccggccggc
ctggtggagcctctggtggtagactttggaaacgcctggaagcttcaaggggattgtccg
gacctgcacagacaacacagcgacccctgcagcctgaacccacgcttgaccaggtttgcg
gaggaggcttgtgcgctcctgacgtcctccaagttcgagacctgccaccatgcggtcagc
cctctgccctacctgcagaactgtcgttatgatgtttgctcctgcgctgacagccaggat
tgcctgtgtagcgcagtagctaactacgctgctgcgtgtgcccgaaaaggcgtgcacatc
gggtggcgggagccagacttctgtgctctgagctgcccacagggccaggtgtacctgcag
tgtggcaattcctgcaacatgacctgccgctccctctcccttccggatgaagaatgcagt
gaagtctgtcttgaaggctgcttctgccccccagggctctaccaggatgaaagaggggac
tgtgtgcccaaggcccagtgtccctgttaccacgacggtgagctcttccagcctgcagac
atcttctcagaccaccacaccatgtgttactgtgaagatggcttcatgcactgtaccacg
agtggtgcctcggggagcctgttgcctaacattgtcctcagcagccccttgtctcaccgc
agcaaaaggagcctctcctgtcggccacccatggtcaagttggtatgtcctgctgacaac
ccacgggctcaagggctggagtgtgctaagacgtgccagaactatgacctggaatgcatg
agtctgggctgtgtgtcaggctgcctctgtcccccaggcatggtccggcacgaaaacagg
tgtgtggccttggagcgctgtccctgcttccaccagggtgaggcgtactccccaggagat
acagtgaagatcggctgcaacacctgtgtctgccgggaccggaggtggaactgtacaacc
cacgtgtgtgacgccacttgctctgcccttggtatggctcactacctcaccttcgatggt
ctcaagtacctgttccctggggagtgtcagtacgttctggtgcaggattactgtggcggt
agctctgggacctttcggatcctggtggggaatgagggttgcacttatccctcggtgaaa
tgcaagaaacgggtgaccatcctggtggatggaggggagattgaactgttcaatggagag
gtgaatgttaagagacctctgaaagatgagtctcactttgagatggtggagtcgggccag
tacgtcatcctgctgctgggtcagggcctttctgtggtctgggaccaccgcctcagcatc
tccgtggtcctgaagtacacataccaggaacacgtgtgtggcctgtgtgggaacttcgat
ggtatccagaacaatgacttcaccagcagcagccttcaggtggaggaagaccccattgac
tttgggaactcctggaaagtgagcccacagtgtgccgacacaagaaagctgtcactagac
gtttcccttgccgcctgccacaacaatatcatgaaacagacaatggtggactcatcttgc
agaatccttaccagtgacatcttccagggctgcaacaggctggtggaccccgagccatac
ctggacatctgcatttatgacacttgctcctgtgagtccatcggggactgctcctgtttc
tgtgacaccatcgctgcctatgcccacgtgtgcgcccagcatggccaggtggtagcctgg
aggaaacccacactgtgcccccagagctgtgaagaaaggaacgttcgggaaaacagctat
gagtgtgagtggcgctataacagctgtgcacccgcctgcccggtcacatgccagcaccct
gagcctctggcttgtcctgtacagtgtgtggagggctgccatgcacactgccctccaggg
aaaatcctggatgaacttctgcagacctgtgtggatgcccaagactgccctgtgtgtgag
gtggctggtcggcgcttggcttccgggaagaaaatcaccctgagtcccaatgaccctcag
cactgtcagaattgtcactgtgatggtgtgaaccttacctgtgaagcctgccaagagccc
ggaggcctggtggtacccccaacagatgccccagtcagctccaccaccccatatgttgag
gacacccccgaaccccccctgcacaacttctactgcagcaagctgctggatctcgtcttc
ctcctggatggctcctacaggctgtctgaggctgagtttgaagtgctcaaagcttttgtg
gtgggtacgatggagaggctacacatctctcagaagcgcatccgcgtggcagtggtagag
taccatgatggctcccatgcctaccttgagctcagggcccggaagcgaccctcagagctt
cggcgcatcgccagccagattaagtatgtgggcagccagttggcctctaccagtgaggtt
ttgaagtacacactgttccagatctttggcaaaattgaccgtcctgaagcctcccgtgtc
attctgctcctgaccgctagccaggagccccaacggatggctaggtatttcacccgctat
ctccaaggtttcaagaagaagaaggtcattctgatccctgtgggcattgggccgcacgcc
aacctcaaacagatccgcctcattgagaagcaggcccctgagaacaaggcctttctgctt
agtggggtggatgagctggagcagagaagagatgagataattaactacctctgtgacctt
gcccccgaggccccagcccctactaagcctccacaggtagcacacatcactgtgagccct
ggtatctctggggtttcatcaccaggacccaaaaggaagtctctggttctggatgtggtg
tttgtcctggaggcgtcagacgaagttggtgaagccaacttcaataagagcaaggagttc
ctggaggaggttatccagcgcatggatgtgagcccggctgggacgcacatcgcagttctg
cagtactcatacacagtaaacgtggagtacaccttcaaggaggcccagtccaaggaggat
gtgctgcgacatgtgcgagagatccgctaccaggggggcaacaggacaaacactgggcag
gccctgcagtacctttctgagcacagcttctctcccagccagggggaccgggagcaggca
cctaacctggtctacatggtcacggggaaccctgcctctgatgagatcaggaggttgcct
ggagacatccaggtggtacccattggggtgggctccagggccaacctgcaggaactggag
aggatcagcagacccatcgctcccatcttcatccaagactttgagacacttccccgagag
gctcccgacctggtcctgaggacgtgctgctccaaggaggggctgcagctgcccaccctc
ccccctctccctgactgcagccaacccctggatgtggtcctgctcctggatggctcctct
agcttgcctgcatcttcctttgatgagatgaagagttttgccaaagctttcatctcaaag
gccaacattgggccccatctcactcaggtgtccgtgatacaatacggaagcatcaatacc
atcgatgtaccgtggaatgtggcccaggagaaagcctatctacagagtttggtggacctc
atgcagcaggagggcggccccagccagattgggaatgctctggcctttgccgtgcgctat
gtaacttcacagatccatggagccaggcctggagcctccaaagcagtggtcatgatcatc
atggatacctccttggattcggtggacacagcagtggatgctgccagatccaaccgggtg
gcagtgttccccattggggttggggatcgatatgatgaagcccagctgaggatcttggca
ggccctggggccagctccaatgtggtaaagctccagcaagttgaagacctcctcaccatg
gtcaccccgggaaactccttcttccacagactgtgttctgggttttctggagtttgtgtg
gatgaggatgggaacgagaagaggcctggagacgtctggaccttgccagatcagtgccac
acagtgacttgcttggcaaatggtcagaccttgttgcagagtcatcgggtcaattgtgac
catggaccccggccttcatgttccaacagccagtctcccgttcgggtggaggagacctgt
ggctgccgctggacctgcccttgtgtgtgcacgggcagttccactcggcacatcgtcacc
ttcgatggacagaatttcaagcttaccggcaactgctcctacgtcatctttcaaaacaag
gagcaggacctggaagtggtcctccataatggggcctgcagccctggggccgtgcaaacc
tgcatgaagaccatcgaggttaagcatgctggcctctctgttgagctgcgcagtgacatg
gagatggcagtgaatgggagacccgtccttgccccgtacgtgggtggaaatatgcaagtc
agcatctatggtgctatcatgtatgaagtcaggttcacccatcttggtcacactctcaca
tttactccacaaaacaacgagttccaactgcagcttagccccaagacctttgcttcgaag
atgtatggtctctgtggaatctgtgacgaaaatggggccaatgacttcacattgcgagat
ggcacggtcaccacagactggaagaggcttatccaggaatggaccgtgcagcagccaggg
agcacatgccagtctgttcctgaagagcagtgtcccatctctgacagctcccactgccag
gtcctcctctcagcatcgtttgctgaatgccacaaggtcatcgctccagccacattccat
acaatctgccagcaagacagttgccaccaggagcgagtgtgtgaggtgattgcttcttac
gcccatctctgtcggactaatggagtctgtgttgattggaggacaactgacttctgtgct
atgtcatgcccaccatccctgatatataaccactgtgagcgtggctgccctcggtactgt
gatggaaacactagcttctgtggcgaccacccctcagaaggctgcttctgcccccaacac
caagtcttgctggaaggcagctgtgtccccgaagaggcctgcactcagtgtgttggggac
gatggagtccgacatcagttcctagagacctgggtcccagaccatcagccctgtcagatc
tgtatgtgcctcagtgggaggaagattaactgcactgcacagccatgtcccacagcccgg
gctcccacctgtggcccttgtgaagtggctcgtctcaagcagagtgcagacctgtgctgc
ccagagtatgagtgtgtgtgtgacctggtcaactgcaacttgccaccagtgcctccgtgt
gaaggaggactccagccaaccctgaccaaccctggagaatgcagacccacctttacctgt
gcctgcaggaaagaagagtgcaaaagagtgtccccaccctcctgcccccctcaccggaca
cccacccttcggaagacccagtgctgtgatgagtatgaatgcacttgcagctgtgtcaac
tccacgctgagctgcccacttggctacctggcctcggccaccaccaatgactgtggctgc
accacaaccacctgtctccctgacaaggtttgtgtccaccgaggcaccgtctaccctgtg
ggccagttctgggaggagggctgtgacacatgcacctgcaccgacatggaggatactgtt
gtgggcctgcgtgtggctcaatgctctcagaagccctgtgaagacagctgtcagccaggt
ttttcttatgttctccacgaaggggagtgctgtggaaagtgcctgccgtctgcttgcaag
gtggctggctcaccgaggggtgattccctctcttcctggaagagcgttgggtctcagtgg
gctgttcctgagaacccctgcctcatcaatgagtgtgtccgagtggaggacgcagtgttt
gtgcagcagaggaacatctcctgcccacagctggctgtccctacctgtcccacgggcttc
cagctgaactgtgagacctcagcatgctgtcccagctgtcactgtgagcctgtggaggcc
tgcctgctcaatggcaccatcattgggcccgggaagagtgtgatggttgacttatgcacg
acctgccgttgcattgttcagagaggagccatcttcagattcaagctggagtgcagaaag
accacctgtgaggcctgccctgtgggctatcgggaggagaagagtcagagcgaatgctgt
ggccggtgcctgcctacggcttgcactattcagctgagaggaggacgggtcatgaccctg
aagcaagatgagacattccaggatggctgtgacagtcatctgtgcagggtcaacgagaga
ggggagtacatttgggagaagagggtcacgggctgccccccattcgatgaacgcaagtgt
ctggctgaaggaggcaaaatcgtgaaaattccaggcacctgctgtgacacgtgtgaggag
cccgattgcaaagacatcacagccaaggtgcagtatatcaaagtgggagattgtaagtcg
gaagaggaagtggacattcactattgccagggaaagtgcgccagcaaagctgtgtactcc
attgacatcgaggatcttcaggaacagtgctcctgctgctggccctcaagtacagagcgc
atgcgtgtgcccttgctctgcaccaatggctcggttgtgcaccacgaggtcatcaatgcc
atgcaatgcagatgttctccccgtaattgtggcaagtga

KEGG   Rattus norvegicus (rat): 117036
Entry
117036            CDS       T01003                                 

Gene name
Lamc1
Definition
(RefSeq) laminin subunit gamma-1 precursor
  KO
K05635  laminin, gamma 1
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05020  Prion disease
rno05145  Toxoplasmosis
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    117036 (Lamc1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    117036 (Lamc1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    117036 (Lamc1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    117036 (Lamc1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    117036 (Lamc1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    117036 (Lamc1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    117036 (Lamc1)
   05145 Toxoplasmosis
    117036 (Lamc1)
  09164 Neurodegenerative disease
   05020 Prion disease
    117036 (Lamc1)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N Laminin_B
Other DBs
NCBI-GeneID: 117036
NCBI-ProteinID: NP_446418
RGD: 621052
Ensembl: ENSRNOG00000002680
Vega: OTTRNOG00000002096
UniProt: F1MAA7
Position
13
AA seq 1607 aa
MTGGRRAALALQLQRRLWLLLAVLAAVACWVRAAMDECADEGGRPQRCMPEFVNAAFNVT
VVATNTCGTPPEEYCVQTGVTGVTKSCHLCDAGQLHLQHGAAFLTDYNNQADTTWWQSQT
MLAGVQYPNSINLTLHLGKAFDITYVRLKFHTSRPESFAIYKRTREDGPWIPYQYYSGSC
ENTYSKANRGFIRTGGDEQQALCTDEFSDISPLTGGNVAFSTLEGRPSAYNFDNSPVLQE
WVTATDIRVTLNRLNTFGDEVFNDPKVLKSYYYAISDFAVGGRCKCNGHASECVKNEFGK
LVCNCKHNTYGVDCEKCLPFFNDRPWRRATAESANQCLPCDCNGRSQECYFDPELYRSTG
HGGHCTNCRDNTDGAKCERCRENFFRLGNTEACSPCHCSPVGSLSTQCDSYGRCSCKPGV
MGDKCDRCQPGFHSLTEAGCRPCSCDPSGSTDECNVETGRCVCKDNVEGFSCERCKPGFF
NLESSNPKGCTPCFCFGHSSVCTNAVGYSVYDISSTFQIDEDGWRVEQRDGSEASLEWSS
DRQDIAVISDTYFPRYFVAPAKFLGNQVLSYGQNLSFSFRVDRRDTRLSAEDLVLEGAGL
RVSVPLIAQGNSYPSETTVKYIFRLHEATDYPWRPALSPFEFQKLLNNLTSIKIRGTYSE
RSAGYLDDVTLQSARPGPGVPATWVESCTCPVGYGGQFCETCLPGYRRETPSLGPYSPCV
LCTCNGHSETCDPETGVCDCRDNTAGPHCEKCSDGYYGDSTLGTSSDCQPCPCPGGSSCA
IVPKTKEVVCTHCPTGTAGKRCELCDDGYFGDPLGSNGPVRLCRPCQCNDNIDPNAVGNC
NRLTGECLKCIYNTAGFYCDRCKEGFFGNPLAPNPADKCKACACNPYGTVQQQSSCNPVT
GQCQCLPHVSGRDCGACDPGFYNLQSGQGCESCDCHALGSTNGQCDIRTGQCECQPGITG
QHCERCETNHFGFGPEGCKPCDCHHEGSLSLQCKDDGRCECREGFVGNRCDQCEENYFYN
RSWPGCQECPACYRLVKDKVAEHRVKLQELESLIANLGTGDETVTDQAFEDRLKEAEREV
TDLLREAQEVKDVDQNLMDRLQRVNNSLHSQISRLHNIRNTIEETGILAERARSRVESTE
QLIEIASRELEKAKMAAANVSITQPESTGEPNNMTLLAEEARRLAERHKQEADDIVRVAK
TANETSAEAYNLLLRTLAGENQTALEIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGD
KAVEIYASVAQLTPVDSEALESEANKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEVK
NLLEKGKAEQQTADQLLARADAAKALAEEAAKKGRNTLQEANDILNNLKDFDRRVNDNKT
AAEEALRRIPAINRTIAEANEKTREAQLALGNAAADATEAKNKAHEAERIASGVQKNATS
TKADAERTFSEVTDLDNEVNGMLRQLEEAENELKRKQDDADQDMMMAGMASQAAQEAELN
ARKAKNSVSSLLSQLTDLLEQLGQLDTVDLNKLNEIEGSLNKAKDEMKVSDLDRKVSNLE
NEARKQEAAILDYDRDIAEIIKDIHNLEDIKKTLPTGCFNTPSIEKP
NT seq 4824 nt   +upstreamnt  +downstreamnt
atgacgggcggcaggcgggccgctctggccctgcagctccagcggcggctgtggctgctg
ctggctgtgctggcggccgtggcgtgctgggtccgggcggccatggacgagtgcgcggat
gagggcggccggccgcagcgctgcatgccggagtttgttaatgccgccttcaatgtgacc
gtggtggctaccaacacgtgtgggactccacccgaggagtactgcgtgcagactggagtg
accggagtcaccaagtcctgtcacctgtgcgacgccggccagctccacctgcaacacggg
gcagccttcttgaccgactacaacaaccaggccgacaccacctggtggcaaagccagact
atgctggccggggtgcagtaccccaactccatcaacctcacgctgcacctgggaaaggct
tttgacatcacttacgtgcgtctcaagttccacaccagccgcccagagagtttcgccatt
tataagcgcactcgggaagacgggccctggattccttaccagtactacagtggctcctgt
gagaacacgtactcaaaggctaaccgtggcttcatcaggacgggaggagatgagcagcag
gccttgtgtaccgatgagttcagtgacatttcccccctcaccggcggcaacgtggccttt
tcaactctggaaggaaggccgagtgcctacaactttgacaacagccccgtgctacaggaa
tgggtgacagccactgacatcagagtcacactcaatcgcctgaacacctttggagatgaa
gtattcaatgaccccaaagtcctcaagtcttactattatgcaatctcggactttgctgtg
ggtggcaggtgtaaatgtaatggacatgccagtgagtgtgtaaagaacgagtttgggaag
ctcgtgtgcaactgcaagcataacacatacggagttgactgtgaaaagtgcctgcctttc
ttcaacgatcggccgtggaggagagcaactgccgagagtgccaaccagtgcctgccttgc
gactgcaatggccgatcccaagaatgctactttgaccctgagctataccgttccactggg
cacggtggccactgtaccaactgccgggataacacagacggtgccaagtgcgagaggtgc
cgggagaacttcttccgcctggggaacactgaagcctgctctccgtgccactgcagcccc
gttggttctctcagcacgcagtgtgacagttacggcaggtgcagctgtaagcccggagtg
atgggtgacaagtgtgaccgttgccagcctgggttccattccctcaccgaggcaggatgc
aggccatgctcctgcgacccttctggcagcaccgacgaatgtaatgttgaaacaggaaga
tgcgtttgcaaagacaacgtcgaaggcttcagctgtgagagatgcaaacctggatttttt
aatctggaatcctccaatcctaagggttgcacaccctgcttctgctttgggcattcttct
gtgtgcaccaatgctgttggctacagtgtttatgacatctcctcgacctttcagattgat
gaggacgggtggcgtgtggagcagagagatggctcggaggcgtctcttgagtggtcctca
gaccggcaggatattgctgtgatctcagacacttacttccccagatactttgtcgcccct
gccaagttcctgggcaaccaggtcctgagttacgggcagaacctctcattctccttccga
gtggacagacgggacactcgcctctccgcagaggatctcgtgctggaaggagctggcttg
agagtatccgtgcccttgatcgctcagggcaactcgtaccctagcgagaccactgtgaag
tacatcttcagactccatgaagcaacagattacccttggaggcctgctctctccccattt
gaatttcagaagctcctaaacaacttgacctctatcaagatccgtggtacatacagtgag
aggagcgctggatatttggatgatgtcaccttgcaaagtgctcgccctgggcccggagtc
cccgcaacgtgggtggagtcctgcacctgtccagtggggtatggagggcagttctgtgag
acatgcctcccaggctacagaagagaaaccccaagtcttggaccttacagcccgtgtgtg
ctctgtacctgtaatgggcacagtgagacctgtgaccccgagacaggtgtctgtgactgc
agagacaatactgccggcccgcactgtgagaaatgtagcgatgggtactatggagactca
accctgggcacctcctctgactgccagccttgtccctgccctggcggctcaagttgtgcc
attgtcccaaagacaaaggaagtggtgtgcacacactgtccaactggcactgccggcaag
agatgtgagctctgtgatgacggctactttggagaccctctgggcagcaatggacccgtg
agactctgccgcccatgtcagtgcaacgacaacatagaccccaacgcagttggcaactgc
aaccgtctgactggagaatgcctgaagtgcatctataacacggctggcttctactgcgac
cggtgcaaagaagggtttttcggaaatcccctggctcccaatccagcagacaagtgcaaa
gcctgcgcctgcaatccctatggtacagtacagcagcagagcagctgtaacccagtgacc
gggcagtgccagtgtctgcctcatgtgtcaggccgggactgtggcgcttgtgaccctggc
ttctacaacctgcagagcgggcaaggctgcgagagttgtgactgccatgctttgggctcc
accaacgggcaatgtgacatccgcaccgggcagtgtgagtgtcagcccggcatcaccggt
cagcactgtgagcgctgtgagaccaaccactttgggtttggacctgaaggctgcaaacct
tgcgactgtcaccacgaaggatccctttcgctccaatgtaaggatgacggccgttgtgaa
tgcagggaaggcttcgtgggcaatcgctgtgaccagtgtgaagagaactatttctacaat
cggtcttggcctggctgccaagagtgtccggcttgttaccggctggtgaaggataaggtt
gctgagcatcgagtgaaactccaggagttggagagcctcatagcaaacctcgggactggg
gatgagacggtgacagatcaagcctttgaggacagactgaaggaagcagagagggaggtt
acagacctcctccgtgaggctcaggaagtcaaagatgtagaccaaaatctgatggatcgc
cttcagagagtaaataacagtctgcacagccaaatcagccggctgcacaatatccggaac
actatagaagagacggggatcttggctgagcgagcgcggtcccgagtggagagcacagag
cagctgattgagatcgcctccagggagctcgagaaagcaaaaatggctgccgccaatgtg
tcaatcactcagccagagtctacaggggaaccaaacaacatgaccctcttggcagaagaa
gcccgaaggcttgcagagcgtcataaacaggaagctgatgacattgtacgagtggcaaag
acagccaacgagacctcagctgaggcatataatctgcttttgaggaccctggcaggagaa
aatcaaactgcattggagattgaagagcttaacaggaagtatgaacaagcaaagaacatc
tctcaggacctggagaagcaggctgccagagtccatgaggaagccaagcgagcaggtgac
aaagccgtggagatctatgccagtgtggcccagctgacccctgtggactcggaggctcta
gagagtgaagcaaataaaatcaagaaagaagctgcagatctggaccgtctgattgaccag
aagctaaaagattacgaggaccttagggaagacatgcgaggaaaggaacatgaagtaaag
aaccttctagagaaggggaaagccgaacagcagaccgctgaccaactcctagctcgagcc
gacgctgccaaggctcttgctgaagaagctgctaaaaagggacgcaataccttacaagaa
gctaacgacattctcaacaacctgaaagattttgatagacgagtgaacgataacaagaca
gctgctgaagaagctctaaggagaattcccgccatcaaccggaccatagctgaagccaat
gagaagacaagggaggcacagctggcgctgggcaatgcagcagctgacgccacggaggcc
aagaacaaagcccacgaggcggagaggattgccagtggcgtgcagaagaatgccaccagt
accaaggcggatgcggagagaaccttctcggaagtgacagacctggataacgaggtgaat
ggcatgttgaggcagctagaggaagcggagaatgagctgaagaggaagcaagatgatgcc
gaccaggacatgatgatggcgggaatggcttcacaggctgctcaggaagctgaacttaat
gccagaaaagccaaaaactctgtcagcagcctcctcagccaactgaccgacctcttggag
cagctgggacagctggacaccgtggacctgaacaagctcaacgagatcgaaggctctctg
aacaaagccaaagacgaaatgaaggtgagcgacctggacaggaaggtgtccaacctggag
aacgaggcccggaagcaggaggcagccatcttggactatgaccgagacatagcagagatc
ataaaggacattcacaacctggaggacatcaagaagacactaccaacaggctgctttaac
accccgtccatcgagaaaccctag

KEGG   Rattus norvegicus (rat): 140433
Entry
140433            CDS       T01003                                 

Gene name
Lama5
Definition
(RefSeq) laminin subunit alpha-5 precursor
  KO
K06240  laminin, alpha 3/5
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05145  Toxoplasmosis
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    140433 (Lama5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    140433 (Lama5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    140433 (Lama5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    140433 (Lama5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    140433 (Lama5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    140433 (Lama5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    140433 (Lama5)
   05145 Toxoplasmosis
    140433 (Lama5)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N Laminin_G_2 Laminin_I Laminin_G_1 Laminin_II Laminin_B Laminin_G_3
Other DBs
NCBI-GeneID: 140433
NCBI-ProteinID: NP_001178538
RGD: 621023
Ensembl: ENSRNOG00000053691
UniProt: F1MAN8
Position
3
AA seq 3713 aa
MAKRGGQLCVGCAPGARGPRSLAPLPLLLLAGLALVGEARIPGGDGFSLHPPYFNLAEGA
RITASATCGEEAPTRSASRPTEDLYCKLVGGPVAGGDPNQTIQGQYCDICIAANSNKAHP
VSNAIDGTERWWQSPPLSRGPEYNEVNVTLDLGQVFHVAYVLIKFANSPRPDLWVLERST
DFGHTYQPWQFFASSKRDCLERFGPRTLERITQDDDVICTTEYSRIVPLENGEIVVSLVN
GRPGARNFSYSPVLRDFTKATNIRLRFLRTNTLLGHLMGKALRDPTVTRRYYYSIKDISI
GGRCVCHGHADVCDAKDPSDPFRLQCACQHNTCGGSCDRCCPGFNQQPWKPATTDSANEC
QSCNCHGHAYDCYYDPEVDRRNASQNLDNVYQGGGVCLDCQHHTTGINCERCVPGFYRAP
DQPLDSPHVCQPCDCESDFTDGTCEDLTGRCYCRPNFTGERCSACAEGYVDFPHCYPLPS
FPHNDTGEQVLPAGQIVNCDCNAAGTQGNACRKDPRLGRCVCKPNFQGNHCELCAPGFYG
PSCHPCQCSSPGVANGLCDPESGQCTCRTGFEGDKCDHCALGYFHFPLCQLCGCSPAGTL
PQGCDESGHCQCRPGFDGPHCDRCLPGYHGYPDCHACACDPRGSLDQQCGAGGLCHCRPG
YTGATCQECSPGFYGFPSCIPCHCSADGSMHTTCDPTTGQCRCRPRVTGLRCDMCVPGAY
NFPYCEAGSCHPAGLAPANPALPEAQAPCTCRAHVEGPSCDRCKPGYWGLSSSNPEGCTR
CSCDPRGTLGGVTECQGNGQCFCKAHVCGKTCAACKDGFFGLDHADYYGCRSCRCDVGGA
LGQGCEPKTGACRCRPNTQGPSCSEPAKDHYLPDLHHIRLELEEAATPEGHAVRFGFNPL
EFENFSWRGYAHMTAIQPRIVARLNVTSPDLFRLVFRYVNRGSTSVNGQVSVREEGKFSS
CTNCTEQSQPVAFPPSTEPAFVTVPQRGFGEPFVLNPGTWALLVEAEGVLLDYVVLLPST
YYEAALLQHRVTEACTYHPSALHPTENCLLYAHLPLDGFPSAAGTEALCRHDNSLPRPCP
TEQLSPLHPPLATCAGSDVDIQLEMAVPRPGRYALVVEYVSEYSHQEMGVAVHTPQRAPQ
QGMIDLHPCPYSSLCRGPARDTQHHLAVFHLDSEASIRLTAEQAHFFLHSVTLIPVEEFS
AEFVEPRVFCVSSHGAFNPGSAACLASRFPKPPQPIVLKDCQVLPLPPDLPLTQSQELSP
GASPVGPQPRPPTAVDPNAEPTLIRHPQGTVVFTTQVPTLGRYAFLLHSYQPVHPSFPVE
VLIDGGRIWQGHANASFCPHGYGCRTLVSCEGQTMLDVTDNELAVTVRVPEGRWLWLDYI
LVVPEDAYSSSYLQEEPLDKSYDFISHCATQGYHISPSSSSLFCRNAATSLSLFYNNGAL
PCGCHEVGAVGPTCEPFGGQCPCRGHVIGRDCSRCATGYWGFPNCRPCDCGARLCDELTG
QCICPPRTVPPDCLVCQPQSFGCHPLVGCEECNCSGPGVQELTDPTCDMDSGQCRCRPNV
AGRRCDTCAPGFYSYPSCRPCDCHEAGTMASVCDPFTGQCHCKENVQGSRCDQCRVGTFS
LDAANPKGCTRCFCFGATERCGNSNYARHEFMDMEGWVLLSSDRQVVPHEHRPEMELLHA
DLRLVADTFPELYWQAPPSYLGDRVSSYGGTLHYELHSETRRGDVFIPYESRPDVVLQGN
QMSIAFLELAYPSPGQVHRGQLQLVEGNFRHLETHNPVSREELMMVLAGLEQLQIRALFS
QTSSTVSLRRVVLEVASEAGGGPPASNVELCMCPANYRGDSCQECAPGYYRDTKGLFLGR
CVPCQCHGHSDRCLPGSGTCVGCQHNTEGDQCERCRPGFVSSDPSDPASPCVSCPCPLAV
PSNNFADGCILRNGRTQCLCRPGYAGASCERCAPGFFGNPLVLGSSCQPCDCSGNGDPNM
IFSDCDPLTGACRGCLRHTTGPRCESCAPGFYGNALLPGNCTRCDCSPCGTETCDPQTGR
CLCKAGVTGQRCDHCLEGHFGFEQCQGCHPCACGPAAEGSECNPQTGQCHCRPGTTGPQC
LECAPGYWGLPEKGCRRCQCPRGHCDPHTGRCTCPPGLSGERCDTCSQQHQVPVPGRPGS
HGIHCEVCDHCVVLLLDDLERAGALLPSIREQLRGINASSTAWARLHRLNASIADLQSKL
RSPLGPHNQTAEQLQTLEQQSISLQQDTERLGSQATGVQDQAGRLLDNTESTLVRAQKLL
EIVQAVSRALNELASQGFPGNATTPSGEQLRWALAEVERLLWDMRTRDLGAPQAVAEAEL
AEAQRLMARVQEQLTSFWEENQALATHIRDQLAQYESGLMDLREALNHAVNTTREADELN
SRNEERLREALQWKQELSQDNATLKATLQAASLTLAHVSELLQGIDKAKEDLEHLAASLD
GAWTPLLKRMQAFSPASSKVDLVEAAEAHAQKLNQLAINLSGIIQGINQDRFIQRAVEAS
NAYSSILQAVQAAEDAAGHALRQASRTWEMVVQRGLAAGARQLLANSSALVETILGHQER
LGLAHGRLQAAGTQLRDVRAKKNQLAAQIQEAQAMLAMDTSETSEKIAHAKAVAAEARDT
AAHVQSQLQGMQKNVERWQSQLGGLRGQDLSQAERDASSSVSTLEKTLPQLLAKLSHLEN
RGVHNASLALSANIGRVRKLIAQARSAANKVKVSMKFNGRSGVRLRTPRDLADLAAYTAL
KFYIQSPVPAPEPGENTGDRFVLYMGSRQATGDYMGVSLRNQKVHWVYRLGEAGPTTLSI
DENIGEQFAAVSIDRTLQFGHMSVTVEKHMVHEIKGDTVAPGREGLLNLNPDDFVFYVGG
YPSNFTPPEPLRFPGYLGCIEMDTLNEEVVSLYNFEQAFKLTTAVDKPCARSKTTGDPWL
TDGSYLDGSGFARITFEKQFSNTKRFEQELRLVSYNGIIFFLKQENQFLCLAVQDGTLVL
LYDFGSGLKRADPLQPPQALTAASKAIQVFLLAGARKRVLVRVERATVFSVDQDNELEMA
DAYYLGGVPPEELPPSLRRLFPSGGSVRGCIKGIKALGKYVDLKRLNTTGISFSCTADLL
MGRTMTFHGHGFLPLALPDVAPITGEVYSGFGFRGTQDDNLLYYRTSPDGPYQVSLRGGH
VTLQFMNREVETQRVFADGAPHYVAFYSNVTRVWLYVDDQLQPVKSREGTTPMLQLQPEE
PPRLLLGGLPMSGTFHNFSGCISNVFVQRLRGPQRVFDLHQNMGSVNVSVGCTPAQLQTS
GATAQKVSRRSRQPSQDLTCMTPWLLGTIQDAYQFGGPGPSYLQFVGVSPSHRNRLHLSM
LVRPHGASQGLLLYVAPLSSHSPSLVLFLNHGRFVAQTEGPGPQLQAQSRQHSRAGQWHR
VSVRWGMQQVQLVVDGSQTWSQKAPHRRVHRAEGPQHYTLFVGGLPAGSYSSKLPVSVGF
SGCMKKLQLDKRPLRAPTRMVGVTPCVSGPLEDGLFFPGSEGAVTLELPKAKMPHVSLEL
EVRPLAAAGLIFHLGQAHATPYVQLQLLTEQVLLRANDGAGEFSTWVTYPKLCDGQWHQV
TVIKGRNTLRLEVDTHSNHTTGRLPETLADSPALLHLGSPPKSEAAWPEPPAYRGCLRKL
LFNGAPVNMTASARIQGAVGMSGCPSGTLAISKQGKALTQRQANPSVFPLRWH
NT seq 11142 nt   +upstreamnt  +downstreamnt
atggcgaagcgcggagggcagctctgcgtggggtgcgcaccgggagctcgtggcccccgg
agcctcgcgccgctgcccttgctgctgctggctgggctggcgctggtaggcgaggcgcgc
attcccgggggcgatggcttcagcctgcacccgccctacttcaacctggcggagggcgct
cgcatcaccgcgtccgcgacctgcggcgaggaggccccgacgcgcagtgcctcacgcccc
accgaggacctctactgcaagctggttgggggtccggtggcgggcggagatcccaaccag
acaatccagggccagtactgtgacatctgtatagctgccaacagcaacaaggcacaccct
gtgagcaacgccatcgatggcacggagcgctggtggcagagcccacccctgtcccgtggc
ccggagtacaatgaggtcaacgtcacactggacctgggccaggtgttccacgtggcctat
gtgctcatcaagtttgctaactcacctcggcctgacctctgggtactggaacggtccaca
gacttcggccacacttaccagccatggcagttctttgcctcctccaagagggattgtttg
gagcggtttggacctcggactctagagcgcatcacgcaggacgacgatgtcatctgcacc
acagagtactcccgaatagtgcctctggagaacggcgagatcgtggtgtccctggtgaac
ggacgccctggggccaggaacttctcctattcaccagtacttcgagacttcaccaaggcc
accaacatccgcttgcggtttcttcgaaccaacacgctcttgggccaccttatgggcaag
gcgctgcgggaccccaccgtcacccgcaggtattattacagcatcaaagacatcagcatc
ggcggccgctgcgtctgtcatggccacgcagatgtctgtgacgctaaggacccctcggat
cctttcaggctgcagtgtgcctgccagcacaacacgtgtggaggctcttgtgaccgatgc
tgtccaggcttcaaccagcagccgtggaagcccgccaccacggacagcgccaatgagtgc
cagtcctgcaactgccatggccatgcctacgactgttactatgaccccgaggtggaccgg
cgcaacgccagccagaacctggacaacgtgtaccagggtggaggtgtctgcctggattgc
cagcaccacaccacgggtatcaactgcgagcgctgtgttcctggcttctaccgcgcccct
gaccagccactggactcgcctcatgtctgtcagccctgtgactgtgagtcagacttcacc
gatgggacctgcgaagacctgacgggccgctgttactgcaggccgaacttcacaggagag
cggtgttccgcctgcgctgagggctacgtggacttcccgcactgctaccctctgccttca
tttccccataatgacacgggagaacaggtgcttcctgccggacagatcgtgaactgtgac
tgcaacgccgctgggacccagggcaatgcctgccggaaggacccgaggttgggacgctgt
gtctgcaaacccaacttccagggcaaccactgcgagctctgtgctcccggattctacgga
cctagctgccacccatgccagtgttccagtcctggggtagccaatggcctctgtgaccca
gagtcaggccagtgcacgtgccgcaccggcttcgagggggacaagtgtgaccactgtgcc
cttggctatttccacttccctctctgtcagctatgtggctgcagtccagcagggaccttg
cctcaaggctgtgacgagtctggccattgccaatgccgacctggctttgatggtcctcac
tgtgaccgctgccttccaggataccatgggtaccccgactgtcatgcttgtgcctgtgac
cctcgggggtccctggatcaacagtgtggagcgggtggtttgtgccactgccgtcctggc
tacacgggcgccacttgtcaggaatgtagccccggattctacggctttcccagctgcatc
ccttgccactgctctgccgatggctccatgcatacaacctgtgaccctacaactggccag
tgtaggtgccgtccccgagtgacaggacttcgttgtgatatgtgtgtgccgggggcctat
aacttcccctactgcgaagctggctcttgtcatcctgctggtctggccccagccaatcct
gcccttcctgaggcacaggctccctgtacgtgccgggctcatgtggaggggccaagctgt
gaccgctgtaaacctgggtattggggactgagctccagcaaccctgaaggctgcacacgc
tgcagctgtgacccacgaggcacactgggtggagttactgagtgccagggcaatgggcag
tgtttctgcaaggctcatgtgtgtggcaagacctgtgcagcctgcaaggatggcttcttt
ggcctggatcatgctgactactatggctgccgtagctgcaggtgtgatgttggaggtgcc
ctgggtcagggctgtgaaccaaagacgggtgcctgcaggtgccgccctaacacccagggc
ccttcctgcagcgagccagcgaaggaccactacttgccagacctgcatcacatacggctg
gaactagaggaggcggccactcccgagggccatgccgtccgctttggcttcaaccccctg
gagtttgagaacttcagctggagaggctacgcacatatgacggctatccagcccaggatc
gtggccaggctgaacgtgacctcccctgacctctttcggctggtcttccgatatgttaac
cgtggatcgaccagcgtgaatgggcaggtctctgttcgggaagagggcaagttttccagc
tgtaccaactgcacagagcagagccagccagtggccttcccacccagcacagagcctgcc
tttgtcaccgtgccccagaggggctttggggagccctttgtgctgaaccccggcacctgg
gccttgctggttgaggctgaaggtgtactcttggactacgtggtcctcctgcccagcacc
tactacgaggcagctctcctacagcatcgagtaaccgaggcctgtacctaccatccctca
gccctgcaccccacagagaactgtctcctctatgctcacctacccctggatggcttccct
tccgctgctggaactgaggccctgtgtcgccatgacaacagcctgccccggccctgcccc
acagagcagctcagccccttacacccgcccttggcgacctgtgccggcagtgatgtggac
atccagcttgagatggcggtgcctcggccaggccgatatgctctcgtggtggagtacgtc
agtgagtattcacaccaggagatgggcgtggctgtgcacacccctcagagagccccgcag
caggggatgatcgacctccacccctgtccctacagctccctgtgccggggtccggctcgg
gatacccagcatcatttagctgtcttccacctggactctgaggccagcatccggctcacg
gctgagcaggctcacttcttcctgcacagtgtcaccctgatacctgtggaggagttcagc
gctgagtttgtggagccccgggtcttctgtgtgagcagtcatggagcgttcaaccccggc
agcgctgcctgtctagcctcccgattcccgaagccaccgcagcccatcgtccttaaggac
tgccaggtcttgccactgcctccagacctgcctctgacccagtctcaggagctctcaccg
ggagcatctccagtgggaccacagcctcggccgccaaccgcggtggatcctaatgcagaa
cccaccttgatacgccacccacagggcacggtggtcttcaccacccaggtgcccaccctg
ggccgttacgccttcctgctgcatagctaccagccggtccacccctccttccctgtggag
gtactcatcgacggtggccgcatctggcagggccacgccaatgccagcttctgtccacat
ggttatggctgccgtaccctggtgtcgtgcgagggtcaaacaatgttggatgttacggac
aacgagctcgccgtgaccgtgcgtgtgccagaaggccggtggctctggctggactacata
cttgtcgtccctgaggatgcctacagctccagttacctccaagaggagcccctggacaaa
tcctatgacttcatcagccactgcgccacccagggctaccacattagccccagcagctcg
tctctgttctgccggaacgctgccacctccttgtctctcttctacaacaatggggccctc
ccttgtggctgccacgaggtgggtgccgtaggccccacatgcgaacccttcgggggccaa
tgcccctgccggggccacgtcattggccgtgactgctcccgctgcgccactggctactgg
ggcttccccaactgcaggccctgtgactgcggagcccgcctgtgtgacgagctcacaggt
cagtgtatctgtcctccacgcactgtgccccccgactgcctggtctgccagccacagagc
tttggttgccaccccttggtgggctgtgaggagtgtaactgctcaggacccggcgtccag
gagctgacggaccccacctgtgacatggacagcggccagtgcagatgcagacccaatgta
gctgggcgtcgctgtgacacctgtgcaccaggcttctatagttatcctagctgtcgcccc
tgtgactgtcatgaggcaggcaccatggccagtgtgtgtgaccccttcacaggccagtgc
cattgcaaggagaatgtacagggctccagatgtgaccagtgtcgcgtggggaccttctcc
ttggatgctgctaaccccaagggctgtacccgctgcttctgtttcggagccacagagcgc
tgtggaaactctaactatgcccgccatgagttcatggacatggagggctgggtgcttttg
agcagcgaccgacaggttgtaccccacgagcatcggcccgagatggagctgctgcacgca
gatctgcggcttgtggccgacactttcccagagctgtactggcaggccccgccctcctac
ctgggagacagggtgtcatcctacggtgggaccctccactatgagctgcactcagagacc
cgtcgaggtgatgtcttcattccctacgagagccggccagatgtcgtgctgcagggcaac
caaatgagcatcgccttcctggagctggcgtacccttcacctgggcaggttcaccgaggg
cagctacagctggtagaggggaacttccggcacttggagactcacaacccagtgtcccga
gaggaactcatgatggtgctggccggcctggagcagctgcagatccgtgccctcttctca
cagacctcttccactgtctccttgcgtagagtggtactagaggtggccagcgaagctggc
ggggggcctccagccagcaatgtggaactgtgtatgtgccctgccaactaccgaggggac
tcatgccaggagtgcgcccctggctattaccgggacaccaagggtctcttcctgggccga
tgtgtcccctgccagtgccatgggcattcagaccgctgccttcctggctctggcacttgt
gtgggctgccagcacaacacagaaggggaccagtgtgagcgctgcagacctggcttcgtc
agcagtgatcccagtgaccctgcatctccctgtgtgagctgcccttgccccttggcggtg
ccctccaataattttgcagacggctgcatcttaagaaatggccgaacccagtgcctctgc
aggcctggctatgccggtgcctcctgtgaacggtgtgcacctggcttttttgggaaccct
ctggtgctaggcagctcctgtcagccctgcgactgcagcggtaatggagaccccaacatg
atcttcagtgactgcgaccccctgacaggcgcttgtcgaggctgcctccgtcatactacc
gggccccgctgtgaaagctgtgcccctggcttctacggcaatgccttgttgccaggcaac
tgcacccggtgtgactgttccccatgcgggacagaaacctgtgatccccagactgggcgc
tgcctgtgtaaggcaggcgtgactggacaacgttgtgaccactgtttggaaggacacttc
ggttttgagcaatgccagggctgccacccttgtgcctgtggaccagctgctgagggctct
gagtgcaaccctcagactggtcagtgtcactgccggccagggaccacaggaccccagtgc
cttgagtgtgctcctggctactggggactcccggagaaaggttgcaggcgctgccaatgt
ccacgaggccactgtgacccacacacaggccgctgcacctgtcctccagggctcagtggg
gagcgctgtgacacctgcagccagcagcaccaggtgcctgtaccgggcaggcctgggagc
catggcatacactgtgaggtgtgtgaccactgtgtggtcctgcttctggatgacctcgag
cgggctggtgccctccttccctccatccgtgagcagctgcggggtatcaatgccagctcc
acggcctgggccaggctgcacaggctgaatgcctccattgctgacctgcagagtaagctc
cggagcccactgggaccccacaaccagacagcagagcagctacagactctagagcagcaa
agtataagccttcaacaggatacagagaggctgggcagtcaggccacaggggtccaagat
caggcaggccggctgctggacaacacggagtccacgctggtccgggcacagaaattgttg
gagatcgtgcaagctgtgagccgtgccctgaatgaactggcatctcaaggatttccaggc
aatgccacaacaccgtctggcgagcagctgcgctgggcactggctgaagtggagcggcta
ctctgggatatgcggactcgtgacctgggggccccacaggcagtggcagaagctgaactg
gcagaagcccagaggctgatggcgcgtgtccaggagcagctgactagcttctgggaggag
aaccaggcattggccacacacattcgggaccagctggctcagtacgagtctggcctcatg
gacctccgtgaggccctgaaccatgcagtcaatactacccgggaggctgatgaactcaac
agccgcaatgaggagcggctgagagaagccctgcaatggaaacaggagctgtcccaggac
aacgccaccctaaaggccactctccaagctgccagtctcaccttggcccatgtttctgag
cttctgcagggcatagacaaggctaaggaggacctagagcacctggcggccagcctggat
ggagcctggacaccgctactgaagaggatgcaggccttttcccccgccagcagcaaggtg
gacttagtagaggctgctgaggcccacgctcagaagctgaatcagctggcgatcaacctg
tctggcattatccagggaatcaatcaggaccgcttcatccagagggcggtggaagcctcc
aatgcctatagcagcatcctgcaggccgtacaggctgccgaagatgccgcaggccacgcg
ctgaggcaggccagccgcacgtgggagatggtggtgcaacggggcctagcagctggagcc
cggcagctgttagccaacagcagtgccttggtggagaccatcttggggcaccaggagagg
ctgggccttgctcatggccgcctgcaggccgcggggacccagcttcgcgatgtccgtgcc
aagaagaaccagctggccgcccagatccaggaggcacaagccatgctggccatggacaca
agcgagaccagcgagaagatcgctcacgccaaggctgtggctgcagaagcccgcgacacg
gccgcccatgtgcagtctcagcttcagggtatgcagaagaatgtggagcggtggcagagc
cagctggggggcctgcgaggccaggacctgagccaggcggaacgggatgcaagcagttca
gtgtctaccctggagaagacattgccacagctgctggccaaactgagccatctggagaac
cgaggggtgcacaacgccagcctggccttgtctgccaacatcggtcgtgtgcgcaagctc
attgcccaggcccggagtgccgccaacaaggtcaaggtgtccatgaagttcaacgggcgt
tcgggggtacggctgcgcaccccacgagacctcgctgaccttgctgcctacaccgccctc
aagttctacatccagagcccagtgccagcgcctgaacctggcgagaacactggggaccgc
tttgttctgtacatggggagccgccaggccaccggggactacatgggagtgtctctgcgt
aatcagaaggtgcactgggtgtacaggttaggtgaggctggccccacaactctcagcatt
gacgagaacattggggagcagtttgcagccgtcagcattgacaggaccctccagtttggc
cacatgtctgtcacggtggagaagcatatggttcatgagatcaagggggacacagtggcc
cctgggagggagggactactcaacctgaatcctgacgattttgtcttctacgtgggagga
taccccagcaacttcacgccccctgaacccctccgattccctggctacctgggttgcatc
gagatggacacactgaatgaagaggtggtcagcctctacaactttgagcaggccttcaag
ctgaccactgcggtggataagccttgtgctcgctccaagaccaccggtgacccatggctc
acagatggctcctacctggacggcagtggctttgcccgcatcacctttgagaagcagttc
agcaacacaaaacgctttgaacaggagctgcgtcttgtgtcctacaatgggatcatcttt
ttcctcaagcaagagaaccagttcttgtgcctggcagtacaggacggcactcttgtactc
ctctatgacttcggctccggcctgaagagggcggacccactgcagcccccacaagccttg
acggcagccagcaaggcgatccaagtgtttctattggctggcgcccgcaaacgtgtgctg
gtgcgtgtggagcgggccactgtgttcagcgtagatcaggataacgagctggagatggct
gatgcctactacctgggaggtgtgccacctgaggagctgcccccaagcctacggcggctc
ttcccctcgggaggctctgtccgtggctgcatcaaaggtatcaaggctctgggcaagtac
gtggacctcaaacgactgaacaccacgggcatcagcttcagctgcaccgctgacctgcta
atgggacgcaccatgactttccacggccacggcttcctgcccctggcacttcccgatgtg
gcgccgatcactggagaggtctattctggcttcggctttcgtggcactcaggatgacaac
ctgctgtattaccgcacctccccggatgggccataccaggtatccctgagggggggtcac
gtgacactccagtttatgaacagagaggtggaaactcaacgggtctttgctgatggcgcc
cctcactatgttgccttctacagcaatgtcaccagggtatggctgtatgtggatgaccag
ctacaaccggtgaagtcccgtgagggaacaacccccatgctccaactacagcctgaggag
cccccacggcttctcctgggaggcctgcctatgtctggtaccttccacaacttcagtggt
tgcatcagcaacgtgtttgtgcagcgacttcggggaccacagcgtgtgtttgacctacac
cagaacatggggagtgtcaacgtaagcgtaggctgcacaccagcccaactccagacctca
ggggccacagctcagaaggtttcccgccgcagtcgccaacccagccaggacctcacctgc
atgacaccctggctccttgggactattcaggatgcgtaccagtttgggggacccgggccc
agttacctgcagtttgtgggtgtctccccatcccacaggaataggctccacctctcgatg
ctcgtccgcccacatggggcttcccagggcctcttgctctatgtagcccccttgtcgagc
cacagtccttcactggtactctttctaaaccatggacgctttgtcgcacagactgagggc
cctgggccccagctccaggcccagagtcgccagcactcacgggctggccagtggcacagg
gtgtctgtccgctggggaatgcagcaggtccagcttgtggtggatggcagccagacctgg
agccagaaggcgccccaccgccgggtccacagggcagagggtccacagcactacaccctc
tttgtaggaggtctccctgccggcagttacagttccaagcttcctgtgtctgtggggttc
agcggctgtatgaagaagttacaactggataagcggccactgagggccccaacaaggatg
gtgggggtcacaccctgtgtctcaggccccctggaagatggcctgttcttcccaggcagt
gagggagccgtcacattagagctccccaaggccaagatgccccatgtgagcctggagctg
gaggtgcggcccttggccgctgctggcctcatctttcacctgggccaggcccatgctact
ccctacgtgcagctgcagttgctaacagaacaggtcctgctacgggcaaatgatggggca
ggggagttctccacgtgggtgacctaccccaagctttgtgatggacagtggcaccaagtg
acagtgatcaagggcaggaacacactgcggctagaagtcgacacacacagcaaccacacc
acaggccgtttgccagagaccttggctgattctccagcgcttttgcaccttgggagcccg
cccaagtctgaagctgcctggccagagccccctgcctaccgaggatgtctgaggaagctg
ctgttcaacggggcccctgtcaacatgactgcttctgcacgcatccagggagcagtgggg
atgagcggatgcccctcaggaaccctagcaatttccaagcagggaaaggcactgacccag
aggcaggccaaccccagtgtcttcccgctacgctggcattga

KEGG   Rattus norvegicus (rat): 192362
Entry
192362            CDS       T01003                                 

Gene name
Lamc2
Definition
(RefSeq) laminin subunit gamma-2 precursor
  KO
K06246  laminin, gamma 2
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05145  Toxoplasmosis
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    192362 (Lamc2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    192362 (Lamc2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    192362 (Lamc2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    192362 (Lamc2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    192362 (Lamc2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    192362 (Lamc2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    192362 (Lamc2)
   05145 Toxoplasmosis
    192362 (Lamc2)
SSDB
Motif
Pfam: Laminin_EGF Laminin_B
Other DBs
NCBI-GeneID: 192362
NCBI-ProteinID: NP_001094110
RGD: 621053
Ensembl: ENSRNOG00000002667
Vega: OTTRNOG00000002095
UniProt: F1LRH4
Position
13
AA seq 1192 aa
MPALWLSCCLGVALLLPAAQATSRREVCDCNGKSRQCVFDQELHRQTGNGFRCLNCNDNT
AGVHCERCREGFYRQRDRDRCLPCNCHSKGSLGAGCDNSGQCRCKPGVVGQRCDRCQPGF
HMLTDAGCTRDQRQLDSKCDCDPAGISGPCDSGRCVCKPAVTGERCDRCRAGYYHLDRAN
PEGCTQCFCYGHSDSCHTSADFSVHKITSTFNQDEDGWKAVQRNWAPAKLHWSPRHQDVF
SSARRSDPVYFVAPAKFLGNQQVSYGQSLSFDYRVDRGGRHPSAYDVILEGAGLQIRAPL
MPPGKTLPCGITKTYTFRLNEHPSSHWSPQLSYFEYRRLLRNLTALLIRATYGEYSTGYI
DNVTLISARPVSGAPAPWVERCVCPAGYKGQFCQECASGYKRDSARLGPFGTCVPCNCQG
GGACDPDTGDCYSGDENPDIECADCPIGFYNDPHDPRSCKPCPCHNGFSCSVMPETEEVV
CNNCPPGVTGARCELCADGFFGDPFGERGPVRPCQRCQCNNNVDPSASGNCDQLTGRCLK
CIYNTAGIYCDQCKAGYFGDPLAPNPADKCRACNCNPVGSEPGECRSDGSCVCKPGFGGL
NCAHAAALTSCPACYNQVKTQMDQFAQQLQNLEALVSKAQGGNGAVPSEELEGRMQQAEQ
ALRDILREAQISEGAMRALSLQLAKARSQENNYKNRLDDLKMTVERIRALGSQYQNRVQD
TSRLISQMRLSLAESEASLQNTNIHSSEHYVGPNGFKSLAQEAARLADSHVESANTMKQL
TRETEDYSKQALSLARKALSGGGGSGVLDSSVVQGLMGKLEKTKSLTQQLSREGTQADIE
ADRSYQHSLRLLDSASQLQGIRDSSFQAEAKRIRQKADSLSNLVTKQMDAFTSVRNNLGN
WEKETRQLLQTGKDRRQTSDQLLSRANLAKSRAQEALSMGNATFYEVENILKNLREFDLQ
VEDRKAEAEEAMKRLSYISQKVADASDKTQQAETALGSATADTQRAKNAATEALEITSEI
EQEIGSLNLEANVTADGALAMEKGLATLKSEMREVEGELARKELEFHTDKDSLQLVITEA
QQADARAKSTGVTIQDTLNTLEGILRLIDQPDAVDEEGLMLLEQELFQAKTQINSRLRPL
MSELEDRARRQSNHLHLLETSIDGILADVKNLENIRDNLPPGCYNTQALEQQ
NT seq 3579 nt   +upstreamnt  +downstreamnt
atgcctgcgctctggctcagctgctgcctcggggtcgcgctcctgctgccagcagcccag
gccacctccaggagagaagtctgtgattgcaatgggaagtccaggcagtgcgtctttgac
caggagctccatcgacagacaggcaatgggttccgttgcctcaactgcaatgacaatacg
gccggggttcactgcgagaggtgcagggaggggttttaccggcagcgagacagagatcgc
tgcctgccctgcaactgccactcaaaaggttccctcggtgctggatgtgacaactctgga
caatgcaggtgtaagccaggtgtggtaggacagaggtgtgaccgatgtcagccaggcttc
catatgctcaccgatgctggatgtacccgagaccagaggcaactagattccaagtgtgac
tgtgacccagccggcatctctggaccctgtgattctggccgatgtgtgtgcaaaccagcc
gtcactggagaacgctgtgacaggtgccgagcaggctactatcatctggacagggcaaac
cctgagggctgcacccagtgtttctgctatgggcattcagacagctgccacacctctgct
gacttcagtgtccacaaaatcacctcaactttcaatcaggatgaagatggttggaaagca
gttcagagaaattgggcgcctgcaaaactccactggtcaccgcgccatcaggatgtattt
agttctgcccgaagatcagacccagtctatttcgtggcacccgccaaatttctcggtaat
cagcaagtgagttacgggcagagcctgtcttttgactaccgcgtggacagaggaggtcga
cacccatctgcctatgatgtgatcctggaaggtgctggtctacagatcagagctcctctg
atgccaccaggcaagacacttccctgtgggatcacaaagacttacacgttcagattaaac
gaacatccaagcagtcactggagcccccagctgagttatttcgagtatcgaaggttactg
cggaacctcacagccctcctcatccgagctacgtacggagaatacagtacagggtacatt
gacaatgtgaccctgatttcagcccgccctgtctctggagccccagcaccctgggttgaa
cgttgtgtatgccctgctgggtacaagggacaattctgccaggaatgtgcttctggttac
aaaagagattcagcaagactggggccttttggcacctgtgttccctgtaactgccaaggg
ggaggggcctgtgacccagacacaggggattgctactcaggggacgagaatcctgacatt
gagtgtgccgactgccccattggtttctataacgacccgcacgacccccgcagctgcaag
ccgtgtccctgccacaatggcttcagctgttccgtgatgcctgagacagaggaggtggtg
tgcaacaactgcccccctggggtcacaggtgcccgctgtgagctctgtgcagatggcttc
tttggggacccctttggggaacgtggcccagtgaggccttgtcagcggtgccaatgcaac
aacaatgtggaccccagtgcttctgggaactgtgaccagttgacaggcagatgtttgaaa
tgcatctacaacacagcgggaatctactgtgaccagtgcaaagcaggttacttcggagac
ccattggctcccaacccagcagacaagtgtcgagcttgcaactgcaaccctgtgggctcg
gagcctggagagtgtcgaagtgatggcagctgtgtttgcaagccaggctttggaggcctc
aactgtgcgcacgccgccgccctaaccagttgccctgcttgctacaatcaagtgaagact
cagatggaccagtttgcccagcagctccagaacttggaggccctggtttcaaaggctcag
ggtggtaatggtgcagtccccagcgaggagctggaaggcaggatgcagcaggctgagcag
gcccttcgagacattctgagagaagctcagatttcagaaggtgctatgagagccctcagc
ctccagctggccaaggcaaggagccaagagaacaactacaagaaccgcctggatgacctc
aagatgactgtggaaaggattcgggccctgggcagtcagtatcagaaccgagttcaggat
acaagcagactcatctctcagatgcgcctgagcctggcagaaagtgaagcttccctgcaa
aacactaacatccattcctctgagcactacgtggggccaaatggatttaaaagtctggct
caggaggccgcaagattggcagatagccatgttgagtcagctaacacaatgaagcagcta
acgagggaaaccgaggactattccaaacaagcactttcattggcccgcaaggctctgagt
ggaggaggtggaagcggtgtcctggacagctccgtggtacaaggtcttatgggaaaatta
gagaaaaccaagtccctgacccagcagttgtcaagggagggcacccaagccgacatcgaa
gctgataggtcctatcaacatagtctccgcctcctggattcagcgtctcagcttcaggga
atccgtgattcctcttttcaggcagaggcaaagaggatcaggcagaaggctgattccctc
tccaacttggtgaccaaacaaatggatgcattcacgagtgtgcgaaacaatctggggaac
tgggaaaaagaaacacggcagctcttgcagactgggaaggacagaagacagacttcggat
cagctgctttcccgtgccaaccttgctaaaagcagagcccaagaagcgctaagtatgggc
aatgccactttttatgaagttgagaacatcctaaagaacctcagagagtttgatctgcag
gttgaagacagaaaagcagaggctgaggaggccatgaagagactctcctatattagccag
aaggttgcagatgccagtgacaagacccagcaagcagaaacggccctggggagcgccact
gctgacacccaacgggcaaagaacgcagctacggaggccctagagatcaccagcgagata
gagcaggagatagggagtctgaacttggaagccaatgtgacagcagatggggccttggcc
atggaaaagggactggccactctgaagagcgagatgagagaggtggaaggggagttggcc
aggaaggagcttgagttccacacggataaagactccttgcagctggtgattactgaagcc
cagcaagctgatgccagagccaagagcacgggagttacgatccaagacacgctcaacaca
ttggaaggcatcctgcgcctaatagaccagcctgatgctgtggatgaagaggggctgatg
ctgttagaacaagagcttttccaagccaagacccagatcaacagtcgacttcggcccttg
atgtctgaactggaggacagggcgcgtcgacagagcaaccacctccacctgctggagacc
agcatagatgggattctagccgatgtgaagaacctggagaacattcgagacaacctgccc
ccaggctgctacaacacccaagctcttgagcagcagtga

KEGG   Rattus norvegicus (rat): 24477
Entry
24477             CDS       T01003                                 

Gene name
Ibsp
Definition
(RefSeq) bone sialoprotein 2 precursor
  KO
K06253  integrin binding sialoprotein
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    24477 (Ibsp)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    24477 (Ibsp)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    24477 (Ibsp)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    24477 (Ibsp)
SSDB
Motif
Pfam: BSP_II Presenilin
Other DBs
NCBI-GeneID: 24477
NCBI-ProteinID: NP_036719
RGD: 2855
Ensembl: ENSRNOG00000002158
UniProt: A0JPM6
Position
14
AA seq 319 aa
MKTALILLCILGMASAFSMKNFHRRIKAEDSEENGVFKYRPRYFLYKHAYFYPPLKRFPV
QGGSDSSEENGDGDSSEEEGEEEETSNEEENNEDSEGNEDQEAEAENATLSGVTASYGVE
TTADAGKLELAALQLPKKAGDAEGKAPKMKESDEEEEEEEEEENENEEAEVDENEQVVNG
TSTNSTEVDGGNGPSGGDNGEEAEEASVTEAGAEGTTAGARELTSYGTTTAVLLNGFQQT
TPPPEAYGTTSPPARKSSTVEYGEEYEQIGNEYNTAYETYDENNGEPRGDTYRAYEDEYS
YYKGHGYEGYEGQDYYYHQ
NT seq 960 nt   +upstreamnt  +downstreamnt
atgaagactgctttaatcttgctctgcattttggggatggcgagtgctttctcgatgaaa
aatttccatagaagaatcaaagcagaggattctgaagaaaacggggtctttaagtaccgg
ccacgctactttctttataagcatgcctacttttatcctcctctgaaacggtttccagtc
cagggaggcagcgactcttcggaagaaaatggagatggcgatagttcggaggaggagggg
gaggaagaggagacttcaaatgaagaggaaaacaacgaagactctgaggggaatgaagac
caggaggcggaggcagagaacgccacactctcaggggtaaccgctagctacggggtagag
accacagctgacgctggaaagttggagttagctgcgctccagctacccaagaaggctgga
gatgcagagggcaaggctccaaaaatgaaggaaagcgacgaggaagaggaagaggaagag
gaggaagaaaacgagaacgaagaagcagaagtggatgaaaatgagcaggtcgtcaacggc
accagcaccaactccacggaggtggacggggggaatggccccagcggaggagacaacgga
gaagaagccgaggaggcaagcgtcactgaagcaggtgcagaaggaacaacagcaggggcc
agggagctgaccagttatggcaccacgacagctgtccttctgaacgggtttcagcagacg
accccgccacccgaagcctatgggaccacctccccaccagccagaaagagcagcacggtt
gagtatggggaagaatacgaacaaataggcaacgagtacaacactgcgtatgaaacctat
gacgagaacaacggggagcctcgcggggacacttaccgagcttatgaggacgaatacagc
tactacaaggggcatggctatgaaggctacgagggtcaggattattactaccaccagtga

KEGG   Rattus norvegicus (rat): 24718
Entry
24718             CDS       T01003                                 

Gene name
Reln
Definition
(RefSeq) reelin precursor
  KO
K06249  reelin [EC:3.4.21.-]
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05017  Spinocerebellar ataxia
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    24718 (Reln)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    24718 (Reln)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    24718 (Reln)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    24718 (Reln)
  09164 Neurodegenerative disease
   05017 Spinocerebellar ataxia
    24718 (Reln)
Enzymes [BR:rno01000]
 3. Hydrolases
  3.4  Acting on peptide bonds (peptidases)
   3.4.21  Serine endopeptidases
    3.4.21.-  
     24718 (Reln)
SSDB
Motif
Pfam: EGF_2 BNR Reeler EGF_Tenascin
Other DBs
NCBI-GeneID: 24718
NCBI-ProteinID: NP_536319
RGD: 3553
Ensembl: ENSRNOG00000021441
UniProt: P58751
Position
4
AA seq 3462 aa
MERGCWAPRTLVLAVLLLLLATLRARAATGYYPRFSPFFFLCTHHGELEGDGEQGEVLIS
LHIAGNPTYYVPGQEYHVTISTSTFFDGLLVTGLYTSTSIQSSQSIGGSSAFGFGIMSDH
QFGNQFMCSVVASHVSHLPTTNLSFVWIAPPAGTGCVNFMATATHRGQVIFKDALAQQLC
EQGAPTEATAYSHLAEIHSDSVILRDDFDSYHQLELNPNIWAECSNCDTGEQCGTIMHGN
AVTFCEPYGPRELTTTYLNTTTASVLQFSIGSGSCRFSYSDPSIIVSYAKNNTADWIQLE
KIRAPSNVSTIIHILYLPEDAKGENVQFQWKQDSLRVGEVYEACWALDNILVINSAHRQV
ILEDSLDPVDTGNWLFFPGATVKHSCQSDGNAIYFHGNEGSQLNFATTRDVDLSTEDIQE
QWSEEFESQPTGWDILGAVVGSECGTIESGLSLVFLKDGERKLCTPYMDTTGYGNLRFYF
AMGGTCDPGDSHENDVILYAKIEGKKEHIALDTLSYSSYKVPTLVSVVINPELQTPATKF
CLRQKNHQGHNQNVWAVDFFHVLPILPSTMSHMIQFSINLGCGTHQPGNSVSLEFSTNHG
RSWSLLHTECLPEICAGPHLPHSTIYSSENYSGWNRVTIPLPNAALTRDTRIRWRQTGPI
LGNMWAIDNVYIGPSCLKFCSGRGQCTRHGCKCDPGFSGPACEMASQTFPMFISESFGSS
RLSSYHNFYSIRGAEVSFGCGVLASGKALVFNKDGRRQLITSFLDSSQSRFLQFTLRLGS
KSVLSTCRAPDQPGEGVLLHYSYDNGITWKLLEHYSYLNYHEPRIISVELPDDAKQFGIQ
FRWWQPYHSSQGEDVWAIDEILMTSVLFNSISLDFTNLVEVTQSLGFYLGNIQPYCGHDW
TLCFTGDSKLASSMRYVETQSMQIGASYMIQFSLVMGCGQKYTPHMDNQVKLEYSTNHGL
TWHLVQDECLPSMPSCQEFTSASIYHASEFTQWRRVTVILPQKTWSGATRFRWSQSYYTA
QDEWALDDIYIGQQCPNMCSGHGSCDHGVCRCDQGYQGTECHPEAALPSTIMSDFENPSS
WDSDWQEVIGGEVVKPEQGCGVVSSGSSLYFSKAGKRQLVSWDLDTSWVDFVQFYIQIGG
ESAACNKPDSREEGVLLQYSNNGGIQWHLLAEMYFSDFGKPRFVYLELPAAAKTPCTRFR
WWQPVFSGEDYDQWAVDDIIILSEKQKQVIPVVNPTLPQNFYEKPAFDYPINQMSVWLML
ANEGMAKNDSFCATTPSAMVFGKSDGDRFAVTRDLTLKPGYVLQFKLNIGCASQFSSTAP
VLLQYSHDAGMSWFLVKEGCFPASAGKGCEGNSRELSEPTVYYTGDFEEWTRVTIAIPRS
LASSKTRFRWIQESSSQKNVPPFGLDGVYISEPCPSYCSGHGDCISGVCFCDLGYTAAQG
TCVSNIPNHSEMFDRFEGKLSPLWYKISGGQVGTGCGTLSDGRSLYFNGLGKREARTVPL
DTRNIRLVQFYIQIGSKTSGITCIKPRARNEGLVVQYSNDNGILWHLLRELDFLSFLEPQ
IISIDLPREAKTPATAFRWWQPQHGKHSAQWALDDVLIGVNDSSQTGFQDKFDGSIDLQA
NWYRIQGGQVDIDCLSMDTALIFTENIGKPRYAETWDFHVSASSFLQFDMSMGCSKPFSA
THSVQLQYSLNNGKDWHPVTEECVPPTIGCVHYTESSTYTSERFQNWRRVTVYLPLATNS
PRTRFRWIQANYTVGADAWAIDNVLLASGCPWLCSGRGICDSGRCVCDRGFGGPFCVPVV
PLPSILKDDFNGNLHPDLWPEVYGAERGNLNGETIKSGTSLIFKGEGLRMLISRDLDCTN
TMYVQFSLRFIAKGTPERSHSILLQSSINGGVTWRLMDEFYFPQTTSILFINVPLPYSAQ
TNATRFRLWQPYNNGKKEEIWIIDDFIIDGDNLNNPVMLLDTFDFGPREDNWFFYPGGNI
GLYCPYSSKGAPEEDSAMVFVSNEIGEHSITTRDLSVNENTIIQFEINVGCSTDSSSADP
VRLEFSRDFGATWHLLLPLCYHSSSLVSSLCSTEHHPSSTYYAGTTQGWRREVVHFGKLH
LCGSVRFRWYQGFYPAGSQPVTWAIDNVYIGPQCEEMCCGHGSCVNGTKCICDPGYSGPT
CKISTKNPDFLKDDFEGQLESDRFLLMSGGKPSRKCGILSSGNNLFFNEDGLRMLVTRDL
DLSHARFVQFFMRLGCGKGVPDPRSQPVLLQYSLNGGLSWSLLQEFLFSNSSNVGRYIAL
EMPLKARSGSTRLRWWQPSENGHFYSPWVIDQILIGGNISGNTVLEDDFSTLDSRKWLLH
PGGTKMPVCGSTGDALVFIEKASTRYVVTTDIAVNEDSFLQIDFAASCSVTDSCYAIELE
YSVDLGLSWHPLVRDCLPTNVECSRYHLQRILVSDTFNKWTRITLPLPAYTRSQATRFRW
HQPAPFDKQQTWAIDNVYIGDGCLDMCSGHGRCIQGSCVCDEQWGGLYCDEPETSLPTQL
KDNFNRAPSNQNWLTVNGGKLSTVCGAVASGLALHFSGGCSRLLVTVDLNLTNAEFIQFY
FMYGCLITPSNRNQGVLLEYSVNGGITWTLLMEIFYDQYSKPGFVNILLPPDAKEIGTRF
RWWQPRHDGLDQNDWAIDNVLISGSADQRTVMLDTFSSAPVPQHERSPADAGPVGRIAFD
MFMEDKTSVNENWLFHDDCTVERFCDSPDGVMLCGSHDGREVYAVTHDLTPTENWIMQFK
ISVGCKVPEKIAQNQIHVQFSTDFGVSWSYLVPQCLPADPKCSGTVSQPSVFFPTKGWKR
ITYPLPESLMGNPVRFRFYQKYSDVQWAIDNFYLGPGCLDNCGGHGDCLKEQCICDPGYS
GPHCYLTHTLKTFLKERFDSEEIKPDLWMSLEGGSTCTECGILAENTALYFGGSTVRQAI
TQDLDLRGAKFLQYWGRIGSENNMTSCHRPVCRKEGVLLDYSKDGGITWTLLHEMDFQKY
ISVRHDYILLPEGALTNTTRLRWWQPFVISNGLVVSGVERAQWALDNILIGGAEINPSQL
VDTFDDEGSSHEENWSFYPNAVRTAGFCGNPSFHLYWPNKKKDKTHNALSSRELIIQPGY
MMQFKIVVGCEATSCGDLHSVMLEYTKDARSDSWQLVQTQCLPSSSNSIGCSPFQFHEAT
IYNAVNSSSWKRITIQLPDHVSSSATQFRWIQKGEETEKQSWAIDHVYIGEACPRLCSGH
GYCTTGAVCICDESFQGDDCSVFSHELPSYIKDNFESARVTEANWETIQGGAIGSGCGQL
APYAHGDSLYFNGCQIRQAATKPLDLTRASKIMFVLQIGSTAQTDSCNSDLSGPHTVDKA
VLLQYSVNNGITWHVIAQHQPKDFTQAQRVSYNVPLEARMKGVLLRWWQPRHNGTGHDQW
ALDHVEVVLVSTRKQNYMMNFSRQHGLRHFYNRRRRSLRRYP
NT seq 10389 nt   +upstreamnt  +downstreamnt
atggagcgcggctgctgggcgccgcggactctcgtcctggccgtgctgctgctgctgctg
gcgacgctgagggcgcgcgccgccacgggctactacccgcgcttctcgcccttctttttc
ctgtgcacccaccacggggagctggaaggggatggggagcagggcgaggtgctcatttcc
ctgcacattgcgggcaaccccacctactacgtaccgggacaggaataccatgttacaatt
tcaacaagcaccttcttcgatggcttgctggtgacgggactgtacacctcgacaagcatc
cagtcttctcagagcattggaggatccagtgcctttggatttgggatcatgtccgaccac
cagtttggtaaccagtttatgtgcagcgtggtggcctctcatgtgagtcacctgcctaca
acaaacctcagcttcgtctggatcgccccaccagctggcacgggctgtgtgaatttcatg
gctactgcaacacacaggggtcaggtgattttcaaagacgccttagcccagcagctctgt
gaacaaggagctcccacagaggccactgcttactcgcacctcgctgaaatacacagtgac
agtgtgattctacgagatgactttgactcctaccaccaactagaattgaaccccaatata
tgggctgaatgcagtaactgtgacactggagaacagtgtggcaccatcatgcatggcaac
gctgtcaccttctgtgagccatatggtcctcgagagttgactaccacatacctgaacacg
acaacagcatccgtcctccagttctccattgggtcaggatcctgtcgatttagctactct
gaccccagcatcattgtgtcatatgccaagaacaatactgccgattggattcagctcgag
aagattagagccccttccaacgtaagcaccatcatccacatcctctaccttcctgaggac
gccaaaggggagaacgtgcagttccagtggaaacaggacagcctgcgtgtgggtgaggtg
tacgaagcctgctgggccctggataacatcctggtcatcaattcggcccacagacaagtc
attctggaggacagcctcgacccagtagacacgggcaactggcttttcttccctggagca
acggtcaagcacagctgccagtcagatgggaacgccatatatttccacggaaatgaagga
agccagctcaattttgccaccacgcgggatgtagatctctccacagaggatattcaggag
cagtggtcagaagaatttgagagccagcccacaggatgggacatcttgggagcagtagtt
ggttcggaatgtggaaccattgagtcaggactctcactggtgtttctcaaagatggagag
aggaagctatgcaccccctacatggatacaactggttatggcaacctgaggttctacttc
gctatgggagggacctgtgaccctggagactctcatgaaaacgatgtcatcttatatgca
aagattgaaggaaaaaaagaacacatcgcactggacaccctttcctattcttcctataag
gttccaactttggtttctgtggtcatcaatcctgaacttcagacacctgccaccaaattt
tgtctcagacaaaagaaccaccaagggcataaccagaatgtctgggctgtggacttcttc
catgtgctgcccattctcccttccacaatgtctcacatgatccagttttccattaatctg
ggctgcggcacacaccagcctggtaacagcgtcagcttggagttttccactaaccatggc
cggtcctggtccctactccacaccgagtgtttgccggagatctgtgcaggtccccacctc
ccgcacagcaccatctactcctcggaaaactacagcgggtggaaccgagtcacgattccc
ctccctaatgcggcactcacccgagacaccaggattcgctggagacaaacaggacccatc
ctgggaaacatgtgggcaattgataatgtttatataggtccgtcatgcctcaaattctgt
tctggcagaggacaatgcactcgacatggttgcaagtgcgaccccggattttctggcccc
gcttgtgagatggcgtctcagaccttcccaatgtttatttcagaaagctttggcagttcc
agactttcctcctaccataacttttactctatccgtggtgctgaagtcagctttggctgt
ggtgtcttagccagtggtaaggctctggttttcaacaaagatgggcgacggcagctgatc
acctcctttctggatagctcgcagtccaggtttctgcagttcaccctgaggctggggagc
aagtctgttctgagcacgtgcagggcccctgaccagccaggggagggagttttgctgcat
tattcatatgacaatgggataacatggaaactcttggagcactattcatatctcaactac
cacgagcccaggatcatctccgtagagctaccagatgacgcgaagcagtttggaatccag
ttcagatggtggcagccgtaccattcttcccagggagaagatgtgtgggccattgacgag
atcctcatgacatcggtcctgttcaacagcatcagcctcgacttcaccaatcttgtggaa
gtcactcagtccctggggttctacctcggcaacatccagccatactgcggccatgactgg
acactttgttttacgggagattcgaaacttgcctcgagcatgcgctatgtggaaacacag
tccatgcagattggagcgtcctatatgattcagttcagcctggtgatggggtgtggccag
aagtacactcctcacatggacaaccaggtgaagctggagtattccaccaaccacgggctc
acgtggcaccttgtacaagacgagtgccttcccagtatgccaagttgccaggaatttaca
tcggccagcatttaccacgccagtgagttcacacagtggagaagagtgaccgtcatcctt
ccccagaaaacatggtctggggccacccgcttccgttggagtcagagctattacacagcc
caggatgagtgggctttagacgacatttacattgggcagcaatgccccaacatgtgcagt
gggcatggctcatgtgaccacggcgtgtgcaggtgtgaccagggataccagggcactgaa
tgccacccggaagccgcacttccttccacgattatgtcagattttgagaacccgagcagt
tgggactcagactggcaagaagttattgggggagaagttgtaaaacctgagcaaggctgt
ggggtcgtgtcttccgggtcttctctgtacttcagcaaggccgggaagaggcagctggtg
agctgggatctggacacgtcctgggtggacttcgtgcagttctacatccagataggcgga
gagagtgccgcgtgcaacaaacctgacagccgagaggagggcgtcctcctccaatacagc
aacaatggcggcatccagtggcacctgctggcagagatgtacttctcagacttcggcaag
cccagatttgtctacctggaactcccagctgctgcgaagaccccttgtaccaggttccgc
tggtggcagcctgtgttctcaggggaggactacgaccagtgggccgtcgatgacatcatc
attctgtcagagaagcagaagcaggttatcccagttgtcaacccaactttgccccagaac
ttctatgagaagccagctttcgactaccctattaaccagatgagcgtgtggctgatgttg
gccaatgaagggatggctaagaacgatagcttctgtgccactacgccttcagccatggtg
ttcggaaagtcagacggagaccgattcgcagtgacgcgagatctgaccctgaaacccgga
tatgtgctccagttcaagctaaacatcgggtgtgccagccagttcagcagcactgctccg
gtcctcctgcagtattcacatgatgctggcatgtcctggtttctggtgaaggaggggtgc
tttcctgcctccgcgggcaagggatgtgaagggaactcccgggaactcagtgagcctact
gtctattacactggagactttgaagaatggacaagagtcaccattgccatcccgaggtcc
cttgcgtccagcaagaccagattccgatggatccaagagagcagctctcagaagaatgta
cccccatttggcttagatggggtatacatatctgagccttgtcccagctactgcagtggt
catggagactgcatctcaggggtgtgtttctgtgacctggggtacacagctgcacaagga
acgtgtgtgtccaacatccccaaccacagtgagatgttcgataggttcgaggggaagcta
agcccactgtggtacaaaatatccggaggtcaggttgggacgggctgtggaaccctcagt
gacggcaggtccctctacttcaatggccttgggaaaagggaagccaggacggtcccgctg
gacaccaggaatatcagacttgttcagttttatatacaaattggaagtaaaacgtcaggt
attacgtgcatcaaaccacgggctagaaacgaggggcttgttgttcagtattcaaatgac
aatgggatactctggcacttgctgcgggagttggacttcctgtcgttcctggagccacag
atcatttctattgacctgcctcgggaagccaagacacctgccacggccttccggtggtgg
cagccgcaacacgggaagcactcagcccagtgggctttggatgatgttcttataggagtg
aatgacagctctcaaactggatttcaagataaatttgatggctccatagacttgcaagcc
aattggtatcgaatccagggaggccaagttgacattgactgcctctctatggacaccgcc
ctcatattcaccgaaaacataggcaaacctcgctatgccgagacctgggacttccatgtg
tcagcctcaagcttcttacagtttgacatgagcatgggctgcagcaagcctttcagcgcc
acccacagtgtacagctccagtactccctgaacaacggcaaggactggcatcctgtcacc
gaagagtgtgtccctcctaccattggctgcgtgcactacacagagagttcaacttacacg
tcggaaaggttccagaactggaggcgggtcaccgtctaccttccactcgccaccaattct
ccgaggactcggttcagatggattcaggccaactacaccgtgggagcagatgcctgggct
attgataatgtcctactggcctcaggctgcccttggctgtgctcaggacgaggcatttgt
gactcggggcgctgtgtgtgtgaccggggcttcggtggacccttctgtgttcctgtcgtt
cctctcccctccattctgaaagatgatttcaatgggaacttacatcctgacctttggcct
gaagtgtacggggcagagagggggaatctgaatggcgaaaccatcaaatccggaacatct
ctgatctttaaaggggagggactaagaatgcttatttcaagagacctggattgcaccaat
accatgtatgtccagttttctctccgatttatagcaaaaggtaccccggagaggtctcac
tccattctactgcaatcctccatcaatgggggggtcacctggcgcctgatggatgagttt
tacttccctcaaacaaccagcatacttttcattaatgtccccttgccgtacagcgcccaa
accaacgctacgagattcagactctggcagccgtacaacaacggtaagaaagaagaaatc
tggatcattgatgactttattattgatggggacaatctgaacaaccccgtgatgctgctg
gacacgttcgactttgggcccagggaagacaattggtttttctatccaggtggtaatatc
ggtctctactgcccgtattcttcaaagggggctcctgaggaggactcggccatggtgttc
gtttcaaacgaaatcggcgagcactccataaccacacgcgacctaagtgtgaacgagaac
accatcattcaattcgagatcaacgtcggatgctccactgatagctcgtctgctgatccg
gtcagactggaattttcaagggacttcggagccacctggcacctgctgctgccgctctgc
taccacagcagcagcctcgtcagttcgctctgctccactgagcaccaccccagcagcacc
tactacgcggggaccacccagggctggaggagggaggtcgtgcacttcgggaagctgcac
ctttgcggatctgtgcgtttccgttggtaccagggattctatcccgccggctctcagccg
gtcacgtgggccattgataatgtctacattggtccccagtgcgaagagatgtgctgtggg
catgggagctgcgtcaatggaaccaagtgtatatgtgatcccggctactccggtccaacc
tgcaaaataagcaccaaaaatcctgattttctcaaagacgattttgaaggtcaactggaa
tccgatcgattcttactaatgagcggtgggaaaccttctcgaaaatgtgggatcctttcc
agtgggaacaacctcttcttcaatgaggatggcttgcgcatgctggtaacacgggacctg
gatctatcgcatgctaggtttgtgcagttcttcatgaggctgggatgcggtaaaggtgtt
ccggaccccaggagccagcccgtgcttctccagtactccctcaatggcggcctctcctgg
agccttcttcaagagttcctcttcagcaactccagcaatgtgggcaggtacatcgccctg
gaaatgcccctgaaagcccgctctggttctacacgcctccgctggtggcagccgtccgaa
aatgggcacttctatagcccctgggtgatcgaccagattcttattggaggaaatatctct
ggtaatacagtcttagaagatgatttctcgactctggacagcagaaagtggctgcttcac
ccaggaggcaccaagatgcccgtgtgtggctccacaggcgatgccctggtcttcattgaa
aaggccagcacccgttacgtggtcaccacggacattgccgtgaatgaggactcattccta
cagatagatttcgctgcgtcctgctcagtcacagactcctgctatgccattgaactggag
tactccgtggatctcgggctgtcgtggcaccccctggtgagggactgcctgcctaccaac
gtcgagtgcagtcgctatcacctgcagcggatcctggtgtcagatactttcaacaaatgg
accagaatcaccctgcccctccctgcctacaccaggtctcaagccactcgtttccgctgg
caccagccagcgccttttgacaagcagcagacgtgggcgatagataacgtctatattggg
gatggttgcctcgacatgtgcagtggccacgggaggtgcatccagggaagctgtgtctgt
gatgagcagtggggaggcctgtactgtgatgaacctgagacctcccttcccacccagctc
aaagacaacttcaatcgagctccctccaaccagaactggctgactgtgaatggtgggaaa
ttgagtactgtgtgtggagccgtggcttcgggcctggctctccacttcagtgggggctgt
agccgattgttagtcactgtggatctgaacctcactaatgctgagtttatccaattttac
ttcatgtatggatgcctcattacgccgagtaaccgtaaccagggagtcctgctggagtac
tccgtcaatggaggcatcacctggaccctgctgatggagattttctatgaccagtacagc
aagcctgggttcgtgaatatccttctccctcctgatgctaaagagattggtactcgcttc
cgctggtggcagccacgacatgatggccttgaccagaatgactgggccattgacaatgtc
ctcatctcaggctctgctgaccagaggacggtcatgctggacacgttcagcagtgcccca
gtgccacagcatgagcgttcccctgcagatgctggccccgttgggagaattgcttttgat
atgttcatggaggacaaaacctcagtgaatgagaattggctcttccatgatgactgtaca
gtagaaagattctgtgattccccagacggtgtcatgctctgtggcagccatgacggacga
gaggtgtatgcagtgacgcacgacctgactcccactgagaactggatcatgcaattcaag
atctctgttggatgcaaagtgcctgaaaaaattgcccaaaatcaaattcatgtgcagttt
tcaactgactttggcgtgagctggagttatctagtccctcagtgcttacccgccgaccca
aagtgttctgggaccgtctctcaaccatctgtgttctttccaacgaaagggtggaaaagg
atcacctacccgcttcctgaaagcttaatggggaatcctgtaagatttaggttctaccaa
aagtactcagacgtgcagtgggcaattgacaacttctaccttggccctgggtgtttggac
aactgtggaggccacggggactgcctaaaggaacagtgcatctgtgacccaggctactcg
gggccacactgctacttaactcacaccctgaagaccttcctgaaggaacgctttgacagt
gaagagatcaagcctgacttatggatgtccttagaaggcgggagcacttgcacagagtgc
gggatcctcgccgagaacactgcactctacttcgggggctccacggtgaggcaagctatt
actcaggacttagacctcagaggtgccaaattcctgcagtactggggacgcatcggcagt
gagaacaacatgacgtcttgccatcggccggtctgccggaaggagggcgtgctgctggac
tactctaaggatggaggaatcacttggactttgcttcatgagatggatttccagaaatac
atttcggtgaggcacgactacatcctcctccctgagggggccctcaccaacaccactcga
cttcgctggtggcagccttttgtgatcagcaatgggctcgtggtttccggggtggagcgt
gcgcagtgggccctggacaacattctgattggtggagcagaaatcaatccaagccaactg
gtggacacttttgatgacgaaggctcttcccatgaagaaaactggagtttttaccccaat
gcagtaaggacagcagggttctgtggcaatccatccttccacctctactggccgaacaaa
aaaaaggacaagacccacaatgctctctcctcccgcgagctcatcatacaaccaggatac
atgatgcaattcaaaattgtggtgggttgtgaagccacttcctgtggtgaccttcattcc
gtcatgctggagtacactaaggacgcaagatctgactcctggcagcttgtgcagacccag
tgcctaccttcctcttccaatagtatcggctgctcccccttccagttccacgaggccacc
atttataacgctgtcaacagctcgagctggaagaggatcaccatccagctgccagaccac
gtctcctcaagtgccacacagttccgctggatccagaaaggagaagaaacagagaagcaa
agctgggccatcgaccacgtgtacattggagaggcttgtcccaggctctgcagcggacat
ggttactgcaccacaggggccgtctgcatctgtgatgaaagcttccaaggtgacgactgc
tctgtcttcagtcacgagcttcccagttacattaaagataattttgaatcggcaagagtc
accgaagccaactgggaaacaatccagggaggcgccatcggaagtggctgtgggcagctg
gcgccctatgcccacggagactcgctctacttcaatggctgtcagataaggcaagccgcc
accaagccgctggacctcactcgagcaagcaaaattatgtttgtcttgcaaattgggagc
acggcacagacagacagttgcaacagtgacctcagcggcccccacacggtggacaaggca
gtgctgctgcaatacagcgtcaacaacggcatcacatggcacgtcatcgctcagcaccag
ccgaaggacttcacacaagctcagcgggtgtcttacaacgttcccctggaagcacggatg
aaaggagttttgctgcgctggtggcagccacgacacaacggaacaggtcatgatcagtgg
gctttggaccatgtggaggtcgtcctagtaagcactcgcaaacaaaattacatgatgaat
ttttcacgacaacatgggctcaggcacttctacaaccgaaggcgaaggtcgcttaggcga
tacccatga

KEGG   Rattus norvegicus (rat): 25304
Entry
25304             CDS       T01003                                 

Gene name
Comp
Definition
(RefSeq) cartilage oligomeric matrix protein precursor
  KO
K04659  thrombospondin 2/3/4/5
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04145  Phagosome
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05144  Malaria
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    25304 (Comp)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    25304 (Comp)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    25304 (Comp)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    25304 (Comp)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    25304 (Comp)
  09174 Infectious disease: parasitic
   05144 Malaria
    25304 (Comp)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:rno04131]
    25304 (Comp)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    25304 (Comp)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    25304 (Comp)
   04990 Domain-containing proteins not elsewhere classified [BR:rno04990]
    25304 (Comp)
Membrane trafficking [BR:rno04131]
 Endocytosis
  Phagocytosis
   Opsonins
    25304 (Comp)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   25304 (Comp)
  Exosomal proteins of colorectal cancer cells
   25304 (Comp)
  Exosomal proteins of bladder cancer cells
   25304 (Comp)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   25304 (Comp)
Domain-containing proteins not elsewhere classified [BR:rno04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   25304 (Comp)
SSDB
Motif
Pfam: TSP_C TSP_3 EGF_CA EGF_3 COMP cEGF EGF_MSP1_1 hEGF
Other DBs
NCBI-GeneID: 25304
NCBI-ProteinID: NP_036966
RGD: 2378
Ensembl: ENSRNOG00000048472
Vega: OTTRNOG00000001988
UniProt: P35444
Structure
PDB: 
1VDF 1FBM

Position
16
AA seq 755 aa
MSPTACVLVLALAALRATGQGQIPLGGDLAPQMLRELQETNAALQDVRELLRQQVKEITF
LKNTVMECDACGMQPARTPGLSVRPVPLCAPGSCFPGVVCTETATGARCGPCPPGYTGNG
SHCTDVNECNAHPCFPRVRCINTSPGFHCEACPPGFSGPTHEGVGLTFAKTNKQVCTDIN
ECETGQHNCVPNSVCVNTRGSFQCGPCQPGFVGDQRSGCQRRGQHFCPDGSPSPCHEKAD
CILERDGSRSCVCAVGWAGNGLLCGRDTDLDGFPDEKLRCSERQCRKDNCVTVPNSGQED
VDRDGIGDACDPDADGDGVPNEQDNCPLVRNPDQRNSDKDKWGDACDNCRSQKNDDQKDT
DRDGQGDACDDDIDGDRIRNVADNCPRVPNFDQSDSDGDGVGDACDNCPQKDNPDQRDVD
HDFVGDACDSDQDQDGDGHQDSRDNCPTVPNSAQQDSDHDGKGDACDDDDDNDGVPDSRD
NCRLVPNPGQEDNDRDGVGDACQGDFDADKVIDKIDVCPENAEVTLTDFRAFQTVVLDPE
GDAQIDPNWVVLNQGMEIVQTMNSDPGLAVGYTAFNGVDFEGTFHVNTATDDDYAGFIFG
YQDSSSFYVVMWKQMEQTYWQANPFRAVAEPGIQLKAVKSSTGPGEQLRNALWHTGDTAS
QVRLLWKDPRNVGWKDKTSYRWFLQHRPQVGYIRVRFYEGPELVADSNVVLDTAMRGGRL
GVFCFSQENIIWANLRYRCNDTIPEDYERHRLRRA
NT seq 2268 nt   +upstreamnt  +downstreamnt
atgagccccactgcctgcgttctagtgctcgccctggctgccttgcgggctaccggccag
ggccagatcccgctgggtggagacctagccccacagatgcttcgagaactccaggagact
aatgcggcgctgcaagacgtgagagagctcttgcgacagcaggtcaaggagatcaccttc
ctgaagaatacggtgatggaatgtgacgcttgcggaatgcagcccgcacgcacccccggt
ctgagcgtgcggccagtgccgctctgcgcacccggctcctgcttccctggcgtagtctgc
acggagacagctaccggcgcgcgctgcggcccctgccctccgggctacaccggcaacggc
tcgcactgcaccgacgttaatgagtgcaacgctcacccctgtttcccgcgcgtgcggtgc
atcaataccagccctggctttcactgcgaagcctgtccccctgggttcagcgggcccacc
cacgagggtgtggggctgaccttcgccaagaccaacaaacaagtttgcacagatattaat
gagtgtgagaccgggcagcacaattgcgttcccaactccgtgtgcgtcaacacccggggc
tccttccagtgcggtccctgccagcccggcttcgtgggcgaccagaggtcaggctgccag
cggcgtgggcaacacttctgccccgacgggtcacccagcccgtgccatgagaaagcagac
tgtattttggagcgcgacggctcaaggtcctgcgtgtgtgcggtcggctgggccggcaac
gggctcctgtgcggacgcgacacagacctggacggtttcccggacgagaagcttcgctgc
tcagagcgccagtgccgcaaggacaactgcgtgacggtgcccaattcagggcaggaggat
gtggaccgggacggcattggagatgcttgtgacccggatgcggacggggatggagtccct
aatgagcaagacaattgcccgctggttcgaaacccagaccagcgcaactcggataaagac
aagtggggagatgcctgcgacaactgccggtcccagaagaatgatgaccagaaagataca
gaccgggatggccagggcgatgcctgcgacgacgacatagatggcgatcgaatccgaaat
gtagctgacaactgtccccgggtgcccaactttgaccagagtgacagcgatggtgatggt
gttggggatgcctgtgacaattgtccccagaaagacaacccggaccagagggacgtggac
cacgactttgtgggtgatgcctgtgacagtgaccaagaccaggacggggatggacaccaa
gactcccgggacaactgccccacagtacccaacagtgcccagcaggactcagaccatgat
ggcaagggtgatgcctgtgatgatgacgacgacaatgacggagtccctgacagccgggac
aattgccgcttggtgcccaacccgggccaagaggacaatgaccgggatggcgtgggtgac
gcttgtcagggtgacttcgatgctgacaaggttatagacaagatcgatgtgtgccccgag
aacgccgaggtcactctcaccgacttcagggccttccaaacagttgtgctggaccccgag
ggtgatgcgcagatcgaccccaactgggtggtgctcaatcagggaatggagatcgttcag
accatgaacagtgaccctggcctggctgtgggttacacggcattcaacggtgtagatttt
gagggaacgttccatgtaaacaccgccaccgatgatgactacgctggcttcatcttcggc
tatcaagacagctcaagtttctatgtggtcatgtggaaacagatggagcagacgtactgg
caggccaatcctttccgggcagtggctgaaccagggattcagctcaaggctgtcaagtcc
tctacaggtcccggggaacagctccgaaatgcgttgtggcacacgggggacacagcatcc
caggtgcggctgctgtggaaggatcctcgaaatgtgggctggaaggataagacatcctac
cgctggttcctgcagcaccggcctcaagtcggctacatcagggtgcggttctatgagggt
cctgagctagtagctgacagcaacgtggtgctggacacagctatgcgtggtggccgcctg
ggtgtcttctgcttctcccaagagaatatcatctgggctaacctgcgctaccgttgcaat
gatacaatccctgaggactatgagcgtcaccggctgcggagggcctag

KEGG   Rattus norvegicus (rat): 25353
Entry
25353             CDS       T01003                                 

Gene name
Spp1
Definition
(RefSeq) osteopontin precursor
  KO
K06250  secreted phosphoprotein 1
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04371  Apelin signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04620  Toll-like receptor signaling pathway
rno04929  GnRH secretion
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04371 Apelin signaling pathway
    25353 (Spp1)
   04151 PI3K-Akt signaling pathway
    25353 (Spp1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    25353 (Spp1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    25353 (Spp1)
 09150 Organismal Systems
  09151 Immune system
   04620 Toll-like receptor signaling pathway
    25353 (Spp1)
  09152 Endocrine system
   04929 GnRH secretion
    25353 (Spp1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    25353 (Spp1)
SSDB
Motif
Pfam: Osteopontin
Other DBs
NCBI-GeneID: 25353
NCBI-ProteinID: NP_037013
RGD: 3752
Ensembl: ENSRNOG00000043451
Vega: OTTRNOG00000000106
UniProt: P08721
Position
14
AA seq 317 aa
MRLAVVCFCLFGLASCLPVKVAEFGSSEEKAHYSKHSDAVATWLKPDPSQKQNLLAPQNS
VSSEETDDFKQETLPSNSNESHDHMDDDDDDDDDGDHAESEDSVNSDESDESHHSDESDE
SFTASTQADVLTPIAPTVDVPDGRGDSLAYGLRSKSRSFPVSDEQYPDATDEDLTSRMKS
QESDEAIKVIPVAQRLSVPSDQDSNGKTSHESSQLDEPSVETHSLEQSKEYKQRASHEST
EQSDAIDSAEKPDAIDSAERSDAIDSQASSKASLEHQSHEFHSHEDKLVLDPKSKEDDRY
LKFRISHELESSSSEVN
NT seq 954 nt   +upstreamnt  +downstreamnt
atgagactggcagtggtttgcttttgcctgttcggccttgcctcctgtctcccggtgaaa
gtggctgagtttggcagctcagaggagaaggcgcattacagcaaacactcagatgctgta
gccacttggctgaagcctgacccatctcagaagcagaatcttctagccccacagaattct
gtgtcctctgaagaaacggatgactttaagcaagaaactcttccaagcaactccaatgaa
agccatgaccacatggacgatgatgacgacgacgatgacgacggagaccatgcagagagc
gaggattctgtgaactcggatgaatctgacgaatctcaccattccgatgaatctgatgag
tccttcactgccagcacacaagcagacgttttgactccaatcgcccccacagtcgatgtc
cctgacggccgaggtgatagcttggcttacggactgaggtcaaagtccaggagtttccct
gtttctgatgaacagtatcccgatgccacagatgaggacctcacctcccgcatgaagagc
caggagtccgatgaggctatcaaggtcatcccagttgcccagcgtctgagcgtgccctct
gatcaggacagcaacgggaagaccagccatgagtcaagtcagctggatgaaccaagcgtg
gaaacacacagcctggagcagtccaaggagtataagcagagggccagccacgagagcact
gagcagtcggatgcgatcgatagtgccgagaagccggatgcaatcgatagtgcggagcgg
tcggatgctatcgacagtcaggcgagttccaaagccagcctggaacatcagagccacgag
tttcacagccatgaggacaagctagtcctagaccctaagagtaaggaagatgataggtat
ctgaaattccgcatttctcatgaattagagagttcatcttctgaggtcaattaa

KEGG   Rattus norvegicus (rat): 25412
Entry
25412             CDS       T01003                                 

Gene name
Col2a1
Definition
(RefSeq) collagen alpha-1(II) chain precursor
  KO
K19719  collagen type II alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04974  Protein digestion and absorption
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    25412 (Col2a1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    25412 (Col2a1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    25412 (Col2a1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    25412 (Col2a1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    25412 (Col2a1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    25412 (Col2a1)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   25412 (Col2a1)
SSDB
Motif
Pfam: Collagen COLFI
Other DBs
NCBI-GeneID: 25412
NCBI-ProteinID: NP_037061
RGD: 2375
Ensembl: ENSRNOG00000058560
UniProt: P05539
Position
7
AA seq 1419 aa
MIRLGAPQSLVLLTLLIATVLQCQGQDARKLGPKGQKGEPGDIKDIIGPKGPPGPQGPAG
EQGPRGDRGDKGERGAPGPRGRDGEPGTPGNPGPPGPPGPPGPPGLGGGNFAAQMAGGFD
EKAGGAQMGVMQGPMGPMGPRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPIGPRGPPGPA
GKPGDDGEAGKPGKAGERGLPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAGAPGVK
GESGSPGENGSPGPMGPRGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFL
GAPGAKGEAGPTGARGPEGAQGSRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGIA
GAPGFPGPRGPPGPQGATGPLGPKGQTGEPGIAGFKGEQGPKGETGPAGPQGAPGPAGEE
GKRGARGEPGGAGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDP
GRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPGVMGFPGPK
GANGEPGKAGEKGLAGAPGLRGLPGKDGETGAAGPPGPSGPAGERGEQGAPGPSGFQGLP
GPPGPPGEGGKQGDQGIPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTD
GPKGAAGPDGPPGAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPGKDGGRGLT
GPIGPPGPAGANGEKGEVGPPGPSGSTGARGAPGERGETGPPGPAGFAGPPGADGQPGAK
GDQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPP
GSNGNPGPAGPPGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPGEKGEPGDDGPSGSD
GPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPGPVGPPGLT
GPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALGAPGAPGPPGSPGPAGPTGKQGDR
GEAGAQGPMGPSGPAGARGIAGPQGPRGDKGEAGEPGERGLKGHRGFTGLQGLPGPPGPS
GDQGTSGPAGPSGPRGPPGPVGPSGKDGSNGIPGPIGPPGPRGRSGETGPAGPPGNPGPP
GPPGPPGPGIDMSAFAGLGQREKGPDPLQYMRADEADSTLRQHDVEVDATLKSLNNQIES
IRSPDGSRKNPARTCQDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGESCVYPNP
ATVPRKNWWSSKSKEKKHIWFGETMNGGFHFSYGDGNLAPNTANVQMTFLRLLSTEGSQN
ITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEMRAEGNSRFTYTALKDGCTKHTGKWGKT
IIEYRSQKTSRLPIVDIAPMDIGGPDQEFGVDIGPVCFL
NT seq 4260 nt   +upstreamnt  +downstreamnt
atgatccgcctcggggctccccagtcgctggtgctgctgacgctgctcatcgccacggtc
ctacaatgtcagggccaggatgcccgaaaattagggccaaaggggcagaaaggagaacct
ggagatatcaaagatatcataggacctaaaggacctcctggccctcagggacctgccggt
gaacaaggacccagaggtgaccgtggtgacaagggagagaggggtgcacctggaccccgt
ggcagagatggagaacctggtacccctggaaatcctggtccccctggccctccaggcccc
cctggtccccctggccttggtggaggaaactttgcagcccagatggctggaggatttgac
gagaaggctggtggtgcccagatgggagtcatgcaaggccccatgggccccatgggacct
cgtggacccccaggccccgccggtgcccccggccctcaaggatttcaaggcaatcctggt
gaacctggcgagcctggtgtctctggtcccattggtccccgaggtcctccaggtcctgct
ggaaaacctggtgatgatggtgaagctggaaagcccggaaaggccggagaaagaggcctc
cctggtcctcagggtgctcgtggattcccgggaacccccggtcttcctggtgtcaagggt
cacagaggttacccaggcctcgatggtgctaagggagaagctggtgctccaggtgtgaag
ggtgagagcggttcccctggtgagaacggatccccaggcccaatgggtccccgtggcctg
cctggcgagagaggacggactggccctgctggtgctgctggtgctcgaggcaacgacggc
cagccaggccctgctggacctccgggtcctgttggtcctgcaggtggtcctggcttcctt
ggtgctcctggtgccaagggcgaagctggtcccactggtgcccgtggtcctgaaggtgct
caaggttctcgtggcgaacccggcaatcccggatcccccgggcccgcaggcgcttctggt
aacccagggactgatggtattcccggagccaaaggatctgctggcgctcctggaattgct
ggtgcccctggcttccctgggccccgtggccctcccggtcctcaaggtgcaactggtcct
ctgggccccaaaggtcagacgggtgagcccggcatcgctggcttcaaaggtgaacaaggc
cccaagggagagactggacctgctgggccccagggagcccctggccctgctggtgaagaa
ggaaaacgaggtgctcgaggagagcccggtggcgctggacctatcggcccccctggagag
agaggtgctcctggcaaccgcggtttcccaggtcaagatggtctggcaggtcccaagggt
gcccctggagagcgagggcctagtggcctggctggtcctaagggtgccaatggtgacccg
ggtcgtcctggagaacctggtcttcctggagccaggggtcttactggccgccctggtgat
gctggtcctcaaggcaaagttggtccttctggagcccctggagaagacggtcgccctggt
cctcctggtcctcagggagctcgtgggcagcctggcgtcatgggtttccctggccccaaa
ggagccaatggcgagcctggcaaagctggtgagaaaggactggctggtgctcctggtctg
agaggtcttcctggcaaagatggtgagacaggagccgcaggaccccccggccccagcgga
cctgctggtgaacgaggcgagcagggtgctcctgggccatcagggttccagggacttcct
ggccctcctggtcccccgggtgaaggtggaaagcaaggtgaccagggtattcctggtgaa
gctggagcccctggcctcgttggtccccggggtgaacgaggtttcccaggtgaacgtggc
tctcccggtgctcagggcctccagggtccccgaggcctccctggcactcctggtactgat
ggtcccaaaggtgcggctggcccagatggtccccctggggctcagggccctccaggtcta
cagggaatgcctggtgagaggggagcagctggtattgctggacccaagggagacagaggt
gatgttggcgagaaaggcccagagggagctcctggaaaagatggtggccgaggtcttact
gggcccatcgggcccccaggaccagcaggtgccaatggcgagaagggagaagtcggacct
cctggtccttcaggaagtactggagctcgaggcgccccgggtgagcgcggagagactggg
ccacctggacctgctggattcgctggccctcctggtgctgatggccagcctggtgccaag
ggcgatcaaggagaagctggacagaaaggtgacgctggtgcccctggcccacaaggcccc
tcaggagctcccgggccacagggtcctactggagtgactggtcctaagggagcccggggc
gcccaaggcccaccgggagccaccggattccctggagctgctggccgagttggaccccca
ggttctaatggcaaccctgggcccgccggtccccctggtcctgctggaaaagatggtccc
aaaggtgctcgaggagacactggtgcccctggcagagctggtgaccctggacttcaagga
cccgcaggagctcctggagagaaaggcgaacctggagatgacggtccctctggttctgat
ggtcccccaggtccccaggggctggctggtcaaaggggcattgttggtctgcctggtcag
cgtggtgagagaggattcccaggccttcccggcccatcgggtgagcccggcaagcagggt
gcacctggtgcatctggagacagaggtcctcctggtcctgtggggcctcctggcttgaca
ggacctgcaggtgaacctggacgagagggcagccctggtgctgatggaccccctggcaga
gatggtgcggctggagtcaagggtgatcgtggtgagactggtgcacttggtgctcctgga
gctcctgggcccccaggctctcctggtcctgctggcccaactggcaaacaaggagacaga
ggagaggctggtgcacaaggtcctatgggcccctcaggacctgctggagcccgtggaatt
gctggccctcaaggcccccgaggtgacaaaggagaagctggagagcctggcgagagagga
ctgaaggggcaccgaggtttcactggactgcagggtctgcctggccccccgggtccttct
ggagatcagggtacttctggccctgctggtccttccggccctagaggtccacctggccct
gttggtccctctggcaaagatggctctaatggaatccctggccccatcgggcctccaggt
ccccgtggacgctcaggagaaactggccctgctggtcctcctggaaatcctggtccccct
ggccctccgggtcctcctggtcctggcatcgacatgtcagcctttgctggcttaggacag
agagagaagggccccgatcccctgcagtacatgcgggccgacgaggcagacagtaccttg
agacagcatgacgtcgaggtggacgccacgctcaagtcgctgaacaaccagatcgagagc
atccgcagccctgatggctcccgcaagaatcccgctcgcacctgccaggacctgaaactc
tgccacccagagtggaagagcggagactactggattgatcccaaccagggctgcaccttg
gacgccatgaaagtcttctgcaacatggagactggcgagtcttgcgtctaccccaaccca
gcgactgtgcctcggaagaactggtggagcagcaagagcaaggagaagaagcacatctgg
tttggagagaccatgaacggcggcttccacttcagctacggcgacggcaacctggctccc
aacaccgctaacgtccagatgactttcctccgtctactgtccactgagggctcccagaac
atcacctaccactgtaagaacagcattgcctacctggacgaagcagccggcaacctcaag
aaggccttgctcatccagggctccaatgatgtggagatgagggccgagggcaacagcagg
ttcacgtacactgccctgaaggatggctgcacgaaacacaccggtaagtggggcaagacc
atcatcgagtaccgatcacagaagacctcacgccttcccattgttgacattgcacccatg
gacatcggagggcctgatcaggaatttggtgtggacatagggcctgtctgtttcttgtaa

KEGG   Rattus norvegicus (rat): 25473
Entry
25473             CDS       T01003                                 

Gene name
Lamb2
Definition
(RefSeq) laminin subunit beta-2 precursor
  KO
K06243  laminin, beta 2
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05145  Toxoplasmosis
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    25473 (Lamb2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    25473 (Lamb2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    25473 (Lamb2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    25473 (Lamb2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    25473 (Lamb2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    25473 (Lamb2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    25473 (Lamb2)
   05145 Toxoplasmosis
    25473 (Lamb2)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N F5_F8_type_C
Other DBs
NCBI-GeneID: 25473
NCBI-ProteinID: NP_037106
RGD: 2988
Ensembl: ENSRNOG00000047768
UniProt: P15800
Structure
PDB: 
5LF2

Position
8
AA seq 1801 aa
MEWASGKPGRGRQGQPVPWELRLGLLLSVLAATLAQVPSLDVPGCSRGSCYPATGDLLVG
RADRLTASSTCGLHSPQPYCIVSHLQDEKKCFLCDSRRPFSARDNPNSHRIQNVVTSFAP
QRRTAWWQSENGVPMVTIQLDLEAEFHFTHLIMTFKTFRPAAMLVERSADFGRTWRVYRY
FSYDCGADFPGIPLAPPRRWDDVVCESRYSEIEPSTEGEVIYRVLDPAIPIPDPYSSRIQ
NLLKITNLRVNLTRLHTLGDNLLDPRREIREKYYYALYELVIRGNCFCYGHASQCAPAPG
APAHAEGMVHGACICKHNTRGLNCEQCQDFYQDLPWHPAEDGHTHACRKCECNGHSHSCH
FDMAVYLASGNVSGGVCDGCQHNTAGRHCELCRPFFYRDPTKDMRDPAACRPCDCDPMGS
QDGGRCDSHDDPVLGLVSGQCRCKEHVVGTRCQQCRDGFFGLSASNPRGCQRCQCNSRGT
VPGGTPCDSSSGTCFCKRLVTGDGCDRCLPGHWGLSHDLLGCRPCDCDVGGALDPQCDEA
TGQCPCRPHMIGRRCEQVQPGYFRPFLDHLTWEAEGAHGQVLEVVERLVTNRETPSWTGV
GFVRLREGQEVEFLVTSLPRAMDYDLLLRWEPQVPEQWAELELVVQRPGPVSAHSPCGHV
LPRDDRIQGMLHPNTRVLVFPRPVCLEPGLSYKLKLKLTGTGGRAHPETPYSGSGILIDS
LVLQPHVLMLEMFSGGDAAALERRTTFERYRCHEEGLMPSKTPLSEACVPLLISASSLVY
NGALPCQCDPQGSLSSECNPHGGQCRCKPGVVGRRCDACATGYYGFGPAGCQACQCSPDG
ALSALCEGTSGQCLCRTGAFGLRCDHCQRGQWGFPNCRPCVCNGRADECDAHTGACLGCR
DYTGGEHCERCIAGFHGDPRLPYGGQCRPCPCPEGPGSQRHFATSCHRDGYSQQIVCHCR
AGYTGLRCEACAPGHFGDPSKPGGRCQLCECSGNIDPTDPGACDPHTGQCLRCLHHTEGP
HCGHCKPGFHGQAARQSCHRCTCNLLGTDPQRCPSTDLCHCDPSTGQCPCLPHVQGLSCD
RCAPNFWNFTSGRGCQPCACHPSRARGPTCNEFTGQCHCHAGFGGRTCSECQELHWGDPG
LQCRACDCDPRGIDKPQCHRSTGHCSCRPGVSGVRCDQCARGFSGVFPACHPCHACFGDW
DRVVQDLAARTRRLEQWAQELQQTGVLGAFESSFLNLQGKLGMVQAIVAARNTSAASTAK
LVEATEGLRHEIGKTTERLTQLEAELTDVQDENFNANHALSGLERDGLALNLTLRQLDQH
LDILKHSNFLGAYDSIRHAHSQSTEAERRANASTFAIPSPVSNSADTRRRAEVLMGAQRE
NFNRQHLANQQALGRLSTHTHTLSLTGVNELVCGAPGDAPCATSPCGGAGCRDEDGQPRC
GGLGCSGAAATADLALGRARHTQAELQRALVEGGGILSRVSETRRQAEEAQQRAQAALDK
ANASRGQVEQANQELRELIQNVKDFLSQEGADPDSIEMVATRVLDISIPASPEQIQRLAS
EIAERVRSLADVDTILAHTMGDVRRAEQLLQDAQRARSRAEGERQKAETVQAALEEAQRA
QGAAQGAIRGAVVDTKNTEQTLQQVQERMAGTEQSLNSASERARQLHALLEALKLKRAGN
SLAASTAEETAGSAQSRAREAEKQLREQVGDQYQTVRALAERKAEGVLAAQARAEQLRDE
ARGLLQAAQDKLQRLQELEGTYEENERELEVKAAQLDGLEARMRSVLQAINLQVQIYNTC
Q
NT seq 5406 nt   +upstreamnt  +downstreamnt
atggagtgggcctcaggaaaaccagggaggggcaggcagggacagcctgtgccatgggaa
cttcgcttgggcctacttctaagtgtgctggctgccacattggcccaagtcccatccttg
gatgtacctggctgttctcgaggaagctgctatccagccaccggtgacctgttggtgggc
cgtgcagacagactgacagcctcctccacgtgtggcttgcatagccctcaaccctactgt
attgtcagtcacctgcaggatgaaaagaagtgcttcctgtgtgactcccgacgtcccttc
tctgctcgagacaacccaaatagtcaccggatccagaatgtagtcactagctttgcgcca
caacgccggacggcctggtggcagtcggagaacggtgttccgatggtcaccatccaactg
gacctggaagctgagtttcatttcacccacctcatcatgaccttcaagacatttcgtcct
gcggccatgctggtggagcgctctgcagacttcggccgcacgtggcgtgtctaccgatat
ttttcctacgactgcggggctgacttcccaggaatcccgctggccccgccgcgtcgctgg
gatgatgtagtgtgtgagtcccgctactcagaaattgagccatctactgaaggcgaggtc
atctatcgtgtgctggatcctgctattcctattccagacccctacagctcacggattcag
aacctgttgaagatcaccaacctccgtgtgaacctcacccggcttcacacactgggagac
aacttgcttgacccacggcgggagatccgggaaaaatattattatgctctctatgaactt
gtcatccgcgggaactgcttctgctatggacatgcctcacagtgtgcgcctgcaccaggg
gcaccggcccatgctgagggcatggtacatggggcctgtatttgcaagcacaatactcgt
gggctcaactgtgagcagtgtcaggatttctatcaggaccttccctggcatcctgctgag
gatggccatactcacgcctgtcggaagtgtgagtgcaacgggcatagtcatagctgccac
tttgacatggctgtgtacctggcatctggaaacgtaagtggaggtgtctgcgatggctgt
cagcacaacacggctgggcgccactgcgagctctgccggcccttcttctaccgtgaccca
accaaggacatgcgggatccagctgcgtgccgtccttgtgactgtgacccaatgggttct
caagacggtggccgctgtgattcccatgatgaccctgtgctaggtctggtctcaggccag
tgtcgctgcaaagaacacgtggttggcactcgctgccagcaatgccgggacggcttcttt
ggacttagtgccagcaaccctcgaggatgtcagcgttgtcagtgtaactcaaggggcaca
gtgcctgggggcaccccttgtgactccagtagtggaacctgtttctgcaagcgtctggtg
actggagacggctgtgaccgctgtctgcctggccactggggcctgagccatgacctgcta
ggctgccgcccctgtgactgtgatgtgggcggtgccttggatcctcagtgtgatgaagcc
acgggtcagtgcccgtgccgcccacacatgattgggcggcgctgtgaacaagtacagcct
ggctacttccggccttttctagaccatctaacctgggaggctgagggtgcccatgggcag
gtgcttgaggtggtagaacgtctggtgaccaaccgagagactccgtcctggactggtgta
ggctttgtgcgactgcgggaaggtcaggaagtggagttcctggtgacctctttgcctagg
gccatggactatgacctgctactgcgctgggagccccaggttcctgagcagtgggcagag
ctggaattggtggtgcagcgcccagggcctgtgtctgcacacagtccgtgtgggcacgtg
ctgccgagggatgaccgcattcaggggatgcttcacccaaacaccagggttttggtgttt
cccagacctgtctgccttgagcctggtctctcctacaagctgaagttgaaattgactgga
acagggggacgtgcccatcctgaaaccccatactctggatctggaatactcattgactcg
ctggtgctgcagccccatgtcttgatgctagagatgtttagtgggggggatgctgctgcc
ctggaacgccgtaccacctttgaacgctaccgctgccatgaggaaggtctgatgcccagc
aagacccccctatctgaggcctgcgttcccctcctcatcagtgcgtcctccctggtctac
aatggcgccttgccctgccagtgtgaccctcaaggctcactgagttccgaatgcaatcct
catggtggccagtgccggtgcaagcctggagtggttggacgacgctgtgatgcctgtgct
actggctactatggctttgggcccgcaggctgtcaagcctgccagtgtagtcccgacgga
gcactcagtgccctgtgtgaagggactagtggacagtgtctctgccgaaccggtgccttt
ggtcttcgctgtgaccactgtcaacgtggccagtggggtttccctaactgccggccgtgt
gtctgcaacgggcgtgcagatgagtgtgacgcccacacaggcgcttgcctgggctgccgt
gattacacagggggcgagcactgtgagaggtgcattgctggttttcatggggacccacgg
ctgccatatgggggccagtgccggccttgtccctgccctgaaggccccgggagccagcga
cactttgctacttcatgtcaccgggatggatattcccagcaaattgtgtgccactgtcga
gcaggctacacagggcttcggtgtgaggcttgtgcccccgggcactttggggacccatcc
aagccaggtggcaggtgccaactgtgtgagtgcagtgggaacattgaccccacggatccc
ggtgcctgtgatccccacacggggcaatgcttgcgctgtttacaccacacggaggggccc
cactgtggccattgcaagcctggcttccatgggcaagctgcccgacagagctgtcaccgc
tgtacctgcaaccttctgggcacagatccccagcggtgcccatctacagacctgtgccat
tgtgacccaagcactgggcagtgcccatgcctcccccatgtccaaggcctcagttgcgac
cgttgtgcccccaacttttggaactttaccagtggacgtggctgccagccttgtgcttgt
cacccaagccgggccagaggccctacctgcaatgagttcacagggcagtgccactgtcat
gctggctttggtgggaggacctgttctgagtgccaagagctccactggggagaccctgga
ctgcagtgccgcgcctgtgactgtgatcccagaggaatagacaaaccccaatgtcatcgt
tctactggtcactgtagctgccgcccaggcgtgtctggcgtgcgctgcgaccagtgtgct
cgtggcttctcgggtgtttttcctgcttgtcacccctgccatgcatgctttggagactgg
gatcgtgtggtacaggaccttgctgctcgtacgcggcgcctggagcagtgggctcaggaa
ttgcagcaaacgggtgtgctgggtgcctttgagagcagctttttaaacctgcaggggaag
ctgggaatggtgcaggccatcgtggctgcccgcaacacctcagctgcgtctactgcaaag
cttgtagaggccacagagggactgcgtcatgaaattgggaagaccactgaacgcctgact
cagttagaagcagagctcacagatgtacaggatgagaacttcaatgccaaccatgcactc
agtggtctggagagagacgggctcgcacttaatctcacactgaggcagcttgaccagcat
ctggatatcctcaaacactcaaatttcttaggtgcctatgacagcatccgacatgcccac
agtcagtctacagaagcagagcgccgggccaatgcctccacctttgcaatacccagccct
gtgagcaactcagcagatacccggcgtcgggcagaagtgctaatgggtgcccaaagagag
aacttcaaccgccagcacttggccaaccagcaggcacttggacggctgtccacacacacc
cataccctgagcctgacgggagtaaatgagctggtgtgtggggccccaggggacgcaccc
tgtgctaccagcccttgtgggggtgctggatgtcgggatgaggatgggcaaccccgctgt
ggtggcctcggttgcagtggagcagcagccacagcagatctagcactgggccgggctcgg
cacacacaggcagagctgcagagagcgctggtagaaggtggtggcatcctcagccgagtg
tctgagactcgtcggcaggcagaggaggcccagcaacgagcacaggcagccctggacaag
gctaatgcttctaggggccaagtggagcaggccaatcaggagcttcgagaacttatccag
aatgtgaaggacttcctcagccaggagggagctgatcccgacagtattgaaatggtagca
acacgggtgctagacatctccatcccagcctcgcctgagcagatccagcgcctagcaagt
gagattgcagaacgtgtccgaagcctggcggatgtggacacaatcctggcacataccatg
ggggatgtgcgtcgggctgaacagctactgcaagatgcacagcgggcacggagccgggct
gagggtgagagacagaaggcagagacagtccaagcggcactggaggaggcccagagggca
cagggagctgctcagggcgccatccggggagcagtggttgacacaaaaaacacagagcag
accctgcagcaggtccaggagaggatggcaggtacagagcagtctctgaactctgctagt
gagcgggctcggcagttacatgcccttctggaagccctgaaactgaaaagggcaggaaac
agcctggcagcatctacagctgaagagacagcaggcagtgcccagagccgtgccagggag
gctgagaaacagctacgggaacaagtaggtgaccagtaccaaactgtgagggcgttggct
gaacggaaggctgaaggtgttctggccgcacaagccagagcagaacaactgcgggatgag
gctcggggtctgttgcaggctgctcaggacaagttgcagcggctacaggaactggagggc
acatatgaagagaatgaacgcgaactggaggtcaaagcggctcagctggatgggctggag
gccaggatgcgcagtgtgcttcaggcaatcaacctgcaggtccagatctacaacacctgc
cagtga

KEGG   Rattus norvegicus (rat): 25567
Entry
25567             CDS       T01003                                 

Gene name
Tnr
Definition
(RefSeq) tenascin-R precursor
  KO
K06252  tenascin
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05165  Human papillomavirus infection
rno05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    25567 (Tnr)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    25567 (Tnr)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    25567 (Tnr)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    25567 (Tnr)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    25567 (Tnr)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    25567 (Tnr)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   25567 (Tnr)
SSDB
Motif
Pfam: fn3 Fibrinogen_C EGF_Tenascin EGF_2 Tissue_fac EGF Lzipper-MIP1
Other DBs
NCBI-GeneID: 25567
NCBI-ProteinID: NP_037177
RGD: 3886
Ensembl: ENSRNOG00000002468
Vega: OTTRNOG00000000165
UniProt: Q05546
Structure
PDB: 
1TDQ

Position
13
AA seq 1358 aa
MGIEGETVVLKNMLIGVNLILLGSMLKPSECRLEVTTERVQRQTVEEEGGASSYNTSSKE
QPMVFNHVYNINVPLESLCSSGLEASAEQDVSAEDDTLAEYTGQTSDHESQVTFTHKINL
PKKACPCASSAQVLQELLSRIEMLEREVSVLRDQCNTNCCQESAATGQLDYVPHCSGHGN
FSFESCGCICNEGWFGKNCSEPYCPLGCSSRGVCVDGQCICDSEYSGDDCSELRCPTDCS
SRGLCVDGECVCEEPYTGEDCRELRCPGDCSGKGQCANGTCLCQEGYAGEDCSQRRCLNA
CSGRGHCQEGLCICEEGYQGPDCSAVTPPEDLRVAGISDRSIELEWDGPMAVTEYVISYQ
PTALGGLQLQQRVPGDWSGVTITELEPGLTYNISVYAVISNILSLPITAKVATHLSTPQG
LQFKTITETTVEVQWEPFSFSFDGWEISFTPKNNEGGVIAQLPSDVTSFNQTGLKPGEEY
IVNVVALKEQARSPPTSASVSTVIDGPTQILVRDVSDTVAFVEWTPPRAKVDFILLKYGL
VGGEGGKTTFRLQPPLSQYSVQALRPGSRYEVSISAVRGTNESDASSTQFTTEIDAPKNL
RVGSRTATSLDLEWDNSEAEAQEYKVVYSTLAGEQYHEVLVPKGIGPTTKTTLTDLVPGT
EYGVGISAVMNSKQSIPATMNARTELDSPRDLMVTASSETSISLIWTKASGPIDHYRITF
TPSSGISSEVTVPRDRTSYTLTDLEPGAEYIISITAERGRQQSLESTVDAFTGFRPISHL
HFSHVTSSSVNITWSDPSPPADRLILNYSPRDEEEEMMEVLLDATKRHAVLMGLQPATEY
IVNLVAVHGTVTSEPIVGSITTGIDPPKNITISNVTKDSLTVSWSPPVAPFDYYRVSYRP
TQVGRLDSSVVPNTVTEFTITRLYPATEYEISLNSVRGREESERICTLVHTAMDSPMDLI
ATNITPTEALLQWKAPMGEVENYVIVLTHFAMAGETILVDGVSEEFQLVDLLPRTHYTVT
MYATSGPLVSGTIATNFSTLLDPPANLTASEVTRQSALISWQPPRAAIENYVLTYKSTDG
SRKELIVDAEDTWIRLEGLSENTDYTVLLQAAQEATRSSLTSTIFTTGGRVFSHPQDCAQ
HLMNGDTLSGVYTIFLNGELSHKLQVYCDMTTDGGGWIVFQRRQNGQTDFFRKWADYRVG
FGNLEDEFWLGLDNIHRITAQGRYELRVDMRDGQEAVFAYYDKFAVEDSRSLYKLRIGGY
NGTAGDSLSYHQGRPFSTEDRDNDVAVTNCAMSYKGAWWYKNCHRTNLNGKYGESRHSQG
INWYHWKGHEFSIPFVEMKMRPYIHRLTAGRKRRALKF
NT seq 4077 nt   +upstreamnt  +downstreamnt
atggggatcgaaggggaaacagtagtcttgaagaacatgctcattggtgtaaacctgatc
ctgttgggatccatgctcaagccttctgaatgccggctggaagtgactacagaaagggtc
cagagacaaactgtggaggaggaaggaggggcttccagctacaacacatccagcaaggaa
cagcctatggtctttaaccatgtgtataacatcaacgtgccactcgaaagcctctgctcc
tcggggctggaggcctcagctgagcaggacgtgagtgccgaagatgatactctggcagaa
tacacaggccaaacctcagaccatgaaagccaggtcaccttcacccacaagatcaacctc
cccaaaaaagcctgcccatgtgcaagctctgcccaggtactgcaggaactgttgagccga
atcgagatgctggagagggaggtgtcagtgctgcgagaccagtgcaacaccaactgctgt
caggaaagtgctgccacaggacaactggactatgtccctcactgcagcggccatggcaac
tttagcttcgagtcctgtggctgcatctgcaatgaaggctggtttggcaagaactgctca
gagccctactgcccattgggctgctccagtcggggtgtatgtgtcgatggccagtgcatt
tgtgacagtgaatacagcggagatgactgttcagagctccggtgcccaacagactgcagt
tcccgagggctctgtgtggatggggaatgtgtctgtgaagagccctacacaggcgaggac
tgcagggagctgcgttgccctggggactgttcagggaaggggcaatgtgccaatggtacc
tgcctgtgccaagagggctatgctggtgaggactgcagtcagcgacggtgtctgaatgct
tgcagtgggcgaggtcactgccaggaggggctctgcatctgtgaggagggctaccagggc
cctgactgctcagcagttacccctccagaggacttgcgagtggctggtatcagcgacagg
tccattgagctggaatgggacgggccgatggcagtgacggaatatgtgatctcttaccag
ccgacggccctggggggccttcagctccagcagcgggtgcctggagattggagtggtgtc
accatcacggagctggagccaggtctcacctacaacatcagcgtctacgctgtcattagc
aacatcctcagccttcccatcactgccaaggtggccactcatctttctactcctcaaggg
ctacagttcaagacgatcacagagaccaccgtggaagtgcagtgggagcccttctctttc
tccttcgatgggtgggagatcagcttcactccaaagaacaatgaaggaggggtgatagct
cagctccccagcgatgttacgtcctttaaccaaacgggactgaaacctggggaggagtac
attgtgaatgttgtagcactaaaggaacaagcccggagccctccgacctctgccagcgtc
tccactgtcattgacgggcccacacagatcctggttcgagatgtctctgatactgtggcc
tttgtggaatggaccccacctcgagccaaagttgatttcattctattaaaatatggcttg
gtgggtggcgaaggtgggaagactaccttccggctgcagcctccccttagccagtactca
gtgcaagcccttagacctggctcccgctacgaggtgtccatcagcgcagtccgggggacc
aatgaaagtgatgcctcaagcacccagtttacaacagaaattgatgctcccaagaatttg
cgagtgggttcccggacagcaactagccttgacctcgaatgggataacagcgaggctgaa
gctcaggagtacaaggttgtgtacagcaccctagccggggagcagtaccatgaagtgctg
gtacccaaaggcattggtccaactaccaagactaccctcacagatctggttccaggcaca
gaatatggagttggaatatctgcggtaatgaactcgaaacaaagcattcccgctaccatg
aatgccaggactgagcttgacagtccccgagacctcatggtaacagcctcctcagagacc
tctatctctctcatctggacgaaggccagtggtcctattgatcactatcgaattactttt
actccatcttctgggatctcctcagaagtcactgtgcctagggataggacttcatataca
ctgacagatctagagcctggagcagaatacatcatctccattactgctgagaggggtcgg
cagcagagcctggagtctactgtggatgccttcacaggcttccgccctatctcccacttg
cacttttctcatgtgacctcctccagtgtcaacatcacctggagtgacccatctccccca
gcagacagactcattctgaactacagccccagggatgaagaggaggagatgatggaggtt
ctcttggatgccaccaagaggcacgccgtcctaatgggtctacagccagccactgaatat
atagtgaatctcgtagctgtccatgggacagtaacctctgaacccatagtgggttctatc
actacaggaattgatcctcccaaaaacatcacaattagcaacgtgactaaggactccctg
acagtttcctggagccctcctgttgcgccttttgattactaccgagtatcctatcgaccc
acccaagtgggacggctggacagctctgtcgtgcccaacactgtgacagagttcacaatc
accaggctctatccagctactgaatatgaaataagcctcaacagtgtacgcggcagggag
gagagtgaacgcatctgcaccctggtgcacacagccatggatagccccatggatctaatt
gctaccaacatcacgccaacagaagccctgctccagtggaaggcacccatgggtgaagtg
gaaaattatgtcattgtcctcacccactttgccatggctggagagaccatcctggttgat
ggggtcagtgaagaattccagcttgtagacctgcttcctaggacccactacactgtcacc
atgtatgctaccagtggacctctcgtgagtggcaccatcgccaccaacttctccaccctc
ctggaccctcccgccaacctgacagccagtgaagtcaccaggcaaagcgcactgatctcc
tggcagccacccagagctgcgattgaaaactatgtcttgacatacaagtccaccgatgga
agccgcaaggagctgattgtggatgctgaggacacctggatccgactggagggcctgtcg
gagaacacagactacacagtgctcctgcaggcagctcaggaggccacaaggagcagcctc
acctctactatctttaccacagggggccgggtgttctctcatcctcaagactgtgcccag
cacttgatgaatggagacactctgagtggagtttacaccatcttcctcaatggggagtta
agccacaagttgcaagtatactgcgatatgaccacagatgggggcggctggattgttttc
cagagacggcaaaatggccaaactgattttttccggaaatgggcagattaccgtgttggc
tttgggaatctggaggatgagttttggctagggctagacaacatccacaggataacagcc
cagggccgctatgagctgcgtgtggatatgcgggatggacaggaggccgtctttgcctac
tatgacaagttcgctgtggaggacagcagaagcctgtacaagctccgtataggaggctac
aatggcactgcaggagactcccttagctatcaccagggacgtcctttctccactgaggac
agagacaatgatgttgcagtcaccaactgtgccatgtcatacaagggtgcttggtggtat
aagaactgccaccggaccaacctcaacgggaagtatggggagtccaggcacagtcagggg
atcaactggtaccattggaaaggccatgaattctccatcccctttgtagaaatgaagatg
aggccctacatccatcgtctcacagccgggaggaaacggcgagccttgaaattctga

KEGG   Rattus norvegicus (rat): 25661
Entry
25661             CDS       T01003                                 

Gene name
Fn1
Definition
(RefSeq) fibronectin precursor
  KO
K05717  fibronectin 1
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04810  Regulation of actin cytoskeleton
rno04933  AGE-RAGE signaling pathway in diabetic complications
rno05100  Bacterial invasion of epithelial cells
rno05135  Yersinia infection
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05205  Proteoglycans in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    25661 (Fn1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    25661 (Fn1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    25661 (Fn1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    25661 (Fn1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    25661 (Fn1)
   05205 Proteoglycans in cancer
    25661 (Fn1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    25661 (Fn1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    25661 (Fn1)
  09171 Infectious disease: bacterial
   05135 Yersinia infection
    25661 (Fn1)
   05100 Bacterial invasion of epithelial cells
    25661 (Fn1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    25661 (Fn1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    25661 (Fn1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:rno04131]
    25661 (Fn1)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    25661 (Fn1)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    25661 (Fn1)
   04990 Domain-containing proteins not elsewhere classified [BR:rno04990]
    25661 (Fn1)
Membrane trafficking [BR:rno04131]
 Endoplasmic reticulum (ER) - Golgi transport
  Forward pathways
   ER-Golgi intermediate compartment (ERGIC) proteins
    25661 (Fn1)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of bladder cancer cells
   25661 (Fn1)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   25661 (Fn1)
Domain-containing proteins not elsewhere classified [BR:rno04990]
 Fibronectin (FN) domain-containing proteins
  Fibronectin type I domain-containing proteins
   25661 (Fn1)
SSDB
Motif
Pfam: fn3 fn1 fn2 Pur_ac_phosph_N NDNF
Other DBs
NCBI-GeneID: 25661
NCBI-ProteinID: NP_062016
RGD: 2624
Ensembl: ENSRNOG00000014288
UniProt: A0A096P6L8
Position
9
AA seq 2477 aa
MLRGPGPGRLLLLAVLCLGTSVRCTETGKSKRQAQQIVQPPSPVAVSQSKPGCFDNGKHY
QINQQWERTYLGNALVCTCYGGSRGFNCESKPEPEETCFDKYTGNTYKVGDTYERPKDSM
IWDCTCIGAGRGRISCTIANRCHEGGQSYKIGDKWRRPHETGGYMLECLCLGNGKGEWTC
KPIAEKCFDHAAGTSYVVGETWEKPYQGWMMVDCTCLGEGNGRITCTSRNRCNDQDTRTS
YRIGDTWSKKDNRGNLLQCVCTGNGRGEWKCERHVLQSASAGSGSFTDVRTAIYQPQTHP
QPAPYGHCVTDSGVVYSVGMQWLKSQGDKQMLCTCLGNGVSCQETAVTQTYGGNSNGEPC
VLPFTYNGRTFYSCTTEGRQDGHLWCSTTSNYEQDQKYSFCTDHAVLVQTRGGNSNGALC
HFPFLYNNRNYTDCTSEGRRDNMKWCGTTQNYDADQKFGFCPMAAHEEICTTNEGVMYRI
GDQWDKQHDLGHMMRCTCVGNGRGEWACIPYSQLRDQCIVDDITYNVNDTFHKRHEEGHM
LNCTCFGQGRGRWKCDPIDQCQDSETRTFYQIGDSWEKFVHGVRYQCYCYGRGIGEWHCQ
PLQTYPGTTGPVQVIITETPSQPNSHPIQWNAPEPSHITKYILRWRPKTSTGRWKEATIP
GHLNSYTIKGLTPGVIYEGQLISIQQYGHQEVTRFDFTTSASTPVTSNTVTGETAPFSPV
VATSESVTEITASSFVVSWVSASDTVSGFRVEYELSEEGDEPQYLDLPSTATSVNIPDLL
PGRKYIVNVYQISEEGKQSLILSTSQTTAPDAPPDPTVDQVDDTSIVVRWSRPQAPITGY
RIVYSPSVEGSSTELNLPETANSVTLSDLQPGVQYNITIYAVEENQESTPVFIQQETTGV
PRSDDVPAPKDLQFVEVTDVKVTIMWTPPNSAVTGYRVDVLPVNLPGEHGQRLPVNRNTF
AEVTGLSPGVTYLFKVFAVHQGRESKPLTAQQTTKLDAPTNLQFVNETDRTVLVTWTPPR
ARIAGYRLTVGLTRGGQPKQYNVGPMASKYPLRNLQPGSEYTVTLMAVKGNQQSPKATGV
FTTLQPLRSIPPYNTEVTETTIVITWTPAPRIGFKLGVRPSQGGEAPREVTSDSGSIVVS
GLTPGVEYTYTIQVLRDGQERDAPIVNRVVTPLSPPTNLHLEANPDTGVLTVSWERSTTP
DITGYRITTTPTNGQQGTALEEVVHADQSSCTFENLNPGLEYNVSVYTVKDDKESAPISD
TVIPEVPQLTDLSFVDITDSSIGLRWTPLNSSTIIGYRITVVAAGEGIPIFEDFVDSSVG
YYTVTGLEPGIDYDISVITLINGGESAPTTLTQQTAVPPPTDLRFTNIGPDTMRVTWAPP
PSIELTNLLVRYSPVKNEEDVAELSISPSDNAVVLTNLLPGTEYLVSVSSVYEQHESIPL
RGRQKTGLDSPTGFDSSDVTANSFTVHWVAPRAPITGYIIRHHAEHSAGRPRQDRVPPSR
NSITLTNLNPGTEYIVTIIAVNGREESPPLIGQQSTVSDVPRDLEVIASTPTSLLISWEP
PAVSVRYYRITYGETGGNSPVQEFTVPGSKSTATINNIKPGADYTITLYAVTGRGDSPAS
SKPVSINYQTEIDKPSQMQVTDVQDNSISVRWLPSTSPVTGYRVTTAPKNGLGPTKSQTV
SPDQTEMTIEGLQPTVEYVVSVYAQNRNGESQPLVQTAVTNIDRPKGLAFTDVDVDSIKI
AWESPQGQVSRYRVTYSSPEDGIHELFPAPDGDEDTAELHGLRPGSEYTVSVVALHGGME
SQPLIGVQSTAIPAPTNLKFTQVSPTTLTAQWTAPSVKLTGYRVRVTPKEKTGPMKEINL
SPDSTSVIVSGLMVATKYEVSVYALKDTLTSRPAQGVVTTLENVSPPRRARVTDATETTI
TISWRTKTETITGFQVDAIPANGQTPVQRTISPDVRSYTITGLQPGTDYKIHLYTLNDNA
RSSPVVIDASTAIDAPSNLRFLTTTPNSLLVSWQAPRARITGYIIKYEKPGSPPREVVPR
PRPGVTEATITGLEPGTEYTIYVIALKNNQKSEPLIGRKKTDELPQLVTLPHPNLHGPEI
LDVPSTVQKTPFVTNPGYDTENGIQLPGTSHQQPSVGQQMIFEEHGFRRTTPPTAATPVR
LRPRPYLPNVDEEVQIGHVPRGDVDYHLYPHVPGLNPNASTGQEALSQTTISWTPFQESS
EYIISCQPVGTDEEPLQFQVPGTSTSATLTGLTRGVTYNIIVEALHNQRRHKVREEVVTV
GNTVNEGLNQPTDDSCFDPYTVSHYAVGEEWERLSDSGFKLTCQCLGFGSGHFRCDSSKW
CHDNGVNYKIGEKWDRQGENGQRMSCTCLGNGKGEFKCDPHEATCYDDGKTYHVGEQWQK
EYLGAICSCTCFGGQRGWRCDNCRRPGAAEPSPDGTTGHTYNQYTQRYHQRTNTNVNCPI
ECFMPLDVQADRDDSRE
NT seq 7434 nt   +upstreamnt  +downstreamnt
atgctcaggggtccgggacccgggcggctgctgctgctagcagtcctgtgcctggggaca
tcggtgcgctgcaccgaaaccgggaagagcaagaggcaggctcagcaaatcgtgcagcct
ccgtccccggtggctgtcagtcagagcaagcctggctgttttgacaacgggaagcattat
cagataaatcagcagtgggaacggacctacctaggcaacgccctggtttgtacctgctat
ggaggaagcagaggttttaactgcgagagcaagcctgaacctgaagagacctgttttgac
aaatacactggaaacacttacaaagtgggtgacacttatgagcgccctaaagattccatg
atctgggactgtacctgcattggggctgggcgaggcaggatcagctgtaccattgcaaat
cgctgccatgaagggggtcagtcctacaagattggtgacaagtggaggaggccacatgag
actggtggctatatgttggagtgtttgtgtctggggaatggaaaaggggaatggacctgc
aagccaatagctgagaaatgttttgatcacgctgctgggacttcctacgtcgtgggggag
acctgggaaaagccctaccaaggctggatgatggtggactgtacttgtctgggcgaaggc
aatgggcgtatcacctgcacctcccggaacagatgcaatgatcaggacaccaggacgtcc
tacagaattggagacacatggagcaagaaggacaacagagggaacctgctccagtgtgtc
tgcacaggcaacggcagaggggagtggaagtgtgagcgacatgttctacagagtgcttca
gctggatctggctccttcacagatgtccgaacagctatttaccaaccccagacccacccc
cagcccgcaccgtacggccactgtgtcacagacagcggtgtggtctactctgtgggaatg
cagtggctgaagtctcaaggagacaagcagatgctgtgcacttgcctgggcaatggcgtc
agctgccaggagacagctgtgacccagacttacggtggcaactcaaacggggagccctgt
gttctcccgttcacctacaacggtaggaccttctactcctgcaccaccgaagggcggcaa
gacggacatctgtggtgtagcacaacttcaaattatgaacaagaccagaagtattctttc
tgcacagaccacgcggttttggttcagactcgaggtgggaattccaatggtgccttgtgc
cacttccccttcctgtacaacaaccggaattacaccgactgtacttctgagggtaggcgg
gacaacatgaaatggtgcggcaccacccagaactacgatgccgatcagaagtttggattc
tgcccaatggctgcccatgaggagatctgcacgaccaacgaaggggtcatgtatcgcatt
ggggaccagtgggataagcagcatgacctgggccacatgatgaggtgcacgtgtgttggg
aacggccgtggagaatgggcctgcatcccctactcccagctccgagatcagtgcatcgtt
gatgacattacttacaacgtcaacgacacgttccacaagcgtcacgaggagggacatatg
ctgaactgtacctgcttcggtcagggccggggcagatggaaatgtgaccccatcgaccag
tgccaagattcagagacccggacattttaccagattggtgactcctgggagaagtttgtg
catggtgtcagataccagtgttactgttacggccgtggcattggggagtggcactgccag
cctctgcagacctacccaggcacaactggacctgttcaagtaattatcacggagaccccc
agccagcccaattcccaccccatccagtggaatgccccggagccttcacacatcaccaag
tacattctcaggtggagacctaaaacctctacgggtcgctggaaggaagctaccattcca
ggccaccttaactcctataccatcaaaggcctgaccccaggtgtgatctacgagggacag
ctcatcagcatccagcagtacgggcaccaagaagtgactcgctttgacttcaccaccagc
gccagcacacctgtgaccagcaacacagtgactggagagactgcgcccttttctcctgtt
gtggccacttccgaatctgtcactgaaatcacagccagcagcttcgtggtctcctgggtc
tcagcttccgacacggtgtcaggattccgagtggagtacgaactgagcgaggaaggagat
gagcctcagtaccttgatcttccaagcacagccacttctgtgaacattcctgacctgctc
ccgggcagaaagtacatcgtcaacgtctatcagatatctgaagagggaaagcagagcttg
atcctgtctacatcacagactacagcacctgatgcgcctccagaccctactgtggaccag
gttgatgacacttccattgttgttcgatggagcagaccccaggcacctatcacagggtac
aggattgtctattcaccttcagtagaaggcagtagcacagaactcaaccttcctgaaacg
gccaactccgtcaccctcagcgacctgcagcccggtgttcagtacaacatcactatctat
gctgtggaggagaaccaggagagcacacccgttttcatccagcaggagactactggcgtc
ccacgatccgatgatgttcccgctccaaaggacctacagtttgtggaagtgaccgacgtg
aaagtcaccatcatgtggacacctcctaatagcgcagtgactggataccgtgtggatgtc
ctgcctgtcaacctgccaggggaacatgggcagaggctgcctgtcaacaggaacaccttt
gctgaagtcaccggactgtccccaggggtcacgtacctcttcaaagtctttgctgtgcat
cagggcagggaaagcaagcctctgacagcacaacagaccaccaaactcgatgctcccact
aacctccagtttgtcaatgaaacggacagaacagttctggtaacttggactccacctcga
gcccggatagcaggctaccgactgacagtgggcctcacccgaggaggccagcccaagcag
tacaatgtgggacccatggcttccaagtatcccctgagaaatctgcagcctgggtctgag
tacactgtgaccttgatggctgtgaaaggcaaccagcagagtcccaaagccaccggagtc
tttactaccctgcagcctctgcgctccattccaccttataacaccgaggtgacagagacc
acaatcgtgatcacctggacccccgctccaaggattggcttcaagctgggtgtacgacca
agccagggaggtgaagcaccccgagaagtgacttcagactcaggaagcatcgttgtgtct
ggcttgactccaggcgtggaatacacgtacaccatccaagtcctgagggacggccaggag
agagatgcaccaattgtcaaccgagtagtgacaccgctgtctcccccaaccaacttgcac
ctggaggccaatcctgacactggagtgcttaccgtctcctgggagaggagcaccacccca
gatattactggctacagaataaccaccacccccacaaacgggcagcagggaaccgctttg
gaagaagtggttcatgccgatcagagttcctgcacttttgaaaacctgaatcctggcctg
gagtacaatgtcagtgtttacactgtcaaagatgacaaggaaagtgcccctatctctgat
accgtcatcccagaggtgccccagctcactgacctaagctttgttgatataactgactca
agcatcggcctgaggtggaccccgctaaactcttccaccattatcgggtaccgaatcaca
gtagttgcggcaggagaagggatccccatttttgaagattttgtggactcctcagtagga
tactacacagttacagggctggaacccggcattgactatgacatcagcgttatcactctc
attaatggcggagagagtgcccctactacactgacacagcaaacggccgtccctcctccc
acggatctgcgattcaccaatatcggtccggacactatgcgggtcacttgggccccgcct
ccgtccattgagctaaccaacctcttggtgcgctactcacctgtgaagaacgaggaggat
gtggcagagctgtccatttcaccctcagacaacgccgtggtcctaacaaatctcctgcct
gggactgagtacttagtcagtgtctccagcgtgtacgaacagcatgagagcatccctctc
agaggaagacagaaaacaggtctggactccccaactggttttgattcttctgatgtcacc
gccaactcattcaccgtccactgggtggctcctcgggcccccatcaccggctacatcatc
cgccatcacgccgagcattctgccggaagacccaggcaagaccgagtgccgccctcaagg
aattctatcaccctcaccaaccttaatccgggcacggagtacattgtcaccatcattgct
gttaatggcagagaggagagccccccactgattggccagcaatccacggtttccgatgtc
ccgagagatctggaggtcatcgcttccacccccaccagcctgctcatcagttgggaaccc
cccgccgtctctgtgcgctattacagaatcacctatggagagacaggaggaaatagccct
gtccaggaattcactgtgcccggaagcaagtccaccgccaccatcaacaacattaaacca
ggagcagactacaccatcaccctgtatgctgtcactggccgtggggacagtccagccagc
agcaagccagtttccatcaattatcaaacagaaattgacaagccatcccagatgcaggtg
acggatgtccaggacaacagcatcagtgtcaggtggctgccttcaacttctcctgtgaca
ggttacagagtgaccaccgctcccaaaaatggcctaggaccaacaaaatctcaaactgtc
agtccagatcaaacagaaatgaccattgaaggtttgcaacccaccgtggagtatgtggtt
agtgtctatgctcagaaccggaacggagaaagccagcccctggttcagactgcagtgacc
aacattgaccgccctaaaggactggcattcactgatgtggatgtcgattccatcaaaatt
gcctgggaaagcccacaggggcaagtttccaggtacagggtgacctactcaagccctgag
gatggaatccatgagcttttccctgcgcctgatggtgacgaggacacggcagagctgcac
ggcctcaggccgggttctgagtacacagtcagtgtggttgccttgcacggtggcatggag
agccagcccctgattggagtccagtccacagccattcctgcgccaaccaatctgaagttc
actcaggtgtcacccaccaccttgactgcccagtggacagcacccagtgttaagctcact
ggctaccgagtgcgggtgaccccgaaggagaagacaggaccaatgaaagaaatcaacctt
tctccagacagcacctccgtgattgtgtcagggctcatggtggccaccaagtatgaagtc
agcgtctatgctctcaaggacacattgacaagcagaccagctcagggagtcgtcacgact
ctggagaatgtcagccctccaagaagggcccgtgtgaccgacgctacagaaactaccatc
actattagctggagaacgaagacagagacgatcactggcttccaagtcgatgccattcca
gccaatggccagaccccggttcagaggaccatcagcccggatgtcagaagctatactatt
acaggtttacagccaggcactgactacaagatccacctgtacacgctcaacgacaatgcc
cggagctctcctgtggtcattgatgcctccacggccattgatgccccatccaacctgcgg
ttcctgaccaccacacccaactccttgctggtatcatggcaggcaccccgtgccaggatt
actggctacattatcaagtatgagaagcctggatcccctcccagagaagtggtccctcgg
ccccgccctggtgtcacggaggccaccatcactggtctggagccaggaaccgagtacacc
atctatgtcatcgcactgaagaacaatcagaagagtgagcccctgattgggaggaaaaag
acagatgagcttccccaactggttacccttccacaccccaatcttcatggaccagagatc
ttggatgttccctccacagttcaaaagacccccttcgtcaccaaccctgggtatgacacc
gaaaatggtattcagcttcctggcacatcccaccaacaacccagtgttgggcaacaaatg
atctttgaggaacatggctttaggcgaaccacgccacccactgcggccacccccgtcagg
cttaggccaagaccatacctgccgaatgtagatgaggaggtccaaatcggtcatgttccc
aggggagacgtagactaccacctctatcctcatgttccggggctcaatccaaatgcctct
acaggacaagaagctctctctcagacaaccatctcttggacgccattccaggagagttct
gagtacatcatttcatgccaacctgttggcactgacgaagagcccttacagttccaagtt
cctggaacttctaccagtgcgactctgactggccttaccagaggggtcacctacaacatc
atagtggaggccctgcacaaccagaggaggcacaaggtccgagaagaggttgttactgta
ggcaacactgtcaacgaaggcctgaaccagcctacggatgactcatgctttgacccttac
acggtttcccattacgccgttggagaggaatgggagcggttatctgactctggctttaag
ctcacttgccagtgcttgggctttggcagtggtcatttcagatgcgattcatctaaatgg
tgccatgacaacggtgtcaactacaagatcggagagaagtgggatcgtcagggagaaaat
ggccagcggatgagctgcacatgtctcgggaatggaaagggagaattcaaatgcgatccc
catgaagcaacgtgttatgacgacgggaagacctaccacgtaggagaacagtggcagaaa
gagtatctcggagccatttgctcctgcacgtgtttcgggggccagcggggctggcgctgt
gacaactgccgcagacctggggctgctgaacccagtcccgatggtaccactggccacacc
tacaaccagtatacacagagataccatcagagaacgaacactaatgtaaattgcccaatt
gaatgcttcatgccgttggacgtgcaggctgacagagatgattccagagagtaa

KEGG   Rattus norvegicus (rat): 290905
Entry
290905            CDS       T01003                                 

Gene name
Col4a1
Definition
(RefSeq) collagen alpha-1(IV) chain precursor
  KO
K06237  collagen type IV alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04926  Relaxin signaling pathway
rno04933  AGE-RAGE signaling pathway in diabetic complications
rno04974  Protein digestion and absorption
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    290905 (Col4a1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    290905 (Col4a1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    290905 (Col4a1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    290905 (Col4a1)
  09154 Digestive system
   04974 Protein digestion and absorption
    290905 (Col4a1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    290905 (Col4a1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    290905 (Col4a1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    290905 (Col4a1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    290905 (Col4a1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    290905 (Col4a1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    290905 (Col4a1)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    290905 (Col4a1)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   290905 (Col4a1)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   290905 (Col4a1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 290905
NCBI-ProteinID: NP_001128481
RGD: 1307148
Ensembl: ENSRNOG00000016281
UniProt: F1MA59
Position
16
AA seq 1669 aa
MGPRLSVWLLLPLAALLLHEERSRAAAKGDCGGSGCGKCDCHGVKGQKGERGFPGLQGVI
GFPGMQGPEGPHGPPGQKGDAGEPGLPGTKGTRGPPGAAGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFSGNPGPPGLPGMKGDPGEILGHVPGTLLKGERGFPGI
PGAPGSPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGSSFQGPKGDKGEQGVSGPP
GVPGQAQVKEKGDFAPTGEKGQKGEPGFQGTPGYGEKGEPGKPGPRGKPGKDGEKGEKGS
PGFPGESGYPGLPGRQGPQGEKGEPGLPGPPGTVVGTRPLGEKGERGYPGSPGLRGEPGP
KGYPGSPGQPGPPGFAVPGQTGAPGFPGERGEKGERGSPGVSLPGPSGRDGAPGPPGPPG
PPGQPGHTNGIVECQPGPPGDQGPPGIPGQPGLTGEVGQKGQKGESCLACDTEGLRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGLEGLPGPQGSPGLIGQPGAKGEPGEIFFDMRLK
GDKGDPGFPGQPGMPGRAGTPGRDGHPGLPGPKGSPGSIGLKGERGPPGGVGFPGSRGDI
GPPGPPGVGPIGPIGEKGQAGLPGGPGSPGLPGPKGEAGKVVPLPGPPGAAGLPGSPGFP
GPQGDRGFPGTPGRPGNPGEKGAVGQPGIGFPGLPGPKGVDGLPGEIGRPGSPGRPGFNG
LPGNPGPQGQKGEPGIGLPGLKGQPGLPGIPGTPGEKGSIGGPGVPGEQGLTGPPGLQGI
RGDPGPPGVQGPAGPPGVPGIGPPGAMGPPGGQGPPGSSGPPGVKGEKGFPGFPGLDMPG
PKGDKGSQGLPGLTGQSGLPGLPGQQGTPGVPGFPGSKGEMGVMGTPGQPGSPGPAGAPG
LPGEKGDHGFPGSSGPRGDPGFKGDKGDVGLPGMPGSMEHVDMGSMKGQKGDQGEKGQIG
PTGDKGSRGDPGTPGVPGKDGQAGHPGQPGPKGDPGLSGIPGSPGLPGPKGSVGGMGLPG
SPGEKGVPGIPGSQGVPGSPGEKGAKGEKGQSGLPGIGIPGRPGDKGDQGLAGFPGSPGE
KGEKGSAGTPGMPGSPGPRGSPGNIGHPGSPGLPGEKGDKGLPGLDGVPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEQGLPGRGFPGFPGTKGDKGSKGEVGFPGLAGSP
GIPGAKGEQGFMGPPGPQGQPGLPGTPGHPVEGPKGDRGPQGQPGLPGHPGPMGPPGFPG
IDGPKGDKGNPGWPGAPGAPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGQKG
LPGLQGVKGDQGDQGIPGPKGLQGPPGPPGPYDVIKGEPGLPGPEGPPGLKGLQGPPGPK
GQQGVTGSVGLPGPPGVPGFDGAPGQKGETGPFGPPGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTTDDPLCPPGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPISGDNIRPFISRCAVCEAPAMVMAV
HSQTIQIPQCPNGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcagcgtctggcttctgctgccgctcgccgcccttctgctccacgag
gagcgcagccgggccgctgcgaagggtgattgtggtggctctggctgtggaaaatgtgac
tgtcatggtgtgaagggacaaaagggagagagaggctttccagggttacaaggtgtcatt
ggatttcctgggatgcaaggacctgaggggccacatggaccaccaggacaaaagggtgat
gctggagaacctggacttcctggaacgaaagggacacgaggaccccctggagcagctggc
taccctgggaatcccggacttcctggtattcctggccaggatggtcctccaggtccccca
ggtatcccaggatgcaatggtacaaagggagagagagggccactcggtcctcctggcttg
cctggattcagcgggaatcctggaccaccggggttaccaggaatgaagggagatcccggt
gaaattctcggccatgttcctgggaccctgctgaaaggagagagaggatttcctggtatc
cctggggcgccgggctcaccagggttgccaggactccagggacctgtcggtcctccggga
tttactggaccaccgggtcctccaggccctcctggacctcctggagaaaaggggcagatg
ggatccagcttccaaggaccaaaaggcgacaagggagaacaaggggtcagtggtcccccg
ggagtacctggacaggcacaagttaaggaaaaaggagactttgccccaacaggagaaaag
ggtcagaaaggtgaacctggattccagggaacaccaggatatggagagaaaggtgaacct
ggcaagccaggacctcggggaaaacctggaaaagatggtgaaaaaggagaaaaggggagc
ccgggctttcctggtgaatccggatacccaggtcttccaggccggcagggccctcaggga
gagaagggtgaacctggacttccaggcccccctggaactgtggtaggcaccaggccattg
ggagagaaaggagagcgaggctacccgggatccccagggttaagaggagagcccggtcct
aaaggttaccctggatcgccaggccaaccaggccctccaggtttcgctgttccaggccag
actggtgctccgggcttcccaggtgaaagaggagaaaaaggtgaaaggggatctccaggt
gtgtctttaccaggaccaagtggaagagatggagccccagggcctcctggccctcctggc
ccccctgggcaaccaggccacacgaacggcattgtggagtgtcaacctggaccacctggg
gaccagggtcctccaggaattccaggacagccaggtttgacaggcgaagttggacagaaa
ggtcagaaaggagagagctgccttgcctgtgacacagaaggacttcgaggccccccaggg
ccacaggggcctccaggagagattggtttccctggacagccaggggctaaaggtgaccga
ggcctgccaggcagagatggtcttgaaggattgccgggtccacaaggttcaccggggctc
ataggccagcctggagctaagggagagcccggcgagatattttttgacatgcgactcaaa
ggtgacaaaggagacccaggttttccaggacagccagggatgccaggaagagcaggaact
ccgggaagagacggccacccaggactccccggacccaaaggctccccgggttcaatagga
ttgaagggagagagaggtccccctggaggagttggctttcctggtagtcgtggagacatt
ggccctcctggacccccaggagtcggtcccatcggtcccattggtgaaaaaggacaagca
ggccttcctgggggccccgggtccccaggtcttccaggtccaaagggtgaagcaggaaag
gttgtcccactccccggcccccctggagctgcaggacttccaggatcccctgggttccca
gggccacaaggtgaccgaggcttcccaggaaccccaggacggccaggcaacccaggagag
aagggtgctgtgggccaaccaggaattggatttcccgggcttcctggccccaaaggtgtt
gatggcttgcctggagagataggacgaccggggagtccaggtcgccctggatttaatggc
ttacctggcaacccaggaccgcaaggccaaaagggagaacctggcattgggcttccagga
ctcaaagggcagccaggtcttccaggtatccctggcacacccggagagaaaggcagcatc
gggggacctggcgttccaggagaacaggggttgactggccctcccggactccaggggatc
agaggtgacccaggacctcctggggttcaaggccctgcaggtccaccaggggttccagga
ataggaccacccggagctatgggccctcccggagggcaaggaccaccaggttcatcaggt
ccacctggagttaaaggagagaagggattccctggattcccaggactggatatgcctggc
cccaaaggagataaaggctctcaaggacttcctggcctcacaggacagtcagggctccct
ggcctccctggacagcaggggactcctggagttccagggttcccaggttctaaaggtgaa
atgggcgtcatgggaaccccgggacaaccaggctcaccaggaccagcaggcgccccaggg
ttacccggagagaaaggagaccatggctttccgggctcctcgggacccaggggcgaccct
ggcttcaaaggtgataaaggtgatgtcgggcttcctggcatgccaggatccatggagcac
gtggacatgggcagcatgaaggggcagaaaggagaccagggagagaaaggacaaattgga
cccactggtgataaaggttcccgaggagaccccggaacaccaggagtgcccgggaaggat
gggcaggcagggcaccctggacagccaggacctaaaggtgacccaggccttagtggaata
ccaggatcccctggacttcctggacccaaaggatcggttggaggaatgggcttgccaggt
tcgcccggagagaaaggcgttccgggcatccctggctcacagggtgtccctggctcacct
ggagagaaaggagctaagggagagaaagggcagtcaggtctccctggcattggaattcct
ggacggcctggtgacaagggagatcaggggcttgcaggtttcccaggcagccctggtgag
aagggagagaaaggcagtgccggaaccccagggatgccagggtccccaggcccaagaggc
tctccagggaacatcggccatccaggaagcccaggcctgcctggagaaaaaggagacaaa
ggcctcccaggactggatggtgttcccggtgtcaaaggagaagcaggtcttcctgggact
cctggccccacaggcccagctggccagaagggagaacctggcagtgatggaatcccgggg
tcggcaggagagaagggtgaacaaggtttgccaggaagaggcttcccaggcttccccggc
accaaaggagacaaaggttccaagggtgaagtgggtttccctggccttgccggaagtcct
gggattcctggagccaaaggcgagcaaggattcatgggtcctcccggccctcaaggacag
ccgggcttacctggcactcctggccaccctgtggaggggcccaaaggagaccgaggacct
cagggtcaacccggcctgccagggcatccgggacctatggggccaccagggttccctgga
atcgatgggccaaaaggtgacaagggaaatccaggttggccaggagctcctggggctcca
ggccctaagggagacccaggattccaaggcatgccgggcattggcggctctccagggatc
acaggttcaaagggagatatgggaccgcctggcgttccaggattccaaggtcagaaaggt
cttcctggcttgcagggagtgaagggagatcagggagatcaaggcatacccggccctaaa
ggtctccaaggtccccctgggcccccaggtccctacgatgtcatcaaaggagaaccaggg
ctcccaggtcctgagggtcctcctggtctcaaaggacttcaaggaccaccaggtccaaaa
ggacagcaaggtgtgacaggctccgtgggcttgcctggacctccaggtgtccctgggttt
gatggtgcccctggccagaaaggagagactggaccatttggaccacctggtccgagaggg
tttcctggtccaccaggccccgatgggctgccaggatccatgggtcccccgggtacccca
tctgtggaccatggcttccttgtgaccaggcatagtcagacaacagatgacccgctgtgt
cccccagggaccaaaattctttaccatgggtactctctgctctatgtccaaggaaacgag
cgggcccatggtcaggacttgggtacggctggcagctgtctgcgcaagttcagcaccatg
cccttcctcttctgcaacatcaacaacgtctgcaacttcgcctccaggaacgactactcc
tactggctgtccactccggagcccatgcccatgtccatggcacctatctctggggacaac
atccggcccttcattagcaggtgtgcagtttgcgaagcgcccgccatggtgatggcggtg
cacagtcagaccatccagatcccgcagtgccccaacggttggtcctcactgtggattggc
tattcctttgtgatgcacaccagcgctggtgctgaaggttctggccaagcccttgcatct
cccgggtcctgtctggaagagtttagaagtgccccattcatcgagtgccatggcagagga
acgtgcaattactacgcgaatgcttacagcttttggctcgccaccatagagagaagcgag
atgttcaagaagcccacgccgtccaccttgaaggcaggggagctgcgcacgcatgtcagc
cgctgccaagtgtgcatgagaagaacatag

KEGG   Rattus norvegicus (rat): 29169
Entry
29169             CDS       T01003                                 

Gene name
Vtn
Definition
(RefSeq) vitronectin precursor
  KO
K06251  vitronectin
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04610  Complement and coagulation cascades
rno05165  Human papillomavirus infection
rno05205  Proteoglycans in cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    29169 (Vtn)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    29169 (Vtn)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    29169 (Vtn)
 09150 Organismal Systems
  09151 Immune system
   04610 Complement and coagulation cascades
    29169 (Vtn)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    29169 (Vtn)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    29169 (Vtn)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    29169 (Vtn)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    29169 (Vtn)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   29169 (Vtn)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   29169 (Vtn)
SSDB
Motif
Pfam: Hemopexin Somatomedin_B
Other DBs
NCBI-GeneID: 29169
NCBI-ProteinID: NP_062029
RGD: 3967
Ensembl: ENSRNOG00000010031
UniProt: Q3KR94
Position
10
AA seq 478 aa
MASLRPFFILALLALVSLADQESCKGRCTQGFMASKKCQCDELCTYYQSCCVDYMEQCKP
QVTRGDVFTMPEDEYWSYDYPEETKNSTSTGVQSENTSLHFNLKPRAEETIKPTTPDPQE
QSNTQEPEVGQQGVAPRPDTTDEGTSEFPEEELCSGKPFDAFTDLKNGSLFAFRGEYCYE
LDETAVRPGYPKLIQDVWGIEGPIDAAFTRINCQGKTYLFKGSQYWRFEDGVLDPDYPRN
ISEGFSGIPDNVDAALALPAHSYSGRERVYFFKGKQYWEYEFQQQPSQEECEGSSLSAVF
EHFALLQRDSWENIFELLFWGRSSDGAKGPQFISRDWHGVPGKVDAAMAGRIYITGSTFR
SVQAKKQKSGRRSRKRYRSRRGRGHSRSRSRSMSSRRPSRSVWFSLLSSEESGLGTYNYD
YDMNWRIPATCEPIQSVYFFSGDKYYRVNLRTRRVDSVNPPYPRSIAQYWLGCPTSEK
NT seq 1437 nt   +upstreamnt  +downstreamnt
atggcatctctgaggccctttttcatactagccctgctggcattggtttctctggctgac
caagagtcatgcaagggccgctgcactcagggtttcatggccagcaagaagtgtcaatgt
gatgagctttgcacctactatcagagctgctgtgttgactacatggagcagtgcaagccc
caagtgacccggggggatgtgtttaccatgccagaagatgagtattggagctatgactac
ccggaggaaaccaagaacagtaccagcactggcgtgcaatccgagaacacctctctacac
ttcaacctaaagcctcgagccgaagagactataaagccaacaactcctgatcctcaggaa
cagtcaaacacccaagaacctgaagtggggcaacagggagtggccccaaggcctgacacc
actgatgaaggaacctctgagtttccagaagaagagctgtgtagtggaaagccctttgat
gccttcacagacctcaagaatggttccctttttgccttccgaggtgagtattgctatgag
ctagatgaaacagcagtgaggcctgggtaccccaaactcatccaagatgtctggggcatc
gagggccccatcgacgctgccttcactcgcatcaactgtcaggggaagacctacttgttc
aagggtagtcagtactggcgctttgaggatggtgtcctggaccctgattatccccgaaac
atctctgaaggcttcagtggcataccagacaacgttgacgcagccttagcccttcctgcc
cacagttacagtggccgggaaagggtctacttcttcaagggaaaacagtactgggagtat
gaatttcagcagcagcccagccaggaggaatgtgaaggaagttctctgtcagccgtgttt
gagcactttgccttgcttcagcgggacagctgggagaacatttttgaactcctcttctgg
ggcagatcctctgatggcgccaaaggaccccaattcatcagcagggattggcatggtgta
ccagggaaagtggacgctgctatggccggccgcatctacatcactggctccacattccgc
tctgtccaggccaaaaaacagaagtctgggcgtcgaagtcgaaaacgctaccgttcacgc
cgaggacgtggccatagccgcagccgcagtcgcagcatgagctctcgtcgcccatcgcgt
tcagtctggttctctttgctctccagcgaagagagtgggctaggaacctacaattacgat
tatgatatgaactggcgcatacctgccacctgcgagcccattcagagcgtctatttcttc
tctggagacaagtactaccgagttaaccttagaacacggcgagtggactctgtgaatcct
ccctacccacgctccattgctcagtattggctgggctgcccaacctctgagaagtag

KEGG   Rattus norvegicus (rat): 29195
Entry
29195             CDS       T01003                                 

Gene name
Chad
Definition
(RefSeq) chondroadherin precursor
  KO
K06248  chondroadherin
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    29195 (Chad)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    29195 (Chad)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    29195 (Chad)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    29195 (Chad)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:rno00535]
    29195 (Chad)
Proteoglycans [BR:rno00535]
 Extracellular matrix (ECM) proteoglycans
  Small leucine-rich proteoglycan (SLRP) family
   class IV
    29195 (Chad)
SSDB
Motif
Pfam: LRR_8 LRR_4 LRR_5 LRR_1 LRRNT LRR_9 LRRCT A-2_8-polyST
Other DBs
NCBI-GeneID: 29195
NCBI-ProteinID: NP_062037
RGD: 2336
Ensembl: ENSRNOG00000003304
UniProt: O70210
Position
10
AA seq 358 aa
MARVLLLSLVFLAILLPALAACPQNCHCHGDLQHVICDKVGLQKIPKVSETTKLLNLQRN
NFPVLAANSFRTVPNLVSLHLQHCNIREVAAGAFRGLKQLIYLYLSHNDIRVLRAGAFDD
LTELTYLYLDHNKVSELPRGLLSPLVNLFILQLNNNKIRELRAGAFQGAKDLRWLYLSEN
ALTSLHPGSLDDVENLAKFHLDRNQLSSYPSAALSKLRVVEELKLSHNPLKSIPDNAFQS
FGRYLETLWLDNTNLEKFSDAAFAGVTTLKHVHLENNRLNQLPSTFPFDNLETLTLTNNP
WKCTCQLRGLRRWLEAKTSRPDATCSSPAKFKGQRIRDTDALRSCKSPTKRSKKAGRH
NT seq 1077 nt   +upstreamnt  +downstreamnt
atggcccgagtactcttactcagtctggtcttccttgctatcctcctgcctgcgctggcc
gcctgcccccaaaactgtcactgccacggtgatctgcagcatgtcatctgtgacaaggtg
gggctgcagaagatccccaaggtatcagagactaccaaactgctcaacctgcagcgcaac
aacttcccggtgctggctgccaactcgttccggaccgtgccgaacctggtctcgctgcac
ctgcagcactgtaacatccgcgaggtggccgctggcgccttccgaggcctgaagcagctc
atctacctgtacctgtcccacaatgacatccgggtgctgcgagctggagccttcgatgac
ctgactgaactcacttacctctatctagaccacaacaaggtgtcagaactgccccggggg
ctgctctcccctctggtcaacctcttcatcttgcagctcaacaacaacaaaatccgagag
cttcgtgcaggagccttccagggtgccaaggacctgcgctggctctacctgtcagaaaat
gccctcacttccctgcaccctgggtccctggatgatgtggagaacctagccaagtttcac
ctggacaggaaccaactgtctagctacccctcagctgccctgagcaaacttcgggtggtg
gaggagctgaagctgtcccacaaccctctgaagagcatccccgacaatgccttccagtcc
tttggcagatacctggagaccctctggctggataacaccaacctggagaagttctcagat
gctgccttcgcgggcgtgaccacactgaaacatgtccatctggagaacaatcgcctgaac
caattgccctccaccttcccctttgacaacctggagaccctcactctcaccaacaaccca
tggaaatgcacatgccagctccgtggccttcgacggtggttggaagccaagacttctcga
ccagatgctacctgctcttcaccagccaagttcaagggtcagcgtattcgtgacacggat
gccctccgcagctgcaaatcccccaccaagaggtccaagaaagctggccgccattaa

KEGG   Rattus norvegicus (rat): 29220
Entry
29220             CDS       T01003                                 

Gene name
Thbs4
Definition
(RefSeq) thrombospondin-4 precursor
  KO
K04659  thrombospondin 2/3/4/5
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04145  Phagosome
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05144  Malaria
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    29220 (Thbs4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    29220 (Thbs4)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    29220 (Thbs4)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    29220 (Thbs4)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    29220 (Thbs4)
  09174 Infectious disease: parasitic
   05144 Malaria
    29220 (Thbs4)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:rno04131]
    29220 (Thbs4)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    29220 (Thbs4)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    29220 (Thbs4)
   04990 Domain-containing proteins not elsewhere classified [BR:rno04990]
    29220 (Thbs4)
Membrane trafficking [BR:rno04131]
 Endocytosis
  Phagocytosis
   Opsonins
    29220 (Thbs4)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   29220 (Thbs4)
  Exosomal proteins of colorectal cancer cells
   29220 (Thbs4)
  Exosomal proteins of bladder cancer cells
   29220 (Thbs4)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   29220 (Thbs4)
Domain-containing proteins not elsewhere classified [BR:rno04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   29220 (Thbs4)
SSDB
Motif
Pfam: TSP_C TSP_3 COMP EGF_CA EGF_3 EGF cEGF EGF_MSP1_1
Other DBs
NCBI-GeneID: 29220
NCBI-ProteinID: NP_058829
RGD: 62046
Ensembl: ENSRNOG00000012471
UniProt: A0A0G2K7L8
Position
2
AA seq 961 aa
MLAPRAAAFLLLHLVLQPWQRAGAQATPQVFDLLPSSSQRLNPAALQPVLTDPTLHELYV
ISTFKLQSKSSATIFGLYSSSDNSKYFEFTVMGRLNKAILRYLKDDGKIHLVVFNNLQLA
DGRRHRILLRLSNLQRGAGSVELYLDCVQVDSVNNLPRAFSGLTQNPQAIELRTFQRKPQ
DFLEELKLVVRGSLFQVASLQDCFLQQSEPLAATGTGDFNRQFLGQMTQLNQLLGEVKDL
LRQQVKETSFLRNTIAECQACGPLSFQSPTPNTLVPIAPPAPPTRPTRRCDSSPCFRGVR
CTDTRDGFQCGPCPDGYTGNGITCSDVDECKYHPCYPGVRCTNLAPGFRCDACPVGFTGP
MVQGVGINFAKTNKQVCTDVDECRNGACVLNSICINTLGSYRCGPCKPGYTGDQTRGCRT
ERSCRNPEQNPCSVHAQCIEERQGDVTCVCGVGWAGDGYVCGKDVDIDSYPDEELPCSAR
NCKKDNCKYVPNSGQEDADRDGIGDACDEDADGDGILNEQDNCVLTHNVDQRNTDKDIFG
DACDNCRGVLNNDQKDTDGDGKGDACDDDMDGDGIKNILDNCPRVPNRDQQDRDGDGVGD
ACDSCPDVSNPNQSDVDNDLVGDSCDTNQDSDGDGHQDSTDNCPTVINSAQLDTDKDGIG
DECDDDDDNDGIPDLVPPGPDNCRLVPNPAQEDSNNDGVGDICEADFDQDKVIDRIDVCP
ENAEITLTDFRAYQTVVLDPEGDAQIDPNWVVLNQGMEIVQTMNSDPGLAVGYTAFNGVD
FEGTFHVNTQTDDDYAGFIFGYQDSSSFYVVMWKQTEQTYWQATPFRAVAEPGIQLKAVK
SKTGPGEHLRNSLWHTGDTSDQVRLLWKDSRNVGWKDKVSYRWFLQHRPQVGYIRVRFYE
GSELVADSGVTIDTTMRGGRLGVFCFSQENIIWSNLKYRCNDTIPEDFQEFQIQTFDRLD
N
NT seq 2886 nt   +upstreamnt  +downstreamnt
atgctagccccacgcgcggccgccttcctcctactgcacctggtcctgcagccgtggcag
cgggcgggcgcccaggccacaccccaggtctttgaccttctgccatcctccagccagagg
ctgaacccggccgccctgcagccagtcctgacagaccccaccctgcacgagctctatgtc
atctccaccttcaagctgcagagtaagagctcagccaccatctttggcctctactcctcg
agtgacaacagcaaatactttgaattcactgtgatgggacgcctgaacaaagccatcctc
cgttacctaaaggatgatgggaagattcatctggtggttttcaacaacctgcagctggcg
gatgggaggcgccaccggattctcctgagactcagcaatttgcagagaggggccggctcc
gtagaactctacttggactgtgtccaggttgattcagtgaacaacctccccagagccttc
tctggtctcacccagaacccacaggccattgaattgagaacattccagaggaagccccag
gacttcttggaagagctgaagctggtggtgaggggctctctgttccaggtggccagccta
caagactgtttcctgcagcagagtgagccgctggctgccacaggcacaggagactttaat
cgacagtttttgggtcaaatgacacaattaaaccagctccttggagaggtgaaagatctt
ctgaggcagcaggtcaaggaaacatcatttttgcgaaacactattgctgagtgccaggct
tgtggccctctcagcttccagtccccaaccccaaacacgctggtgcccatagctcctcca
gcacccccaacccgaccaacacgtcgctgtgactccagcccttgtttccgaggtgtccga
tgcacggacaccagggacggcttccaatgtggaccctgcccagatggctacacggggaac
gggatcacctgttctgacgtggatgagtgtaaataccatccctgctaccctggtgtgcgc
tgcacaaatttggctccgggtttcagatgtgatgcctgtccagtaggcttcacggggccc
atggtgcagggcgtggggatcaactttgccaaaacaaacaagcaggtgtgcactgatgtt
gacgagtgtcggaacggagcgtgtgttctgaactccatctgcattaacactctgggctct
taccgctgtgggccttgcaagccagggtacactggtgaccagacgaggggatgcaggacc
gaaaggagctgccggaacccagagcagaacccgtgcagtgtgcatgcgcagtgcattgag
gaacggcaaggggatgtgacgtgtgtgtgcggtgttggttgggccggcgacggctatgtc
tgtggaaaggatgtggacatcgacagttaccccgatgaagaactgccgtgctcagccagg
aactgcaagaaggataactgcaagtatgtgccaaactccggccaagaagacgcagacaga
gatggcattggagacgcctgtgatgaggatgcggatggagacgggatcttgaatgagcag
gacaactgtgtcctgactcacaatgtggaccagaggaacactgataaagatatctttggg
gatgcctgtgacaactgccggggcgtcctgaataatgaccagaaggacactgacggggat
gggaaaggagacgcctgtgatgacgacatggatggagacggaataaaaaacattttggac
aactgtccccgagttcccaaccgcgaccaacaggatcgggatggcgatggtgtcggagac
gcctgtgacagctgtcctgatgtcagtaaccctaaccagtcggatgtggataacgatctg
gttggggactcctgtgacaccaaccaagacagtgatggggatggccaccaggacagcaca
gacaactgccccaccgtcattaacagcgcccagctggacaccgacaaggacgggattggc
gatgaatgcgatgacgatgacgataatgacggcatcccagacctggttccccctggccca
gacaactgcaggcttgttcccaacccagcccaggaagatagcaacaatgatggagtagga
gatatctgtgaagctgacttcgaccaggacaaggtcattgatcgcattgatgtgtgtcct
gagaatgcggagatcaccctgactgacttcagggcctatcagactgtggtcctggaccct
gaaggggatgcccagatcgaccccaactgggtggtcctgaaccagggcatggagattgtg
cagaccatgaacagtgaccctggcctagcagtcgggtacacggcatttaatggagttgac
tttgaagggaccttccacgtaaacacccagacagatgacgactatgccggctttattttt
ggttaccaagacagttccagcttctatgtggtcatgtggaaacagacagagcagacatac
tggcaggctacgccattccgtgcagtagcagaacctggtattcagctcaaggccgtgaag
tctaaaactggcccaggggaacatctccgaaactccctgtggcacacaggggacaccagc
gaccaggtcaggctgctgtggaaggactccaggaatgtgggctggaaggacaaggtgtcc
taccgctggttcctgcagcataggccccaggtgggctacatcagggtgcggttttatgaa
ggctcagagttggtggctgactccggggtcactatagacaccaccatgcgcggaggccga
ctgggcgtgttctgcttctctcaggaaaacatcatctggtccaacctcaagtaccgctgt
aacgacaccataccagaggacttccaggagttccagatccagacctttgaccgcctggat
aattaa

KEGG   Rattus norvegicus (rat): 292406
Entry
292406            CDS       T01003                                 

Gene name
Thbs2
Definition
(RefSeq) thrombospondin-2 precursor
  KO
K04659  thrombospondin 2/3/4/5
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04145  Phagosome
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05144  Malaria
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    292406 (Thbs2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    292406 (Thbs2)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    292406 (Thbs2)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    292406 (Thbs2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    292406 (Thbs2)
  09174 Infectious disease: parasitic
   05144 Malaria
    292406 (Thbs2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:rno04131]
    292406 (Thbs2)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    292406 (Thbs2)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    292406 (Thbs2)
   04990 Domain-containing proteins not elsewhere classified [BR:rno04990]
    292406 (Thbs2)
Membrane trafficking [BR:rno04131]
 Endocytosis
  Phagocytosis
   Opsonins
    292406 (Thbs2)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   292406 (Thbs2)
  Exosomal proteins of colorectal cancer cells
   292406 (Thbs2)
  Exosomal proteins of bladder cancer cells
   292406 (Thbs2)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   292406 (Thbs2)
Domain-containing proteins not elsewhere classified [BR:rno04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   292406 (Thbs2)
SSDB
Motif
Pfam: TSP_C TSP_3 TSP_1 TSP1_spondin VWC EGF_3 TSP1_ADAMTS EGF_CA Laminin_G_3 EGF cEGF TSP1_CCN Laminin_G_2
Other DBs
NCBI-GeneID: 292406
NCBI-ProteinID: NP_001162609
RGD: 1310979
Ensembl: ENSRNOG00000010529
UniProt: D4A2G6
Position
1
AA seq 1172 aa
MLWALALLALGIGPRAYAGDHGEDTAFDLFSISNINRKTIGAKQFRGPDPGVPAYRFVRF
DYVPPVNTDDLNRIVKLARRKEGFFLTAQLKQDRKSRGTLLVLEGPGTSQRQFEIVSNGP
GDTLDLNYWVEGHQHTNFLEDVGLADSQWKNVTVQVASDTYSLYVGCDLIDSVTLEEPFY
EQLEADKSRMYVAKGASRESHFRGLLQNVHLVFADSVEDILSKKGCQHSQGAEVNTISEH
TETLHLSPHITTDLVVQGVEKAQEVCTHSCEELSNMMNELSGLHVMVNQLSKNLERVSSD
NQFLLELIGGPLKTRNMSACVQEGRIFAENETWVVDSCTTCTCKKFKTVCNQITCSPATC
ANPSLVEGECCPSCSHSADNDEGWSPWAEWTECSVTCGSGTQQRGRSCDVTSNTCLGPSI
QTRTCSLGKCDTRIRQNGGWSHWSPWSSCSVTCGVGNVTRIRLCNSPVPQMGGKNCKGSG
RETKACQRAPCPIDGRWSPWSPWSACTVTCAGGIRERTRVCNSPEPQYGGKDCVGDVTEH
QMCNKRSCPIDGCLSNPCFPGAKCNSFPDGSWSCGSCPVGFLGNGTHCEDLDECAVVADI
CFSINKASRCVNTNPGFHCLPCPPRYKGTQPFGIGLEDAKTEKQVCEPENPCKDKTHNCH
KHAECIYLGHFSDPMYKCECQTGYAGDGLICGEDSDLDGWPNSNLVCATNATYHCVKDNC
PKLPNSGQEDFDKDGIGDACDEDDDNDGVSDEKDNCPLLFNPRQLDYDKDEVGDRCDNCP
YVHNQAQIDTDNNGEGDACSVDIDGDDVFNERDNCPYVYNTDQRDTDGDGVGDHCDNCPL
MHNPDQMDQDNDLVGDQCDNNEDIDDDGHQNNQDNCPYISNSNQADHDNDGKGDACDSDD
DNDGVPDDRDNCRLVFNPDQEDSDGDGRGDICKDDFDNDNVPDIDDVCPENNAITETDFR
NFQMVPLDPKGTTQIDPNWVIRHQGKELVQTANSDPGIAVGFDEFGSVDFSGTFYVNTDR
DDDYAGFVFGYQSSSRFYVVMWKQVTQTYWEDKPSRAYGYSGVSLKVVNSTTGTGEHLRN
ALWHTGNTEGQVRTLWHDPKNIGWKDYTAYRWHLIHRPKTGYMRVLVHEGKQVMADSGPI
YDQTYAGGRLGLFVFSQEMVYFSDLKYECRDA
NT seq 3519 nt   +upstreamnt  +downstreamnt
atgctctgggcactggccctgctggctctgggcatagggccaagagcttacgctggtgac
cacggcgaggacacagcatttgaccttttcagcatcagcaacattaaccggaagaccatc
ggtgccaagcagttccgagggcccgaccccggtgtgcccgcctaccgttttgtccggttt
gactacgtccccccagtgaatacagacgacctcaacaggatcgttaagcttgcaaggaga
aaggagggcttcttcctcacagcccagctgaagcaggaccgcaagtctcggggaacactc
ctggtgttggaaggtcccggcacctcccagaggcagtttgagattgtgtccaatggccca
ggggataccttggacctcaactactgggtagaaggccaccagcataccaacttcctggag
gatgtgggtctggctgactcccagtggaagaatgtgactgtgcaggtggccagtgacacc
tatagcctgtatgtgggctgtgatcttattgacagtgtcaccctggaagaacccttctat
gagcagctggaagcagacaagagcaggatgtacgtggccaagggtgcatctcgagagagt
cacttcaggggcttgctgcagaatgtccatctcgtgtttgcagattctgtggaagatatt
ctaagcaagaaaggctgtcaacacagccagggagctgaagtcaacaccatcagtgaacat
acagagacactccacctgagccctcacatcaccacagatctggtggtccagggtgtggag
aaggcacaggaggtgtgtacgcactcctgtgaagagctgagcaacatgatgaacgagctg
tccggattgcacgtcatggtgaaccagctgagcaagaacctggagagagtgtctagcgat
aaccagttccttttggagctcattgggggccctctgaagacaagaaacatgtcggcctgt
gtgcaggagggacggatctttgcagaaaacgaaacctgggttgtggatagttgtaccacc
tgcacctgcaagaaatttaaaacagtctgcaatcaaatcacctgctcacctgcaacttgt
gccaacccatctttggtggaaggcgagtgctgtccatcctgttcccactctgcagacaac
gatgagggctggtctccgtgggcagagtggaccgagtgttctgtcacctgtggctctggg
acccagcagagaggccggtcttgtgatgtcaccagcaacacctgcctgggcccctccatt
cagacaaggacatgcagtttgggcaaatgtgatacgagaatccgtcagaatggaggctgg
agtcactggtcaccctggtcttcgtgctccgtgacttgtggagttggcaatgtcacccgc
atacgtctctgcaactcaccagtgccccaaatgggtggcaagaactgcaagggcagtggc
cgggaaaccaaggcctgccagcgtgctccatgcccaattgatggccgctggagcccctgg
tccccttggtcagcctgcacagttacctgtgctggagggatccgtgagcgcacacgtgtt
tgcaacagccctgagccccagtatggagggaaggattgtgtcggggatgtgacagaacac
caaatgtgcaacaagaggagctgccctattgatggatgcctatccaacccgtgtttccct
ggagccaagtgcaacagcttccccgatggctcctggtcctgtggttcctgcccagtgggc
tttctgggcaatggcacccactgtgaggacctggatgagtgtgctgtggtcgcagatatc
tgcttctcaattaacaaagcttcccgctgtgtcaacaccaaccctggcttccactgcctg
ccttgtccgccacgctacaaggggacccaacccttcgggattggcctggaggatgctaaa
acagaaaaacaagtatgtgagccagagaatccgtgcaaggacaagactcacaactgccac
aagcatgcagagtgcatctacctgggccacttcagtgaccccatgtacaagtgcgagtgc
cagactggctatgcaggtgatgggctcatctgcggggaggactcagacctggacggttgg
cccaacagcaacctggtgtgtgctaccaatgccacctaccactgcgtcaaggacaactgc
cccaaactgccaaattctgggcaggaggattttgataaggacggaatcggagatgcttgc
gacgaggacgatgataacgacggcgtgagcgatgagaaggacaattgcccacttctcttc
aatccccgccaattagactatgataaggatgaggttggagaccgctgtgacaattgcccc
tatgtgcacaaccaagcacagatcgacacggacaacaacggtgaaggggacgcctgctct
gtggatatcgacggggatgatgttttcaatgagcgagacaattgcccgtatgtctacaac
actgaccagagagacacagacggtgacggcgttggggaccactgtgacaactgtcctctg
atgcataacccagatcagatggatcaggacaatgatctcgttggagaccagtgtgataac
aatgaggacatagatgatgacggccaccagaacaaccaagacaactgcccatacatctcc
aactccaaccaggctgaccatgacaatgacggcaagggggatgcctgtgactccgatgat
gacaacgatggtgttccagatgacagggacaactgcaggcttgtgttcaaccccgaccag
gaagactcggatggcgacggccgaggtgatatttgtaaagatgactttgacaatgataat
gtcccagatattgatgacgtgtgccccgagaacaatgccatcaccgagacagacttcaga
aactttcagatggtccctctggatcccaaggggaccacacaaattgatcccaactgggta
attcgtcaccaaggcaaagagctggtgcagacagcaaactcggaccctggcatcgccgta
ggtttcgatgagtttgggtctgtggacttcagcggcactttctatgtcaacactgaccgg
gacgatgactacgccggcttcgtctttggctatcagtcaagcagccgcttctatgtggtg
atgtggaagcaggtgacccagacctactgggaagacaagcccagtcgggcttacggctac
tctggtgtgtcactcaaagtagtaaactccacgactggtactggcgagcacctgaggaat
gccctgtggcacacgggaaacacggaaggccaggttcgaactctatggcatgaccccaaa
aacattggctggaaggactacaccgcctacaggtggcatctgattcacaggcctaagaca
ggctacatgagagtcttagtgcatgaaggaaagcaggtcatggctgactcaggaccaatt
tatgaccaaacttatgctggtggacggctgggtctgtttgtcttctcccaagaaatggtc
tacttctcagacctcaagtatgagtgcagagatgcctag

KEGG   Rattus norvegicus (rat): 29393
Entry
29393             CDS       T01003                                 

Gene name
Col1a1
Definition
(RefSeq) collagen alpha-1(I) chain precursor
  KO
K06236  collagen type I alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04611  Platelet activation
rno04926  Relaxin signaling pathway
rno04933  AGE-RAGE signaling pathway in diabetic complications
rno04974  Protein digestion and absorption
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05205  Proteoglycans in cancer
rno05415  Diabetic cardiomyopathy
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    29393 (Col1a1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    29393 (Col1a1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    29393 (Col1a1)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    29393 (Col1a1)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    29393 (Col1a1)
  09154 Digestive system
   04974 Protein digestion and absorption
    29393 (Col1a1)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    29393 (Col1a1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    29393 (Col1a1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    29393 (Col1a1)
  09166 Cardiovascular disease
   05415 Diabetic cardiomyopathy
    29393 (Col1a1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    29393 (Col1a1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    29393 (Col1a1)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   29393 (Col1a1)
SSDB
Motif
Pfam: Collagen COLFI VWC
Other DBs
NCBI-GeneID: 29393
NCBI-ProteinID: NP_445756
RGD: 61817
Ensembl: ENSRNOG00000003897
UniProt: P02454
Structure
PDB: 
3HQV 3HR2

Position
10
AA seq 1453 aa
MFSFVDLRLLLLLGATALLTHGQEDIPEVSCIHNGLRVPNGETWKPDVCLICICHNGTAV
CDGVLCKEDLDCPNPQKREGECCPFCPEEYVSPDAEVIGVEGPKGDPGPQGPRGPVGPPG
QDGIPGQPGLPGPPGPPGPPGPPGLGGNFASQMSYGYDEKSAGVSVPGPMGPSGPRGLPG
PPGAPGPQGFQGPPGEPGEPGASGPMGPRGPPGPPGKNGDDGEAGKPGRPGERGPPGPQG
ARGLPGTAGLPGMKGHRGFSGLDGAKGDTGPAGPKGEPGSPGENGAPGQMGPRGLPGERG
RPGPPGSAGARGNDGAVGAAGPPGPTGPTGPPGFPGAAGAKGEAGPQGARGSEGPQGVRG
EPGPPGPAGAAGPAGNPGADGQPGAKGANGAPGIAGAPGFPGARGPSGPQGPSGAPGPKG
NSGEPGAPGNKGDTGAKGEPGPAGVQGPPGPAGEEGKRGARGEPGPSGLPGPPGERGGPG
SRGFPGADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDG
KTGPPGPAGQDGRPGPAGPPGARGQAGVMGFPGPKGTAGEPGKAGERGVPGPPGAVGPAG
KDGEAGAQGAPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPG
PSGARGERGFPGERGVQGPPGPAGPRGNNGAPGNDGAKGDTGAPGAPGSQGAPGLQGMPG
ERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGDKGETGPSGPAG
PTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDTGVKGDAGPPGPAGPAGPPG
PIGNVGAPGPKGSRGAAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPVGKEGGKGPRG
ETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGSPGTPGPQGIAGQRGVVGLPGQRGERG
FPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGSPGAEGSPGRDGAPG
AKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPIGPAGARGPAGPQG
PRGDKGETGEQGDRGIKGHRGFSGLQGPPGSPGSPGEQGPSGASGPAGPRGPPGSAGSPG
KDGLNGLPGPIGPPGPRGRTGDSGPAGPPGPPGPPGPPGPPSGGYDFSFLPQPPQEKSQD
GGRYYRADDANVVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMCHSDWKS
GEYWIDPNQGCNLDAIKVYCNMETGQTCVFPTQPSVPQKNWYISPNPKEKKHVWFGESMT
DGFQFEYGSEGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQTGNLKKSLLLQ
GSNEIELRGEGNSRFTYSTLVDGCTSHTGTWGKTVIEYKTTKTSRLPIIDVAPLDIGAPD
QEFGMDIGPACFV
NT seq 4362 nt   +upstreamnt  +downstreamnt
atgttcagctttgtggacctccggctcctgctcctcttaggggccactgccctcctgacg
catggccaagaagacatccctgaagtcagctgcatacacaatggcctaagggtccctaat
ggtgagacgtggaaacctgatgtatgcttgatctgtatctgccacaatggcacggctgtg
tgcgatggcgtgctatgcaaagaagacttggactgtcccaacccccaaaaacgggagggc
gagtgctgtcctttctgcccagaagaatatgtatcaccagacgcagaagtcataggagtc
gagggacccaagggagaccctggcccccaaggcccacggggacctgttggcccccctgga
caagatggcatccctggacagcctggacttcctggtcctcctggtccccccggccccccc
ggaccccctggtcttggaggaaactttgcttcccagatgtcctatggctatgatgagaaa
tcagctggagtttccgtgcctggccccatgggtccttctggtcctcgtggtctccctggc
ccccctggtgcacctggtcctcaaggtttccaaggcccccctggtgaacctggcgagcct
ggcgcttcaggtccaatgggtccccgaggtccccctggccctcctggcaagaacggagat
gatggggaagctggtaagcctggccgccctggtgagcgtggacctcctggacctcagggt
gctcgtggattgcctggaacagctggcctccccggaatgaagggacacagaggtttcagt
ggtttggatggtgccaaaggagatactggtcctgctggtcctaagggagagcctggcagt
cctggtgaaaatggagctcccggccagatgggtccccgaggtctgcctggtgagagaggt
cgccctggaccccctggcagtgctggtgctcgtggtaacgatggtgctgtcggtgcagct
gggccccccggtcccaccggccctactggccctcctggcttccctggtgcagctggtgct
aagggtgaagctggtccccagggagcccgaggctctgaaggtccccagggtgtgcgtggt
gagcccggaccccctggccctgctggtgctgctggtcctgctggaaaccctggtgctgat
ggacaacctggtgctaaaggtgccaatggtgctcctggtattgctggtgctcctggcttc
cctggtgcccgaggcccctctggacctcagggccccagcggcgctcctggtcccaaaggc
aacagtggtgaacctggtgcccctggcaacaaaggagacactggtgccaaaggagaaccc
ggccctgctggagttcaaggtccccctggccctgctggagaagaaggaaaacgaggagcc
cgtggtgagcctggacctagcggcctgcccggacctcctggcgagcgcggtggacctggt
agccgtggtttccctggtgctgatggtgttgctggccccaagggtcctgctggtgaacgt
ggttctcctggccctgctggtcccaaaggttctcctggtgaagctggtcgccctggtgaa
gctggtctccctggtgccaagggtctcactggcagtcctggcagccctggtcctgatggc
aaaaccggcccccctggtcccgctggtcaagatggtcgccctggacccgcaggtcctcct
ggagcccgtggacaggctggtgtgatgggattccctggacctaagggtactgctggagaa
cctggaaaggctggagaacgaggtgtccccggaccccctggcgctgttggtcccgctggc
aaagatggcgaagctggagctcaaggagcccccggccctgctggtcctgctggtgagaga
ggtgaacaaggtcccgctggctcccctggattccagggtcttcctggtcccgctggtcct
cctggtgaagcaggcaagcctggtgaacagggtgttcctggagaccttggtgcccctgga
ccctctggcgcaagaggcgagagaggtttccctggtgaacgtggtgtacaaggtccccca
ggtcctgctggtccccgaggaaacaatggtgcccccggcaacgatggtgccaagggtgat
actggtgcccccggagctcctggtagccagggtgcccccggtcttcagggaatgcctggt
gaacgtggtgcagctggtcttccaggtcctaagggtgacagaggtgatgctggtcccaaa
ggtgctgatggttctcctggcaaagatggcgtccgtggtctgactggtcccattggtcct
cctggccctgctggtgcccctggtgacaagggtgaaactggtcccagtggtcctgctggc
cccaccggagcccgtggtgcccccggagaccgtggtgagcctggtccccctggtcctgct
ggcttcgctggcccccctggtgctgatggccaacctggtgcgaaaggtgaacctggtgat
actggtgtgaaaggtgacgctggtcctcctggccctgctggtcccgctggaccccctggc
cccattggtaacgttggtgctcctggacccaaaggttctcgtggtgctgctggtccccct
ggtgctactggtttccctggtgctgctggtcgtgttggtccccctggtccctctggaaat
gctggaccccctggccctcccggtcccgttggcaaagaagggggcaaaggtccccgtggt
gagactggtcccgctggacgtcctggtgaagttggtcccccaggtccccctggccctgct
ggtgagaaaggatctcctggtgctgatggacctgctggctctcctggtacccctggacct
cagggtattgctggacagcgtggtgtggtcggtcttcccggtcagagaggagaaagaggc
ttccctggtcttcctggaccctctggtgaacccggcaaacaaggtccttctggagcaagt
ggtgaacgtggtccccctggccctatgggcccccctggattggctggcccccctggtgaa
tctggtcgtgagggatcccctggtgctgaaggctcccctggaagagatggtgctcctggt
gccaagggtgaccgtggtgagactggccctgctggcccccctggtgctcctggtgctcct
ggtgctcccggccctgttggtcctgctggcaagaatggcgaccgtggtgagactggtcct
gctggtcctgctggtcccattggccctgctggtgcccgtggtcctgctggaccccaaggc
ccccgtggtgacaagggtgagacaggcgaacaaggtgacagaggcataaagggtcatcgt
ggcttctctggtctccagggtcctcctggctctcctggctctcctggtgaacaaggcccc
tctggagcttctggtcctgcaggtccccggggtccccctggctctgctggttctcctggc
aaagatggactcaacggtctccctggccccattggtccccctggtcctcgaggtcgcact
ggcgatagtggtcctgctggtccccccggacctcctggaccccctggccctcccggtcct
cccagcggtggttatgacttcagcttcctgcctcagccacctcaagagaagtctcaagat
ggtggccgttactaccgggccgatgatgccaacgtggtccgtgaccgtgaccttgaggtg
gacactaccctcaagagcctgagccagcagattgagaacatccgcagccctgagggcagc
cgcaagaaccccgcccgcacatgccgtgacctcaagatgtgccactctgactggaagagc
ggagagtactggatcgaccctaaccaaggctgcaacctggatgccatcaaggtctactgc
aacatggagacaggtcagacctgtgtgttccccactcagccctctgtgcctcagaagaac
tggtacatcagcccaaaccccaaggagaagaagcatgtctggtttggagagagcatgacc
gatggattccagttcgagtatggaagcgaaggttccgatcctgccgatgtcgctatccag
ctgaccttcctgcgcctgatgtccaccgaggcctcccagaacatcacctatcactgcaag
aacagcgtagcctacatggaccaacagactggcaacctcaagaagtccctgctcctccag
ggctccaacgagatcgagctcaggggcgaaggcaacagtcgattcacctacagcacgctt
gtggatggctgcacgagtcacaccggaacttggggcaagacagtcatcgaatacaaaacc
accaagacctcccgcctgcccatcatcgatgtggctcccttggacattggtgccccagac
caggaattcggaatggacattggccctgcctgcttcgtgtaa

KEGG   Rattus norvegicus (rat): 294337
Entry
294337            CDS       T01003                                 

Gene name
Col6a1
Definition
(RefSeq) collagen alpha-1(VI) chain
  KO
K06238  collagen type VI alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04974  Protein digestion and absorption
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    294337 (Col6a1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    294337 (Col6a1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    294337 (Col6a1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    294337 (Col6a1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    294337 (Col6a1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    294337 (Col6a1)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    294337 (Col6a1)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   294337 (Col6a1)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   294337 (Col6a1)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   294337 (Col6a1)
SSDB
Motif
Pfam: VWA Collagen VWA_2 VWA_3
Other DBs
NCBI-GeneID: 294337
NCBI-ProteinID: XP_215375
RGD: 1565398
Ensembl: ENSRNOG00000001249
Position
20
AA seq 1034 aa
MVTRRPAGNMRLAHTLLPLLLQACWVAAQDIQGSRAIAFQDCPVDLFFVLDTSESVALRL
KPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLMRMPSGRDE
LKASIDAVKYFGKGTYTDCAIKKGLEELLIGGSHLKENKYLIVVTDGHPLEGYKEPCGGL
EDAVNEAKHLGIKVFSVAITPDHLEPRLSIIATDHTYRRNFTAADWGHSRDAEETISQTI
DTIVDMIKNNVEQVCCTFECQAARGPPGPRGDPGYEGERGKPGLPGEKGEAGDPGRPGDL
GPVGYQGMKGEKGSRGEKGSRGPKGYKGEKGKRGIDGVDGMKGETGYPGLPGCKGSPGFD
GIQGPPGPKGDAGAFGLKGEKGEAGAEGEAGRPGNSGPPGDEGEPGEPGPPGEKGEAGDE
GNAGPDGAPGERGGPGERGPRGTPGVRGPRGDPGEAGPQGDQGREGPVGIPGDPGESGPI
GPKGYRGDEGPPGPEGLRGAPGPVGPPGDPGLMGERGEDGPPGNGTEGFPGFPGYPGNRG
PPGINGTKGYPGLKGDEGEAGDPGEDNNDVSPRGVKGAKGYRGPEGPQGPPGHVGPPGPD
ECEILDIIMKMCSCCECTCGPIDILFVLDSSESIGLQNFEIAKDFIIKVIDRLSKDELVK
FEPGQSHAGVVQYSHNQMQEHVDMRSPNVRNAQDFKEAVKKLQWMAGGTFTGEALQYTRD
RLLPPTQNNRIALVITDGRSDTQRDTTPLSVLCGSDIQVVSVGIKDVFGFVAGSDQLNVI
SCQGLSQSRPGISLVKENYAELLDDGFLKNITAQICIDKKCPDYTCPITFSSPTDITILL
DSSASVGSHNFETTKVFAKRLAERFLSAGREDPTQVVRVAVVQYSGQGQQQPGRASLQFQ
QNYTVLASSVDSMDFINDATDVNDALSYVTRFYRENSSGATKKRVLLFSDGNSQGATAEA
IEKAVQEAQRGGIEIFVMVVGPQVNEPHIRVLVTGKTAEYDVAFGERHLFRVPNYQALLR
GVLYQTVSRKVALG
NT seq 3105 nt   +upstreamnt  +downstreamnt
atggtgactcgaaggccagccggaaacatgaggctggcccacactctgctgcccctgctg
ctacaggcctgctgggtggccgcacaggacatccagggctctagagcgattgccttccaa
gactgccctgtggatctgttcttcgtgctggacacctcggagagtgtggccttgaggctg
aaaccttatggggccctggtggacaaggtgaaatcctttaccaagcgtttcatcgacaac
ctgagagacaggtactaccgatgtgaccgcaacctggtttggaatgcgggcgcgctgcac
tacagtgacgaggtggagatcatcagagggctcatgcgcatgcccagtggccgtgacgag
ctcaaagccagcatagatgcggtcaagtacttcgggaaaggcacctacaccgactgtgcc
attaagaaggggctggaggagctgctcattgggggctcccacctaaaagagaacaagtac
ttgatcgtggtgactgacgggcatcccctcgagggctacaaggaaccatgtgggggtctg
gaagatgcagtgaacgaggccaaacacttgggcatcaaggtcttttctgtggccatcaca
cctgatcacctggagccacgtctaagcatcatcgccacagaccacacataccggcgcaat
ttcacggctgctgactgggggcatagtcgagatgcggaagagaccatcagccagaccatt
gacaccattgtggacatgattaaaaataatgtggagcaagtgtgttgtacttttgagtgc
caggctgccagaggacctccagggccccggggcgaccctgggtatgagggggaacgagga
aagccgggtcttcccggagagaagggagaagctggagaccctggacgacctggggatctt
ggaccagtcgggtaccagggtatgaagggagaaaagggaagccgtggagagaagggttcc
agaggcccgaaaggctacaagggcgagaaaggcaagcgcggcatcgatggggtggacggc
atgaagggagagacgggatacccaggactaccaggctgcaagggttcgccaggatttgat
ggcattcaaggacccccaggtcctaagggcgatgctggtgcctttgggctgaaaggagaa
aagggtgaagctggagcagaaggtgaggctgggagacctgggaactcagggccacccgga
gatgagggtgaaccgggagaacctggtccccccggagaaaaaggagaggccggtgatgag
ggaaatgctggcccagatggtgcccctggagagaggggcggccctggtgaaagaggacct
cgggggacccctggtgtgaggggcccaaggggagacccgggtgaagctggaccacaaggt
gaccaaggaagagaggggcctgttggcatccctggagatccgggtgagagtggccccatc
ggacctaaaggataccgaggtgatgagggtcctccaggccctgagggtctcagaggagcc
ccaggacccgttggaccccccggggacccaggactgatgggtgaaaggggtgaagatgga
ccaccaggaaatggcacggaaggcttccccggcttccctgggtatccaggcaacagaggc
cctcctgggataaatggcacgaagggctaccctggcctcaagggggatgagggtgaagcc
ggagacccaggagaggataacaatgatgtttcaccccgtggagtcaaaggggcaaaggga
taccggggccccgagggaccccaggggcctccaggacacgtgggaccacctgggccagat
gagtgtgagattctggatatcatcatgaaaatgtgctcctgctgtgagtgcacgtgtggt
cccatcgatattctcttcgtgctggatagctcagagagcattggcctacagaactttgag
attgccaaggacttcatcatcaaggtcattgaccggctgagcaaggacgagctagtcaaa
tttgagccagggcagtcccacgcaggcgtggtgcagtacagccacaaccagatgcaagaa
cacgtggacatgcggagccccaacgtccgcaacgcccaggacttcaaagaagccgtcaag
aaactgcagtggatggctggcggcacattcaccggagaagcgctgcagtacacccgggac
cggctcctcccacccacgcagaacaaccgaattgctctggtcatcacagacggacgctct
gacactcagcgggacactacacccctcagtgtgctctgcggctcagacattcaggtagtt
tctgtgggcatcaaggatgtgtttggctttgtcgcgggctccgaccagctcaacgtcatt
tcctgccaaggcttatcacaaagccggccgggtatctccctggtgaaggagaactatgca
gaacttctagatgacggctttttgaagaacataacagcccagatctgtatagataagaag
tgtccagattatacctgtccaatcacattctcctccccgaccgacatcaccattctgcta
gacagctcggccagtgtcggcagccacaacttcgaaaccaccaaggtcttcgccaagcgc
ctagccgagcggttcctgtcagcgggcagggaagatcccacccaggtcgtgcgggtggcc
gtggtacagtatagtggtcaaggccagcaacagccaggtcgagcgtctcttcagttccag
cagaactacacggtactggccagctctgtggacagcatggatttcatcaacgacgccaca
gatgtcaacgacgctctgagctatgtaactcgtttctaccgggaaaactcctcaggtgcc
accaagaagagagtgctgttgttctcagatggtaactcacagggggccacagcagaggcc
attgagaaggctgtgcaggaggcccagcgtgggggcattgagatctttgtgatggtggtg
ggaccccaggtgaatgaaccccacatccgtgtactcgtcaccggtaagaccgccgagtac
gacgtggccttcggcgagcgccacctattccgtgtaccgaactaccaggccctactacgt
ggtgtcctctaccagacagtctccaggaaggtggccctgggctag

KEGG   Rattus norvegicus (rat): 298941
Entry
298941            CDS       T01003                                 

Gene name
Lamb1
Definition
(RefSeq) laminin subunit beta-1 isoform X1
  KO
K05636  laminin, beta 1
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05145  Toxoplasmosis
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    298941 (Lamb1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    298941 (Lamb1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    298941 (Lamb1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    298941 (Lamb1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    298941 (Lamb1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    298941 (Lamb1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    298941 (Lamb1)
   05145 Toxoplasmosis
    298941 (Lamb1)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N F5_F8_type_C CorA
Other DBs
NCBI-GeneID: 298941
NCBI-ProteinID: XP_003750185
RGD: 1306311
Ensembl: ENSRNOG00000005678
Position
6
AA seq 1834 aa
MERPLSSLPTFAFSPLYLSRKEDTKKRRAACLPRPPSRAASPRLQGLDMGVLQVFAFGVL
ALWGTRVCAQEPEFSYGCAEGSCYPATGDLLIGRAQKLSVTSTCGLHKPEPYCIVSHLQE
DKKCFICDSRDPYHETLNPDSHLIENVVTTFAPNRLKIWWQSENGVENVTVQLDLEAEFH
FTHLIMTFKTFRPAAMLIERSSDFGKTWGVYRYFAYDCESSFPGISTGPMKKVDDIICDS
RYSDIEPSTEGEVIFRALDPAFKIEDPYSPRIQNLLKITNLRIKFVKLHTLGDNLLDSRM
EIREKYYYAVYDMVVRGNCFCYGHASECAPVDGFNEEVEGMVHGHCMCRHNTKGLNCELC
MDFYHDLPWRPAEGRNSNACKKCNCNEHSSSCHFDMAVFLATGNVSGGVCDNCQHNTMGR
NCEQCKPFYFQHPERDIRDPNLCEPCTCDPAGSENGGICDGYTDFSVGLIAGQCRCKLHV
EGERCDVCKEGFYDLSAEDPFGCKSCACNPLGTIPGGNPCDSETGYCYCKRLVTGQRCDQ
CLPQHWGLSNDLDGCRPCDCDLGGALNNSCSEDSGQCSCLPHMIGRQCNEVESGYYFTTL
DHYIYEAEEANLGPGVVVVERRYIQDRIPSWTGPGFVRVPEGAYLEFFIDNIPHSMEYEI
LIRYEPQLPDHWEKAVITVQRPGRIPTSSRCGNTVPDDDNQVVSLSPGSRYVVLPRPVCF
EKGMNYTVRLELPQYTASGSDVESPYTLIDSLVLMPYCKSLDIFTVGGSGDGEVTNSAWE
TFQRYRCLENSRSVVKTPMTDVCRNIIFSISALIHQAGLACECDPQGSLSSVCDPNGGQC
QCRPNVVGRTCNRCAPGTFGFGPNGCKPCDCHLQGSVSAFCDAVTGQCHCFQGIYARQCD
RCLPGFWGFPSCQPCQCNGHAEDCDTVTGECLSCQDYTTGHNCERCLAGYYGDPIIGSGD
HCRPCPCPDGPDSGRQFARSCYQDPVTLQLACVCDPGYIGSRCDGCASGYFGNPSDFGGS
CQPCQCHHNIDTTDPEACDKETGRCLKCLYHTEGDHCQFCQHGYYGDALRQDCRKCVCNY
LGTVREHCNGSDCHCDKTTGQCSCLPNVIGQNCDRCAPHTWQLASGTGCEPCNCNAVHSF
GPSCNEFTGQCQCMPGFGGRTCSECQELFWGDPDVECRACDCDPRGIETPQCDQSTGQCV
CVEGVEGPRCDKCTRGYSGVFPDCTPCHQCFALWDVIIGELTNRTHKFLEKAKALKISGV
IGPYRETVDSVEKKVNEIRDILAQSPAAEPLKNIGILFEEAEKLTKDVTEKMAQVEVKLS
DTASRSNSTAGELDALQAEAGSLDKTVNELAEQLEFIKNSDIQGALDSITKYFQISLEAE
KRVNASTTDPNSTVEQSALTRDRVEDLMLERESQFKEKQEEQARLLDELAGKLQSLDLSA
AAEMTCGTHPGADCSESECGGPNCRTDEGEKKCGGPGCGGLVTVAHSAWQKAMDFDRDVL
NALAEVEQLSKMVSEAKVRADEAKQNAQDVLLKTNATREKVDRSNEDLRNLIKQIRDFLT
EDSADLDSIEAVANEVLKMEMPSTPQQLQNLTEDIRERVETLSQVEVILQQSAADIARAE
LLLEEAKRASKSATDVKVTADMVKEALEEAEKAQVAAEKAIKQADEDIQGTQNLLTSIES
ETAASEETLTNASQRISKLERNVEELKRKAAQNSGEADYIEKVVYSVKQSADEVKKTLDG
ELDEKYKKVESLIAQKTEESADARRKAELLQNEAKTLLAQANSKLQLLEDLERKYEDNQK
YLEDKAQELVRLEGEVRSLLKDISEKVAVYSTCL
NT seq 5505 nt   +upstreamnt  +downstreamnt
atggaaaggcccctctcctctctcccaacatttgccttttcccccctctacctctccaga
aaggaagacacgaagaaaagacgggcagcttgcctgccgcgtcctccttcccgtgccgcg
tcccctcgcctgcaaggactggacatgggggtgctccaggtgttcgcctttggtgtccta
gccctatggggcacccgagtgtgcgctcaggaaccagagttcagctatggctgcgcggaa
ggcagctgctaccccgccactggcgaccttctcatcggccgagctcaaaagctctcagtg
acctcgacatgcggactgcacaaaccagagccctactgtattgtcagccacctgcaggag
gacaagaaatgcttcatatgtgattcccgggatccttatcatgagaccctcaatcctgac
agccatctcattgaaaatgtggtcacaacgtttgctcctaaccgccttaagatttggtgg
caatcggaaaatggtgtggagaacgtaactgtccaactggacctggaagcagaattccac
ttcacgcatctcatcatgactttcaagacattccgtccggcggccatgctgatagagcgg
tcttctgactttgggaagacttggggcgtgtacagatactttgcctacgactgtgagagc
tcattccccggcatttcgactggccccatgaagaaggtggatgacatcatctgtgactct
cggtattccgacattgagccctcaacagaaggagaggtgatatttcgtgccttagatcct
gctttcaaaattgaagatccttatagtccaaggatacagaatctattaaaaatcaccaac
ttgagaatcaagttcgtgaaactgcacaccttgggagataaccttttggactccagaatg
gaaatcagagagaagtactattacgctgtttatgatatggtggttcgagggaactgcttc
tgctatggtcatgccagtgaatgtgcccccgtggatggattcaacgaagaagtggaagga
atggttcatgggcactgcatgtgcagacacaacactaaaggcttgaactgtgagctgtgc
atggatttctaccacgatttgccgtggagacctgccgaaggtcggaacagtaacgcctgc
aaaaagtgtaactgcaacgaacattccagctcgtgtcactttgacatggcggtcttcctg
gctaccggcaatgtcagcgggggagtgtgtgataactgtcagcacaacaccatggggcgc
aactgcgagcagtgcaaacccttctacttccagcaccctgagagggacatccgggacccc
aatctctgcgaaccatgtacctgtgatccagctggttctgagaatggagggatctgtgac
ggttacactgatttttctgtgggtctcattgctggtcagtgtcggtgcaaattgcacgtg
gagggagagcgctgcgacgtctgtaaagaaggcttctatgacctgagtgctgaggacccg
ttcggttgtaaatcgtgtgcttgcaatcctctgggaacaattcctggtgggaatccttgt
gactctgagaccggttactgctactgtaaacgtctggtgacaggacagcgctgtgaccag
tgcctgccgcagcactggggcttaagcaatgacttggacggatgtcgaccctgtgactgt
gaccttggaggggcattgaacaatagctgctccgaggactccggccagtgctcctgcctg
ccccacatgattgggcggcagtgtaacgaggtggagtctggctactacttcaccaccctg
gaccattacatctacgaagctgaggaagccaatcttgggcctggggtcgttgtggtggag
aggcggtacattcaggaccgcattccttcctggacaggacctggcttcgtccgggtgccc
gagggggcttacttggagtttttcattgacaacataccacattccatggagtatgaaatc
ctaattcgctatgaaccacagctgccggaccactgggagaaagctgtcatcaccgtacag
cggccggggaggattccaaccagcagccgatgtggtaacactgttcctgatgatgacaac
caggtggtgtccttgtcaccgggctcaagatacgttgtcctccctcgcccagtatgcttt
gaaaaggggatgaactacaccgtgcggttggagctgccccagtatacggcatctggcagt
gacgtggagagcccttatactctcattgactcgcttgttctcatgccctactgtaagtcg
ctggacatcttcactgttggaggctcaggagatggggaggtcaccaacagtgcctgggaa
accttccagcgctacaggtgtctggagaacagccggagtgtggtgaaaacacctatgaca
gatgtctgcagaaacatcatcttcagcatttctgccttgatccaccaggcaggcctcgct
tgtgaatgtgaccctcagggatctctgagttctgtgtgtgaccccaatggaggccagtgc
cagtgccgccctaacgtggtcggaagaacctgcaatcggtgtgcacctggtaccttcggg
tttggccccaacggatgcaaaccttgtgactgtcatctacaaggatctgtcagtgccttc
tgtgacgcagtcactggccagtgccactgtttccagggcatatatgctcggcagtgtgac
cgatgtctccctgggttttggggctttcccagctgccagccctgccagtgtaatggccat
gctgaagattgtgacacagtgaccggggagtgtctgagctgtcaggactacaccacgggc
cacaactgcgaaaggtgtctggctggttactatggtgatcccatcattgggtcaggagac
cactgtcgcccttgtccttgtcctgatggtcctgacagtggacgccagtttgccaggagc
tgttatcaagaccctgtcacgctccagcttgcatgtgtttgcgatcctgggtacattggc
tccagatgtgatggctgtgcctctgggtattttggcaatccctcggactttgggggttca
tgtcagccctgtcagtgtcaccacaacattgacactaccgacccagaagcctgtgacaag
gagactgggagatgtctcaagtgcctgtaccacacagagggggaccattgccagttttgc
cagcatgggtactacggcgatgctcttcgacaagactgtagaaagtgtgtctgcaattac
ctgggcacagtgagggagcattgtaacggctctgactgccattgcgacaaaaccaccggt
cagtgctcgtgtcttccgaatgtgatcgggcagaactgtgatcggtgtgcaccccacacc
tggcagctggccagcgggactggctgcgagccgtgcaattgcaatgctgtgcattccttc
gggccatcctgcaatgagttcacagggcagtgccagtgcatgccaggctttggaggccga
acctgcagcgagtgccaggagctcttctggggagaccccgatgtggagtgccgagcctgt
gactgtgatcccaggggcattgagacaccgcagtgtgaccagtccacgggccagtgtgtc
tgtgtggagggtgttgagggtccacgctgtgacaagtgcaccagagggtactccggggtc
ttccctgactgcacaccctgccaccagtgctttgctctctgggatgtgatcattggtgag
ctgaccaacaggacccacaaattcctggagaaagccaaggctttgaaaatcagtggtgtg
attggtccctaccgagagaccgtggactctgtggagaagaaagtcaatgaaataagagac
atcctggcccagagcccagcagcggaaccactgaaaaacattggcattctctttgaggag
gcagagaaattaaccaaagatgtcacagaaaagatggctcaagtagaagtgaaattatct
gacacagcttcacggagtaacagcacagccggagagctggacgcactgcaggcagaagca
gggagccttgacaagaccgtgaacgagctggcagaacagctagaatttatcaaaaactcc
gatattcagggtgccttagacagtatcaccaagtatttccagatctctcttgaggcagaa
aagagggtgaacgcctccaccacagaccccaacagcactgtggagcagtcagccctcacc
cgagacagagtggaagatctgatgttggagcgagagtctcagttcaaggaaaagcaggag
gaacaggcgcgccttctggatgaactggcgggcaaactgcaaagtctagacctgtcggct
gctgcagagatgacctgtggaacgcatccaggggctgactgttctgaaagtgaatgtggt
ggccccaactgcagaactgacgaaggagagaagaagtgtggagggcctggctgtggtggt
ctggtcactgtggcccatagtgcctggcagaaagccatggattttgaccgtgatgtcctg
aatgccctggctgaagtagaacagctctccaagatggtctctgaagcaaaagtgagagcg
gacgaggctaagcagaatgctcaggatgtcctgttgaaaacaaatgccaccagagagaaa
gtggacaggagcaatgaggacctgaggaacctcatcaagcaaatcagagactttctgacg
gaggacagcgctgatctggacagtattgaagccgttgcgaacgaagtactgaagatggaa
atgcctagcaccccacagcagttgcaaaacctaacagaagacatccgggaacgagttgaa
accctttcccaagtagaagttattttgcagcagagtgctgctgacattgccagagctgag
ctgttgctggaagaagctaagagagcaagcaaaagtgcaacagatgttaaggtcactgca
gacatggtaaaggaggcgttggaagaagcagaaaaggcccaggttgcagcagagaaggcg
attaaacaagctgatgaggatatccaggggacccagaacctgctaacatcgattgaatct
gagacagcagcgtctgaggaaaccctgaccaatgcctcccagcgcatcagcaagcttgag
aggaatgtggaagagcttaagcgtaaagctgcccagaactctggggaggcagactatatt
gaaaaagtcgtatatagtgtaaaacagagtgcagacgaggttaagaagactctagatggc
gaacttgatgaaaaatataagaaggtagagagtttaattgcccaaaaaactgaagagtca
gcagatgccaggaggaaagctgagctgctacaaaatgaagcaaaaacactcctggcccaa
gccaacagcaaacttcagctgttagaagacttagaaaggaagtatgaggacaatcaaaag
tacttagaagataaagctcaagaactggtgcgactggaaggagaggtgcgctcactcctt
aaggacataagtgagaaagttgccgtttacagcacctgcttataa

KEGG   Rattus norvegicus (rat): 304913
Entry
304913            CDS       T01003                                 

Gene name
Tnn
Definition
(RefSeq) tenascin-N precursor
  KO
K06252  tenascin
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05165  Human papillomavirus infection
rno05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    304913 (Tnn)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    304913 (Tnn)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    304913 (Tnn)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    304913 (Tnn)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    304913 (Tnn)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    304913 (Tnn)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   304913 (Tnn)
SSDB
Motif
Pfam: fn3 Fibrinogen_C DUF4998 Interfer-bind Pur_ac_phosph_N DUF4959 EGF_2 EGF_Tenascin Wtap Ax_dynein_light
Other DBs
NCBI-GeneID: 304913
NCBI-ProteinID: NP_001100659
RGD: 1306002
Ensembl: ENSRNOG00000002548
Vega: OTTRNOG00000000167
UniProt: D3ZK14
Position
13
AA seq 1562 aa
MGLWGMLAFPLGFLLASALLVASAPANPETPGCSHKDQQVTVSHTYKIDVPKSALVQVET
DPQSLSDDGTSLLVPGEDGEEQNIIFRHNIRLQTPQKNCELADSVQDLLARVKKLEEEMA
ELKEQCSTSRCCQGAADLSRHCSGHGTFFAETCSCHCDQGWEGADCEQPTCPGACSGHGR
CVDGQCVCDQPYVGVDCAYAACPQDCSGHGVCVRGVCQCHKDFTAEDCSEQRCPGDCSGN
GFCDTGECYCEMGFTGPDCSQVVAPQGLQLLKSTENSLLVSWEPSSEVDHYLLSYYPLGK
EQATKQVRVPKEQHTYDITGLLPGTKYIVTLRNVKKDISSSPQHLLATTDLAVVGTAWVN
EETETSLDVEWENPLTEVDYYKLRYGPLTGQEVTEVTVPKSRDPKSRYDITGLQPGTEYK
ITVVPIRGDLEGKPILLNGRTEIDGPTNVVTNQVTEDTASVSWDPVRADIDKYVVRYISH
DGETKEKAVPKDQSSTVLTGLKPGEAYKVFVWAERGNQGSKKADTKALTEIDSPENLVTD
RVTENSLSVSWDPVEAEIDRYVVSYTSVDGETKQVPVRKDQSSTVLTGLSPGVEYKVYVW
AEKGDRESKKANTKAPTDIDSPKNLVTDQVTENTLSVSWDPVQANIDRYMVSYTSADGET
REVPVPKEKSSTVLTGLRPGMEYKVHVWAQKGTQESRKANTKAPTDIDGPKNLVTDQVTE
TTLSVSWDPVEADIDRYMVRYTSPDGETKEVPVSKDKSSTVLRGLRPGVEYKVDVWAQKG
AQDSRKANTKAPTDIDSPKNLVTEQVTESTATVSWDPVEADIDRYVVRYTSVDGETREFL
VGKDQTSTILTGMKPGVEYQVDVWAQKGTRESRKTSTKAPTDIDGPKNLVTDQVTETTLS
VSWDPVEADIDRYMVRYTSPDGETKEVPVSKDKSSTVLRGLRPGVEYKVDVWAQKGAQDS
RKANTKAPTDIDSPKNLAINQVTETTLSVSWDPVQADIDRYVVRYTSADGESKEFLIGKE
QRSTVLTGLRPGVEYKVEVWAQKGARESKKANTKGHTDIDSPKNLVTNQVTENTATISWD
PVQADIDRYMVRYTSADGETREVPVRKEKSSTVLTGLRPGVEYTVQVWAQKGARESRKAK
TKAPTEIDSPKNLVTNRVTENTATISWDPVRATIDRYMVRYTSADGETKEVPVSKDQSST
ILTGLKPGMEYTIHVWAQKGARESKKADTKALTEIDPPKNFRPFGVTHSSGVLTWMPPSA
QIDGYILTYQLPNGILKEVELQRGQQRFELQDLEQGVTYPVSLVAFKGNQRSRSVSTTLS
TVDARFPHPSDCSQVQQNTNAAASGLYTIYLNGDASRPMQVYCDMDTDGGGWIVFQRRNT
GQLDFFKRWRSYVEGFGDPMKEFWLGLDKLHNLTTGTTTRYEVRADLQTANESAYAVYDF
FQVASSKERYKLSVGKYRGTAGDALTYHNGWKFTTFDRDNDIALSNCALTHHGGWWYKNC
HLANPNGKYGETKHSEGVNWEPWKGHEFSIPYVELKIRPFGYSRDRFSGRKKRSIGGKAR
VF
NT seq 4689 nt   +upstreamnt  +downstreamnt
atgggtctgtgggggatgcttgccttccccctgggattcctgcttgcttctgcgctcctg
gtggcttcagccccagccaacccagagactcctggctgcagccacaaagaccaacaggtc
accgttagccacacctacaagattgatgtgcccaagtctgctctggttcaagtagagacg
gacccacagtcactcagtgatgatgggacatcactcttggttcctggggaggatggggag
gagcagaacattatcttcaggcacaacatccgtcttcagacaccgcagaagaactgtgag
ttggccgacagtgtccaggacctgctagcccgggtgaaaaagctggaggaggagatggca
gagctgaaggagcagtgtagtaccagccgctgctgccagggagctgccgatctgagccgt
cactgcagtgggcatgggaccttcttcgctgagacctgcagctgccactgtgaccagggc
tgggagggcgctgactgtgagcagcccacctgtcctggggcttgcagcggccacgggcgg
tgcgtggatgggcaatgcgtgtgtgatcagccctatgtgggggtcgactgcgcctacgcc
gcctgtccccaggactgcagtgggcatggcgtgtgtgtgcggggtgtctgccagtgccac
aaggacttcacagctgaggactgcagcgagcagcgctgtcctggcgactgtagtggcaac
ggcttctgtgacactggcgagtgttactgcgagatgggctttactggccccgactgttcc
caggtggtggctcctcagggcctgcagttgctcaagagcaccgagaactccctgctggtg
agctgggagccctccagtgaggtggaccactacctgcttagctactaccccttggggaag
gagcaagctacaaaacaggtccgggtgcccaaggagcaacacacctacgacatcactggt
ttgctgcctggcaccaagtacatagtcaccctgcgtaacgtgaagaaggacatttccagc
agccctcagcatctccttgccaccacagatcttgctgtggtgggcacggcctgggtaaat
gaggagactgagacatctcttgatgtggagtgggagaaccctctgactgaggtggactat
tacaagcttcggtatggccccttaacagggcaggaagtgactgaggtcactgtgcccaag
agccgtgatcccaagagtagatatgacatcaccggtctgcagcctggaacggaatataaa
atcacagttgtgcccataagaggagatctggagggaaagccaattctcctcaatggcagg
acagaaattgatggaccaaccaatgtggtcacaaaccaagtgacagaagacacagcatct
gtttcctgggatccagtgagggctgacatagacaagtatgtggtgcgctatatctcccat
gatggggagaccaaggagaaggcagtacccaaggaccagagcagcaccgttctcacgggc
ctgaagccaggagaggcctacaaagtctttgtgtgggctgagagaggcaaccaaggcagc
aagaaagcagacaccaaggccctcacagaaattgacagcccagaaaacctggtgaccgac
cgtgtgacagagaacagtctctctgtctcgtgggacccggtggaggctgaaatagacagg
tacgtggtgagctacacttccgtggatggagagacaaagcaggttccagtgaggaaggac
caaagcagcaccgtcctcaccggcctgagtccaggtgtggagtacaaagtgtacgtgtgg
gcagagaaaggtgatcgagagagcaagaaggccaacaccaaggctcccacagacattgac
agccccaaaaacttggtgactgaccaggtgacagagaacactctcagtgtctcttgggac
cctgttcaggccaacattgacaggtatatggtgagctacacttctgctgatggagagacc
agggaggttccagtgcctaaggagaagagcagcactgttctgacaggcctgaggccaggc
atggagtacaaggttcacgtatgggcccagaaggggacccaggagagcaggaaggccaat
accaaggcccccacagatattgatggccccaaaaacctagtgactgaccaggtgacagag
accactcttagtgtctcctgggacccagtggaggctgacatcgatagatacatggtacgc
tacacgtctcctgatggagagaccaaggaggtccctgtgtcaaaggataagagcagcaca
gtcctgaggggcctgaggccaggtgtggagtacaaggttgatgtgtgggcccagaagggg
gcccaggacagccggaaggccaacaccaaggcccccacagatattgacagtcccaaaaac
ctcgtgactgagcaggtgacagagagcactgccaccgtgtcctgggacccagtagaggct
gacattgacaggtatgtggtgcgctacacctctgtcgatggagagaccagggaatttctc
gttgggaaggatcaaacaagcaccatcctgaccggcatgaaaccgggtgtggagtatcag
gtcgatgtgtgggcccagaaggggacacgggagagcaggaagaccagcaccaaggcccca
acagatattgatggccccaaaaacctagtgactgaccaggtgacagagaccactcttagt
gtctcctgggacccagtggaggctgacatcgataggtacatggtacgctacacgtctcct
gatggagagaccaaggaggtccctgtgtcaaaggataagagcagcacagtcctgaggggc
ctgaggccaggtgtggagtacaaggttgatgtgtgggcccagaagggggcccaggacagc
cgaaaggccaacaccaaggcccccacagatattgacagtcccaaaaacctggcgatcaac
caggtgacagagaccacgctcagtgtctcctgggacccagtacaggctgatatcgacagg
tatgtggtgcgctacacctctgctgatggagaatccaaagaattcttgattgggaaggaa
cagaggagcactgtcctgacaggcctgaggccaggcgtagagtacaaggttgaagtgtgg
gcccagaaaggagcccgggagagcaagaaagccaacaccaaaggccacacagacattgac
agtcccaaaaacctggtgaccaatcaggtgacagaaaatacagccaccatctcctgggac
ccagtgcaggctgatattgacaggtacatggtgcgctacacctctgctgatggagagacc
agggaggttccagtgaggaaggagaagagtagcacggtgctcacaggcctgaggccagga
gtggagtacacagtccaagtgtgggctcagaagggggcccgggagagcaggaaggccaaa
accaaggcccccacagaaattgacagccccaagaacttggtgaccaaccgagtgacagag
aatacagccaccatctcctgggacccagtacgagccaccattgacaggtacatggttcgc
tacacctctgcggatggagagactaaggaggttccagtgtcaaaggaccagagtagcacc
atcctgacaggtctgaagccaggcatggagtataccattcatgtgtgggcccagaagggg
gctcgggagagcaagaaggctgataccaaggccctaacagaaattgaccctcccaaaaat
ttccgtccatttggtgtaacacattccagtggggttttgacctggatgcccccctctgct
caaatcgatggctacattttgacctaccagcttccaaatggcatcttgaaggaggtggag
ctccaaagaggccagcagagatttgagttgcaagacctggaacagggcgtcacctatcct
gtttctctggttgcctttaaaggtaatcagcggagccggagtgtgtctaccaccctttct
acagtggacgcccgatttccacacccctcagactgcagtcaggttcagcagaacaccaat
gctgctgccagtggcctctacacgatctacctcaatggcgatgccagccggcccatgcag
gtttattgtgacatggacacggatggaggcggctggattgtctttcagagacgtaacact
gggcagctggatttcttcaagcgttggcggagttatgtagaaggctttggggaccccatg
aaggagttctggcttggacttgacaaactacataatctcaccactggcactaccactcgg
tatgaggtgagggcagatttacagactgccaatgaatccgcctacgctgtgtatgacttc
ttccaagtggcatccagcaaagagcggtacaagctgtcggttgggaaatacagaggcaca
gcaggggatgctcttacctaccacaatggttggaagttcacaacttttgacagagacaat
gatattgccctcagcaactgcgcgctgacgcatcatggcggctggtggtataagaactgc
catttggccaacccgaatggcaaatatggggagaccaagcacagtgagggggtgaactgg
gagccatggaaaggacatgagttctccattccttacgtggagctaaaaatccgccccttt
ggttacagcagagaccgtttctctggcaggaagaagcggtccataggagggaaggcaaga
gtgttctaa

KEGG   Rattus norvegicus (rat): 305078
Entry
305078            CDS       T01003                                 

Gene name
Lamb3
Definition
(RefSeq) laminin subunit beta-3 precursor
  KO
K06244  laminin, beta 3
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05145  Toxoplasmosis
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    305078 (Lamb3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    305078 (Lamb3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    305078 (Lamb3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    305078 (Lamb3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    305078 (Lamb3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    305078 (Lamb3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    305078 (Lamb3)
   05145 Toxoplasmosis
    305078 (Lamb3)
SSDB
Motif
Pfam: Laminin_N Laminin_EGF F5_F8_type_C Laminin_II BNR_6 Alanine_zipper
Other DBs
NCBI-GeneID: 305078
NCBI-ProteinID: NP_001094311
RGD: 1562668
Ensembl: ENSRNOG00000006025
UniProt: F1LPI5
Position
13
AA seq 1172 aa
MMTALFLLWLALPGLLCAQQPCSRGACYPPVGDLLIGRTQLLRASSTCGLTKPETYCTQY
GQWQMKCCKCDSRLPRNYNSHRVENVVSSSGPMRWWQSQNDVSPVSLQLDLDKRMQLQDI
MMDFKGLMPAGMLIERSSDFGKTWRVYQYLATDCASTFPQVHQGQPKNWQDVRCRPLSQR
PNGHLTGGKVQLNLMDLASAIPASQSKKIQELGDITNLRVNFTKLAPVPQRGSYPPSAYF
AVSQLRLQGSCFCHGHADRCAPNPGGPTGSTTAVQVNDVCVCQHNTAGPNCDRCAPFYNN
RPWRPAEGQDIHECQRCDCNGHSETCHFDPAVFAASQGTSGGVCDNCQDHTEGKNCERCQ
LHYFRNRRPGAPIHETCIPCECDPDGAVPGAPCDRLTGQCVCKDHVQGERCDLCKPGFTG
LTFANPQGCHPCDCSILGTRQDMPCEEETGRCLCLPNVVGPKCDQCAPSHWKLASGRGCE
PCACDPRNSLSSQCNQFTGQCPCREGFGGLTCSSAAIRQCPDRTHGDVATGCRACDCDFR
GTEGPGCDKTSGRCLCRPGLTGPRCDQCQRGYCDRYPVCVACHSCFQAYDTDLQEQAWRL
GSLRNTTEGLGTGTGLEDRGLASRLLDAKSKIEQIRQILGGTSVTERDVAQVANAILSIR
RTLQGLPLDLPLEEEMESFSGDLGNLDRSFNRLLLMYRSKKEQFEKLSSVDPSGAFRMLT
MAYEQSSRAAQQVSDSSSLLSQLRDSRREVEGLERQTGEGGAGGAQLMALRLEMASLPDL
TPTINKLCGGSRQTACTPGDCPGLLCPQDNGTACGSHCRGALPRARGALHMAGQVAEQLR
SFNTQLQQTRQMIRAAEEAASKVQSDAQRLETQVSTSRLQMEEDVRRTRLLIQQVRGFLT
DPHTDAATIQQVSEAVLALWLPTDSATVLRKMKEIQDIAARLPNVDLVLSQTKQDIARAR
RLQAEAEKARSRAHVVEGQVDDVVGNLRQGTVALQEAQDTMQGTGRSLRLIQERVTEVQQ
VLVPAERLVKGMKEQMSGFWARVKELRHQAQEEQAQAMKAQQLAEGASNQAMNAQEGFER
LKQRYTELKDRLGQSSMLGEQGNRILSIKAEAEELFGETMEMMDKMKDMESELLRGSQAI
MLRSADLSGLEKRVEQIRSYINGRVLYYATCK
NT seq 3519 nt   +upstreamnt  +downstreamnt
atgatgacggcacttttcctcttgtggcttgccttgcctggcctcctgtgtgcccagcaa
ccctgctcccgaggggcctgttatccacctgttggggacttgctcattggaaggactcag
cttcttcgagcctcgtctacctgtggattgaccaagcctgagacctattgcacccaatat
ggacagtggcagatgaaatgctgcaagtgtgactccaggctgcctcgcaactacaacagt
caccgagtggagaacgttgtatcgtcttcaggccccatgcgctggtggcagtcccagaac
gatgtgagccctgtctctcttcaactggacctagacaagaggatgcagcttcaggacata
atgatggattttaagggtctcatgccagctggcatgttgattgagcgctcttctgacttt
ggcaagacatggagggtgtaccagtacctggcgacagactgtgccagcacctttccccag
gtccaccagggccagcccaagaactggcaggacgtccggtgccggcccttgtcccagagg
cctaatgggcatctgactgggggaaaggtccaacttaaccttatggatttggcatcagct
atccctgcatctcaaagtaaaaagattcaagaactaggagacatcacgaacttgagagtc
aacttcaccaagctggccccagtgccccagaggggctcctatccacccagcgcctacttt
gcagtatcacagctacgtctacaggggagttgcttctgtcatggacatgccgaccgctgt
gcccctaatcctggaggccctacaggttccaccaccgctgtgcaggtcaacgatgtctgt
gtctgccagcacaacacagctggccccaactgtgaccgctgcgcccccttctacaacaac
cggccatggagacctgcagagggacaggacatccatgagtgccaaaggtgtgactgcaat
gggcactcagagacctgccacttcgacccagctgtgtttgctgctagccaagggacaagt
ggaggcgtgtgtgacaactgccaggaccacacagagggcaagaactgtgagcgatgtcag
ctacactatttccggaatagacgaccaggcgctcccattcatgagacctgtattccctgc
gagtgtgatccagacggggcagtcccaggagctccctgtgaccgactgacgggccagtgc
gtgtgcaaagaccacgtgcaaggggagcgctgtgatctttgcaagcctggtttcacaggg
ctcaccttcgccaacccacagggctgccatccctgtgactgcagcatcttaggtacccgg
caagatatgccttgtgaggaggaaactggacgctgcttgtgtcttcccaatgtagtgggc
cccaaatgtgaccagtgcgccccttcccactggaagctggccagcggccggggctgtgag
ccctgtgcctgtgacccacgcaactccctcagctcccagtgcaaccagtttacagggcag
tgtccttgtcgggaagggtttggtggcctcacatgtagttctgcagccattcgtcagtgt
cccgaccggacccatggagacgtggccacagggtgccgagcctgtgactgtgacttcaga
ggaaccgagggtcctggctgtgacaaaacctcaggccgttgcctctgccgccccggcttg
acagggccccgctgcgaccagtgccaacgaggctactgcgaccgctacccagtgtgtgtg
gcctgccactcttgcttccaggcctatgacacagacctccaggagcaggcttggcgcctt
ggcagtctccgtaataccactgaaggcctgggcacagggacaggtctggaggaccgtggc
ctggcttctcggttgctagatgccaagagcaaaatcgagcagatcagacagattctgggt
ggcacctctgtcacagagcgggatgtggctcaagtggccaatgccatcctgtctatcagg
aggactcttcagggcctgcccctggacctacccttagaggaggagatggaatccttctct
ggagacctggggaatctagacagaagcttcaatcgcctcctccttatgtaccggagcaag
aaggagcagtttgagaagctaagcagtgtggacccttcaggagccttccgcatgctgact
atggcttatgagcagtcctcccgggctgcacagcaagtatccgacagttccagcctgctg
agccagctgagggacagtcgcagagaggtagaaggtctggagagacagactggagaggga
ggagctggaggagctcagctcatggccctgcgcctagagatggcttcactgcctgacttg
acacccaccatcaacaagctctgtggtggctctaggcagacagcctgcaccccaggagac
tgccctggattgctgtgtccccaggacaatggtacagcctgtggctctcactgcagggga
gccctgcccagagccagaggggccttacacatggcagggcaggtagctgagcagctacga
agcttcaatacccagctccagcagaccaggcaaatgatcagggcagctgaggaggcagca
tcaaaggtccaatcagatgcccagcgccttgagactcaggtgagtaccagccgcttacag
atggaggaagatgtccggcgcacacggcttctcatccagcaggtccggggcttcctcaca
gaccctcacacagatgcagccaccatccaacaggtgagcgaggcagtgctggctctctgg
ctgcccacagactcagctacagtgctacgcaagatgaaagagatccaggacattgctgcc
aggctccctaacgtggacttagtgctatcccagactaagcaggacattgcacgggcccgc
aggctccaggctgaggccgagaaggccagaagccgagcccacgttgtggaagggcaggtg
gatgacgtggtcgggaacctcaggcagggcaccgtggctctgcaggaggctcaggacact
atgcagggcactggccgctctcttcggctcatccaggaaagggttactgaggttcagcag
gtcctggtaccagctgaaaggctggtgaaaggcatgaaggaacagatgagtggattctgg
gcacgggtgaaggagctccgccatcaggcccaggaagagcaggcacaggcaatgaaggcc
cagcagcttgcagagggtgccagcaatcaggccatgaatgctcaggagggctttgagaga
cttaagcaaaggtatacagagttgaaagaccggctgggtcagagttctatgctgggtgag
caaggcaatcggatcttgagcatcaaggcggaggcagaggagctctttggggaaaccatg
gaaatgatggataaaatgaaagacatggagtcagagctgcttcgagggagtcaggccatc
atgcttcgttcagcagacctgtcggggctggagaagcgcgtggagcagatccgcagttac
atcaacgggcgtgtgctgtactatgccacctgcaagtga

KEGG   Rattus norvegicus (rat): 305104
Entry
305104            CDS       T01003                                 

Gene name
Col9a1
Definition
(RefSeq) collagen alpha-1(IX) chain
  KO
K08131  collagen type IX alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04974  Protein digestion and absorption
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    305104 (Col9a1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    305104 (Col9a1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    305104 (Col9a1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    305104 (Col9a1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    305104 (Col9a1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:rno00535]
    305104 (Col9a1)
Proteoglycans [BR:rno00535]
 Extracellular matrix (ECM) proteoglycans
  Others
   305104 (Col9a1)
SSDB
Motif
Pfam: Collagen Laminin_G_3
Other DBs
NCBI-GeneID: 305104
NCBI-ProteinID: NP_001094312
RGD: 1309425
Ensembl: ENSRNOG00000012920
UniProt: F1LQ93
Position
9
AA seq 934 aa
MVTLINPREPTGKMKNFWKIPLFFFACSFLGPCVSATPKRRARFPANSVSHGGSELCPKI
RIGQDDLPGFDLISQFQIEKAASRRTIQRVVGSTALQVAYKLGSNVDFRIPTRHLYPSGL
PEEYSFLTTFRMTGSTLEKQWNIWQIQDSSGREQVGVKINGQTKSVAFSYKGLDGSLQTA
AFLNLQSLFDSRWHKLMIGVERTRATLFIDCIQIDSLPIKPRAQIDADGFAVLGKLVDNP
QVSVPFELQWMLIHCDPLRPRRETCHELPVRITTSQTTDERGPPGEQGPPGPPGPPGVPG
IDGIDGDRGPKGPPGPPGPPGDPGKPGAPGKPGTPGADGLTGPDGSPGSVGPRGQKGEPG
VPGSRGFPGRGIPGPPGPPGTTGLPGELGRVGPAGDPGKRGPPGPPGPPGPSGTIGFHDG
DPLCPNSCPPGRSGYPGLPGMRGHKGAKGEIGEPGRQGHKGEEGDQGELGEVGAQGPPGA
QGLRGITGIVGDKGEKGARGFDGDPGPQGVPGAAGDQGQRGPPGETGPKGDRGIQGSRGI
PGSPGPKGDTGLPGVDGRDGIPGMPGTKGEAGKPGPPGDAGLQGLPGVPGIPGAKGVAGE
KGNTGPPGKPGQLGSSGKPGQQGPPGEVGPRGPRGLPGSRGPVGPEGSPGIPGKLGPLGS
PGLPGLPGPPGLPGMKGDRGVFGEPGPKGEQGASGEEGEAGVRGDLGDMGQPGPKGSVGN
PGEPGLRGPEGIRGLPGVEGPRGPPGPRGVQGEQGATGLPGIQGPPGRAPTDQHIKQVCM
RVVQEHFAEMAASLKRPDTGASGLPGRPGPPGPPGPPGENGFPGQMGIRGLPGIKGPPGA
LGLRGPKGDLGEKGERGPPGRGPKGLPGAIGLPGDPGPASYGKNGRDGEQGPPGVAGIPG
VPGPPGPPGPPGFCEPASCTLQAGQRAFSKGPDK
NT seq 2805 nt   +upstreamnt  +downstreamnt
atggtgactcttataaacccaagagaacccactgggaaaatgaagaacttctggaaaatt
ccacttttcttctttgcatgcagttttctgggaccctgtgtatctgcaactcctaagcgt
cgtgcaagatttcctgccaattctgtttctcatggtggaagtgaactgtgtcccaagatc
aggattggccaggatgacttgccaggatttgacctgatttctcagttccaaattgagaag
gctgcatctcgaaggactatccagagggtggtggggtccacagcgttacaagtggcttac
aagctgggaagtaatgtagacttcaggattccaacaaggcatttgtatcccagtggactg
cctgaagaatattcctttttaactactttccggatgactggaagcacacttgaaaagcaa
tggaacatttggcagatccaggattcctcagggagggagcaagttggcgtgaagataaat
ggccaaacaaagtctgtcgcgttttcatacaagggactggacgggagtctccaaacggca
gccttcttgaatttgcagtccttgtttgactcccgttggcataagctcatgattggcgtg
gagagaacaagagctactctttttattgactgcatccagatcgattccttacctataaag
ccgagagcccagatcgatgctgatggctttgctgtgctgggaaaacttgtggacaatcct
caggtttctgttccctttgaactccagtggatgctgattcactgtgaccccctgagaccc
cggagagagacgtgccatgaactgccagtcagaatcacaaccagccagaccaccgatgag
agaggtcctccgggtgagcaggggcctccggggcctcccgggcctcctggagttccgggc
atagatggtattgatggtgaccgaggtccaaagggtccccccggacctccgggtcctcct
ggagacccgggcaagccaggagcaccaggcaagccaggcactccaggagctgatggatta
acaggacctgatgggtccccgggctccgttggaccaaggggacaaaaaggagaacctggt
gtgcctgggtctcgtggatttccaggccgtggcattccaggaccccctggccctcctggg
accacaggacttcctggagaacttggccgagtaggccctgctggcgaccctgggaaaaga
ggaccacctggtcctcctggacccccaggacctagtggaacaattggatttcatgatgga
gacccattgtgccccaattcctgcccaccaggtcgctctggctatccaggcctaccaggc
atgagaggccacaaaggggcgaaaggagaaattggcgagccaggaagacaaggacacaag
ggtgaagagggtgaccagggggaactgggagaagttggagctcaaggacctccaggagct
caaggtctgagaggcatcactggcatagttggagacaaaggagaaaaaggtgctcgggga
ttcgatggagatcctgggcctcagggcgttccaggtgcagctggtgatcaagggcagcga
ggacctccaggagaaacaggtcctaagggagacaggggcattcaaggttcccgaggaatt
cctggatccccgggtccaaaaggagacacgggcttgccaggtgtcgatggtcgagacgga
atcccgggaatgcctggaacaaagggtgaagcaggaaagcctggacctcccggtgatgcg
ggattgcagggtttaccgggtgtacctgggattcctggtgcaaaaggtgtggccggcgaa
aagggtaacacgggtcctccggggaagcctggtcagttgggaagttcaggaaaaccaggc
caacaaggaccgccgggagaggttggacctcggggacccaggggccttccaggcagcaga
ggcccggtaggaccagaaggctctccaggcataccagggaaactgggacctcttggcagc
cctggccttcctggcttgcctgggccccctggacttcctggaatgaaaggagacaggggt
gtatttggtgaaccaggtcccaagggcgaacagggtgcctctggtgaagaaggtgaagca
ggagtaaggggtgaccttggagatatgggacaacctggcccaaagggatcagtgggtaac
cctggggagcccggcctgaggggacctgaaggaatcagagggcttcctggagtggaagga
ccaagaggaccacctggaccccgaggtgtgcaaggagaacagggtgccactggactgcct
ggtatccagggacctccgggcagagcgccgacggaccagcacatcaagcaggtttgcatg
agagttgtgcaagagcattttgcagaaatggcagctagcctcaagagaccagacacagga
gcttctggtcttcctgggaggcctggtcccccagggcctcccggccctccaggagagaat
ggtttccctggtcagatgggaatccgtggccttccaggcattaagggtccccctggtgct
cttggcttaaggggacctaaaggtgacttgggagaaaaaggagaacgtggtcctccagga
agaggtcctaagggtttgcctggagctatcggtctcccaggtgatccaggccctgccagc
tatgggaaaaatggccgagatggagagcaaggtcccccaggagtggcaggaattcctggt
gtgcctgggcccccaggtcctccaggccctcctggtttctgtgagccagcctcttgcacc
cttcaggcaggtcaaagagcatttagcaaagggcctgacaagtga

KEGG   Rattus norvegicus (rat): 306628
Entry
306628            CDS       T01003                                 

Gene name
Col4a2
Definition
(RefSeq) collagen alpha-2(IV) chain
  KO
K06237  collagen type IV alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04926  Relaxin signaling pathway
rno04933  AGE-RAGE signaling pathway in diabetic complications
rno04974  Protein digestion and absorption
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    306628 (Col4a2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    306628 (Col4a2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    306628 (Col4a2)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    306628 (Col4a2)
  09154 Digestive system
   04974 Protein digestion and absorption
    306628 (Col4a2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    306628 (Col4a2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    306628 (Col4a2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    306628 (Col4a2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    306628 (Col4a2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    306628 (Col4a2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    306628 (Col4a2)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    306628 (Col4a2)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   306628 (Col4a2)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   306628 (Col4a2)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 306628
NCBI-ProteinID: XP_038951185
RGD: 1308085
Ensembl: ENSRNOG00000023972
Position
16
AA seq 1707 aa
MDRVRFSASGLPLRGWLLLATVTVGLLAQSVLGGVKKSDVPCGGRDCSGGCQCFPEKGAR
GQPGEVGPQGYNGPPGLQGFPGLQGRKGDKGERGAPGPTGPKGDVGARGVSGFPGADGIP
GHPGQGGPRGRPGYDGCNGTRGDAGPQGPSGTGGFPGLPGPQGPKGQKGEPYALSKEDRD
KYRGEPGEPGSVGYQGPPGRPGPIGQMGPMGAPGRPGPPGPPGPKGQPGNRGLGFYGEKG
EKGDVGQPGPNGIPSDITLIGPTPSTYHPDMYKGEKGSQGEPGIPGITLKGEEGIMGFPG
TRGFPGLDGEKGVSGQKGSRGLDGFQGPSGPRGPKGERGELGPPGPPAYSPHPSLAKGAR
GDPGFQGAHGEPGSRGEPGDPGPVGPPGLSIGDEDSKRGLPGEMGPKGFSGEPGPSAYYP
GPPGADGKPGPQGLPGPAGPPGPDGFLFGLKGSEGRVGYPGPSGFPGGRGQKGWKGEAGD
CQCGQVIPGLPGLPGPKGFPGVNGEFGKKGDQGDPGLHGIPGFPGFKGAPGIAGAPGPKG
VKGDSRTITTKGERGQPGIPGVHGMKGDDGVPGRDGLDGFPGLPGPPGDGIKGPPGDAGL
PGTPGTKGFPGEVGPPGQGLPGPKGERGFPGDAGLPGPPGFPGPPGLPGTPGQADCDTGV
KRPIGGGQQVVIQPGCVEGPAGSPGQPGPPGPTGAKGIRGIPGFPGASGEQGLKGFPGDP
GREGFPGPPGFMGPRGSKGAPGLPGPDGPPGPIGLPGPAGPPGDRGIPGEVLGAQPGARG
DAGLPGQPGLKGFPGEIGAPGFRGSQGMPGMPGLKGQPGFPGPSGQPGLSGPPGQHGFPG
APGREGPLGLPGSPGLGGLPGDRGEPGEPGEPGPVGMKGVSGDRGDAGVSGERGHPGSPG
FKGMAGMPGIPGQKGDRGSPGMDGFQGMLGLKGRPGFPGIKGEAGFFGVPGLKGLPGEPG
VKGNRGDRGPPGPPPLILPGMKDIKGEKGDEGPMGLKGYLGLKGIQGMPGVPGLSGIPGL
PGRPGFIKGAKGDIGVPGTPGLPGFPGVSGPPGITGFPGFTGSRGEKGTPGVAGVFGETG
PTGDFGDIGDTVDLPGSPGLKGERGVTGIPGLKGLFGEKGAEGDVGFPGITGMAGAQGSP
GLKGQTGFPGLTGLQGPQGEPGRIGIPGDKGDFGWPGVPGRPGIPGIRGISGLHGLPGTK
GFPGSPGVDAHGDPGFPGPTGDRGDRGEANTLPGPVGAPGQKGEQGIPGERGPVGSPGLQ
GFPGISPPSNISGLPGDVGAPGIFGLQGYQGPPGPPGPNALPGIKGDEGSSGAAGFPGEK
GWVGDPGPQGQPGVHGLPGEKGPKGEQGFMGNTGPSGAVGDRGPKGPKGDQGFPGAPGSM
GSPGIPGIPQKIAVQPGTMGPQGRRGLPGALGEMGPQGPPGDPGFRGAPGKAGPQGRGGV
SAVPGFRGDQGPMGHQGPIGQEGEPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQEPMCP
VGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASRNDKSY
WLSTTAPLPMMPVAEEEIKPYISRCSVCEAPAVAIAVHSQDVSIPHCPAGWRSLWIGYSF
LMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYFANKYSFWLTTIPEQNFQ
STPSADTLKAGLIRTHISRCQVCMKNL
NT seq 5124 nt   +upstreamnt  +downstreamnt
atggaccgagtgcggttctcagcctctggtcttcccctgagagggtggctgctgctggcg
actgtgacagtggggctcctggctcagagcgtcttggggggggtgaagaagtcggatgtg
ccctgcggagggagagactgcagtgggggctgccagtgcttcccggagaaaggagcacgg
ggtcagccaggggaagtgggcccccaggggtacaatgggcccccagggttgcaaggattc
ccaggactacagggccgcaaaggagacaagggtgaacggggagctcctggaccaactgga
ccaaaaggagatgtgggagcgagaggcgtctctggattccccggtgcagatggaattcct
gggcacccagggcaaggtggaccccgtggaagaccgggctacgatggttgcaatggaacc
agaggagacgccgggccacagggaccctctggaactgggggcttccctggcctccctggg
ccccaaggacccaaggggcagaaaggcgaaccttacgcactctcgaaagaggaccgtgac
aaatatcggggtgagcctggagagcctggctcggtcggctaccaggggcctcctggccgc
ccagggccaataggacagatgggtcccatgggagcacctggaagaccgggaccaccagga
ccccctggacccaaaggacaaccaggcaaccgaggactgggtttttacggagagaagggt
gaaaagggtgacgtaggacagccaggacccaatgggatcccatctgacatcacactcatt
gggcccacgccatcaacgtatcacccggatatgtacaagggtgaaaagggaagtcaagga
gagccagggatacccggcataaccttaaaaggcgaggaaggcatcatgggattcccagga
acacggggttttcctggccttgatggagaaaaaggagtctcaggacagaaaggaagcaga
ggcctggatggtttccaaggccccagtggaccccgaggacccaagggagaacggggagaa
ctaggacccccaggacctccagcctactcaccccatccatccctggcaaaaggtgccaga
ggtgacccaggattccaaggtgctcatggggagccaggaagccgaggcgagccaggagac
cctggacccgtggggcccccaggcctgtccattggagatgaagattcaaagagaggcctt
ccaggggagatgggacccaaaggcttctcgggagagccaggaccctcggcatactatccc
ggcccaccaggagctgatggaaaaccaggtccccaaggactccccgggcccgcaggccca
cctggaccagatggcttcctgtttggcctgaaaggatcagaagggagagtgggttacccc
gggccttcaggtttcccaggaggtcgagggcagaaaggatggaaaggtgaagctggagat
tgccagtgcggccaggtcatcccgggtctcccgggactgccgggacccaaaggttttcct
ggcgtcaatggggagtttggaaagaaaggtgaccaaggagacccaggcctgcacgggatc
cctgggttcccgggattcaagggagccccgggcatcgctggagctcctggacccaaagga
gtaaagggagactctagaacaatcactaccaaaggtgagcgaggacagccaggtatccct
ggtgtgcatgggatgaaaggtgatgacggtgtcccgggacgtgacgggctggatggattc
cctggcctgccgggtcctccgggcgatggcatcaaagggcctccaggggacgcaggtctc
cctggcacacctggaaccaagggcttcccaggagaagtgggtcctccaggacagggctta
cccggcccgaaaggtgaacgtggtttccctggagacgccggactgcctggacctccaggc
ttcccgggtcctccaggtctcccaggaacaccgggtcaggcagattgtgacacgggtgtg
aaaagacctattggaggcggccagcaagtggtcattcagccaggttgtgttgaaggccct
gcaggatcacctggccagccaggacccccaggtcccacaggtgccaaaggcatccgaggc
atcccaggattcccaggagcttctggagaacaagggctcaagggctttccaggagaccca
ggccgagaaggtttcccagggcccccagggttcatgggaccccgaggatccaaaggtgcc
ccaggcctccctggcccagatggacccccaggccccattggacttccagggccagctgga
ccccctggggacagaggaattccaggagaagtgctaggggcccagcctggggcccgggga
gatgctggactaccaggacagcctgggctaaaagggttcccaggagaaatcggagcacct
ggattcagaggaagccaagggatgcccgggatgccagggctcaaaggtcagccaggcttc
ccaggaccttcgggacagccaggactgtctgggcccccaggacaacatggattcccagga
gctcctggacgagaggggcctttggggctgccaggctcccctggccttggaggtttgcct
ggagacaggggtgagcctggtgagcctggtgagcctggccctgtgggcatgaaaggtgtg
tctggggacagaggcgacgccggtgtgtcaggcgagcgtggtcatcctggaagccctgga
tttaaaggaatggcggggatgcccggtatccctgggcagaaaggcgatcgagggtcacct
ggaatggacggcttccagggcatgttgggacttaagggaagacccggcttcccaggaatc
aagggagaggctggattcttcggagtccctggtttgaagggcctgcctggcgagccaggt
gttaaaggaaaccgaggggatcggggccccccaggaccacctcctctaatcctgccgggg
atgaaggacatcaagggagaaaagggagatgaagggccgatgggtctgaaaggctacttg
ggcttaaaaggcatccaaggaatgcctggggtccctgggctctctggaatccctgggcta
cctggaaggcctggcttcatcaaaggagccaagggagacatcggagtcccaggcacacca
ggtttaccaggatttcctggggtatctggcccccctggaattaccgggttcccaggattc
acaggaagccggggcgagaagggtactccaggagtagcaggggtttttggagagactggc
cccaccggggactttggtgatatcggagacactgtggacttaccaggaagcccaggcctg
aagggggaacggggagtcaccggaatcccaggtctcaagggactctttggagagaaagga
gctgagggtgacgttggatttcctgggataacaggcatggctggggctcaaggctctcct
ggacttaaaggacagacaggcttcccggggctcacagggctgcaagggccacagggagag
cccggacggatcggaatacctggtgacaagggtgatttcggctggccaggggtacctggt
agaccaggcatcccgggaatccgagggatcagcggattacatggcctgccaggcaccaaa
ggcttccctggatccccaggtgtcgatgcccatggtgacccagggttcccaggccccact
ggagacaggggtgaccggggagaggccaacacacttccaggccctgtaggagctccaggg
cagaaaggggagcagggaatcccaggggaacgtggcccagtcggaagcccaggacttcag
ggttttcctggtatttctccaccatccaacatctctgggttacctggtgatgtgggggca
ccaggaatatttggcttgcaaggctaccaaggccctccaggaccaccggggccaaatgca
cttcctggaatcaaaggagatgaggggagctcgggagctgcaggtttccccggagagaaa
ggatgggttggtgacccagggccccagggccagcctggtgttcacggtcttccaggggag
aaagggcccaagggtgagcaaggattcatgggcaacactgggccctctggggccgtgggt
gacagaggccccaaaggacccaaaggtgaccaaggattcccaggtgctcctggctctatg
ggatccccaggcatcccgggcatcccccagaagattgctgtccagcctgggaccatgggt
ccccagggcaggagaggccttcccggtgcactgggagagatggggcctcagggccctcct
ggagatccaggtttccgaggggcccccggcaaggccggtccccagggcagaggtggtgtg
tctgctgttccagggttccggggagaccaagggcccatgggacaccagggtccgattggc
caggaaggggagccaggccgtccagggagcccaggcctgccagggatgccaggccgcagc
gtaagcattggctacctcctggtgaagcacagccagacggaccaggaacccatgtgcccc
gtgggcatgaacaagctctggagtgggtacagcctgctgtatttcgagggccaggagaaa
gcacataatcaggacctggggctggcgggctcctgcctggcacgcttcagcaccatgccc
ttcctgtactgcaatccgggtgacgtctgctactacgccagccgcaacgacaagtcctac
tggctctccaccacggccccactgcccatgatgcccgtagctgaggaggaaatcaaaccc
tacatcagccgctgttcggtgtgtgaggctccggccgtggccattgctgtccacagccaa
gatgtctctataccccactgccccgctgggtggcggagtttgtggatcggatattcattc
ctcatgcacactgcagctggggacgaaggcggtggccagtcactggtgtcgccaggcagc
tgtctggaggacttccgtgcaacaccatttatcgagtgtaacgggggccgtgggacctgc
cattacttcgctaacaagtacagcttctggctgaccacaattcctgagcagaacttccag
agcacaccatccgctgacacgctcaaggccggcctcatccgcacccacatcagccgctgc
caagtgtgcatgaagaatctgtga

KEGG   Rattus norvegicus (rat): 307582
Entry
307582            CDS       T01003                                 

Gene name
Lama3
Definition
(RefSeq) laminin subunit alpha-3 precursor
  KO
K06240  laminin, alpha 3/5
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05145  Toxoplasmosis
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    307582 (Lama3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    307582 (Lama3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    307582 (Lama3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    307582 (Lama3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    307582 (Lama3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    307582 (Lama3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    307582 (Lama3)
   05145 Toxoplasmosis
    307582 (Lama3)
SSDB
Motif
Pfam: Laminin_I Laminin_G_2 Laminin_G_1 Laminin_II Laminin_G_3 Laminin_EGF SQS_PSY
Other DBs
NCBI-GeneID: 307582
NCBI-ProteinID: NP_775428
RGD: 628776
Position
18
AA seq 1725 aa
MPPTVRWSAWCTGWLWIFGAALGQCLGYGSEQQRVAFLQHPGQNHLQASYMELRPSQGCR
PGYYRDIKSFPAGRSVPCNCNGHSNRCQDGSGVCINCQHNTAGEHCERCKRGYYGSAIHG
SCRVCPCPHTNSFATGCAVDGGAVRCACKPGYTGAQCERCAPGYFGNPQKFGGSCQPCNC
NSNGQFGTCDPLTGDCVSQEPKDGSPAEECDDCDSCVMTLLNDLVPMGEELALVKSKLQG
LSVNTGSLEQIRHVEMQAKDLRNQLLGFRSAISSHGSQMDGLEKELSHLYQEFETLQEKA
QVNSRKAQTLYNNIDTTIQNAKELDMKIKNILTNVHILLKQIARPGGEGMDLPVGDWSRE
SAEAQRMLRELRGRDFKKHLQEAEAQKMEAQLLLNRIRTWLESHQVENNGLLKNIRDSLN
DYEAKLQDLRSVLQEAAAQGKQATGLNHENEGVLGAIQRQMKEMDSLKKYLTEHLATADA
SLLQTNSLLQRMDTSQKEYESLAAALNGARQELNDQVRELSRSGGKAPLVAEAEKHAQSL
QELAKQLEEIKRNTSGDESVRCVVDAATAYESILNAIRAAEDAAGKADSASESAFQTVIK
EDLPRRAKTLSSDSEELLNEAKMTRKRLQQEINPALNSLQQTLKTVSVQKDLLDANVTAV
RNDLRGIQRGDIDSVVSGAKSMVRKANGITSEVLDGLSPIQTDLGRIKDSYGSTRHEDFN
KALIDANNSVKKLTKKLPDLFVKIESINQQLLPLGNISDNVDRIRELITQARDAANKVAI
PMRFNGKSGVEVRLPNDLEDLKGYTSLSLFLQRPDLRENGGTEDMFVMYLGNKDASKDYI
GMAVVDGQLTCVYNLGDREAEVQIDQVLTESESQEAVMDRVKFQRIYQFAKLNYTKEATS
NKPKAPAVYDLEGGSSNTLLNLDPEDAVFYVGGYPPDFELPSRLRFPPYKGCIELDDLNE
NVLSLYNFKTTFNLNTTEVEPCRRRKEESDKNYFEGTGYARIPTQPNAPFPNFIQTIQTT
VDRGLLFFAENQDNFISLNIEDGNLMVRYKLNSEPPKEKGIRDTINDGKDHSILITIGKL
QKRMWINVNERSVRIEGEIFDFSTYYLGGIPIAIRERFNISTPAFQGCMKNLKKTSGVVR
LNDTVGVTKKCSEDWKLVRTASFSRGGQMSFTNLDVPSTDRFQLSFGFQTFQPSGTLLNH
QTRTSSLLVTLEDGHIELSTRDSNIPIFKSPGTYMDGLLHHVSVISDTSGLRLLIDDQVL
RRNQRLPSFSNAQQSLRLGGGHFEGCISNVLVQRFSQSPEVLDLASKSTKKDASLGGCSL
NKPPFLMLFKSPKRFNKGRIFNVNQLMQDAPQATRSTEAWQDGRSCLPPLNTKASHRALQ
FGDSPTSHLLLKLPQELLKPRSQFSLDIQTTSPKGLVFYAGTKDSFLALYVADGRVVFAL
GAGGKKLRLRSKERYHDGKWHTVVFGLNGGKARLVVDGLRAQEGSLPGNSTISPREQVYL
GLPLSRKPKSLPQHSFVGCLRDFQLNSKPLDSPSARFGVSPCLGGSLEKGIYFSQGGGHV
ILANSVSLGPELKLTFSIRPRSLTGVLIHVGSQSGQRLSVYMEAGKVTTSVSSDAGGSVT
SITPKQSLCDGQWHSVAVSIKQRILHLELDTDSSYTVAPLSFSPNSTRGSLHVGGVPDKL
KMLTLPVWNSFFGCLKNIQVNHVPVPITEATEVQGSVSLNGCPDH
NT seq 5178 nt   +upstreamnt  +downstreamnt
atgccgcctacagtgaggtggtcagcctggtgcacaggatggctgtggatctttggggca
gctctgggccagtgcctggggtatggctcagagcagcaaagggtagcatttcttcagcat
ccagggcaaaaccatctgcaagcaagttatatggagcttagacccagccagggctgtcgc
ccaggatactatcgagacatcaaaagcttccctgcgggaaggtctgttccctgcaattgc
aacggacattcaaatagatgccaagacggctcgggagtgtgcattaactgtcagcacaac
acagctggggagcactgtgagcgttgcaagaggggttactatggaagcgccatccatgga
tcctgcagggtttgcccctgtcctcacaccaacagctttgccactggctgtgctgtggat
ggaggagctgtgaggtgtgcctgcaaacccggatacacaggagcacagtgtgagaggtgt
gcaccaggatattttgggaacccccagaaatttggaggtagctgccaaccatgcaattgc
aacagtaatggccagtttggcacttgtgatcccctaactggagactgtgtaagccaagaa
cccaaagatggcagccctgcagaagaatgtgatgactgtgacagctgtgtgatgactctc
ctaaatgacttggtccccatgggtgaggaactcgccctggtgaaatcaaaacttcagggg
ctgagtgtgaacactggttctctggaacagatccggcatgtggagatgcaggccaaggac
ctgaggaaccagctgcttggcttccgttccgccatctccagtcacgggtcccaaatggac
ggcctggaaaaagaactcagtcatttgtaccaggaattcgaaactttgcaagaaaaggcg
caggtcaattccagaaaagcacaaacattatataacaacatcgatacgacaatccaaaac
gccaaagagttggacatgaagattaaaaacatacttacgaatgtgcacattctcctgaag
cagatcgctcggccaggtggagaaggaatggacttgccggtgggcgactggtccagggag
tcggcggaagctcagcgcatgttgcgggagctgcgaggccgagactttaaaaagcacctc
caagaagcagaggcccagaaaatggaagcccagctcttactgaaccgaatcaggacctgg
ctggaatcccaccaggtggagaacaatggactgctaaagaatattcgggattcattaaat
gattatgaagccaaacttcaggacctgcgttccgtgcttcaggaggcggcagcccaggga
aagcaggctacaggcctcaaccacgaaaatgagggggtcctaggagccatccagagacaa
atgaaggaaatggattccctgaagaagtacctcaccgagcacctggccacagcagacgct
tccctgctgcaaaccaacagtctactgcagcggatggacacgagccagaaggagtatgaa
agcttagctgctgctttaaacggagcaagacaggaactgaatgaccaagtgcgggaactc
tccagatccggaggcaaagcacccctggtggctgaggccgagaagcacgctcagtcttta
caggagctggcaaagcagctggaagagataaagagaaacaccagtggggatgagtcggtg
cgctgtgtcgtggacgctgccactgcctatgagagcatcctcaacgccatccgagcagca
gaggatgcagccggcaaggccgacagtgcctcagagtccgccttccagacagtgataaag
gaagatcttccgagaagagccaaaaccctgagttctgacagcgaggaactgttaaacgag
gccaagatgacacggaaaaggctacagcaagaaatcaatccagctctcaacagcctacag
caaaccctgaagactgtatcagttcagaaggacctgctagatgccaatgtcactgctgtc
cgtaatgaccttcgtgggatccagagaggtgatattgacagtgtggtgagtggagcgaag
agcatggtcaggaaagccaatgggataacgagcgaggtcctggacgggctcagccccatc
cagacggatttgggaaggattaaggacagctacgggagcacacggcatgaggacttcaac
aaagctctgattgacgccaataactcagtaaagaaattaaccaagaagttgcctgatctt
tttgtcaagattgaaagcatcaatcaacagttgctgcccctgggaaacatctctgacaat
gtagaccgaatccgagagctcattacgcaggccagagatgctgcgaacaaggttgcaatt
cccatgaggttcaatggtaaatctggtgttgaagtccgtctgccaaatgacctagaagac
ttgaagggatacacgtctctgtctttgttcctccaaagaccagacttaagagagaatgga
ggcactgaggacatgtttgtaatgtaccttggaaacaaggatgcctccaaggactacatc
ggcatggcggttgtagatggccagctgacgtgtgtctacaacctgggggaccgagaagct
gaagttcagatcgatcaggtcctgacggagagtgagtctcaggaggcagttatggaccgg
gtgaagttccagagaatatatcaatttgccaagcttaattacaccaaagaagccacgtcc
aataaacccaaagctcccgcggtctacgacctggagggtggcagtagcaacacgctcctt
aatttggatcccgaggacgctgtgttttatgtcggaggttacccaccggattttgaactt
cctagcagactgcggttccctccatacaaaggctgtatcgaactagatgacctcaatgaa
aacgttctaagcttgtacaatttcaagacaactttcaatctcaacaccacggaggtggag
ccttgtaggaggagaaaggaagagtcagacaaaaattactttgaaggtacaggctatgct
cgcatccctactcaaccaaatgctcccttcccaaacttcatacagaccatccagactact
gtggacagaggtttactgttcttcgcagaaaaccaggataacttcatatctctgaacata
gaagatggcaatctcatggtgagatacaaactaaattcagagccacccaaagagaaggga
attcgagacaccatcaacgatgggaaagatcattcgatcttaatcacaattggaaaacta
caaaaacgcatgtggataaatgtgaacgaacgcagtgtacgaatcgaaggggaaatattt
gatttcagcacatattatttgggcggaattccaattgcaatcagagaaaggtttaacatc
tcaacgcctgctttccaaggctgcatgaagaatctgaagaaaaccagtggggttgtcagg
ttgaatgatactgtgggtgtaaccaagaagtgctcagaagactggaagcttgtgcgaacc
gcctcgttctccagaggagggcagatgagctttacaaacttggacgtgccctcgactgac
cgcttccagctctcctttgggtttcagacctttcaacccagtggcacactgctcaatcat
cagacgcggacaagcagcctgctggtcaccctggaagatgggcacattgagttgagcact
agggacagcaacatcccaattttcaagtctccagggacctacatggacggtttactgcat
catgtatctgtaataagtgacacctcaggtctccgccttctcatcgatgaccaggtcctg
agaaggaaccagaggcttcctagcttctctaacgcccagcagtcgctccgccttggagga
ggtcatttcgagggttgtatcagcaatgttttagtccaaaggttttcacagagtccagaa
gtcctggatctggccagtaaatctaccaagaaggatgcatccctaggaggctgcagttta
aacaagccaccttttcttatgttgtttaaaagtcccaagagatttaacaagggccggatt
ttcaatgttaatcagctgatgcaagatgcacctcaggccacaaggagcacagaggcttgg
caagatgggaggtcctgcctaccacctctgaacaccaaggcctctcacagagccctgcag
tttggagacagccccaccagccacttgctactcaagcttccccaggaactgctgaaacct
aggtcacagttttctttagacatacagacaacttcccccaaaggactggtgttttacgca
ggcaccaaggactccttcctggctctttatgtcgcagatggccgtgttgtctttgctttg
ggggcaggagggaagaaactgagactcaggagcaaggagagataccatgacgggaagtgg
cacacggtggtgttcggactaaatggaggaaaggcacgcctggttgtggatgggctaagg
gcccaggaaggcagtttgcctggaaattctaccatcagccccagagaacaggtttaccta
gggttgccgctatcaagaaagccaaagagcctaccccagcacagttttgtggggtgcctg
agagatttccagttgaactcgaaacccctggattctccttctgcgaggtttggggtatct
ccctgcttgggtggctctttagagaaaggcatttatttctcccaaggaggaggccatgtg
atcctagccaattctgtgtccttggggccagagcttaagctcactttcagcattcgccca
cggagtctcactggggtcttaatacacgtcggaagtcaatctggacagcgcttaagtgtg
tacatggaggcaggaaaggtcacaacctctgtgagcagtgatgcaggaggaagtgtgaca
tcaattacaccgaagcagtctctgtgtgatggacagtggcactcggtggcagtctccatt
aaacagcgcatcctgcatctagaactggatacagacagtagctacacagtcgcaccactt
tccttctcaccaaacagcacccgagggtcactgcacgtcggaggtgtcccagacaaattg
aaaatgcttacactccctgtgtggaactcattttttggctgtctgaagaatattcaagtc
aaccatgtccctgtccccatcacagaagccacagaagtccaaggttctgtcagcctgaat
ggctgccctgaccactaa

KEGG   Rattus norvegicus (rat): 309368
Entry
309368            CDS       T01003                                 

Gene name
Lama2
Definition
(RefSeq) laminin subunit alpha-2 isoform X1
  KO
K05637  laminin, alpha 1/2
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05145  Toxoplasmosis
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
rno05410  Hypertrophic cardiomyopathy
rno05412  Arrhythmogenic right ventricular cardiomyopathy
rno05414  Dilated cardiomyopathy
rno05416  Viral myocarditis
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    309368 (Lama2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    309368 (Lama2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    309368 (Lama2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    309368 (Lama2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    309368 (Lama2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    309368 (Lama2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    309368 (Lama2)
   05145 Toxoplasmosis
    309368 (Lama2)
  09166 Cardiovascular disease
   05410 Hypertrophic cardiomyopathy
    309368 (Lama2)
   05412 Arrhythmogenic right ventricular cardiomyopathy
    309368 (Lama2)
   05414 Dilated cardiomyopathy
    309368 (Lama2)
   05416 Viral myocarditis
    309368 (Lama2)
SSDB
Motif
Pfam: Laminin_G_1 Laminin_EGF Laminin_G_2 Laminin_I Laminin_N Laminin_B Laminin_II Laminin_G_3 Arfaptin Ribosomal_S15
Other DBs
NCBI-GeneID: 309368
NCBI-ProteinID: XP_038954896
RGD: 1308889
Ensembl: ENSRNOG00000011134
Position
1
AA seq 3121 aa
MPAAGGILLLLLLGILEGSQTQRPQQRRQSPAHQQRGLFPAVLNLASNALITTNATCGEK
GPEMYCKLVEHVPGQPVRNPQCRICNQNSSHPYQRHPITNAIDGKNTWWQSPSIKNGVEY
HYVTITLDLQQVFQIAYVIVKAANSPRPGNWILERSLDDEEYKPWQYHAVTDTECLTLYN
IYPRTGPPSYAKDDEVICTSFYSKIHPLENGEIHISLINGRPSADDPSPELLEFTSARYI
RLRFQRIRTLNADLMMFAHKDPREIDPIVTRRYYYSVKDISVGGMCICYGHARACPLDPA
TNKSRCECEHNTCGESCDRCCPGFHQKPWRAGTFLTKTECEACNCHGKAEECYYDETVAS
RNLSLNIHGKYIGGGVCINCTHNTAGINCETCIDGFFRPKGVSPSYPRPCQPCHCDPAGS
LSEVCIKDEKHTRRGLKPGSCYCKPGFGGMNCDRCVRGYRGYPDCQPCNCSGLGSTNQDP
CIGPCSCKENVEGEDCSHCKFGFFNLQEENQKGCEECFCSGVSDRCQSSYWTYGNIQDMH
GWYLTDLSGRIQVAPQLDNPDAPQQISVSNSEARRSLPDSYYWSAPAPYLRNKLPAVGGQ
LSFTISYDLEEEEDDTEKILQLMIILEGNNLRISTAYKEVHLEPSEEHVEELSLKEESFT
IHGTNLPVSRKDFMILLTNLERVLIQTTYNLGMDVIFRLSSVNLESAVPYPTDRSIASAV
EVCQCPPGYSGSSCEACWPRHRRVNGTIFGGLCEPCQCFGHAEACNDVTGECLSCKDHTG
GPYCNECLPGFYGDPTRGTSEDCQPCACPLNIPSNNFSPTCHLDRSLGLICDECPVGYTG
PRCERCAEGYFGQPSIPGGSCQPCLCNDNLDFSIPGSCDSLSGSCLICKPGTTGRFCELC
ADGYFGDAVDAKNCQPCRCNVNGSFSEICHRKTGQCECRPNVQGQRCDECKPETFGLQLG
RGCIPCNCNSFGSKSFDCEASGQCWCQPGVAGKKCDRCAHGYFNFQEGGCTACDCSHLGN
NCNPKTGQCICPPNTIGEKCSECLPNTWGHSIVTGCKVCNCSTVGSLSSQCNINTGQCSC
HPKFSGMKCSECNRGHWNYPLCSLCDCFLPGTDATTCDSETRKCSCSDQTGQCTCKVNVE
GVHCDRCRPGKFGLEAKNPLGCSSCYCFGVTSQCSEAKGLVRTWVTLRDEQTILPLVDEA
LQHTTTKGIAFQNPEIVAKMDEVRQDLHLEPFYWKLPEQFEGKKLMAYGGKLKYTIYFEA
RDETGFATYKPQVIIRGGTPTHARIITRHMAAPLIGQLTRHEIEMTEKEWKYYGDDPRIS
RTVTREDFLDILYDIHYILIKATYGNVVRQSRISEISMEGAEPGQVSAASPLANLIERCD
CPPGYSGLSCETCAPGFYRVHSEPGGRTPGPTLGTCVPCQCHGHSSQCDPETSVCQNCQH
HTAGDFCERCALGYYGIVRGLPNDCQPCACPLISPSNNFSPSCVLEGLEDYRCTACPRGY
EGQYCERCAPGYAGSPSSPGGSCQECECDPYGSLPVPCDPVTGLCTCRPGATGRKCDGCE
HGHAREGAECVFCGDECTGLLLGDLAHLEQMAMNINLTGPLPAPYKILYGLENATQELKH
LLSPQRAPERLLQLAEGNMNTLVMEMNELLTRATKVTADGEQTGQDAERTNSRAQSLEEF
IKGLVQHAEAVNEKAIKLNETLGSQDRTAERNLQELQKEIDRMLKELRSKDLQTQKEVAE
DELVAAEGLLKRVNKLFGEPRAQNEEMEKDLREKLAEYQNKLDDAWDLLREATKKTKDAD
RLSAANQKNMTILETKKEAIEGSKRQIENTLKEGNDILDEANRLAGEITSVIDYVEDIKT
KLPPMSEELSDKIDDLAQEIKDRRLAEKVFQAESHAAQLNDSSAVLDGILDEAKNISFNA
TAAFRAYSNIKDYIDEAEKVAREAKELAHEATKLATSPQGLLKEDAKGSLQKSFRILNEA
KKLANDVKENHNDLSGLKTRLETADLRNSGLLGALNDTMDKLSAIPNDTAAKLQAVKEKA
REANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVAKTNAVVKDPSKNKIIADAGATVRN
LEQEADRLIDKLKPIKELEDNLKKNISEIKELINQARKQANSIKVSVSSGGDCIRTYKPE
IKKGSYNNIVVHVKTAVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRVEYPD
LTIDDSYWYRIEASRTGRNGSISVRALDGPKASMVPSTHHSVSPPGYTILDVDANAMLFV
GGLTGKIKKADAVRVTTFTGCMGETYFDNKPIGLWNFREKEGDCKGCTVSPQVEDSEGTI
QFDGEGYALVSRPIRWYPNISTVMFKFRTFSSNALLMYLATRDLKDFMSVELSDGHVKVS
YDLGSGMASVVSNQNHNDGKWKSFTLSRIQKQANISIVDIDSNQEENIATSSSGNNFGLD
LKADDKIYFGGLPTLRNLSMKARPEVNVKKYSGCLKDIEISRTPYNILSSPNYVGVTKGC
SLENVYTVSFPKPGFVELAAVSIDVGTEINLSFSTRNESGIILLGSGGTLTPPRRKRRQT
TQAYYAIFLNKGRLEVHLSSGTRTMRKIVIKPEPNLFHDGREHSVHVERTRGVFTVQVDE
NRRHMQNLTEEQPIEVKKLFVGGAPPEFQPSPLRNIPAFQGCVWNLVINSIPMDFAQPIA
FKNADIGRCAYQKPREEEDDAVPAEVTVQPQPVPTPAFPFPAPTMVHGPCVAESEPAFLS
GSKQFGLSRNSHIALAFDDTKVKNRLTIELEVRTEAESGLLFYMARINHADFATVQLRNG
FPYFSYDLGSGDTSTMIPTRINDGQWHKIKITRVKQEGILSVDDASNQTISPKKADILDV
VGILYVGGLPINYTTRRIGPVTYSLDGCVRNLYMEQAPVDLDQPTSSFHVGTCFANAEKG
TYFDGTGFAKAVGGFKVGLDLLVEFEFRTTRPTGVLLGVSSQKMDGMGIEMIDEKLMFHV
DNGAGRFTAVYDAGSPGHMCDGRWHKVTAKKIKNRLELVVDGNQVDAQSPNAASTSADTN
DPVFVGGFPDGLNQFGLTTNVRFRGCIRSLKLTKGTGKPLEVNFAKALELRGVQPVSCPT
T
NT seq 9366 nt   +upstreamnt  +downstreamnt
atgcctgcggccggcgggatcctcctgctcctgctcttggggatacttgaaggctcccag
actcagcggccacagcagcggcgacagtctccagctcatcaacagagaggtttatttcct
gctgtcctgaatcttgcttcgaatgcgctcatcacaaccaatgctacatgtggggaaaaa
ggacctgagatgtactgcaagttggtggaacatgtccctgggcagcctgtcaggaatccc
cagtgtcgaatctgcaatcagaacagcagccatccataccagaggcacccgatcacgaac
gctattgatggcaagaacacttggtggcagagccctagtatcaagaacggagtggaatac
cattatgtgacgattaccctggatttacagcaggtgttccagattgcctatgtaattgtg
aaagcagccaattcccctcggcctggaaactggattttggagcgttccctggatgacgag
gagtacaaaccctggcagtatcatgcagtgacagacacggagtgcctgaccctctacaat
atctatccccgcactgggccaccatcctatgccaaagacgatgaggtcatctgcacttca
ttttattccaagatccaccctttagaaaatggagagattcacatttctttgatcaatggg
agaccaagtgctgacgacccctcccctgaactgctggaattcacctccgctcgctatatt
cgcctgaggtttcagcggatccgcaccttgaatgctgacttgatgatgtttgctcacaaa
gaccccagagaaatcgaccccattgtcacgcgaagatattactattctgtcaaggatatt
tcagtcggcgggatgtgcatctgttatggtcatgcccgggcttgtccacttgatcctgca
accaataaatcgcgctgtgagtgtgaacataacacctgtggggagagctgtgacaggtgc
tgtccaggattccatcagaagccatggagagctgggaccttcttaaccaagactgaatgt
gaagcttgcaattgtcacggaaaagctgaagaatgctactatgatgaaactgttgccagc
agaaatctaagtctaaatatacatgggaaatacattggagggggtgtttgcatcaactgc
acacacaacactgctggaataaattgtgagacgtgtattgatggattcttcagaccaaaa
ggggtgtcaccaagttatccaagaccgtgccagccatgtcactgcgatccagctggctcc
cttagtgaagtctgtatcaaggatgagaaacacacccggcgagggttgaaacctgggtcc
tgttactgcaaacctggctttggaggcatgaactgtgatcgctgtgtcaggggttaccgt
ggttatccagactgtcaaccctgtaactgcagtggcttggggagcacaaaccaggaccct
tgtattgggccctgtagctgtaaggagaacgttgaaggtgaagactgcagccattgcaaa
tttggcttcttcaacttgcaagaggaaaatcagaaaggctgtgaggagtgtttctgttca
ggggtgtcagacagatgtcagagctcctactggacctatgggaatattcaagacatgcac
ggctggtacctcacagacctctctggccgcattcaggtggctccccagcttgataaccct
gacgcaccccagcagatcagcgtcagtaattctgaggcccggcgatccctgccagacagt
tactactggagtgccccagcgccatatctgagaaacaaactcccagcagttggaggacag
ttgtcatttaccatctcatatgacctcgaagaggaggaagatgatacagaaaaaatcctt
cagctgatgattatcttagagggaaataacttgagaatcagcacagcctataaggaggta
cacttagagccatctgaagaacatgttgaggaattatcactcaaggaagagtcctttacc
atacatggaacaaatctgcccgtcagtagaaaggacttcatgatcttgctcacaaatttg
gagagagtcctcatccaaaccacatacaacttagggatggacgtcatcttcaggctgagc
tctgtgaatcttgaatctgccgtcccctatcctactgatagaagtatcgcgtctgccgtg
gaagtttgccagtgtccacccgggtacagtggcagctcttgtgaagcgtgttggcctcgg
caccgaagagttaatggcactattttcggtggcctctgtgaaccatgtcagtgctttggt
cacgcggaagcctgtaatgacgtcacaggagaatgtctgagctgtaaggatcacactggt
ggaccatactgcaacgaatgcctccctggtttctatggtgatcctactcgaggaacctct
gaagactgccagccctgtgcctgtccactcaatatcccatcaaataactttagtccaaca
tgccatttagaccggagtctgggattgatctgtgacgagtgtcctgtcgggtacacagga
ccacgctgtgagaggtgtgcggaaggctattttggacaaccttccatacctggaggatca
tgtcagccatgcctgtgcaatgacaaccttgacttctccatccctggcagctgtgacagc
ctgtctggctcctgtctgatttgtaagccaggtacaacaggccggttctgtgagctctgt
gctgacgggtattttggagatgcagttgatgcaaagaactgtcaaccgtgccgttgtaac
gtcaatggctccttctcagagatttgtcacaggaaaacgggacaatgtgagtgcagaccc
aatgtgcaggggcagcgctgtgacgagtgtaagcctgaaacctttggcctgcaactggga
aggggatgtattccctgcaactgcaattctttcggctctaaatcctttgactgtgaagcg
agtggacaatgctggtgccagcccggagtagcagggaagaaatgtgaccgttgtgcccac
ggctacttcaacttccaagaaggaggctgcacagcttgtgactgttctcaccttggcaac
aattgtaacccaaaaactggtcaatgcatttgccctcccaatacaattggagaaaagtgt
tctgagtgcctacctaacacttggggccacagcattgtcactggctgtaaggtttgtaac
tgcagcacagtggggtccttgtcttcacaatgcaacataaacacaggccagtgcagctgt
catccaaaattctctggtatgaaatgctcagagtgcaaccgaggtcactggaactatcct
ctctgcagtctctgtgactgctttcttccagggacagatgccacgacttgtgattcagag
actaggaaatgctcctgtagcgatcaaaccggacagtgcacctgtaaggtgaacgtggaa
ggtgtccactgtgacaggtgccggcctggcaaatttggactagaggccaagaatccactt
ggctgcagcagctgctactgcttcggagtcaccagtcaatgctctgaagcaaagggactg
gtccgtacgtgggtgactctgcgtgatgaacagaccattctgcctctggtggatgaggcc
ctgcagcacacaactaccaaaggcattgctttccagaacccagagattgttgcaaagatg
gatgaagtcaggcaagatctccatttggaacctttctactggaaactcccagaacaattt
gaagggaaaaagttgatggcctatgggggcaaactcaagtataccatctattttgaggcc
cgggatgagacaggctttgccacatataaacctcaagttatcattcgaggtggaactcct
actcatgctagaattattaccaggcacatggctgcccctctcattggccagttgacacgg
catgaaattgaaatgacagagaaagaatggaaatattatggtgatgatcctcgaatcagt
agaactgtgacccgggaagacttcttggatatactgtatgatattcactatattctgatc
aaggctacttatggcaatgttgtgagacaaagccgcatttctgaaatctccatggaagga
gctgaaccaggacaggtgtcagcggcaagcccactggcaaacctgatagaaagatgtgac
tgccctcctggctattccggcttgtcttgtgagacgtgtgcaccaggattttaccgagtg
cattctgaaccaggtggccgaactcccggaccaaccttaggaacctgtgttccctgtcag
tgtcatggacacagtagccagtgtgatcctgagacctcagtatgccagaattgtcagcat
cacaccgccggtgacttctgtgaacggtgtgccctcggctactatggaattgtcagggga
ctgccaaatgactgccaaccatgcgcttgtcctctgatttcgcccagcaacaatttcagc
ccgtcttgtgtactagaaggtctggaagattatcgctgcacagcctgcccaagggggtat
gaaggacagtactgtgaaaggtgtgccccaggctacgctggcagcccaagcagccctgga
ggctcctgccaagaatgtgagtgtgacccttatggctccctgccagttccctgtgacccg
gtcacgggactctgcacgtgccgtcctggagccacaggaaggaagtgtgatggctgcgag
cacgggcatgcgcgcgagggcgcagagtgtgtcttttgtggagatgagtgtacaggcctt
cttcttggtgacctggctcatctagagcagatggccatgaacatcaacctcaccggccca
ctgcctgctccgtataaaattctgtatggtcttgaaaatgcaactcaggagctcaagcac
ctgctctcgccgcaacgggcaccagagaggctccttcagttggcagagggcaacatgaac
acactcgtgatggaaatgaatgagcttctgaccagagcaaccaaagtgacagcagatggt
gagcaaacgggacaagacgctgagaggaccaacagcagagcacagtccttggaagaattc
attaaagggcttgtccagcacgctgaagctgtaaatgaaaaagctataaaactaaatgag
accttaggaagtcaagacaggacagcagagagaaacttacaagagcttcagaaggagatt
gaccggatgctgaaagaacttcgaagtaaagatcttcagacacagaaggaagttgctgag
gatgaactggtggcagcagaaggccttctgaagagagtaaacaagttgtttggagagccc
agagcccagaatgaagaaatggaaaaggatctccgtgagaaactggcagagtaccagaac
aagcttgatgatgcatgggatctattgagagaagccaccaagaaaactaaagatgctgat
cgcttgtctgctgccaaccaaaagaatatgaccatcttggagacaaagaaggaggctatt
gaaggtagcaagcgacaaatagagaacactttaaaggaaggcaatgacatacttgatgaa
gccaaccgactcgcgggcgaaattacctcagtcatagattatgttgaagacattaaaact
aagttgccaccaatgtctgaggagctgagtgacaaaatagatgacctcgcccaggaaata
aaggacagaaggcttgctgagaaggtgttccaggctgagagccacgctgctcagctgaac
gactcgtctgctgtgcttgatggaatcctggatgaggctaagaacatctccttcaatgcc
acggcggccttcagagcttacagtaatattaaagactacatcgatgaagctgagaaagtg
gccagagaagccaaagagcttgctcacgaggctacaaaactggcaacaagtccccagggc
ttgttaaaggaagatgccaaaggctcccttcagaaaagtttcaggatcctcaacgaagcc
aagaagctggcaaatgatgtgaaagaaaatcacaatgatctaagtggcctgaaaaccagg
ttagaaactgctgaccttagaaacagtggactcctaggagctttaaatgacaccatggac
aagttatcagccattccgaatgacacggctgctaaactgcaggctgtcaaagagaaagcc
agagaagccaatgacaccgcaaaagctgtcctggcccaggttaaagatctgcatcagaac
ctagatggcctgaagcaaaactacaataaactagcagacagcgtggccaaaacgaatgct
gtggtgaaagatccttccaaaaacaaaatcattgcagacgcaggtgctactgtgagaaat
ctagaacaggaagctgatcggctaatagacaaactcaagcccatcaaggaacttgaggac
aacctaaagaaaaacatttcggaaataaaggaactgatcaaccaagcccggaaacaagct
aactctatcaaagtatctgtgtcttcgggaggtgactgtattcgaacatacaagccagaa
atcaagaaaggaagctacaataatatcgttgtccatgtcaagaccgctgttgctgacaac
ctccttttctaccttgggagtgcaaaatttattgactttctggctatagaaatgcgtaaa
gggaaagtcagcttcctctgggatgttggctctggagttggccgagtagagtatccggac
ttgaccatcgacgactcatattggtaccgtattgaagcatcaagaacggggagaaatgga
tctatttctgtgagagctttggatgggcccaaagccagtatggtgcccagcacccaccat
tcagtatctcctcctgggtacactatcctagatgtggatgcaaatgcaatgctgtttgtt
ggcggcctgactggaaaaattaagaaggctgatgctgtacgtgtgactaccttcaccggt
tgtatgggagaaacatactttgacaacaagcctataggtttatggaacttccgggagaaa
gaaggtgactgcaaaggatgtactgtcagcccgcaagtggaagacagtgaagggactatt
cagtttgatggagaagggtatgcgttggtgagccgtcccatccggtggtaccccaacatc
tccacagtcatgttcaagttccggacattttcatcaaatgctctcctgatgtatcttgcc
acacgagacctgaaagatttcatgagtgtagagctcagtgatggacacgtgaaagtcagc
tatgacctgggttcaggaatggcttccgttgtcagcaatcaaaaccataatgacgggaaa
tggaaatcattcacgctgtcaaggattcagaaacaagcgaacatatcaattgtagacatt
gattctaaccaggaggagaatatagctacttcatcttctggaaacaactttggtcttgac
ttgaaagcagatgacaaaatatattttggtggcctgccaactctgagaaacttgagtatg
aaagcaaggccagaagtcaatgtgaagaaatactccggctgcctcaaagacattgaaatt
tcaaggactccttacaatatcctcagcagccccaattatgttggtgtgaccaaaggctgt
tcactggagaatgtttacacagttagtttccccaagcctggttttgtagagcttgctgct
gtgtccattgatgtcggaacggaaatcaacctgtccttcagtaccaggaacgagtctggg
atcattctcttgggaagtggagggacactcacaccacccaggagaaaacggagacaaact
acacaggcttattatgccatattcctcaacaagggccgtctggaagtgcatctctcctcg
gggacacgaacgatgaggaaaattgtcatcaaacccgagccaaatttgtttcatgatggg
agagaacattctgtccacgtagaaagaaccagaggagtcttcactgttcaagttgatgaa
aacagaagacatatgcaaaacctgacagaggagcagcccatcgaagtgaaaaagctcttt
gttgggggtgctcctcctgaattccagccctccccactcagaaatatcccagcttttcaa
ggctgtgtgtggaaccttgttattaactccatccccatggactttgcacagcctatagct
ttcaaaaatgctgatattggccgctgtgcctatcaaaagccccgggaagaagaagacgat
gccgttccagctgaagttactgtccagcctcagccagtgcctacccctgccttccctttc
ccggcccccaccatggtacatggcccgtgtgttgcagaatcagaaccagcttttctgtcg
gggagcaagcagttcgggctttccagaaacagccacattgctctcgcttttgatgacacc
aaagttaaaaaccgcctcaccattgagctggaagtgcgaactgaggctgaatcaggctta
ctcttttacatggctcggatcaatcatgcagattttgctactgttcagctgaggaatggc
ttcccctacttcagttatgatctggggagcggggacacaagcaccatgatccccacccga
atcaacgatggccagtggcacaagattaagattacgagagtgaagcaggagggaattctt
tctgtagatgacgcctccaaccaaaccatcagtcccaagaaagctgacatcctggatgtt
gtggggattctgtatgttggtggcttgccgatcaactataccacacgcagaattggtcca
gtaacctacagcctagatggctgtgttaggaatctttacatggaacaagcccctgttgat
ctggaccagcctacctccagttttcacgtcgggacatgctttgccaatgcagagaagggg
acttactttgatggaactggttttgctaaagcagttggtggcttcaaagttggattggac
cttcttgtagaatttgaattccgtaccacaagacccactggagtcctcctgggagtcagc
agtcagaagatggatggaatgggtattgaaatgattgacgagaagcttatgttccatgtg
gataatggcgccggccgattcactgcggtctacgatgctgggagcccaggccatatgtgc
gatggacgatggcataaagtcactgccaagaagatcaaaaaccgccttgagctggtggta
gatgggaaccaggtggatgcccagagcccaaatgcagcctccacatcagcagatacaaac
gaccctgtttttgttggcggtttcccagatggcctcaatcagtttggcctgaccaccaac
gttaggttccgaggctgcatccgatctctgaagctcaccaaagggacaggcaagccgctg
gaggttaattttgccaaggctctggaactgaggggtgttcaacctgtatcatgcccaact
acctaa

KEGG   Rattus norvegicus (rat): 309816
Entry
309816            CDS       T01003                                 

Gene name
Lama4
Definition
(RefSeq) laminin subunit alpha-4 precursor
  KO
K06241  laminin, alpha 4
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05143  African trypanosomiasis
rno05145  Toxoplasmosis
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    309816 (Lama4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    309816 (Lama4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    309816 (Lama4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    309816 (Lama4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    309816 (Lama4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    309816 (Lama4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    309816 (Lama4)
   05145 Toxoplasmosis
    309816 (Lama4)
   05143 African trypanosomiasis
    309816 (Lama4)
SSDB
Motif
Pfam: Laminin_G_2 Laminin_G_1 Laminin_I Laminin_II Laminin_EGF Laminin_G_3 Sec34
Other DBs
NCBI-GeneID: 309816
NCBI-ProteinID: NP_001296376
RGD: 1560062
Ensembl: ENSRNOG00000000599
Position
20
AA seq 1817 aa
MAWNTAWCSVLALWLLWGAVCSNAASGDGNAFPFDIEGSAVVGRQDPSETSDSRVTMGRL
PPAAERCDAGFFRTSSGECSPCDCNGNSHECLDGSGFCLHCQRNTTGEHCERCLAGYIGD
SIRGAPRFCQPCPCPLPHLANFAESCFRKNGAVRCMCKENYAGPNCERCAPGYYGNPLLI
GSTCKKCDCSGNSDPNLIFEDCDEITGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAV
CNCGGGPCDSVTGECLEEGFELPTGCDKCVWDLTDDLRLAALSIEESKSGLLSVSSGAAA
HRHVTDMNSTIHLLKTRLSERENQYTLRKIQINNSENTLRSLLSDVEELDEKGSRASRKG
TLVEKESTDTIDQATQLVEQAHSMRDKIQEINSKMLYYGENQELGPEEIAEKLVLAQKML
DEIRSRQPFLTHRELVDEEADEVQELLSQAENWQRLHNDTRSLLPVVLEQLDDYNAKLSD
LQESINQALDHVRDAEDMNRAITSKQRDHEKQHERVKEQMEAVGTSLSLSADSLITPRLT
LEELDEIIKNASGIYAEIDGAKNELQGKLSNLSNLSHDLVQEAVDHAYNLQQEANELSRN
LHSSDMNGLVQKALDASNVYENIANYVSEANETAELALNITDRIYDAVSGIDTQIVYHKD
ESDNLLNQARELQAKADSSSDEAVADTSRRVGGALWRKGALRDKLTEAVKQLQAAERGDA
HQRLGQTKLIIEEANKTTMAVQQVTTPMANNLSNWSQNLQTFDSSAYNTAVNSARDAVRN
LTEVVPRLLGQLRTVEQKRPASNVSASIQRVRELIAQTRSVASKIQVSMLFDGHSAVEVH
PKVSVDDLKAFTSISLYMKPPPKPAEPTGAWVEDQFVLYLGSKTAKKEYMGLAIKNDNLV
YVYNLGTKDVEILLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGE
FAGDDSLLDLTPEDTVFYVGGVPANFKLPASLNLPSYSGCLELATLNNDVISLYNFKHIY
NMDPAKSVPCARDKLAFTQSRAASYFFDGSSYAVVRDITRRGKFGQVTRFDIEIRTPADN
GLVLLMVNGSMFFSLEMRNGYLHVFYDFGFSNGPVHLEDTLKKAQINDAKYHEISIIYHN
DKKMILVVDRRHVKSTDNEKKKIPFTDIYIGGAPQEVLQSRTLRAHLPLDINFRGCMKGF
QFQKKDFNLLEQKETLGVGYGCPEDSLISRRAYFNGQSFIASTQKISFFDGFEGGFNFRT
LQPNGLLFYYASGSDDVFSISLDNGTVVMDVKGIKVKSADKQYHDGLSHFVVTTVSATRY
ELVVDKSRLGSKNPTKGKAEPTQTTEKKFYFGGSPISPQYANFTGCISNAYFTRLDRDVE
VEDFQRYSEKVHTSLYECPIESSPLFLLHKKGKNSSKPKTNKQGEKSKDAPSWDPIGLKF
LERKASRDSHCHLSSSPRAIEHAYQYGGTANSRQEFEHIQGDFGEKSQFAIRLKTRSSHG
MIFYVSDQEENDFMTLFLAHGRLVFMFNVGHKKLKIRSQEKYSDGLWHDVIFIREKSSGR
LVIDGLRVLEERLPPSDAAWKIKGPIYLGGVAPGRAVKNVQINSVYSFSGCLGNLQLNGA
SIASASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLV
HGHSVNGEYLNVHMKSGQVIVKVNNGVRDFSTSVTPKQNLCDGRWHRITVIRDSNVVQLD
VDSEVNHVVGPLNPKPVDHREPVFVGGVPESLLTPRLTPSKPFTGCIRHFVIDGRPVSFS
KAALVSGAVSINSCPTA
NT seq 5454 nt   +upstreamnt  +downstreamnt
atggcttggaacacagcctggtgctcagtcctggccctgtggctcctctggggtgctgtc
tgctccaacgcggcgtcaggggacggcaatgcgtttccctttgacatcgaggggagcgca
gtggtcggcaggcaagacccatctgagaccagcgactcccgtgtgacaatgggacgcctg
ccgccagctgctgagagatgtgacgccggcttctttcgcacgtcatcaggagaatgttcg
ccctgtgactgtaacggcaattcccacgagtgtttggatggctccggattctgtctgcac
tgccagcggaacacaacaggggagcactgtgaaaggtgtctggctggctatattggagac
tccatcagaggcgcaccccgattctgccagccgtgcccctgtcccttacctcacctggcc
aactttgcagaatcctgcttcaggaaaaacggagctgttcggtgtatgtgtaaagaaaac
tatgctgggcctaactgtgaaagatgtgctcctggttactatggaaaccccctactcatc
ggaagcacctgtaagaaatgtgactgcagtggcaattcggatcccaacctcatctttgaa
gactgtgatgaaatcactggccagtgcaggaactgcttacgaaataccaccgggttcaag
tgtgaacgctgtgctcctggctattacggggatgccaggatcgccaagaactgtgcagtg
tgcaactgtgggggcggcccatgtgacagcgtaaccggagagtgcttggaagaaggattt
gaactccctacaggctgtgataagtgtgtctgggacttgaccgacgacctgcgattagca
gcgctctccatcgaagagagcaaatccgggctgctgagcgtgtcctccggcgctgcggcc
catagacacgtgaccgacatgaactctaccatccacctcctcaaaacaaggttgtcagaa
agagaaaaccagtataccctgagaaagatccagataaacaactcagagaacacactgaga
agcctcctgtctgacgtagaggaactggatgagaagggaagccgagcctcaaggaagggg
acgctggttgaaaaggaaagcacggatacaattgaccaggcaactcagctcgtcgagcaa
gcccacagcatgagggacaaaatccaagaaatcaatagcaagatgctttactatggagag
aatcaggaacttggccctgaggaaattgcggagaagctggtgttggcccagaaaatgctg
gatgagatccgaagccggcagccattcctcacccatcgggagcttgtggatgaggaggca
gatgaagtccaggaattgctgagccaagctgagaactggcagagactgcacaacgacacc
cgctctttactccctgtggtgttggagcagctggatgactacaatgctaagctgtcagac
ctccaggaatccattaaccaggcccttgaccatgtcagggatgcagaagacatgaacaga
gccatcacctcgaagcagcgggaccatgagaaacaacacgagagagtgaaggaacagatg
gaggccgtgggcacctctctgagtttgtctgcagactctctcatcacacctcggctcact
cttgaggaacttgatgagataataaagaatgcatctggaatttatgcagagatagatgga
gccaaaaacgaactacaggggaaactgtccaacctgagtaacctcagtcatgatttggtc
caagaagctgtggaccacgcgtacaacctccaacaggaagccaacgagctaagcagaaat
ttgcacagttcagacatgaatgggctggtgcagaaggctttggacgcatccaatgtctat
gaaaatatcgccaactatgtcagtgaggccaacgaaacagcagaacttgctctgaatatt
actgatcgaatttatgacgccgtgagtggaattgacactcagatcgtttaccataaggat
gaaagcgacaaccttctcaatcaagccagagagctgcaagcaaaggcggattctagcagt
gacgaggccgtggctgacaccagcaggcgcgtgggaggagccctgtggaggaagggcgcc
ctcagagacaaactgaccgaggctgtcaagcaactccaggcagcagagagaggggatgcc
caccagcgcctgggccagactaagctcatcattgaggaagctaacaagacgacaatggcc
gtccaacaggtcaccacaccgatggctaacaacctcagcaactggtcacagaaccttcag
acttttgactcttctgcctacaacactgcagtgaactccgctcgggacgcagtgagaaac
ctcactgaggttgttcctcggcttctgggtcagctccgtactgtggagcagaagcggcct
gcaagcaatgtttcggccagcatccagagggtccgagagctcatcgctcagaccaggagt
gttgccagcaagatccaggtttccatgctgttcgatggccactcggccgtcgaagttcac
cccaaagtcagtgtggatgacctgaaggccttcacctccatcagcttgtacatgaagcct
cctcccaagccggcggagcccaccggggcctgggtagaagatcagtttgtcctctacctc
ggaagcaaaaccgccaaaaaagagtacatgggtctggcaatcaaaaatgataacctggtc
tacgtttacaatttggggaccaaagatgtggagattctcctggattccaagcccgtgagc
tcctggcctgcttactttagtattgtcaagattgagagggtagggaaacatggaaaggtg
ttcttgacagtccccagtctcagcagcacagcagaagaaaagtttattaagaagggggag
tttgcaggagacgactccttgctggatctgacccctgaggatactgtgttttacgttggc
ggggttcctgcaaacttcaagctcccggccagcttaaacctgcccagctactcaggctgc
ctggagctggccactctgaataatgatgtaatcagcctgtacaacttcaagcacatctac
aacatggacccagctaagtcggtgccctgtgccagggataaactggctttcactcagagc
agggctgccagctacttcttcgatggctccagttacgctgtggtgagggacatcaccagg
agagggaaatttggccaggtgactcgctttgacatagagattcgaacaccagctgacaat
ggcctcgtgctcctgatggtcaatggaagtatgtttttcagcctggaaatgcgcaatggc
tacctgcatgtgttctacgactttggattcagcaacggacccgtgcatcttgaagacaca
ttgaaaaaggcccagatcaacgatgcgaaataccatgagatctcaatcatttaccacaat
gacaagaaaatgattttggtggtggacagacggcatgttaagagcacagacaatgagaag
aagaagataccattcacagacatctacattggaggtgcgccccaagaagtcttacagtcc
aggaccctaagagcacatcttcccctagacatcaactttaggggatgcatgaagggtttc
cagttccaaaagaaagatttcaatttactggagcagaaagaaaccttaggagttggttat
ggatgcccagaggactccctgatctctcgcagagcatatttcaacgggcaaagttttatt
gcttcaactcagaaaatatctttcttcgatggcttcgaaggaggcttcaatttccgaacg
ttacagccaaatgggttactgttctactatgcctctgggtccgacgatgtgttttccatt
tccctggacaatggcactgtcgtcatggatgtaaagggcatcaaggtaaagtcagcagac
aagcagtaccatgatgggctgtcccacttcgtcgtcaccaccgtctcagccacaaggtat
gaactagtagtagacaaaagccgacttgggagtaagaatcctacgaaagggaaggcagag
ccgacgcaaacaaccgagaagaagttctacttcggtggctcgcccatcagccctcagtat
gctaatttcactggatgtataagcaatgcctactttaccaggttggatagagatgtggaa
gttgaagatttccagcggtattctgaaaaggtgcacacttcactctatgagtgtcccatc
gagtcgtcacctctgtttctccttcacaaaaaaggaaagaattcctcaaagcctaaaaca
aacaaacagggagagaaaagtaaggacgcaccttcgtgggatcctattggtctgaagttt
ctggaacggaaagcctcaagagactcccactgccacctctccagcagccccagagcgata
gagcatgcctatcaatatggcggcacggccaacagccgccaagagtttgaacacatacaa
ggagattttggtgaaaaatcccagtttgccattcgtctgaagacccgttcctcacatggg
atgattttctatgtctcagaccaagaagagaatgatttcatgaccctgttcttggcccac
ggtcgcttggtctttatgtttaatgtcggccacaagaaactgaagattagaagccaggag
aagtacagtgatgggttgtggcatgatgtgatatttattcgggaaaagagcagtggccga
ttggtcattgatggtctgcgagtcctcgaagaaaggctcccccctagtgacgctgcctgg
aaaatcaagggtcccatttatctgggaggagtggctcccggaagggctgtgaaaaatgtc
cagattaactcagtctacagtttcagtggctgccttggcaatctccagctcaatggagcc
tccatcgcctctgcctcccaaacgtttagcgtgaccccttgctttgaagggccaatggag
acggggacttatttctccacagaaggaggctatgtggttctagatgagtctttcaacatt
gggctaaagtttgaaatcgcatttgaagtccgtccccggagcagttccgggacccttgtc
catggccacagtgtcaacggggaatacctgaacgtacacatgaaaagcggacaggtcata
gtgaaagtcaacaatggcgtcagagacttttctacctcagtcacccccaagcagaatctc
tgcgatggcagatggcacagaattacagttattagagactccaacgtggttcagttggat
gtagattcagaggtgaaccatgtagttggacccctgaacccaaagccggttgatcacagg
gagcctgtgtttgtcggaggggttccagagtccttgctgacgccccgtttgactcccagc
aaacccttcacaggctgcatccgtcactttgtgatcgatgggcgtccggtgagcttcagt
aaagcagccctggtcagtggtgctgtgagcattaactcctgccccacagcctga

KEGG   Rattus norvegicus (rat): 311862
Entry
311862            CDS       T01003                                 

Gene name
Lamc3
Definition
(RefSeq) laminin subunit gamma-3 precursor
  KO
K06247  laminin, gamma 3
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05145  Toxoplasmosis
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    311862 (Lamc3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    311862 (Lamc3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    311862 (Lamc3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    311862 (Lamc3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    311862 (Lamc3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    311862 (Lamc3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    311862 (Lamc3)
   05145 Toxoplasmosis
    311862 (Lamc3)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N Laminin_B XhlA DUF4477
Other DBs
NCBI-GeneID: 311862
NCBI-ProteinID: NP_001101300
RGD: 1309399
Ensembl: ENSRNOG00000059507
Position
3
AA seq 1580 aa
MAVSGILSRLATVVCVALVMLKAHCATGAGMGSCYDGAGRAQRCLPEFENAAFGRRAEAS
HTCGQPPEDFCPHVGAPGAGPQCQRCDDADPRRRHDASYLTDFHSPDDSTWWQSPSMAFG
VQYPTSVNLTLHLGKAYEITYVRLKFHTSRPESFAIYKRTHTGGPWEPYQYYSASCQKTY
GHPEGHYLRPGEDERVAFCTSEFSDISPLNGGNVAFSTLEGRPSAYNFEESPVLQEWVTS
TDLLISLDRLNTFGDDIFKDPRVLQSYYYAVSDFSVGGRCKCNGHASACGPNEAGQLACH
CQHNTTGVDCERCLPFFQDRPWARGTAEDANECLPCNCSGHSEECTFDRELYRSTGHGGH
CQRCRDHTAGPHCEHCEKNYYRWDPKTPCQPCDCHPAGSLSLQCDNSGTCPCKLTVTGWK
CDRCLPGFHSLSEGGCRPCTCNVAGSLGTCDPRSGNCPCKENVEGGLCDRCRPGTFNLQP
HNPAGCSSCFCYGHSKVCAPASGFQEHHIRSDFRHGADGWQARSTEVSERPLQWSQSGIL
LGLRGGEELSAPEKFLGDQRLSYGQPLILTLQVPPGGSPPPMQLRLEGAGLSLALKPSNL
PSSQDTRQPGRVQLQFLLQETSEEAEPPLPAFHFQRLLSNLTTLSIWTSGQGLGHSGHVL
LCEVHLTSARPQRGLAPPASWVETCLCPQGYTGQFCEFCALGYKREIPHGGPYTNCIPCT
CNQHGTCDPNTGICLCGHHTEGPSCERCMPGFYGNAFSGHADDCQPCPCPGQSACTTIPE
SRDVVCTHCPPGQRGRRCESCEDGFFGDPLGLSGAPQPCRRCQCSRNVDLNAVGNCDPHS
GRCLRCLHNTTGAHCEHCQEGFYGSALATRPADKCAPCSCDPRGSSSQKTCNPATGQCAC
LPYVTGRDCSRCSPGFYDLQPGRGCQSCKCHPVGSLENKCHPKTGQCPCRPGVTGQACDR
CQLGFFGFSIKGCRDCRCSPLGAASPQCHENSTCVCRPGFVGYKCDRCQDNFFLVDGDTG
CQECPTCYALVKEEAAKLKTRLMLMEGWLQGSECGSPWGPLDILQGEAPRGDVYQGHHLF
QEARGTFLEQMVGLEESVKATWEQLQVLRGNAHCAQAGAQKTCNQLAELGETLRSSEEEV
LRAASALSFLANLQEGSSAPTNWGRLASEAYILTRSHRDTATKIEATVERALLASNTSYE
LLWKLLKGSVASEAQQELEERYQEVQAAQTALGIAVAEALPKAEKALATVKQVVGAAAPR
LGSMVTPEAMDSQARYLGRRVKALEQKLQQNQAGQAVGALQVEAGRALRKVEPFLQLRNK
TTAALTQVSSAVQAAKVTVIGAKTLLADLEGMKLRFPLPKEQAALKRKAGSIRTRLLKDT
KRKTKQAERMLGNAASLSSNSQKRSKEAELMSKENAKLARAFLREGKQNYRHASRLASQT
QATLRQASRLVLTSEARKQELEEAKQVASGLSTVERQIRESRISLEKDTKVLSELLAKLG
SLGTHQAPAQTLNETQQALESLRLQLDSHGSLHHKLRQLEEESARQELQIQSFENDLAEI
RADKHNLETILSSLPENCAS
NT seq 4743 nt   +upstreamnt  +downstreamnt
atggctgtgtccgggatcctgtcccgcctggcaacggtggtatgtgtggcgttggtgatg
ctgaaggcgcactgcgcgacgggtgcgggcatgggctcttgctacgacggtgcgggacgc
gcacagcgctgtctgcctgagttcgagaacgcggcgttcggccgacgcgccgaggcctcg
cacacgtgcggacagcccccagaggacttctgtccgcacgtgggggcaccaggagctggg
ccacagtgccagcgctgtgatgatgctgacccccggcgacgccacgacgcctcctacctc
acagacttccacagccctgatgacagcacctggtggcaaagtccgtccatggccttcggg
gtgcagtatcccacttcggttaacctgaccctgcatctagggaaggcctatgagatcacc
tatgtgaggctaaagttccacaccagccgccccgagagctttgccatctacaagcgcacg
cacacgggtggcccctgggagccctaccaatactacagtgcctcctgccagaaaacctat
ggccatcctgagggccactacctgcgacctggtgaagatgagagggtggctttctgcacc
tctgagttcagcgacatctcccccctgaatgggggcaacgtggccttctccactctggag
ggccgtcccagtgcctacaactttgaggagagccctgtgctgcaggagtgggttaccagc
accgatctcctgatctctctggatcgtctcaacacgtttggggatgacatcttcaaggac
cccagagtgctccagtcttactactacgctgtgtctgacttctctgtgggtggcaggtgc
aaatgcaatggtcacgccagtgcgtgcgggcccaacgaggctggtcagctggcttgtcac
tgccagcacaacaccacgggagtggactgtgagcgttgtctacccttcttccaggaccgt
ccgtgggcccgcggcaccgcagaggatgccaacgagtgtctgccctgcaactgtagtggg
cactctgaggagtgcacgtttgacagggagctctatcggagcacgggccatggtgggcac
tgccagcgctgccgagaccacacagctgggccacattgtgagcattgtgagaagaactat
tacagatgggacccgaagacaccatgccaaccctgtgactgccacccagcaggctccctg
agtctccagtgtgacaactcgggcacctgtccctgcaagctgacggtgactggctggaag
tgtgatcgctgcctgcctggattccactcactcagtgagggcggctgcagaccctgtacc
tgcaatgtcgctggcagcttgggcacctgcgatccccgaagtggaaactgcccctgcaaa
gagaatgtggaaggcggtctgtgtgacagatgccgccctgggacttttaacctgcagccc
cataacccagccggctgcagtagctgcttctgttacggccactccaaggtgtgtgcccct
gcttcagggttccaggaacaccacatccgctcagacttccgccatggagctgatggctgg
caggccagaagcacagaggtgtcagagcgtcctctgcaatggagccagagcgggatcctc
ctgggcctgcgaggaggggaggaactctcagccccagagaagttcctgggagaccagaga
ctcagctatggacagccactcatactgaccctccaagtaccccccgggggctccccacct
cctatgcagctgagactggagggggcaggtttatctctggctctgaaaccctccaatcta
cccagctctcaggacaccaggcagccaggacgagttcagctccagttcctcctgcaggag
acttctgaagaggcagagcctccactgcccgccttccacttccagcgcctgctttccaat
ctgaccactctgagcatctggaccagtggccaaggcctgggccactctggccacgtgctc
ttgtgtgaagtccatctcacatcggcccggcctcagcgtgggcttgccccaccagcctct
tgggtggagacctgcttatgtccccagggatacacgggccagttctgtgaattctgtgct
ctgggatacaaaagagaaatacctcatgggggtccctataccaactgcatcccctgcacc
tgcaaccagcatggaacctgtgaccccaacacagggatctgcctgtgtggccaccacacg
gagggtccatcctgtgagcgatgcatgccaggtttctacggtaacgccttctcaggccac
gctgatgattgccagccctgtccatgccctggccaatcagcctgcacgaccatcccagag
agtagagatgtggtgtgcacacactgcccccctggtcagagaggacgacgctgtgagagc
tgtgaagatggcttcttcggagatcccctaggactctctggagctccccagccctgccgc
cgatgccagtgcagcaggaacgtggatctcaatgctgtgggcaactgcgatcctcattct
ggccgctgtctgcgctgtctgcacaacacaacaggggcccactgcgagcactgtcaggag
ggtttctatgggagtgccttggccacgaggcctgcggacaaatgtgctccctgcagctgt
gacccgaggggctcaagcagtcagaagacctgcaacccagcgactggccagtgtgcctgc
ctgccttacgtcaccgggagggattgcagccgctgcagccctggcttctacgacctccag
cctgggaggggctgccagagctgcaaatgtcacccagtgggctccttggagaacaagtgc
caccccaagactggccagtgcccctgtcgacccggtgtcaccggccaagcctgtgacaga
tgccagctgggtttctttggcttctccatcaagggctgccgagactgtaggtgttcccca
ctgggtgctgcctcaccccagtgccatgagaacagcacgtgtgtgtgccggcctggcttc
gtgggctataaatgtgaccgctgccaggacaatttcttcctcgtggatggtgacacaggc
tgccaagagtgccccacctgctatgctctagtgaaggaggaggcagccaagctgaagacc
aggctgatgctgatggaggggtggcttcaggggtccgagtgtggcagcccctggggacca
ctagacattctgcagggagaagcccctcggggggatgtctaccaaggtcaccacctattt
caagaggcccgggggaccttcctggagcagatggtgggcctggaggagtctgtgaaggcc
acttgggagcagctgcaggtgctgagagggaacgcacactgtgcccaggctggagctcag
aagacctgcaaccagctggcagagctgggggagacgctgcgttcctcagaggaggaggtc
ctgcgtgcagcctcagctctctcatttctggcaaatcttcaggaaggatccagcgcaccc
accaattggggtcgcctggcatcagaggcctacatcctcaccaggagccacagggacaca
gccaccaagatcgaagctaccgtggaaagggccctgctcgcctccaacaccagctatgag
ctcctgtggaagctgttgaaaggcagtgtggcctcagaggcccagcaggaactggaggag
aggtaccaggaggtgcaggcagctcagactgccctgggcatagctgtggcagaggcgctg
cccaaagctgaaaaggcactagccacggtgaagcaagtcgttggtgccgcagccccacgt
ctgggctcgatggtcacccctgaagcaatggactcccaagctagatacctaggccggaga
gtgaaagccctggagcagaagctgcagcagaaccaggcaggccaagcggtgggagccctg
caggtagaggctgggagagccttgaggaaggtggagccctttctgcagctacgaaacaag
accacagctgccctgacacaggtttcctcagctgtccaagctgctaaggtgaccgtcata
ggagcgaagaccctgctagctgacctagagggaatgaagctaaggttccctctacccaag
gagcaggcagcgctgaagaggaaagcaggcagcatcaggaccaggctcttgaaggacacg
aagagaaagaccaagcaggcagagaggatgctgggaaatgctgcctcgctctcctccaac
tcccagaagagaagcaaagaagcagaactgatgtccaaggagaatgccaagcttgccaga
gctttcctgagggaagggaagcagaactaccgtcatgccagccgactcgccagccagacc
caggccacactccgccaggcctctcgcctggtgctgacctcagaagcacggaaacaggag
ctggaagaagctaaacaggtggcctctgggctgagcactgtggagcgccagatccgagag
tctcggatctccttggagaaggacaccaaggtcctgtcagagctgctcgcgaaactgggg
tccctgggtacccaccaagcccctgctcagaccctgaacgagacccagcaggcactagaa
agcttgaggctgcagctggattcacatggatccttgcatcacaaactaaggcagctggag
gaagagtctgctcgacaggagctgcagatccagagctttgagaacgacctcgctgagatc
cgcgctgacaagcacaacttggagaccattctaagcagtctaccagagaactgtgccagc
tag

KEGG   Rattus norvegicus (rat): 315979
Entry
315979            CDS       T01003                                 

Gene name
Col6a6
Definition
(RefSeq) collagen alpha-6(VI) chain isoform X1
  KO
K06238  collagen type VI alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04974  Protein digestion and absorption
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    315979 (Col6a6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    315979 (Col6a6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    315979 (Col6a6)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    315979 (Col6a6)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    315979 (Col6a6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    315979 (Col6a6)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    315979 (Col6a6)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   315979 (Col6a6)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   315979 (Col6a6)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   315979 (Col6a6)
SSDB
Motif
Pfam: VWA VWA_2 Collagen Integrin_beta
Other DBs
NCBI-GeneID: 315979
NCBI-ProteinID: XP_038938536
RGD: 1309172
Ensembl: ENSRNOG00000023007
Position
8
AA seq 2264 aa
MLLILFLMMICLHVCANQDSGPEYADVVFLVDSSDHLGMKSFPLVKTFIQKLVSSLPVEA
NKYRVALAQYSDALHNEFHLGAFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSA
PTNGRDKKQFPPILVVLASAESEDDVEEASKALREDGVKIISVGVQKASEEDLKAMATSQ
FHFNLRTARDLSMFAPNMTQIIKDVTQYREAMTDDIIVEACQGPSVADVVFLLDMAINGS
QENLDHLKAFLGESVSALDIKENCMRVGLVAYSNETRVISSLSMGVNKTEVLQRIQDLSP
HVGQAYTGAALRKTRKEVFSAQRGSRKNQGVPQIAVLVTHRASDDNVTKAAVNLRREGVT
VFTMGVEGANPEQLEKIASYPAEQFTSKLSNFSELATHNQTFLKKLRNQITHTVSVFSER
TETLKSACVDTEEADIYLLIDGSGNTQPTDFHEMKIFLSEVVDMFNIAPHKVRVGAVQYA
DTWDLEFEISKYTNKPDLGKAIDNIRQMGGNTNTGAALNFTLTLLQRAKKQRGNKVPCHL
VVLTNGMSQDSVLGPAHKLREENIRVHAIGVKEANQTQLREIAGDEKRVYYVHEFDALRD
IRNQVVQEICAEEACRDMKADIMFLVDSSGSIGLENFSKMKVFMKNLVSKSQIGADRVQI
GVVQFSHENREEFQLNTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFSPEKGARPN
VRKFLILITDGEAQDIVKDPAVALRKDGVIIYSVGVFGSNVTQLEEISGKPEMVFYVENF
DILQHIEDDLVLGICSPREECKRIEVLDVVFVIDSSGSIDYQEYNIMKDFMIGLVKKADV
GKNQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDQPMGGNTYTAEALAFSDHMFTE
ARGSRLHKGVPQVLIVITDGESHDAEKLNATAKALRDKGILVLAVGIAGANTWELLAMAG
SSDKYYFVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQKMKE
FLASVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQIFGYTHIGD
ALRKVKHYFRPDTGSRINAGTPQVLLVLTDGRSQDEVAQAAEELRHKGVDIYSVGIGDVD
DQQLIQITGTAEKKLTVHNFDELKKVKKRIVRNICTSGGESNCFVDVVVGFDISSQRRGQ
TLLEGQPWMGTYLQDLLRAISSLNGVSCEVGTETQVSIAFQVTNAMERYPSKFEIYSENI
LSSLQGVTVNGPSRLNPNLLSSLWDTFQNKSAARGKVVLLFSDGLDDGVEKLEQKSDELR
KEGLNALITIAVDGAADSSDLADLLYIEFGKGFEYRTQFTIGMGNLGSHLSKQLINVAER
TCCCLLCKCTGWDGAMGDPGSAGEKGPPGFKGSEGYLGEEGIAGERGALGPMGEQGTKGC
LGAKGPKGARGLSGEEGEVGEDGLDGLNGEQGDSGIPGRRGERGDAGSQGSPGKRGASGD
RGPKGLRGDPGTPGRDNSIQGPKGLKGDPGRQGRRGWPGSPGTPGSRRKMVVHGRRGHTG
PQGNPGITGPDGLEGSPGLKGPQGPRGEVGEKGEKGGFGMKGPQGPPGPKGRAGNQGHWG
SQGSKGEPGDLGEKGAAGFPGPRGLQGDDGSPGYGSIGRKGTKGQEGFPGESGPKGDIGD
PGDPGEAGPKGARGKTVSAGIPGEPGSPGEPGPPGRKGMKGARGLASFSTCDLIQYVREH
SPVRHGKPECPVHPTELVFVLDQSRDVTEQDFERMKGMMASLVRDIKVRDASCPVGARVA
ILAYNSHTRHLIRFSDAYRKDQLLREIKALPYEKSSDSREIGKAMRFISRNVFKRTLPGA
HVRRIATFFSSGQSADVQSITTAAMEFSALDIIPVVIAFSNVPSVKRAFSIDDTGTFQVL
VVPSRPDQVPALERLQRCTFCYDLCKPDASCDQAKPPIQSYLDAAFLLDSSWHVGGAEFE
DMRDLLEALLDHFEIASEPETSVTGDRVALLSHAPVDFLPNTQRSPVRTEFNLTTYSSKH
LMKRHVEQAVQQLNGDAFLGHALRWALDNVFLNTPNLRRNKVIFVISAGETSHLDAETLK
KESLRAKCHGYALFVFSLGPDWDDKELEDLASHPLDQHLIQLGRIHKPDHGYGVKFVKSF
INSIRHGINKYPPVNFKAKCSRLSSAAPKLSPRQFRSFVPGPQKALLKDHTLEAAKLFQD
KKRLSSILRSGRAALSSHSRSTRYSFKQGREATKATSKLGKRSD
NT seq 6795 nt   +upstreamnt  +downstreamnt
atgctgctgattttgttcctgatgatgatttgtttacacgtttgtgcgaaccaagattct
ggccccgagtacgcagatgtggtttttctggtggacagttccgatcacctagggatgaag
tcctttcctcttgtgaaaactttcatccaaaagctagtcagcagcctccccgtagaggcc
aacaagtaccgtgtggccctggcccagtacagtgatgctctccacaatgagttccaccta
ggcgccttcaagaacaggaaccccatgctgaaccacctcaagaagaacttcgggttcatc
ggcggctccctgaagattgggaacgccctgcaggaggctcacaggacctatttctctgct
cccacaaacggaagagacaagaaacagttccccccaatcctggtggtgctggcttcagcg
gagtccgaggatgacgtagaagaggcttcgaaggccctgcgggaagatggggtgaaaatc
atctctgtgggggtgcagaaggcttctgaggaagacctgaaggccatggccacctctcag
tttcatttcaacctcaggactgccagagacctcagcatgtttgccccaaacatgacacag
atcatcaaggatgtgactcagtacagggaagcaatgactgatgacattattgtagaagct
tgccaaggcccatctgtggccgatgtggtattcctgttggatatggccatcaacggcagc
caagagaacctagatcatcttaaagcattcctgggggaaagtgtctctgcccttgacata
aaggaaaactgcatgagggttggcctggtggcctatagcaacgaaacaagggtgatcagt
tctctgagcatgggtgtcaacaagacggaggtcttacagcgcatacaggatctgtcccct
cacgtagggcaggcctacacaggagctgccctcagaaagacgaggaaggaagtcttcagt
gcacagaggggcagtcggaagaaccaaggggtgcctcagattgctgtgctggtgacccac
agagcttcagatgacaacgtgaccaaggcagctgtcaacctccggcgggagggagtgacc
gtctttaccatgggcgtagagggggctaacccagaacagctggagaagatcgcatcctac
cctgccgagcagttcacctccaaactgagcaacttctctgagctggccacccacaaccag
acgttcctgaagaagctgaggaaccaaatcacacacacagtctctgtcttctcagaacgg
actgagaccctcaaatcagcctgtgtggacacagaggaagcggatatctatctgctcatt
gatggttcagggaacacgcagcccacagacttccatgagatgaagatcttcctgtccgag
gtggtggacatgttcaacatcgctccccacaaggtgcgagtcggggccgtgcagtacgcc
gacacctgggacctggaatttgagatctctaagtatactaacaagcctgacttgggaaag
gccatcgacaacatcaggcagatgggtggaaacaccaatacgggtgcagccctgaacttc
accctgacgctgttgcaaagagcaaagaagcaacggggaaacaaagtgccatgtcacctg
gttgtcctgacaaacggcatgtcccaggacagcgtcctgggtcccgcgcataagctgagg
gaggaaaacatccgtgtgcatgcgatcggcgtcaaggaagccaaccaaacacagctgcgg
gagatagcgggagatgaaaagcgagtttattacgtgcatgaattcgatgccttgagggac
ataaggaaccaagtggttcaggagatctgtgctgaagaagcctgcagagatatgaaagcg
gacatcatgtttctggtggacagctctggcagcattggactcgaaaacttcagcaagatg
aaggtgtttatgaagaacctggtgagcaaatcccagatcggggctgaccgggtgcaaatt
ggtgtggtccagttcagccacgaaaacagggaggagtttcagctcaacacattcatgtct
caaaatgacattgccaacgccatcgaccaaatggctcacattggagagacaaccttgaca
ggcagtgcgctgacctttgtgtctcagtacttcagtcctgaaaagggggcccggcccaat
gttaggaagttcctcattctcatcacggatggcgaggctcaggacatagtaaaggaccca
gcagtagcccttcgaaaagacggtgtgattatctattctgtgggagtgttcggctccaat
gtcacccagctcgaggagatcagtgggaagccggagatggttttctatgttgagaatttt
gacatcctacagcatattgaagatgatctcgttctgggaatctgcagtccccgtgaagaa
tgcaagcggattgaagttttggatgtagtgtttgtcatcgacagctccggcagcattgac
tatcaagaatataacatcatgaaggacttcatgattggcttggtgaaaaaagctgacgtg
ggcaagaatcaggtccggtttggagctctgaaatatgctgatgacccagaggtgctgttt
tacctggatgaactgggcacaaagctggaggtgatttcagtgctccagaatgaccagccc
atgggtggtaacacttacactgctgaggccctggctttctccgatcacatgttcaccgaa
gcccggggcagccgtctgcacaagggagtcccccaagtcctcattgtgattactgatgga
gaatctcatgatgcagagaagctcaatgccaccgccaaggccctgagggacaaaggcatt
ctggtcctagctgtggggattgctggcgccaatacctgggagctcttggccatggctggg
tcgagtgacaagtactactttgtagagaccttcggaggcctgaaggggatattttccgat
gtgtcagccagtgtctgtaactcttcaaaagtcgattgcgaaattgaaaaagttgacctc
gtattcctcatggatggttcaaacagcatccatccggatgacttccagaagatgaaggag
tttttggcatcggtcgttcaagacttcgatgtcagcctcaacagagtccgcataggagtg
gcacagttcagcgacagctacaggtcagagtttctgctggggacattcacaggggaaaag
gagatatccacacagattgaggccatccagcagatctttggttacacccacattggagat
gcactcaggaaggtgaagcattactttcggccagacacgggcagcaggatcaacgccggt
acccctcaggtgctgctggtcctcacagatggccggtcccaagatgaggtggcccaggcg
gctgaggagctgagacacaagggtgtggacatctactcggtgggcatcggggacgtggat
gaccagcagctgatccagataaccgggacagcggagaaaaaactgacggtgcataacttc
gacgagctaaaaaaggtgaagaaaaggatcgttcgtaacatctgtacctcaggtggtgag
agcaattgtttcgtggatgtggtggttggatttgatatctcaagtcagcggagagggcag
actttgctcgaaggtcagccttggatggggacctacctccaagacctcttacgtgccatc
agctccctcaatggggtaagctgtgaggtgggcacagagactcaggtgagcatagctttt
caagtgacgaatgctatggagagatacccttccaagtttgagatctacagtgagaacata
ctgagcagcctgcagggtgtgacagtgaacggcccatctcgcctcaacccaaacctgctg
agttctctgtgggatacgtttcagaacaagtccgctgctcgagggaaggtggtccttctg
ttttcagatggattggatgacggcgttgaaaaacttgaacagaaatctgatgaactcaga
aaggaaggcctgaatgccctcataaccatcgctgtggacggagctgccgattccagtgac
ctggctgaccttctctacattgaatttgggaaaggatttgagtacagaacacagttcaca
atcggaatggggaaccttgggagtcacctgtcaaagcaactaatcaatgttgcagagagg
acctgctgctgtttgctctgcaagtgcacaggatgggacggcgccatgggggatccgggc
tcagcaggggaaaagggacccccaggatttaaaggcagtgaaggctacctgggagaagaa
ggcattgctggagaaagaggagccttgggaccaatgggagagcaaggtacaaagggatgc
ttaggtgccaaagggcccaagggagcccgaggactcagcggagaagagggtgaagttggg
gaagatgggcttgacggattaaatggagaacagggtgacagtgggattcctggaagaaga
ggagaaaggggtgatgcgggatctcagggaagcccaggcaagagaggggcttctggtgac
cgtggaccgaagggcctacgaggagatccgggaactcctggacgtgataatagcatacag
ggacccaagggcttgaaaggagaccctggaagacaaggcagaagaggctggccaggctct
cctgggacaccaggctcaagaagaaagatggtagttcatggccgaagaggacatacaggc
ccacagggaaatccaggcatcacaggcccagatggacttgaaggttcgcccggacttaag
ggcccgcagggtccaagaggagaggtcggtgagaaaggagaaaaaggaggcttcggaatg
aaaggtccccaggggcctccaggacccaaaggacgagctgggaaccaaggccattgggga
agccaaggaagcaaaggagaacctggagacctgggggaaaagggagctgctggctttcca
gggcctcggggcctgcagggtgacgacggcagcccaggatatggtagcattggccgaaaa
gggacaaaggggcaagaaggattccccggagagagtggaccaaagggtgatattggggac
cctggcgatccaggagaggctggtcccaagggagccagaggcaaaacggtatctgctggg
attccaggagagccaggatcccccggggagccaggaccccctggaaggaagggcatgaaa
ggagcccgaggactggcttcattttccacatgtgacctcattcagtatgtgcgggaacac
agtcctgtcagacacggaaaacctgagtgcccagtgcatccaactgagttggtgtttgtc
ctggaccagtcccgggatgtcacagaacaggactttgagcggatgaaggggatgatggcc
tccttagttagggacatcaaagttagggatgccagctgccccgtgggtgcccgcgttgcc
atcctagcctacaactcccacaccaggcacctcatccgcttctcggatgcctacaggaag
gaccagctgctcagggaaattaaagctcttccttacgagaaatcctccgacagcagggag
attggcaaagccatgaggtttatctccaggaatgtcttcaagagaaccctcccaggggct
cacgtgagaagaatcgccacgttctttagcagtggccagtcagccgatgtccagtccatt
accacagctgccatggagttcagtgccctggacatcataccagtggtgattgcattcagc
aacgtgccctcggtcaagcgtgccttttcgattgatgacactggtacattccaagtcctt
gtggttccctccaggcctgaccaagtgccagcgttagagagactccagcggtgcactttc
tgctatgacctatgcaagccagatgcttcttgtgaccaagccaaaccccccattcagtcc
tacctggatgctgcattcctgctggatagctcctggcatgtgggaggtgcagaatttgaa
gacatgagagacctcctggaggccctgctagatcactttgaaatcgcttcagagccagag
acgtctgtcactggagacagggtggccctactgagtcatgctcccgttgacttcctaccc
aacacccagaggagtcccgttagaactgagttcaacctaaccacctatagcagtaagcac
ctcatgaagagacacgtggagcaagcagtccagcagctgaatggagatgctttccttggt
catgccctcaggtgggcactggacaatgtctttctaaacacgcccaatctgagaaggaac
aaagtcatatttgtgatatctgctggggaaaccagccacctggatgcggaaaccttaaag
aaagagtccctgcgagccaaatgtcatgggtatgccctctttgtgttttccttgggccct
gactgggatgacaaagaactggaagacctggccagccacccacttgaccagcacttgatc
cagcttggccgaattcataaacctgaccacggatatggtgtaaagtttgtgaagtccttt
ataaactccatcaggcatgggataaacaagtatccaccggtcaactttaaagccaagtgc
agcagactcagctcggcggctccgaagctgtccccacgccagtttcgaagctttgttccg
gggccacagaaagctctcctcaaagaccacacactggaggcagcaaagctctttcaagat
aaaaaacgcctttcaagtatcctgagaagtggcagggctgctctttcaagtcattccaga
agcacacgctactcctttaaacaggggagagaagcgacaaaagctacttctaaactcggt
aaacgaagtgactga

KEGG   Rattus norvegicus (rat): 315981
Entry
315981            CDS       T01003                                 

Gene name
Col6a4
Definition
(RefSeq) collagen alpha-4(VI) chain precursor
  KO
K06238  collagen type VI alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04974  Protein digestion and absorption
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    315981 (Col6a4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    315981 (Col6a4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    315981 (Col6a4)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    315981 (Col6a4)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    315981 (Col6a4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    315981 (Col6a4)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    315981 (Col6a4)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   315981 (Col6a4)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   315981 (Col6a4)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   315981 (Col6a4)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3
Other DBs
NCBI-GeneID: 315981
NCBI-ProteinID: NP_001258111
RGD: 1564060
Ensembl: ENSRNOG00000031782
Position
8
AA seq 2114 aa
MGTWKTLWLIISFAAGLGFIKSQRTVCREASVGDIVFLVHTNIDPQHAQSVRNFLYILAN
SLQVGGDNIRVSLALYGDTPTTEFLLSAYPRKGDVLKHIRGLQFKPGGNRMGQSLQFILE
HHFPEGAGSRASQGVPQVALVMSTGVAEDHFREPAEALKREGILLYAIGVKDAAQAELRE
IASSPKDNFTFFVPNFSGLPGLAQKLRPELCTTLAKVVQHTEQGSPACTEAFLADIVFLV
DSSTSIGLQNFQKVKNFLHSIVSGLDVRSDQVQVGLVQYSDNIYPAFQLKQSSLKSVVLE
QIRNLPYNMGSTNTGSALEFIRANYLTEMSGSRAKDGVPQIVILVTDGESNDEVQDAADQ
LKRDGVFVYVVGINIQDVQELQKIASEPFEEFLFTTENFNILQELSGSLSQAVCSTVERE
MKKCLAPIPTDLVFLVEEFTSAGQPNFQQVIKLLKTTVHSLNIHPDIVRVSLVFYSEKPQ
LKFSLNTFQNAAQVLTSLDQLTFRARRGRTKAGAALDFLRKEVFLPEKGSRSIWGVQQIA
VIIMESPSLDNVSTPASHLRRTGVTIYAVGIQSASESKDLEKIATYPPGKHTIHLESFLQ
LDVVRKILNKKLCPETLGKMPPVGSMPPGAEMPPGAEMPPGGKMPPLTGCTPQSMQKVCT
HVEKADIYFLIDGSGSISPKDFTEMKEFMKDVIKMFHIGPDGVRFGVVQYSDKIISQFLL
TQYTSMEKLGTAIGNIQQGGGGTTTGEALSKMALVFRNTARTNVAQYLIVITDGQSSDPV
ADAAQGLRDTGINIYAIGVRDANTTELEEIANNRVFFTDDFHFLKSIHQEVVRDICSFEN
CRSQKADIIFLIDGSESISSEDFEKIKDFVKRMVNQSNIGADKIQIGLLQFSSTPREEFT
LKNNYSSKDEMCRAISNVTQINSGTETGKALNFTLPFFDISQGGRPGVHQYLIVITDGDS
HDDIVSPAKALRDRNIIIFAIGVGKIQRAQLLAITNDQDKVYHEENFESLQNLEKEILYE
VCTSQGCSMDLSVGVDTSTSSGRAQQELGRFLPGLLQQLALLSNISCEAPGQMEPRFRYV
VPGSSSRPVFDSGFEKYNDETVQKFLVQQGSANNRLNVDFLKSLGETAIHSSSANVKVLL
VFTDGLDEDLERLRRTSELLRSGGLSGLLVIGLGGAHKLEELQELEFGRGFAYRQPLSSS
LPSLPSVLLKQLDTIVEKACCYVCAKCYGDDGVRGDPGSLGKEGERGLDGLPGHPGEDGD
HGPRGPRGLPGLRGEEGCPGVRGPKGARGFSGEKGNPGDEGVEGLDGEQGDRGAAGSSGE
KGSRGSQGLTGLPGPAGSRGEPGLRGDPGDPGIDNFIQGPKGEKGRRGRQGSSGFHGLLG
ETGSVGPWGSLGRRGLPGLKGVLGEAGELGYQGEPGYPGPQGPRGRQGPPGSFGQKGDAG
AQGNPGPPGPNGSRGTDGPRGLKGELGHTGERGPRGQQGSRGQPGRFGLDGYGHPGRKGK
KGEPGFPGYPGLQGEDGNPGRGGEKGAKGIRGKRGSSGFPGFAGTPGDQGPPGKMGIKGP
KGLADRTPCEIVDFVRGNCPCSIGISRCPAFPTEVVFALDMSNDVSQSDFERMRNILLSL
LMKLEMSESNCPTGARVAIVSYNMRTDYLVRLSDHRGKATLLQAVRKIPLERSSGFRNLG
ATMKFVARHVFKRVRSGFLVRKVAVFFQVGGNYDATSVNTAMLELHAANIVTAVVTFTED
HNLPDALLVDGLNTFHLFTWETEHQQDVERLVSCTLCYDKCRPAPGCRVESPRPQELDMD
LVFLVDSSQGVSRDIYLGALRLVDSVLKDLEVAAQPGTSWHGARVALLTHTTPGFWPGVG
QDPVLEYFHLTSHGHRTQMQRQIQEAASGLLQGAPALGHALEWTLEKVLLAAILPRRSQV
LFAIVASETSIWDRETLRALSLEAKCKGITLFVLAVGPGVGAQELTELANVASAPSEQHL
LRLEGVSEAEVAYASRFTEAFLNLLKSGINQYPPPELNEECGGPNRGDTLLQFFTSAKRL
SKPRFGTSAATGYDSGAPKPQDSFLRGKRKIRTLSVAQQETLGSLGRDRADAEETPAKGR
RSGAAHGEIPNLVL
NT seq 6345 nt   +upstreamnt  +downstreamnt
atggggacttggaagacattgtggttgatcatctcctttgcagctggtttgggcttcatc
aagtcacaaaggactgtctgcagggaggcctctgtgggagacatcgtgtttctggtccac
accaacatcgacccccagcatgcccagagtgtgcggaacttcttgtatatattggcaaac
agtttacaagtcggcggagataacatccgtgtgagtctggccctgtacggtgatacaccc
accactgagttcctgctttcggcctatccccgcaaaggtgatgtgttgaagcacatccgt
ggacttcagtttaagcccggaggcaacaggatgggccagtccctgcagttcatcctagag
catcacttcccggaaggagccgggagccgggcaagccaaggggtgcctcaggtggccctg
gtgatgagcactggtgtagctgaggaccactttcgtgaacctgccgaagctctcaagaga
gagggaatcttgctatacgccattggagtcaaagacgcagctcaggccgaactcagggag
attgcgagtagtcccaaggataacttcaccttttttgttcctaacttctctggcttgcct
ggcctcgcccagaagctgcggccagagctttgcactacactagcaaaggtagtgcaacat
actgagcagggatctccagcctgcacagaagcattcctggctgatattgtgtttttagta
gacagctcaactagcattggactccagaacttccagaaagtaaaaaatttcctccactcc
attgtctcggggcttgacgtcagaagtgatcaggtgcaagtggggctggtccagtacagt
gacaacatctatccagcctttcagctgaagcagtcctctctaaagagcgtggtcctggaa
cagatccggaatctgccctataacatggggagtacaaataccggaagtgccctagagttc
atcagggccaactacttgaccgagatgagtggcagccgtgccaaggatggggttcctcag
atagttattctggtgacagatggggagtccaatgatgaggttcaggatgcggctgaccag
ctgaaaagagacggggtctttgtgtatgtggtagggatcaatattcaagatgtccaagag
ttgcaaaagatagccagcgagccatttgaggagtttctcttcaccacagaaaacttcaac
atcttgcaggagctctcaggaagcctcagtcaggctgtgtgttccacagtggagagagag
atgaaaaaatgtctggctcccattccaactgacctcgtgttcctcgttgaggaatttacc
agtgccgggcaacccaattttcagcaagtcatcaagttattgaagaccactgtccactct
ctcaacatccaccctgatatcgtgagggtcagcttggtcttttacagtgagaaaccacag
ctgaagttttctctgaatacatttcaaaatgctgcccaggtcttaaccagtttggaccag
ttaaccttcagagcaagacgaggaaggacgaaggcaggggctgctttggatttcctgagg
aaagaagtttttcttccggagaagggcagccggtccatttggggtgtacagcagatagct
gtgatcattatggaaagtccatctctagacaatgtgtctacaccggcttcccacctccgc
aggaccggggtcaccatctatgcagtgggcatccagtctgcctctgagagtaaggacctg
gagaagattgccacataccctcctgggaagcacaccattcacctagagtcctttctgcag
ctcgatgttgtgagaaaaatacttaataagaagctctgtcctgagactttgggcaaaatg
cctcctgtgggcagcatgcctcctggggccgaaatgcctcctggggccgaaatgcctcct
gggggcaaaatgcctcctctgacgggttgtactccacagagcatgcaaaaagtttgcacg
catgtggagaaggcagatatttacttccttattgatgggtctggcagcatcagtccaaag
gacttcaccgagatgaaggagttcatgaaggacgtgatcaaaatgtttcacattggaccc
gacggggtacgatttggagtcgttcagtactcagacaaaattataagccagtttttactc
acccagtacaccagcatggagaagctggggacagccattggcaacatccagcagggagga
ggtggtaccaccaccggtgaggccttaagcaaaatggccctggtcttcagaaacactgcc
agaaccaatgttgcccaatacctcatcgtcatcactgatggtcagtcttcagacccagtg
gctgatgctgcgcagggactgagggacactggaatcaatatctatgccattggagtaaga
gatgctaataccaccgagcttgaggaaatagcgaacaacagggtatttttcacagatgat
tttcacttcttgaaaagcatccaccaggaagtcgttcgggatatctgctccttcgagaat
tgtaggagccagaaagctgacatcatcttcttgattgatggctcagaatccatctcttct
gaagactttgaaaagataaaggacttcgtgaagaggatggtgaatcagtctaatatcggg
gctgataaaattcagattggccttctgcagttcagctccactccccgggaagaattcacg
cttaagaacaattactcctcaaaggacgagatgtgtagagccatctcgaatgttacacag
ataaatagtggtaccgagactgggaaagccttgaatttcactctgcccttttttgacatt
tcacaaggagggagacccggagttcatcagtatttgattgtgatcaccgacggggactcc
catgatgatatagtttccccagccaaagccctcagggacagaaacataattatttttgct
attggggtgggaaaaattcaaagagcccagctcttggcgattactaatgaccaggacaaa
gtataccacgaagaaaattttgagtccctgcagaacctggaaaaggaaattctttatgag
gtctgtacttcccaaggctgcagcatggatctgtctgtaggagttgatacctcaacttct
tcagggcgagctcagcaggaacttggaaggtttctgcccgggctgctgcaacagctggcc
ttgctctctaacatcagctgcgaggctcctggccagatggagccgaggttccgctacgtg
gttccaggatccagcagtcggcctgtctttgattcgggctttgagaaatacaacgatgag
acggtccagaagttcttggtgcaacagggttctgcgaacaaccgactgaatgtagacttt
ctgaagtccctgggagagactgccatccactcgtcttctgctaacgtaaaggtcctttta
gtgtttacagacggacttgatgaagacctagagaggctgaggagaacctcggagctcctt
cgcagtggagggctctccggactcctggtcatcggcctgggaggcgcacacaagttagaa
gagctgcaggagctggagtttggcagagggtttgcgtacaggcagcctctgagctccagc
ctgccatccctcccaagtgtcttactaaagcaacttgacacaattgtggagaaggcatgc
tgttacgtgtgtgccaaatgttatggagatgatggggtcagaggtgaccctgggagtctc
gggaaggagggagagagaggtttggacgggctccccggccatcctggtgaagatggagac
catggaccaagaggcccccgtggtcttcctggacttcgaggtgaggaaggttgtccgggt
gtgagaggacctaagggagcaagaggattttcaggagagaagggcaaccctggtgatgaa
ggtgttgagggcttagatggagaacagggcgatcgcggggccgcggggtcatctggagag
aaaggaagcagaggcagtcagggcttgacaggtctgcctggaccagctggatctcgtggg
gagcctgggctgaggggagatcctggggatcctggaattgataatttcattcaaggcccc
aagggagaaaagggaaggcgtggacgtcagggaagttctggtttccacggacttctgggg
gaaactggcagtgtgggaccttgggggtcacttggaagacgtggcttgccagggttgaag
ggtgtgcttggagaggctggtgagctgggctaccaaggagaacctgggtatccaggccca
cagggacccagaggaaggcaaggacctccaggatcattcggtcaaaaaggggatgcgggt
gcccaggggaatcctgggcctccagggccaaatggctcaagaggaacagatggtcccaga
ggactcaagggagaactcggtcacacaggagaaagaggtccacggggtcagcaagggtca
agaggacagcctggtcgctttggtctagacggctacggtcatcctggaagaaaaggcaag
aagggtgaacctggattccctggctatcctggcttgcaaggagaagatggtaatccaggc
cgtggaggagagaagggggccaagggaatccgagggaagaggggcagttctggatttcct
gggtttgctggaactccaggtgaccaaggcccaccaggaaaaatgggcatcaagggccca
aaaggcttggcagataggacgccctgtgaaattgttgattttgtgcgaggaaactgccct
tgttcgataggtatttccagatgcccagcattcccgacggaagtggtctttgccttggac
atgtccaacgacgtctcccagtcagactttgagaggatgagaaacattttattatctctg
ttgatgaagctggaaatgagtgagagcaactgtcccactggtgctcgagtggccattgtt
tcctacaacatgaggacagattatctagtgcgcttgtcagaccatcggggaaaagctacc
cttctacaggctgtcaggaaaatccccctggaaaggtcatctgggttccgaaaccttggg
gccaccatgaagtttgtggctagacatgtattcaaacgtgtacgctcagggtttttggtg
aggaaagtagccgtgttcttccaggtgggtggaaattatgacgcaacctctgtcaacaca
gccatgctggagctccatgcagcgaacattgtcacagcagtagtgaccttcacagaggac
cacaacctcccagatgccctgctggtggatggactcaacacctttcacttgtttacctgg
gagacggagcaccagcaggatgtggaacgcctggtcagctgtactctctgctacgacaag
tgccgaccagcccccgggtgccgggtggaatctcccaggcctcaggagttggacatggac
ttggtcttcctggtggacagctcccagggtgtcagcagggacatataccttggggctttg
aggctagtggattcagtgctcaaggacctggaggtggctgcacagcctggcacatcctgg
cacggggcacgcgtggccctgttgacacacaccaccccaggcttttggcctggtgtgggt
caagaccctgtgcttgagtactttcacctgacctcccacggccaccggacacagatgcaa
aggcagattcaggaggctgcaagtggtttgctacagggagcccctgccctgggccacgcc
ctggagtggacgctggaaaaagtgcttctggcagccatattgcctcgaaggtcacaggtc
ctctttgccattgtggccagtgagacaagcatctgggacagagagacgctaagggctctg
tccctggaggccaagtgcaagggcatcacactatttgtgttggctgtgggcccaggagta
ggagcccaggagctgactgagctggccaatgtggccagtgcaccctcagagcagcacctg
ttacgcctggagggtgtctcagaggcagaggtagcctatgccagcagattcacggaggcc
ttcctgaaccttctaaaaagtggaataaaccagtatccacccccagagctcaatgaagag
tgtggaggcccaaaccgtggggacaccttgctacaattcttcacttctgccaagaggctg
tccaagccccggtttggcacgtctgctgccactgggtatgactcaggagcacccaagcca
caagactcctttctgagggggaagaggaagatcaggacactgtccgtagctcagcaagaa
acgctgggaagcctcggaagggacagggcagatgctgaagaaacaccagcgaaaggaaga
cgttcgggcgctgcccatggtgagataccaaaccttgttctctaa

KEGG   Rattus norvegicus (rat): 316758
Entry
316758            CDS       T01003                                 

Gene name
Lama1
Definition
(RefSeq) laminin subunit alpha-1 precursor
  KO
K05637  laminin, alpha 1/2
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05145  Toxoplasmosis
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
rno05410  Hypertrophic cardiomyopathy
rno05412  Arrhythmogenic right ventricular cardiomyopathy
rno05414  Dilated cardiomyopathy
rno05416  Viral myocarditis
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    316758 (Lama1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    316758 (Lama1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    316758 (Lama1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    316758 (Lama1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    316758 (Lama1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    316758 (Lama1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    316758 (Lama1)
   05145 Toxoplasmosis
    316758 (Lama1)
  09166 Cardiovascular disease
   05410 Hypertrophic cardiomyopathy
    316758 (Lama1)
   05412 Arrhythmogenic right ventricular cardiomyopathy
    316758 (Lama1)
   05414 Dilated cardiomyopathy
    316758 (Lama1)
   05416 Viral myocarditis
    316758 (Lama1)
SSDB
Motif
Pfam: Laminin_G_1 Laminin_EGF Laminin_G_2 Laminin_I Laminin_B Laminin_N Laminin_II Laminin_G_3 Flexi_CP_N DEDD_Tnp_IS110
Other DBs
NCBI-GeneID: 316758
NCBI-ProteinID: NP_001101707
RGD: 1307207
Ensembl: ENSRNOG00000017237
UniProt: D4A409
Position
9
AA seq 3083 aa
MRGSGTGVALLVLLASVLWVTVRSQQRGLFPAILNLATNAHISANATCGEKGPEMFCKLV
EHVPGRPVRHAQCRVCDGNSTNPRERHPITHAIDGTNNWWQSPSIQNGREYHWVTVTLDL
RQVFQVAYVIIKAANAPRPGNWILERSVDGVKFRPWQYYAVSDTECLTRYKITPRRGPPT
YRADNEVICTSYYSKLVPLEHGEIHTSLINGRPSADDPSPQLLEFTSARYIRLRLQRIRT
LNADLMTLSHRDLRDLDPIVTRRYYYSIKDISVGGMCICSGHASSCPWDEEAKQLQCQCE
HNTCGESCDRCCPGFHQQPWRPGTISSSNQCEECNCHNKAKDCYYDNNVAKERKSLNTAG
QYSGGGVCINCLQNTTGINCETCIDQYYRPHKVSPYEDQPCRPCDCDPVGSLSSVCIKDD
LHADLANGKWPGQCPCRKGYAGDKCDRCQFGYRGFPNCVPCDCSTVGSVNEDPCTEPCLC
KKNVEGENCDRCKPGFYNLKERNPEGCSECFCFGVSDVCDSLTWSVSQVTNMSGWLVTDL
ISPNKIRSQQDALGRHRQISINNTAVMHRFTSPYYWAAPEAYLGNKLTAFGGFLKYTVSY
DIPVETVDSNLMSHADVIIKGNGLTISTRAEGLSLQPYEEYFNVVRLVPENFRDFNTRRE
IDRDQLMTVLANVTHLLIRANYNSAKMALYRLDSVSLDIASPNAIDLVVAADVEHCECPQ
GYTGTSCEACLPGYYRVDGILFGGICQPCECHGHSSECDIHGICSGCTHNTTGDHCEQCL
PGFYGTPSRGTPGDCQPCACPLSIDSNNFSPTCHLTDREEVVCDQCAPGYSGAWCERCAD
GYYGNPTVPGGTCVPCNCSGNVDPLEAGHCDSVTGECLKCLWNTDGAHCERCADGFYGDA
VTAKNCRACNCHENGSLSGICHLETGLCDCKPYVTGQQCDQCLPGYYGLDTGLGCVPCNC
SGEGSISDNCTEEGQCHCVPGVSGKQCDQCSHGFYAFQNGGCTPCDCAHTQNNCDPDSGE
CLCPPHTHGLKCEQCEEAFWGLDPEQGCQACNCSVVGSTSPQCDVLSGQCSCKEGFGGQS
CHQCSLGYRSFPDCVPCDCDLRGTLADTCDLEQGLCSCTEDSGTCSCKENVLGPQCDKCR
AGTFALRADNPLGCSPCFCFGLSQLCSELEGYVRAPITLASDQPILHVVSQSNLKGTTEG
VHFQPPDTLLDAEAVRQHIYAEPFYWRLPKQFQGDQLLAYGGKLQYTVAFYSTLGTGTSN
YEPQVLIKGGRTRKHIIYMDAPAPENGVRQDYEVGMKEEFWKYFNSVSEKHVTHSDFMSV
LSNIEYILIKASYGQGLQQSRIANISMEVGRKAVEPAPEGKVALQLELCVCPPGTAGHSC
QDCAPGYHRGKLPESSGRGPRPLLAPCVPCNCNNHSDVCDPETGKCLNCRDHTAGDHCEL
CTAGYYGKVIGLPGDCTPCTCPHHPPFSFSPTCVLEGDSGFWCDACLPGYEGQYCERCSA
GYHGNPRAAGGSCQKCDCNPQGSVHSDCDRASGQCVCKPGATGLHCEECLPRHILMESHC
VSCDDECVGALLNDLDSIGDAVLSLNLAGVSLAPYGTLENLENTTKYFQGYLLEENAKKV
QAEIQLGGIEEQTENLQKELARVLRSHQQVNTAMERTSNRSQALATFLEQLHRNIKEITE
KVATLNQTTGEDFQPPVSALQSLHQNISSLLALIKKRNFTEMRQNATLELKAAKDLLSRI
QKRFQKPQEKLKALKEASSLLSNHIADLQAAEELLREAGSKTQESSLLLLLVKANLKDFR
EKKLHVQEEQNLTSKLIAQGREWVDAARTHAAAAQDTLTQLEHHRDELLLWASKIRSHVD
DLVMQMSKRRARDLVHRAEQHASELQSAAEALDRDLENVRNVSLNATSAVHVHTNIQTLT
EEAESLAADAHKTANKTSLISESLAPRGKAVLQRSSRFVKESVSTRKKQQGITLKLDELK
NLTSQFQERVDNITRQANDSLTVLRESPGGMREKSRKVKELAVAANETAARTLEDMLGLS
LRVFNTSEDLSRVNATVQETKDLLHNSTMTTILAGRKMRDMEMQANLLFDRLKPLKMLEE
NLSRNLSEIKLLISRARKQAASIKVAVSADRDCIRAYQPQISSTNYNTLLLNVKTQEPDN
LLFYLGSSSSSDFLAVEMRRGKAAFLWDLGSGSTRLEFPDVSINNDKWHSIYITRFGNMG
SLSVKEASAAEDPPVRTSKSPGLANVLDINNSTLMFVGGLGGQIKKSPAVKVTHFKGCMG
EAFLNGKSIGLWNYIEREGKCNGCFGSSQNEDASFHFDGSGYAVVEKALRPTVTQIIILF
STFSPNGLLFYLASNGTKDFLSIELLRGRVKVMVDLGSGPLTLMTDRRYNNGTWYKIAFQ
RNRKQGLLAVFDAYDTSDKETKQGETPGAASDLNRLEKDLIYVGGLPHSKAVRKGVSSRS
YVGCIKNLEISRSTFDLLRNSYGVRKGCTLEPIQSVSFLKGGYMEMPPKSLSPESSLLAT
FATKNSSGVILAALGKDAEKAGASQAHVPFFSILLIEGRIEVHINSGDGTSLRKALLHAP
TGSYSDGQEHSISVVRNRRVITVQLDENSPVEMKLGPLTEGRTINISNLYIGGLPEGKGT
PMIRMRTSFHGCIKNVVIDAQLLDFTRAAGSEQVELDTCLLAEEPTQGLHREHGELPPEP
PTLPEPELCAVDTAPGYVAGAHQFGLSQNSHLVLPFNQSDVRKRLQVQLNIRTFASSGLI
YYVAHQNQMDYAVLQLHEGRLNFMFDLGKGRTKVSHPALLSDGKWHTVKTEYIKRKAFMT
VDGQESPSVTVVGNATTLDVERKLYLGGLPAHYRARSIGTITHSIPACIGDVTVNSQQLD
KDRPASAFAVDRCYAVAQEGTFFEGSGYAALVKEGYKVRLDLNITLEFRTSSKNGVLLGI
SSAKVDAIGLEIVDGKVSFHVNNGAGRITATYKPRATRTLCDGKWHTLHAHKSRHRIVLT
VDGDAVRAESPHTHSTSADTNDPIYVGGYPAHVKQNCLSSRASFRGCVRNLELSRGSQVQ
SLDLSRAFDLHGVFPHSCPGPEP
NT seq 9252 nt   +upstreamnt  +downstreamnt
atgcgcggcagcggtacgggagtcgcgctcttggtgctcctggcctcggtgctctgggtc
accgtgcggagccagcagcgaggcttgttccctgccattctcaacctggccaccaatgcc
cacatcagtgccaatgctacctgtggagagaaggggcctgagatgttctgcaagctcgtg
gagcacgtgccgggccggcctgttcgacacgcccagtgccgggtctgtgatggtaacagt
acaaatcctagagagcgccacccgataacccacgcaatagacggcaccaataactggtgg
caaagccccagcattcagaatgggagagagtatcactgggtcaccgtcaccctggactta
cggcaggtctttcaagttgcgtatgtcatcattaaagctgccaatgcccctcggcctgga
aactggattttggagcgctccgtggatggtgtcaagtttagaccctggcagtactatgca
gtcagtgacacagagtgtctgacgcgctacaaaataactccacggcggggacctcccacc
tacagagctgacaacgaagtgatctgcacctcgtattattctaaactggtgccacttgaa
catggagagatccacacatcactcatcaatggcaggcccagtgctgatgacccctcgccc
cagttgctggaattcacctcagcacggtacatacgccttcgtctgcagcgcatcagaaca
ctcaatgcagacctcatgacccttagccaccgggacctcagagacctcgacccgattgtc
acaagacgttattactactcgataaaagacatctccgttggaggcatgtgcatttgctcc
ggccatgctagcagctgcccctgggatgaagaagcaaagcaacttcagtgtcagtgtgaa
cataatacatgtggtgagagctgtgacaggtgctgtcccggcttccatcagcagccctgg
aggcccggaaccatttcctccagtaaccagtgtgaagaatgtaactgtcacaacaaagcc
aaagattgttactatgacaacaatgttgcaaaggagaggaaaagccttaacactgccggg
cagtacagcggaggaggagtttgtatcaactgcttgcagaacaccacagggatcaactgt
gaaacctgcattgaccagtactacagacctcacaaagtatctccgtatgaggaccagcct
tgccgtccctgtgactgtgacccggtggggtctctgagttctgtctgtatcaaggatgac
ctccacgccgacttagccaacgggaagtggccgggtcagtgtccatgtaggaaaggttat
gcaggagacaaatgtgaccgctgccagtttggctaccggggtttcccgaattgcgtcccc
tgtgactgcagcaccgttggcagcgtgaacgaggacccatgcacagagccatgtctttgt
aagaaaaatgttgagggagagaattgtgatcgctgcaaaccaggattctataacttgaag
gaaagaaaccccgagggctgctcggagtgcttctgctttggtgtctccgatgtctgtgac
agcctcacatggtctgtcagtcaggtgaccaatatgtcagggtggttggtcaccgacttg
atcagcccaaataagatccggtctcagcaggatgccctgggcaggcaccgtcagatcagc
atcaacaacacggcggtcatgcacagatttacttccccttattactgggcagctcctgag
gcctaccttggaaacaagctgacagcatttggtgggttcctaaaatacacagtgtcttac
gacatccctgtggagacggtggacagtaatctcatgtctcatgctgacgtcatcatcaag
ggaaatgggctcactataagcacacgggctgagggcctgtccttgcaaccctacgaggaa
tacttcaatgtggttagacttgtgcctgagaacttccgggactttaataccagaagggag
attgaccgtgaccagctgatgactgtcctggccaatgtgactcatctcttgattagagcc
aactataattctgctaaaatggcgctgtatagattggattctgtctctctggacatagca
agccctaatgctatagacctggtagttgctgctgatgtggagcactgcgaatgtccccaa
ggctacacagggacctcctgtgaggcctgcctccccggctattatcgagtagatgggata
ctctttggaggaatctgtcagccctgcgaatgccacggccactcatctgagtgtgacatt
catggaatttgctctgggtgtacacacaacaccacaggggatcactgtgagcagtgcctg
cctggcttctatgggacaccttcacgtgggaccccaggggactgccagccttgtgcctgc
cctctctccattgactccaacaatttcagccctacctgccacctcaccgacagagaggaa
gtggtctgcgaccagtgtgccccaggttactcaggagcctggtgtgagagatgtgcagat
ggttactatggaaacccaaccgtgccagggggaacctgtgtaccatgcaactgcagtggc
aatgttgatcccctggaggctggccactgtgactctgtcaccggggaatgcctgaagtgc
ttgtggaacacagacggtgcccactgtgagaggtgtgcagatggcttctatggagacgca
gtgactgccaaaaactgccgagcctgtaactgccatgaaaatggctccctttctggcatc
tgccatctggaaactggactgtgtgattgcaaaccttacgtgacaggacagcagtgcgac
cagtgcctgcctggctattatgggttggacacgggacttggctgtgtgccctgtaactgc
agcggggaaggctccatatctgacaactgcacggaggaaggccagtgtcactgtgtacca
ggtgtatctggaaaacagtgtgaccagtgttcacacggcttctatgcattccagaatggc
ggctgtacaccctgtgactgtgctcatactcagaataactgtgaccccgattctggagag
tgtctctgcccacctcacactcacggtctgaagtgcgagcaatgtgaagaggcattctgg
gggctggacccggagcaggggtgccaggcttgcaattgtagtgtggtgggctccacaagt
ccccagtgtgatgttctctctggccaatgttcctgcaaagaagggtttggcgggcagagc
tgccatcagtgttccttgggctacaggagttttcctgactgtgttccctgtgactgtgac
ctgagagggacactggctgacacctgtgacctggaacagggtctctgcagctgcacagag
gacagcggcacctgctcctgcaaggagaatgtcctgggaccccagtgcgataaatgccga
gctggcacctttgccttgcgagctgacaaccccctaggctgcagcccctgtttctgcttt
ggtctatcacagctctgctcggagttggagggttatgtgcgggctcctataaccctagcc
tccgatcagcccatcctgcatgtggtttcacagagcaacctcaagggcacaactgaaggg
gtgcatttccagcctccggacaccttgctggatgcagaggctgtccgccagcacatctat
gcagagccattttactggcggctaccgaagcagttccagggagatcagctcttggcctat
ggtgggaaactgcagtataccgtggctttctactccaccctcggcaccggtacatccaat
tatgagcctcaagtcctcatcaaaggaggtcggaccaggaagcacatcatttatatggat
gccccagcaccagagaacggagtgagacaggactatgaagtggggatgaaagaggagttt
tggaaatattttaactctgtgtctgagaaacatgtcacacactccgattttatgtctgtt
ctcagcaatattgagtatatcctcatcaaagcatcatacggccaaggactgcagcaaagc
agaattgccaacatctccatggaggttggcaggaaggctgttgagccggcccctgagggc
aaggtggcactgcagttggagctctgtgtctgtcctcctggcaccgctggacactcctgt
caggactgtgctcctgggtaccacagagggaagctcccagaaagcagtggtaggggaccc
cgccctctgttggctccctgcgtgccctgcaactgcaacaaccacagtgatgtctgtgac
cccgaaactggaaagtgtctgaactgcagggaccacacagccggcgaccactgcgagctg
tgcactgctggctactatggaaaagtgattgggttgcctggagactgtaccccatgcacc
tgtcctcatcaccctcctttcagtttcagccccacttgtgtcttggaaggtgacagtggt
ttctggtgtgatgcctgtctccccggctatgagggacagtactgcgaaaggtgctctgcg
ggctatcacggcaaccctcgagcagcgggtggcagctgccagaagtgtgactgcaacccc
caaggctctgtccacagtgactgtgaccgtgcatcggggcagtgtgtctgcaagccagga
gctacgggactccactgtgaggaatgcctgccgagacacatcctgatggagagccactgt
gtgtcttgtgatgatgaatgtgtgggtgctttgctgaacgacctggattctattggtgat
gctgtcctgtctttgaacctcgcgggtgtttcccttgctccgtacggaactctggaaaat
ctggaaaatacaactaaatatttccaggggtatttgctagaggaaaacgccaagaaggtg
caggcagaaatccagcttggaggtatcgaagaacagacagaaaatctgcaaaaggagctc
gccagagtgttgagaagccatcagcaggtgaacacagcaatggaaagaacctccaatagg
agtcaggccctggccacgttcctcgagcagcttcacagaaacatcaaagaaatcacggag
aaggtggcaacattgaatcagaccacaggtgaagatttccagccgcccgtttctgccctt
cagagcctgcaccagaacatttcgtctttgctggcacttataaagaaaaggaatttcaca
gagatgcggcagaatgccacccttgagctcaaggccgccaaagatttattgtcacggatt
cagaaaaggtttcagaagcctcaggaaaagttgaaggcattgaaggaggccagcagcctc
ctctcaaaccacatcgctgacctgcaggctgcagaggagctcctcagagaagctgggagc
aagacccaggaaagcagcctcctgctgctccttgtcaaggccaacctgaaagacttcagg
gagaaaaagctgcatgttcaagaggaacaaaatctgacctcaaagctcattgcccaagga
agagaatgggtagatgctgccaggactcatgcagctgctgcccaagacaccctaacacag
ctggagcatcaccgagacgaactcctattgtgggccagtaaaatcaggagccacgtagat
gacctcgtcatgcagatgtccaaacgaagagcccgagaccttgtccacagggcggagcag
cacgcctctgagctgcagagtgcggcagaagcattggacagagaccttgaaaatgttaga
aacgtgtctttgaacgccaccagcgcagtgcatgtccacaccaacatccaaacactgacg
gaggaagcagagagtctggctgcagatgctcacaagacggcgaataagacaagcttgatc
tcagaatccctggctcctcgggggaaagcagtccttcagcgttcttccagatttgtaaaa
gaaagcgtcagtaccaggaagaagcagcaaggcattacgttgaagctggatgagttgaaa
aacttgacgagtcagtttcaggagcgtgtggacaacattacaagacaggccaacgactcc
ctcacagtactcagagaaagcccgggaggtatgagagagaagagtaggaaagtcaaagag
ctggcagtggcagccaatgagactgcggcgaggacactggaggacatgctgggtttgagc
ctaagggtcttcaacacatccgaggacttgtccagggtgaatgccacagtccaggagaca
aaggaccttctgcataactccacaatgaccactatcttggctggaaggaaaatgagagac
atggaaatgcaggccaaccttctgttcgatcgattgaagcctttgaaaatgctggaagaa
aacctgagcagaaacctgtcggagatcaagctgctcatcagccgggcccggaagcaagca
gcttcgatcaaagtagccgtgtctgcggacagagactgcattcgggcctaccagcctcag
atttcttctaccaattacaacaccttgttactgaacgttaagacccaggagccggacaac
cttctcttctaccttggcagcagcagcagctccgactttctcgcagtggagatgcggcgt
gggaaggcagcctttctctgggacctgggctctggctctaccaggttggaattcccagat
gtctccatcaacaacgacaaatggcatagcatctacataaccaggtttggaaacatgggg
tccctgagtgtaaaggaagcaagtgcagctgaggacccaccggtcaggacaagcaaatct
cctggactggcgaatgttctggacataaacaattcgacgctaatgttcgttggagggctt
ggaggtcaaatcaagaaatccccggctgtgaaggttactcattttaagggctgcatggga
gaggccttcttgaatggcaaatccattggcctgtggaattacatcgaaagagaggggaag
tgcaatggctgctttggaagctcccagaacgaagacgcttccttccattttgatggaagc
gggtacgcagtggtggagaaggctctccgtcccaccgtgacacagataataattctcttc
agcaccttctccccgaatgggcttcttttctacctggcttcaaacggcactaaggacttc
ctgtccatcgaactgttgcgtggcagggttaaagtgatggtggacctgggctcaggaccc
ctcactcttatgacggacaggcggtataacaacggaacctggtataaaatcgccttccag
cggaaccggaagcaaggactgctagctgtcttcgatgcatatgacaccagtgacaaggag
acaaagcaaggagaaactccaggagccgcttccgacctcaatcggctggagaaagacctg
atttacgtgggcggattacctcactctaaggctgtgaggaaaggggtcagcagcagaagc
tatgtgggctgtatcaagaacctggagatatccaggtccacctttgatttgctgagaaat
tcctatggagtgagaaaaggctgcacgctggagcctatccagagtgtgagcttcctgaaa
ggcggctacatggagatgccacccaagtctctgtcgccagagtcatccctgctggccaca
ttcgccaccaagaacagcagtggagtcatcctggctgccctgggcaaggatgcggagaag
gctggggcgtctcaggcacatgtgcccttcttttccatcctgctgattgaaggccgcatc
gaagtgcacatcaactctggggatgggaccagtctaaggaaggccctcctgcacgctccc
accggctcttacagtgatggacaagaacattccatctccgtggttaggaatcggagagtt
atcaccgtacagctggacgagaacagccctgtagaaatgaagttgggtccgttaacagaa
ggaaggacgatcaacatatccaacctgtacatagggggacttcctgagggcaagggaacc
ccgatgatcaggatgcggacatcgttccacgggtgtattaaaaacgtggtcatcgatgct
caacttttggactttacccgtgcggctggttctgaacaagtagagctggacacatgcttg
ctggcagaagaacccacacagggcctgcacagagaacatggggaactccctccagaaccc
ccaacgctaccagagcctgaactctgtgcagtagacacggctcctgggtatgtggcagga
gctcaccagtttggcctctcgcagaacagccacttggtgctcccttttaatcagtctgac
gtccggaagaggctccaggtgcagctgaacatccggacatttgcctccagtgggcttatt
tactacgtggctcatcagaaccaaatggactacgccgtgctccagctccacgagggccgc
ctgaacttcatgtttgaccttggcaagggccgaaccaaggtctctcaccctgccctgctc
agcgacggcaagtggcacaccgtcaagacggagtacattaaaagaaaggccttcatgact
gttgacggccaagagtcccccagtgtcactgtggtgggcaatgcaaccacgctggatgtg
gaaaggaagctgtaccttggaggccttcctgctcactacagggccaggagcatcgggact
atcacccacagcatccctgcttgtatcggggacgtgacggttaatagccagcagctggat
aaagacaggccggcctccgcctttgcagtggacaggtgctatgcggtggcccaggaagga
actttctttgaaggaagtggatatgcagctcttgtgaaggaaggctacaaagttcgactg
gatttaaatatcacactggagttccgaacttcctctaagaatggcgtcctcctgggaatc
agcagtgccaaagtggatgccattggtctagagattgtagacggcaaggtctcatttcac
gtcaacaacggtgctggaaggataacagccacctacaaacccagagccaccagaactctc
tgtgatgggaaatggcacacgctccatgcccacaaaagcaggcaccgaatcgtcctgact
gtggatggggacgccgttagggctgaaagccctcacacccactccacctcggcagacact
aacgaccccatttatgtgggtggctaccccgcccacgtcaaacagaactgcctgagcagc
cgggcctcgttccggggctgtgtgaggaatctcgagctgagcagaggctcacaggtgcaa
tccttggacttgagcagagcctttgacctccacggagttttccctcactcctgcccgggg
cctgagccctga

KEGG   Rattus norvegicus (rat): 361821
Entry
361821            CDS       T01003                                 

Gene name
Col6a2
Definition
(RefSeq) collagen alpha-2(VI) chain precursor
  KO
K06238  collagen type VI alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04974  Protein digestion and absorption
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    361821 (Col6a2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    361821 (Col6a2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    361821 (Col6a2)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    361821 (Col6a2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    361821 (Col6a2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    361821 (Col6a2)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    361821 (Col6a2)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   361821 (Col6a2)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   361821 (Col6a2)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   361821 (Col6a2)
SSDB
Motif
Pfam: VWA Collagen VWA_2 VWA_3 Ins134_P3_kin_N
Other DBs
NCBI-GeneID: 361821
NCBI-ProteinID: NP_001094211
RGD: 1305585
Ensembl: ENSRNOG00000001254
UniProt: F1LNH3 Q5EB88
Position
20
AA seq 1027 aa
MTTTTKMLQGSFSVLLLGGLLGVLHAQQQEAISPDISTTDRNNNCPEKADCPVNVYFVLD
TSESVAMQSPTDSLLYHMQQFVPQFISQLQNEFYLEQVALSWRYGGLHFSDQVEVFSPPG
SDRASFTKSLQGIRSFRRGTFTDCALANMTQQIRQHVGRGVVNFAVVITDGHVTGNPCGG
IKMQAERAREEGIRLFAVAPNRNLNEQGLRDIANTPHELYRNNYATMRPDSTEIDQDTIN
RIIKVMKHEAYGECYKVSCLEIPGPHGPKGYRGQKGAKGNMGEPGEPGQKGRQGDPGIEG
PIGFPGPKGVPGFKGEKGEFGSDGRKGAPGLAGKNGTDGQKGKLGRIGPPGCKGDPGSRG
PDGYPGEAGSPGEQGDQGAKGDSGRPGRRGPPGNPGDKGSKGYRGNSGAPGSPGVKGGKG
GPGPRGPKGEPGRRGDPGTKGGPGSDGPKGEKGDPGPEGPRGLAGEIGSKGAKGDRGLPG
PRGPQGALGEPGKQGSRGDPGDAGPRGDSGQPGPKGDPGRPGFSYPGPRGTPGEKGEPGP
PGPEGGRGDFGLKGAPGRKGEKGEPADPGPPGEPGPRGPRGIPGPEGEPGPPGDPGLTEC
DVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDP
KSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYN
QLIKESRRQKTRVFAVVITDGRHDPRDDDLNLRALCDRDVTVTAIGIGDMFHETHESENL
YSIACDKPQQVRNMTLFSDLVAEKFIDDMEDVLCPDPQIVCPELPCQTELYVAQCTQRPV
DIVFLLDGSERLGEQNFYKARRFVEEVSRRLTLARRDDDPLNARMALLQYGSQNQQQVAF
PLTYNVTTIHEALERTTYLNSFSHVGTGIVHAINNVVRGARGGARRHAELSFVFLTDGVT
GNDSLEESVHSMRKQNVVPTVVAVGGDVDMDVLTKISLGDRAAIFREKDFDSLAQPSFFD
RFIRWIC
NT seq 3084 nt   +upstreamnt  +downstreamnt
atgacgactactaccaagatgctccagggttctttttctgtgctcctgcttgggggactc
ttgggggtcctccatgcccagcagcaggaagccatctcacctgatatctctactactgac
aggaacaacaactgtccagagaaggctgactgcccagtcaacgtgtacttcgtgttggac
acctcggagagcgtggccatgcagtcccccacagacagcctgctttatcacatgcagcag
ttcgtaccacaatttatcagccagctgcagaacgagttctacctggagcaggtggccctg
agctggcgctatggtggcctacacttctcggaccaggtggaggtgttcagccccccaggc
agcgaccgggcctccttcactaagagcctacaaggcatccgctccttccgcaggggcacc
ttcactgactgtgccttggccaacatgacccagcagatccggcagcacgtaggccggggt
gtagtcaactttgccgtggtcatcaccgatggccacgtcacaggcaatccctgtgggggc
atcaaaatgcaggctgagcgtgcccgcgaggagggcatccggctctttgccgtggccccc
aataggaacctaaatgaacaaggcctgcgggatatcgccaacaccccacatgagctatac
cgaaacaactatgccaccatgcgacctgactctactgagattgaccaggacaccatcaac
cgcatcatcaaggtcatgaaacatgaagcctacggagagtgctacaaggtgagctgcctg
gagattccgggaccccacggacccaagggttaccgaggacagaagggcgccaagggcaac
atgggcgagccaggagaacctggacagaaaggacgacagggagaccctggcattgaaggc
cccattggattcccaggacccaagggtgtgcctggcttcaagggagagaagggtgaattc
ggatcagatggtcggaagggagctccgggtctagcaggcaagaacggaacagatggacag
aagggcaaactgggccgtattgggcctcctggttgcaagggagacccgggaagtcggggc
cccgatggataccccggagaagctggaagcccgggagagcaaggagaccaaggtgccaag
ggggactctggccgcccaggacgcaggggaccaccaggaaatcctggagacaaaggaagc
aagggatatcgaggcaacagtggagctcccggaagtccaggagtgaaaggaggcaaggga
gggcctggcccccgtggaccaaaaggagagcctggacgcaggggagatcctgggaccaag
ggcggcccgggcagtgatggtccaaagggggagaagggagaccctggtcctgaggggcca
cgaggtctggctggagaaatcggcagtaaaggagccaagggagacagaggtttgcctgga
cccagaggcccccagggagctcttggggagccagggaagcagggatctcggggagaccct
ggggatgctgggcctcgaggagattcaggacagccaggacccaagggagatcctggaagg
cctggattcagctacccaggaccccgagggacacccggtgaaaaaggcgagcccggccca
ccaggccctgagggaggccgaggagactttggtctgaaaggagcacccggacggaaggga
gaaaaaggagagccagctgatcctggtccccctggtgaacctggccctcgggggccaaga
ggaatcccaggacctgagggagaacccggccctccaggagatcctggtctcacggaatgt
gacgtcatgacctatgtgagggagacctgtgggtgctgtgactgtgagaagcgctgcggt
gccctggatgtggtcttcgtcatcgacagctctgagagtattggctacaccaacttcacc
ttggagaagaactttgtcatcaatgtggtcaacaggctgggcgccattgccaaggacccc
aagtctgaaacgggtacacgtgtgggtgtggtacagtacagccatgagggcacctttgag
gccatccggctggacgacgagcgggtcaactccctgtccagtttcaaggaggccgtcaaa
aaccttgagtggattgccggcggcacttggacgccctctgccctcaagttcgcctataat
cagctcatcaaagagagccggcgccagaagacccgggtgttcgcagtggtcatcacagat
gggcgccatgaccctcgagatgatgacctcaatcttcgggcactgtgtgaccgagacgtc
accgtgacagccatcggcatcggtgacatgttccacgagacccacgagagcgagaacctc
tactccatcgcctgtgacaagccccagcaggtgcgcaacatgacgctgttctctgacctg
gtggccgagaaattcatcgatgacatggaagacgtcctttgtccggacccccagatcgtg
tgtccagaacttccctgccaaacagagctctatgtggcccagtgcacacaacggcccgtg
gacattgtcttcctgctggatggctcggagcggctgggcgagcagaacttctacaaggca
cggcgcttcgtggaggaggtgtcacggcgcctgactctggcacgaagggatgacgaccca
ctcaatgcccgcatggccctgttgcagtatggaagccagaaccagcaacaggtggccttc
ccgttgacctacaatgtgaccaccatccacgaggccctggagagaaccacctacctcaat
tctttctcgcacgtgggcacaggcatcgtgcacgccatcaacaacgtggtgcggggggca
cggggtggggcccggcgtcacgcggagctgtccttcgtcttcctcacggacggcgtcacg
ggcaacgacagcctggaggagtccgtacactccatgcgtaagcagaatgtggtgcccacc
gtggtcgctgtgggtggtgatgtggacatggatgtacttaccaagatcagcctgggtgac
agggcggccatcttccgggagaaagacttcgacagtctagcccagcccagcttcttcgac
aggttcatccgctggatctgttag

KEGG   Rattus norvegicus (rat): 362285
Entry
362285            CDS       T01003                                 

Gene name
Col9a3
Definition
(RefSeq) collagen type IX alpha 3 chain precursor
  KO
K08131  collagen type IX alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04974  Protein digestion and absorption
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    362285 (Col9a3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    362285 (Col9a3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    362285 (Col9a3)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    362285 (Col9a3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    362285 (Col9a3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:rno00535]
    362285 (Col9a3)
Proteoglycans [BR:rno00535]
 Extracellular matrix (ECM) proteoglycans
  Others
   362285 (Col9a3)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 362285
NCBI-ProteinID: NP_001102081
RGD: 1310255
Ensembl: ENSRNOG00000009531
Position
3
AA seq 680 aa
MTGAPALALLLLGQLLTATSAQKVGPRGPPGPQGPPGKPGKDGIDGEAGPPGLPGPPGPK
GTSGKPGKPGEAGLPGLPGVDGLTGRDGPAGPKGAPGERGSLGPPGPPGLGGKGLPGPPG
EAGVSGLPGGIGLRGPPGPSGLPGLPGPPGPPGPPGNPGVLPEGATDLQCPAICPPGPPG
PPGMPGFKGPTGYKGEQGEVGKDGEKGNPGPPGPPGIPGTVGLQGPRGLQGLPGPLGPPG
DRGPIGFRGPPGIPGAPGKAGDRGDRGPEGFRGPKGDLGRPGPKGIPGVAGPGGEPGMPG
KDGKDGVPGLDGEKGEAGRTGAPGEKGPNGLPGLPGRAGSKGEKGEVGRTGELGEAGPSG
EPGIPGDVGVPGERGEAGHRGSVGALGPQGPPGAPGIRGFQGQKGSIGDPGLPGPQGLRG
DVGDRGPGGATGPKGDQGIAGSDGLPGDKGELGPNGPVGQKGESGSRGELGPKGIQGPNG
TSGVQGVPGPPGPLGFQGVQGVPGITGKPGVPGKEASEQRIRELCGGMISEQIAQLAAHL
RKPLAPGSIGRPGPAGPPGPPGPPGSIGHPGARGPPGYRGPTGELGDPGPRGSQGDRGDK
GAIGVGLDGPAGDQGYRGPQGVPGISKDGRDGAHGEPGLPGDPGLPGAAGAQGTPGICDT
SACQGAVLGGGGEKSGPRSS
NT seq 2043 nt   +upstreamnt  +downstreamnt
atgaccggagcgcctgccctagccctgctgctgctggggcagctcctgacggccacctcc
gcgcagaaagtgggacctcgaggcccccctggtccccaagggcctcctggaaaacccggc
aaggatggcatcgatggagaagctggccctccaggtctgcctggccctccaggacccaaa
gggacctcagggaagccggggaagccgggagaggcaggactgccaggactgcctggtgta
gatggtctgacagggagagatggacccgcaggacccaaaggtgccccgggagaacgggga
agtctaggacctccagggccaccaggacttggaggcaaaggcctccctggacctcctgga
gaggcaggagtgagtggcctcccaggtgggattggtctacgtggccccccgggaccctct
ggacttccaggcctgcctggccccccaggacctcctggacctcctggaaaccctggagtc
ctccctgaaggtgctactgatctgcagtgtcctgccatctgccctccaggccctccaggg
cccccaggaatgccggggttcaaggggcctactggctacaaaggggaacaaggagaagtc
ggcaaagatggtgagaagggtaatcctggcccccctggacctccaggaatcccaggcact
gtggggctacagggcccaagaggattgcaaggacttccagggccacttggaccccctggg
gaccggggtcccattgggtttcggggccccccagggatcccaggagcacctgggaaagcg
ggtgacaggggtgataggggaccagaagggttccgtggccctaagggtgacctgggcagg
cctggtcccaaaggaatccctggagtggctgggccaggcggagaaccaggcatgccaggc
aaggatggcaaggacggtgtgccgggacttgacggtgagaagggagaggctggtcgcact
ggtgccccaggagaaaaaggccccaatgggctgccggggctccctggacgagcagggtcc
aaaggcgagaagggagaagtgggtagaactggggagctgggcgaggctggcccctctgga
gaaccaggtattcctggagacgttggtgttccgggggagcgtggcgaggctggtcacagg
ggctcagtgggggccctcggcccacaaggtcctcctggggctcctggcattcgtggcttc
cagggccaaaaaggcagcataggagaccctggccttccaggcccccaaggcctacgggga
gatgtgggtgaccggggtccaggaggtgccacaggtcctaagggagaccagggcatcgcg
ggttctgatggtcttccaggggacaaaggagagctgggtcctaatggccccgttggacaa
aaaggagagtccggcagccgaggggaactgggccccaaaggcatccagggccccaatggt
accagtggggtccagggtgtgcctggtcccccaggtccactgggcttccaaggcgtgcag
ggtgtcccaggcatcacggggaaacctggagttccgggcaaggaagccagtgaacaacgc
atcagggagctgtgtggggggatgatcagtgagcaaattgcacagctggctgcacacctg
aggaagcccttagcaccaggctccattggcaggcctggtccagctggccccccaggccct
ccaggccccccaggctccattggccaccctggcgctcggggtcctcctggataccgtggt
cccactggggaactaggagatcctggaccaagaggaagccagggtgacagaggagacaag
ggagcaataggtgtagggttggatggacctgctggggaccagggctaccgagggcctcaa
ggtgtgcctggcatcagcaaagatggtcgagatggtgctcatggtgagcctggccttcct
ggtgatcctggccttcctggagctgctggtgctcaaggaaccccggggatttgcgacacc
tcagcctgccaaggagctgtgttagggggaggtggggaaaagtcaggtcctcggagctca
taa

KEGG   Rattus norvegicus (rat): 362584
Entry
362584            CDS       T01003                                 

Gene name
Col9a2
Definition
(RefSeq) collagen alpha-2(IX) chain precursor
  KO
K08131  collagen type IX alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04974  Protein digestion and absorption
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    362584 (Col9a2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    362584 (Col9a2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    362584 (Col9a2)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    362584 (Col9a2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    362584 (Col9a2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:rno00535]
    362584 (Col9a2)
Proteoglycans [BR:rno00535]
 Extracellular matrix (ECM) proteoglycans
  Others
   362584 (Col9a2)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 362584
NCBI-ProteinID: NP_001102145
RGD: 1307029
Ensembl: ENSRNOG00000011502
UniProt: D3ZNT5
Position
5
AA seq 688 aa
MTALPAPRSLLLFLQVLWLALAQIRGPPGEPGLPGPPGPPGVPGSDGIDGDKGPPGKVGP
PGSKGEPGKPGPDGPDGKPGIDGLMGAKGEPGPMGTPGIKGQPGLPGPPGLPGPGFAGPP
GPPGPVGLPGEIGTPGPKGDPGPEGPSGPPGPPGKPGRPGTIQGLEGSADFLCPTNCPAG
VKGPQGLQGVKGHPGKRGILGDPGRQGKPGPKGDVGASGEQGIPGPPGPQGIRGYPGMAG
PKGEMGPRGYKGMVGSIGAAGPPGEEGPRGPPGRAGEKGDVGSQGARGPQGITGPKGTTG
PPGIDGKDGTPGIPGMKGSAGQVGRPGSPGHQGLAGVPGQPGTKGGPGDKGEPGQQGFPG
ISGPPGKEGEPGPRGETGPQGIMGEKGDQGERGPVGQPGPQGRQGPKGEQGSPGIPGPQG
LPGIKGDKGSPGKTGPRGGVGDPGVAGLPGEKGEKGLSGEPGLKGQQGVRGEPGYPGPSG
DAGAPGVQGYPGLPGPRGLVGDRGVPGQPGRQGVVGRAASDQHIVDVVLKMIQEQLAEVA
VSAKREALGATGMVGLPGPPGPPGYPGKQGPNGHPGPRGIPGIVGAVGQIGNTGPKGKRG
EKGDQGEMGRGHPGMPGPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGRPGRPGPVGLP
GFCEPAACLGASAYTSARLTEPGSIKGP
NT seq 2067 nt   +upstreamnt  +downstreamnt
atgaccgccttgccagcgccccgcagcctcctccttttcctccaggtgctgtggctcgcc
ctggctcagatcagaggtccaccaggggagccgggtcttccagggccaccagggccacca
ggagtgcctggttcggatggcatcgacggtgacaaggggcctccaggtaaagttggtcct
ccgggatccaaaggagagcctggcaaaccggggccagatggtccagatgggaagcctggc
attgatggtttaatgggagccaagggagagcctggccccatgggaacccctggaatcaag
ggccagcctgggctcccagggccccctggcctgccgggccctggttttgctggacctccg
ggaccacctggacctgttggcctccctggtgagattggaaccccaggccccaagggggat
ccagggccagagggaccatcagggcccccaggaccccctgggaaaccaggccgaccagga
accatccagggcctggaagggagtgcggatttcttgtgtccaaccaactgtcctgctggt
gtgaaaggtccccagggtctgcaaggagtgaagggccatccaggcaaacgggggattctg
ggtgatcctggtcgccaggggaagccgggccccaagggagatgtgggtgcctctggagaa
caaggtatccctggaccaccgggtccccaaggcatcaggggctacccaggcatggcagga
cccaagggagagatgggtcctcgtggatataaaggcatggtgggctccatcggggctgct
gggccaccgggtgaagagggcccaagggggccaccaggccgagcgggtgagaagggggat
gtggggagccaaggtgcccgaggaccccaggggataacaggcccaaagggaacaactggt
ccaccaggcattgatggcaaggatgggaccccgggcatacctggcatgaagggcagtgca
ggacaggtgggacggccaggaagcccaggccaccagggcttagcgggtgtgccaggtcag
cctggaacaaaaggaggccccggagacaagggtgaacccggccagcagggcttcccggga
atctctggtccccctgggaaagaaggggagccagggcctcgaggagaaactggtccacag
ggcatcatgggagagaagggtgaccagggtgagagagggccagtggggcagccaggccct
caaggacgacagggccccaagggagagcagggatctccaggaattccaggaccccaaggc
ttgccaggcatcaaaggagataagggttccccagggaagaccgggccccgaggtggagtg
ggtgacccgggggtggccggcctcccaggagagaaaggagagaagggcctgtcaggcgag
ccagggcttaagggacagcaaggagttcgtggagagccgggctaccctggccctagcgga
gatgctggtgccccaggagtgcagggctatcctgggcttcccgggccccgaggactggtg
ggagatcgaggcgtgccaggacaaccggggagacagggtgtggtgggccgagctgccagt
gaccagcacatcgtggatgtggtgctgaagatgattcaagagcaacttgcagaggtagct
gtgagtgccaagcgagaagccctgggtgcaacagggatggtgggccttccaggacctcct
gggccccctggatatccaggcaaacagggacccaatgggcaccctggcccccgaggcatt
cctggcattgtgggagctgtgggtcagattggcaacactggacccaagggaaagcgtgga
gagaagggagatcaaggagaaatgggacgtggccatcccgggatgcctgggccaccaggg
atcccaggtcttcctggccggcctggccaggccatcaacggcaaggatggagacagaggg
tccccaggggccccaggagaggctgggcggcctgggcggccgggcccagtggggctccca
ggcttttgtgagcctgctgcctgcttgggagcctcagcctatacctctgctcgcctcacg
gagcctgggtctatcaaggggccatga

KEGG   Rattus norvegicus (rat): 363265
Entry
363265            CDS       T01003                                 

Gene name
Col4a3
Definition
(RefSeq) collagen alpha-3(IV) chain precursor
  KO
K06237  collagen type IV alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04926  Relaxin signaling pathway
rno04933  AGE-RAGE signaling pathway in diabetic complications
rno04974  Protein digestion and absorption
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    363265 (Col4a3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    363265 (Col4a3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    363265 (Col4a3)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    363265 (Col4a3)
  09154 Digestive system
   04974 Protein digestion and absorption
    363265 (Col4a3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    363265 (Col4a3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    363265 (Col4a3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    363265 (Col4a3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    363265 (Col4a3)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    363265 (Col4a3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    363265 (Col4a3)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    363265 (Col4a3)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   363265 (Col4a3)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   363265 (Col4a3)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 363265
NCBI-ProteinID: NP_001129231
RGD: 71085
Ensembl: ENSRNOG00000015365
Position
9
AA seq 1670 aa
MHSETAPRFLALLLPILLLLLAAPPVVSKGCVCEGKGKCLCWGTKGEKGEIGFPGPPGFP
GQKGFPGPEGLPGPQGPKGSPGLPGLTGPKGIRGITGLPGFAGPPGLPGIPGYPGPPGLA
GLPGCNGSKGEQGFPGIPGTPGYAGLPGPDGLKGQKGKPAQGEDGEFNGKGDPGPPGAPG
FQGLPGPPGFPGPAGPPGPPGFFGFPGAMGPRGPKGRMGDSTIGQEGEKGVKGLTGPPGL
PGPVIFTLRHPYRKSDFQGQKGDEGERGEPGPPGPSGPPGDSYGSEKGAPGEPGPRGKPG
KDGAPGFPGTEGAKGTRGFPGLRGEAGIKGWKGDIGPPGFPGPVKCYDAHLQKGDKGMPG
PPGPKGVRGPRGPSGPPGVPGSPGPSRPGLRGPVGWPGLKGSKGERGPPGIDTVGPPGSL
GCPGSPGPPGPPGPPGRPGDTVFQPGPPGDHGAPGDIGPPGVPGVDGPKGEPGQPCTECH
CIPGPPGVPGVPGLDGVKGIPGGRGAPGVKGNPGSPGNAGLPGFAGFPGDQGHPGLKGDK
GDTPLPWGQVGDPGDPGHRGLPGRKGFDGSPGGPGAKGPRGPRGEPALSGRKGDQGPPGA
PGSPGPPGPAGPAGPPGYGPQGEPGPKGAQGVPGALGPPGEAGLKGESSASIPVLGPPGP
PGPPGQAGPRGLPGLPGPVGTCDPGHPGPDGEPGIPEVGFPGARGPKGDQGFPGTIGLPG
YPGETGRPGYPGEMGVPGAKGEPSVGRPGEPGKPGFPGERGNSGENGDIGLPGLPGPPGT
PGKDGFDGPPGDPGQSGPPGAKGPPGRCIPGPRGTQGLPGLNGLKGQPGRRGDTGPKGDP
GIPGMDRSGVPGERGPPGTPGLPGEMGPPGQKGYPGPPGFPGLPGEKGEVGIMGYPGTTG
LPGLPGKPGSQGQRGNLGIPGVKGERGRPGVKGERGEKGKPGPPHAPHLKGDKGEPGLKG
FVGNPGEKGNRGNPGLPGPKGLEGVPGLPGSPGPRGDTGSSGDPGRPGPQGLPGSMGNMG
VPGPKGRKGTSGFPGVAGRPGLPGIPGPQGDKGEPGYSEGASPGPPGPKGDPGLPGDKGK
KGERGLPGPPGHSGPAGPDGAPGSPGSPGHPGRPGPDGDSGLKGQKGFPGPPGSTGPPGP
PGLPGLPGPMGMRGDQGQDGIPGPPGEKGETGLLGAHPGQKGSPGVPGVKGDRGVPGLSG
LPGRKGTMGDVGPQGPPGTTGLPGPPGLPGTIVPGPKGNRGLPGLRGNPGEPGPPGPPGP
VGEGIKGDKGFMGLPGSRGLPGMVGDTGAPGQPGAPGIPGLPGVRGDPGFPGFPGVKGEK
GNPGFLGSIGHPGPVGPKGPPGPQGKPGTLKVISLPGSPGPPGAPGQPGVKGDPGPLGPP
GIPGPCGPRGQPGKDGKPGAPGPPGVKGSKGSKGEQGPPGLDGLPGLKGKPGDRGTPANG
TRMRGFIFTRHSQTTANPSCPEGTQPLYSGFSLLFVQGNEHAHGQDLGTLGSCLQRFTTM
PFLFCNVDNVCNFASRNDYSYWLSTPAPMPMDMAPITGRALEPYVSRCTVCEGPAMAIAV
HSQTTAIPPCPQGWVSLWKGFSFVMFTSAGSEGAGQALASPGSCLEEFRASPFIECHGRG
TCNYYSNSYSFWLASLNPERMFRKPIPSTVKAGDLEKIISRCQVCMKKRH
NT seq 5013 nt   +upstreamnt  +downstreamnt
atgcactcagagactgctccaaggttcttggcgctcctgctgcctatcctgttgctgctc
ctggctgccccgcctgtggttagcaagggctgtgtctgcgaaggcaaaggaaaatgcctc
tgctggggaactaaaggggagaagggggagatcggcttccctgggccccctggatttcct
ggccagaaaggattcccaggccctgaaggcttgcctggaccacagggacccaagggctcc
ccaggacttccaggactcactggccccaaaggcatcaggggaataactggattaccaggg
tttgcaggtcctcctggacttccaggcatcccaggctaccctgggcctcctgggctggct
ggtttaccaggatgcaatggctctaagggtgaacaaggattcccaggcattcctggcaca
ccaggctatgcagggctcccaggtcctgatggcttaaaaggacaaaagggcaagcctgct
caaggagaagacggagaattcaatggaaaaggtgaccctggtcctccgggggctccaggc
ttccagggtttaccaggacccccaggtttcccagggcctgccggcccaccagggccccca
ggattttttggtttcccaggagcgatgggacccagaggacccaaaggtcgcatgggcgat
agcacgataggacaagaaggagaaaagggtgtgaaaggattaacaggacctcctgggctg
ccaggaccagtgatttttacactccgacacccatacagaaaatcggactttcagggacag
aaaggcgatgagggagagaggggtgaacctggacctcctggaccctcgggaccacctgga
gactcctatggatcagaaaagggtgctcctggagagcctggtcctcggggaaaacctgga
aaagacggtgctcccggcttccctggcactgagggagccaagggaaccagaggctttcct
gggttgaggggtgaagccggcattaagggatggaaaggagacattggccccccaggattt
cctggtccagtaaaatgttatgatgcacatctgcaaaagggagataaaggaatgccaggc
ccgcctgggcccaaaggagtccgtggcccacggggtcccagtggtccccctggagttcct
ggaagtcctggaccgtcaaggcctggcctcagaggacctgtcggatggccaggcttgaaa
gggagtaaaggggagaggggaccccctgggatagacaccgtgggccctcctggatccctg
ggatgtcctggatcaccaggtccaccaggcccaccaggacctccaggacgtccaggtgac
actgtttttcagccaggtccacctggtgaccatggagcaccaggcgatataggacctcca
ggagttccaggagtcgatggacccaaaggagaaccaggccaaccttgtactgagtgtcac
tgcatcccagggcccccaggagtcccaggagttccaggattagatggtgtcaaaggaatc
ccaggaggacgaggtgcgcctggtgtgaagggaaacccagggtccccaggaaatgcaggt
ctcccaggatttgcaggattcccaggcgaccaagggcatccaggacttaaaggggacaaa
ggggacacacctctaccctggggacaagtgggcgatccaggtgatcctggacacagaggc
ctgccagggagaaagggcttcgatggaagtcctggaggtccaggagcaaaagggccacga
ggacctaggggcgaaccggctctgagtggaaggaaaggggaccagggacctccaggagct
cctggatcccctggacccccaggacctgcaggaccagctggaccaccgggctatggaccc
caaggagagccaggtccaaagggagcccaaggagtccctggagccctgggaccaccagga
gaagccggtcttaaaggagaatccagtgcatcaataccagttctaggtcccccaggccct
ccagggccccctggtcaggctggccccagaggtctacctggtttgcctggacctgtggga
acatgtgatcctggtcatcctggacctgatggtgaaccaggaatcccagaagttggattc
ccaggggccagaggacctaagggagatcaaggtttcccaggaacaataggattacctggt
tatcctggggaaacgggaagacctggctatcccggggagatgggcgtcccaggagccaag
ggagaaccatcagtaggcaggcctggggagccaggaaaaccaggctttccaggagaaaga
ggcaattctggggaaaatggagatattggactccctggacttccaggcccccctggaact
ccaggaaaagacgggtttgacgggcctccaggagacccagggcagtctggacctccagga
gcaaaaggacccccagggaggtgcataccaggacccaggggcacccaaggacttccaggt
ttaaatggattgaaaggacaaccaggtagaagaggtgataccgggccaaaaggagaccct
ggcatcccagggatggacagatctggagtccctggagagcgaggaccaccaggaacacca
ggtcttccgggtgagatgggaccgcctggccaaaaaggatatccaggacctccaggattc
ccagggttaccaggagagaaaggagaggttggaatcatgggctatcctggaaccactggc
cttccggggcttcctgggaaaccaggctcacaggggcagaggggtaacctcggaatccca
ggagtaaaaggggaaagaggacgcccgggagtcaaaggcgaacgaggcgagaaggggaaa
cctgggcctccccatgcaccacacttaaagggggacaaaggagagcctggacttaaaggg
ttcgtaggaaatccaggtgaaaagggaaacagaggcaatccagggttaccaggccccaaa
ggcctcgagggagtgcctgggctaccaggatctccaggccctagaggagatacggggagc
agtggggatcctggaagaccaggaccacaaggcttgccaggaagcatggggaacatgggg
gtgccaggccctaaagggagaaaaggaacttcaggatttccaggtgtagctggaagacca
ggcctgccagggatccctggcccccagggagataagggggagccaggttattcagaaggt
gcaagcccaggaccaccaggaccgaagggtgatccaggattgccaggtgacaaagggaag
aaaggagagagaggactacctgggccacctggacattcggggcctgctggacctgacgga
gcccctgggagtcctgggagtcctggccacccaggaagaccaggtcctgatggtgattcc
ggtcttaaaggacagaaaggtttccccggccctccaggaagcactggccctccaggccct
ccaggactcccaggacttcctggaccaatgggtatgagaggcgaccaaggacaagatgga
attcctggtcctccgggagaaaaaggagaaacaggcttgctgggggcccatccaggccaa
aaagggagccctggtgtaccaggtgtcaaaggagacaggggagttccaggcttgtctggc
cttcctggcaggaaggggaccatgggggatgttggacctcaaggacccccaggcaccact
ggactcccaggtccaccaggtctgcctgggaccattgtccctggcccaaaaggaaacaga
ggtctccccggcttacgaggaaatccaggtgagccaggtccccctggacctccaggacct
gtcggagaaggcataaagggtgacaaaggatttatgggcctccctggctccagaggtctg
cctggaatggtaggggacacgggtgcaccaggtcaaccgggagcaccaggcatccccggt
cttcccggtgtcagaggtgatccaggattccccgggtttccaggtgtaaaaggagagaag
ggtaatcccggatttcttgggtcaattggacatccaggaccagttgggccaaaaggacca
ccaggtccacaaggaaaacctggcacacttaaggtcatctcccttccaggaagcccaggg
cctcctggtgcacctggacagccaggagtgaaaggagatcctggacccctgggaccacca
ggaattccagggccctgtgggccaagaggtcaaccaggcaaggacggaaaaccaggagct
ccaggaccaccgggagtaaagggcagcaagggctcgaaaggagaacaaggcccacctggt
ctggatggattgccaggtttaaagggaaaacctggagacagagggacacctgccaacggg
acaagaatgagaggcttcatcttcactcgacacagtcaaaccacagccaatccttcatgc
cctgaaggaacacagccactctacagtgggttttcccttcttttcgtacaaggaaatgag
cacgcccatggacaagaccttggtactctgggcagctgcctacagcgattcaccacaatg
ccattcttattctgtaacgtcgataacgtgtgtaactttgcatcacgaaatgactattca
tactggctgtcgacaccagccccgatgcccatggacatggccccaattactggcagagct
ctcgaaccctacgttagcagatgcaccgtctgcgaaggtcctgcaatggctatagctgtt
cacagtcaaactactgctatccccccatgtccccagggctgggtttctctctggaaaggt
ttttccttcgttatgtttacaagtgcaggctctgagggtgctggacaagcacttgcctcg
cccggctcctgcctggaggaattccgagccagtccgtttatagaatgccatggacgaggg
acctgcaactactactcaaactcctacagcttctggctggcttcactaaatccagaaaga
atgttcagaaaacctattccatcaactgtgaaagctggagacttggagaaaatcataagc
cgctgtcaggtgtgcatgaagaaaagacactga

KEGG   Rattus norvegicus (rat): 363457
Entry
363457            CDS       T01003                                 

Gene name
Col4a5
Definition
(RefSeq) collagen alpha-5(IV) chain isoform X2
  KO
K06237  collagen type IV alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04926  Relaxin signaling pathway
rno04933  AGE-RAGE signaling pathway in diabetic complications
rno04974  Protein digestion and absorption
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    363457 (Col4a5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    363457 (Col4a5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    363457 (Col4a5)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    363457 (Col4a5)
  09154 Digestive system
   04974 Protein digestion and absorption
    363457 (Col4a5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    363457 (Col4a5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    363457 (Col4a5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    363457 (Col4a5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    363457 (Col4a5)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    363457 (Col4a5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    363457 (Col4a5)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    363457 (Col4a5)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   363457 (Col4a5)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   363457 (Col4a5)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 363457
NCBI-ProteinID: XP_006257374
RGD: 1565499
Ensembl: ENSRNOG00000018951
Position
X
AA seq 1688 aa
MQVPGVCLAAGWFLLALCLWGQPGEAAACHGCSSGSKCDCSGIKGEKGERGFPGLEGHPG
LPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQG
IPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMSSLPGPKGNQGYPGPPG
IQGPPGPKGVSGPSGPPGPPGLMGPPGPPGPPGPKGNMGLNFQGPKGEKGEQGLQGPPGP
PGQISEQKRPIDLDFQKGDQGVPGDRGPPGPPGIPGPPGPPGGVKGEKGEPGEPGKRGKP
GKDGETGQPGIPGLPGDPGYPGEPGRHGEKGQKGDTGPPGPPGFVIPTPGTGVTIGAKGN
IGQPGLPGEKGERGFPGIQGPPGFPGPPGTAVMGPPGPPGYPGERGQKGDEGPPGICIPG
SPGLDGQPGAPGFPGPPGPPGPQLPPSDEICKAGPPGPPGSPGDQGLQGEQGVKGDKGDT
CFNCIGTGISGPPGQPGLPGLPGPPGSLGIPGEKGEKGQAGTTGPKGLPGIPGPPGAPGF
PGSKGDPGDVLTLPGMKGEKGELGSPGAPGLPGLPGSPGKDGLPGLPGPKGEPGGITFKG
ERGPPGNPGLPGLPGNMGPTGPVGFGPPGPVGEKGIQGVAGNPGQPGLPGPKGDPGQTIS
QPGKPGLPGNPGRDGEVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGPPGPPGPKGFPGI
PGPPGPPGAPGRMGPEGPPGLPGFPGPKGEPGFGLPGPPGPPGFPGFKGTPGPKGDRGLP
GPPGPPGHTGLDGLPGPKGDAGPNGQPGPIGPPGLPGIGLQGPPGPPGTPGPIGQPGLHG
IPGEKGDPGPPGLDVPGLPGERGSPGLPGAPGLIGPPGSPGIPGRAGVPGFPGVKGEMGM
MGPPGPPGPLGTPGRSGAPGLKGDDGMQGQPGLPGPVGEKGGKGEPGLPGPPGPMRPDFL
GSKGEKGDPGLPGIPGLSGPKGYQGLPGDPGQPGLSGQPGSPGLPGPKGTPGLPGPPGLT
GPPGPKGNIGDMGFPGPQGVDGPPGPPGGPGQPGSPGLPGQKGSKGEPGVSGIGLPGLPG
PKGEPGLPGYPGNPGIKGSMGEAGLPGLPGSPGAKGQPGLPGFPGTPGLPGPKGMNGPPG
NPGLPGEPGPVGGGGRPGPPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGIPGPPGLPGL
SGQKGDGGAPGLPGNPGLPGSKGEPGFQGFPGMPGPPGPPGSPGPSLEGPKGNPGPQGPP
GRPGPPGPPGTEGPRGPPGNGGIKGERGNPGPPGQPGLPGLKGDQGPPGLPGNPGRPGLN
GMKGDPGLPGVPGFPGMKGPIGVPGSTGPDGEPGLTGPPGPPGLPGPSGQSIVIKGDPGP
PGIPGQPGLKGLPGLPGPQGLPGPIGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGSRGL
DGPPGPDGLQGPPGPPGTTSVAHGFLITRHSQTTDAPQCPQGTVHIYDGFSLLYVQGNKR
AHGQDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPMNMEPLKGQSI
QPFISRCAVCEAPAVVIAVHSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQALASP
GSCLEEFRSAPFIECHGRGTCNYYANSYSFWLATVDMSDMFSKPQSETLKAGDLRTRISR
CQVCMKRT
NT seq 5067 nt   +upstreamnt  +downstreamnt
atgcaagtgcctggagtgtgcctggccgccggctggttcttactggccctgtgtctgtgg
gggcagccgggagaggctgcggcctgccatgggtgttcttctggatcaaaatgtgactgc
agtggtataaaaggagaaaagggagaacggggctttccaggcttagaaggccatccaggt
ttaccaggatttccaggtccagaagggcctccaggacctcggggacaaaagggtgatgat
ggaattccagggccaccaggaccaaaaggaatcagaggtcctcctggacttcctggattt
ccagggacaccaggccttcctgggatgccaggccacgatggggctccaggacctcaaggt
atccctggatgcaatggaaccaagggagaacgcggatttccaggcagtccaggctttccc
ggtttacaaggacctccaggaccacctgggatcccaggtatgaagggggagccaggcagt
ataatcatgtcatctctgccagggccaaagggtaatcaaggatacccaggtcctcctgga
atacaaggtccacctggtcccaagggtgtatcagggccaagtggtcctccaggaccacca
ggtttaatgggccctcctggcccaccaggcccaccgggaccaaaggggaatatgggctta
aatttccagggacccaaaggcgaaaaaggtgaacaaggtcttcagggtccacctggtcca
cctgggcagatcagtgaacagaaaagaccaattgacctagattttcagaaaggcgatcag
ggagttcccggtgatagagggcctcctggacctccgggaatacctggtcctcccggtcct
cctggtggtgtgaaaggtgaaaagggtgagcccggagagccaggcaaaagaggtaaacca
ggcaaagatggagagactggtcaaccaggaatcccaggtttgcctggtgatcctggttac
cctggtgaaccaggaaggcatggtgaaaagggccaaaagggtgatactggcccacctgga
ccacctggatttgtaattcctacacctggaactggtgtgactattggagcaaaaggaaat
attgggcaacctggtttgcctggagaaaaaggagagcgaggatttcctggcatacaaggt
ccacctggctttcctggacctccagggacagctgtcatgggtccccctggtcctcctgga
tatcctggtgaaaggggccagaaaggtgatgaaggtccccctggaatttgtattcctgga
tctcctggacttgatggacagcctggagctcctggctttccaggacctcctggtcctcct
ggcccccagttacccccgagtgatgaaatctgtaaagcaggccctccagggcccccagga
tctccaggtgaccaaggactccaaggagagcaaggagtaaaaggtgacaaaggtgatact
tgcttcaactgtattggaactggcatttcagggcctccaggccaaccaggtttaccaggt
ctcccaggtcctccaggatctcttggaatccctggagagaagggcgagaaaggacaagct
gggacaactggtccaaaaggattgccaggcattcctggacctccaggtgctccaggcttt
ccagggtctaagggtgaccctggtgatgttctcactcttccgggaatgaaaggtgaaaaa
ggagaattgggttcccctggagctccagggcttcctggtctacctggtagtcctggaaag
gatggtttgccaggacttcctggccccaaaggagagcccggtggaatcacttttaagggt
gaaaggggtcctcctgggaacccaggtttaccaggtctcccaggaaatatgggacctaca
ggccctgttggttttgggcctccaggccccgtaggtgaaaaaggcatacaaggtgtggcg
ggaaatccaggccagccaggattaccaggtcctaaaggtgatccaggccagaccatcagc
cagccaggaaagcctggcttgccagggaatccaggcagagatggtgaagtgggtcttcca
ggtgatcctggacttccaggccaaccaggcctaccaggaatacctggtagcaaaggagaa
ccaggtatccctgggattgggcctccaggaccacctggtcccaaaggttttccaggaatt
cctggacctccaggacctcctggggcccctggaagaatgggcccagaagggcctcctggg
ttaccgggctttccaggacccaagggagagccaggatttgggttacctgggccacctggg
ccaccaggattcccaggtttcaaaggaacacctggtccaaaaggtgatcgtggtttacca
ggacctccaggtcctccaggacacactggcttggatgggctacctggaccaaaaggtgat
gctggaccaaatggacaacctggcccaatagggccaccagggctgccaggaattggtctt
cagggaccaccaggaccaccagggactccagggccaataggccaacctggcttacatgga
ataccaggagagaagggagatccaggacctcctgggcttgatgttccaggactccctgga
gagagaggtagtccagggctccctggagcccctggtcttataggacctccaggatcacct
ggtatccctggaagagcaggtgtccctggatttccaggtgtcaaaggcgaaatgggtatg
atgggacctccaggcccaccgggacctttgggtactcctggcaggagtggtgctcctggt
cttaaaggtgatgacggcatgcaaggtcagccaggtctccctggccctgtaggagaaaag
ggtggtaagggagaacctggccttccaggtcctcctggaccaatgcgtccagatttcctg
ggatcaaaaggagaaaagggtgatcctggcttaccaggtattcctggactttccggacca
aaaggctaccagggattacctggagacccagggcaacctgggctgagtggacagccggga
tcaccaggactaccaggtcccaaaggtactcctggtctccctggtccaccaggacttacc
ggacctcctggacctaaaggaaacattggcgacatgggttttcctggccctcaaggtgta
gatgggcctcctggacctcctggaggtcctggacaacctggctctcccgggttgcctggc
cagaaaggaagtaaaggagaacctggagtttcaggcattggtctccctggtcttcctggt
ccaaagggtgaacctggtctgcctggatatccaggaaacccaggtatcaaaggttctatg
ggcgaggctggtctgcctggcttaccaggaagccctggagcaaaaggtcaacccggcctg
cccggattcccaggaacaccagggcttcctggaccaaagggtatgaatggtcctcctggt
aaccctggccttccaggagaacctggtcctgtaggtggtggagggcgtcctggaccacca
ggacctcctggtgaaaaaggcaagccaggtcaagatggcattcctggaccagcaggacaa
aagggagaaccaggtcaaccaggctttggaatcccaggacctcctggactcccaggactt
tctggtcaaaagggtgatggaggagcccctggccttccaggaaatcctggccttccaggt
tcaaagggtgaaccaggctttcagggattccctggtatgccaggtcccccaggtccccct
ggttctcctggcccatctttggaaggtcccaaaggaaaccctgggcctcaaggccctcct
gggagaccaggtcctccaggtccacctggtacagaaggtcctcgtggcccccctggaaat
ggaggtattaaaggagagaggggaaatccaggtccacctgggcaaccaggcttacctggc
ttgaaaggagatcaaggaccaccaggccttccgggtaaccctggccgaccaggtcttaat
ggaatgaaaggagatcctggtctccctggtgttccaggattcccaggaatgaaaggaccc
attggagtaccaggctcaactggccctgatggggaaccaggtcttacggggcccccaggt
cctcctgggttgcctggtccttcaggacagagtattgtaatcaaaggagatcctggacct
ccaggaattcctggacaacctggattaaaaggtctaccaggacttccaggccctcaaggt
ctaccaggtccaattggccctccaggagatccaggacgcaatggactcccaggctttgat
ggtgcaggagggcgcaaaggagacccaggcctgccggggcagccaggtagccgtggattg
gatggtcccccaggaccagatggattgcaaggacctccagggcccccgggaaccacctct
gttgcccatggattcctcatcacacgtcacagtcagacaacagatgcaccacaatgccca
cagggaacagtccatatctatgatggcttttctctcttatatgtacaaggaaataaaaga
gcccatggtcaagacttggggacagctggcagctgccttcgtcgcttcagtaccatgcct
ttcatgttctgcaacattaacaacgtttgtaactttgcttcaagaaatgactattcttac
tggctttccaccccagagcccatgccaatgaacatggaacccctgaagggccagagcatc
cagccattcattagtcgatgtgcagtatgtgaggcaccagctgtggtgattgcagttcac
agccagactattcagattcctcactgtcctcaggggtgggactctctgtggattggctat
tcattcatgatgcatacaagtgcaggagcagaaggctctggccaagccctagcctctcct
ggctcctgtttggaagaatttcgttcagctcccttcatcgaatgtcatggacgagggaca
tgcaattactatgccaattcttacagcttttggctggccactgttgatatgtcagacatg
ttcagtaaacctcagtcagagacactgaaagcaggcgacttgaggacacgtattagccga
tgtcaagtatgcatgaaaaggacataa

KEGG   Rattus norvegicus (rat): 363458
Entry
363458            CDS       T01003                                 

Gene name
Col4a6
Definition
(RefSeq) collagen alpha-6(IV) chain isoform X1
  KO
K06237  collagen type IV alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04926  Relaxin signaling pathway
rno04933  AGE-RAGE signaling pathway in diabetic complications
rno04974  Protein digestion and absorption
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    363458 (Col4a6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    363458 (Col4a6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    363458 (Col4a6)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    363458 (Col4a6)
  09154 Digestive system
   04974 Protein digestion and absorption
    363458 (Col4a6)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    363458 (Col4a6)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    363458 (Col4a6)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    363458 (Col4a6)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    363458 (Col4a6)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    363458 (Col4a6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    363458 (Col4a6)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    363458 (Col4a6)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   363458 (Col4a6)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   363458 (Col4a6)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 363458
NCBI-ProteinID: XP_038956452
RGD: 1589724
Ensembl: ENSRNOG00000056772
Position
X
AA seq 1690 aa
MHPGLWLFLVSLCLTKELAEAGQKSYRQPCGGQDCSGACACSPEKGARGHTGPIGTQGPA
GPEGFAGPTGLSGLKGERGSPGRLGPYGPKGDKGPIGVPGFVGISGIPGHPGQPGPRGPP
GLDGCNGTQGAVGFPGTDGYPGVLGPPGLPGHKGAKGEPASFQGSIIGMKGDPGLPGLHG
ITGPSGSPGSPGDAGPIGPPGFQGPPGPPGLPGPDGNMGLGFQGEKGIKGDVGLPGPAGP
PPSTGELEFMGFPKGEKGSKGEPGPPGFPGRSGLPGVPELGSIGEKGERGILGLPGPRGP
VGSEGIQGYPGKQGKKGTSGFPGTNGFPGIKGEKGSIGVRGPDSFTDAEGTVISGFPGDP
GVPGLPGLRGDEGIQGQRGPAGTAGLPSLTGLPGALGPQGSPGLKGDRGNSGRTTFGEAG
LPGRVGLPGLPGLPGPSGPPGRTFVSGPLLSIEPGLPGLQGEQGLKGHQGIKGVKGDSGF
CACEGGAPNIGPHGESGLPGIQGPIGLRGIKGTRGDPGSRGASGPTGTPGLFGPRGQTGL
KGKKGEPTVSRGSKMAGDKGDPGPQGIPGLAGAPGKDGIPGLPGFPGTQGDDGSGFPGER
GLPGLPGEKGHDGPTGPRGIGLPGLPGPRGLPGDKGVDGLPGQQGLRGAKGVTLPCIIPG
SYGPSGFPGAPGFPGSKGARGLPGIPGKPGTHGSKGEPGSPGLIHLPGFPGFPGARGEKG
LPGFPGLLGKHGYPGKAGSPGVPGSKGVAGDIFGAENGASGEQGLQGLPGDKGFPGDSGL
PGPKGLSGKSGMLGPKGERGNPGTSGPPGQPGPSGSTDPFGIKGTSGLPGAPGLPGISGH
PGKKGQRGDIGHPGSTGKRGLPGIKGLPGPQGLAGFLGSPGLSGVTGLPGIPGQKGEKGS
SGPVGFPGLPGLPGLPGADGLKGFSGSFGKVGQPGQAGTPGEKGDRGDPGPVGISSPRPP
MLNLWFKGEKGSQGSAGSDGFPGPRGDKGEPGIPGLPGAPGAPGQSNTIKGLSGSPGSPG
SMGRRGLPGLKGSLGIAGFPGIPGKSGSQGLTGTPGPLGATGIPGLKGDQGPTLGISGSP
GPKGQPGELGFKGVKGKDGLVGDRGYPGNKGDSGKVGSAGDPGFPGSPGLKGISGMNGDP
GFPGSSGHVGSIGRPGPSGLIGPKGFPGLPGLHGLNGLPGTKGTHGTPGASITGVPGPAG
LPGPKGEKGMPGIVIGDPGKQGLRGQKGDQGSPGLQGPAGTPGASGISLPSVIAGQPGDP
GQPGLDGERGRPGLPGPPGPPGPSSDQGDPGDSGFPGIPGLQGLKGNQGLPGFSGLSGDL
GLKGMRGEPGLMGTPGKIGPPGDPGFPGMKGKAGPRGFSGPQGAPGHTPIAEARHVPPGP
LGLPGIDGIPGLTGDPGSQGSVGLQGSKGLPGIPGKDGPSGLPGPSGILGDPGLPGLQGP
PGFEGAPGNQGPIGQPGMPGHSVRVGYTLVKHSQSEHVPPCPIGMSQLWVGYSLLFVEGQ
EKAHNQDLGFAGSCLPRFSTMPFIYCNINEVCHYARRNDKSYWLSTTAPIPMMPVGETQI
PQYISRCSVCEAPSQAIAVHSQDITVPQCPLGWHSLWIGYSFLMHTAAGTEGGGQSLVSP
GSCLEDFRANPFIECSGARGTCHYFANKYSFWLTTVEERGQFQEQPVSENLKTGQLHTRV
SRCQVCMKTP
NT seq 5073 nt   +upstreamnt  +downstreamnt
atgcaccctggattgtggctgttcctggtttccttatgcctgaccaaagaactggcagaa
gcgggacagaagtcctatagacagccatgtgggggccaagactgcagtggagcctgtgca
tgctctcctgagaaaggagcaagagggcacactgggccaattggaactcaaggtccagcc
ggtcctgaagggtttgctggtccaactggtttatcagggttgaaaggagaaagaggctcc
ccaggccgtctgggaccatatggacccaaaggagataagggtcccattggagttcctggc
tttgttggcatcagtgggattccgggacacccaggccaaccaggtccaagaggtccacct
ggcctagatggatgtaatggaactcaaggagctgttggatttccaggcactgatggctac
cctggggttttaggaccacctgggctgcctggtcacaaaggtgcaaaaggagaacctgct
tcttttcaaggcagcatcataggaatgaagggggatcctgggctgcctggactgcatgga
atcactggtccatcaggatctcctggatctcctggagatgcaggacccataggaccacca
ggtttccaaggtccaccaggtccccctggacttcctggtcctgatggaaatatggggtta
ggtttccaaggagaaaaaggaatcaagggggatgttggcctccctggccctgcaggacca
cctccatctactggggaactagaattcatgggattccccaaaggggagaaaggttccaag
ggtgaaccagggcctccaggtttcccaggcaggagtggccttccaggtgtcccggaactt
ggatctattggagaaaaaggagaaagggggatccttggattgccaggacctaggggtccc
gtgggttcagaaggaatacaaggatatccagggaaacagggaaagaaggggacttcaggt
tttcctgggactaacgggttcccaggaattaagggtgaaaagggaagcattggtgtgcga
ggcccagacagtttcactgatgcagaaggcactgtgatctcaggttttcctggagaccct
ggtgtgccaggcctcccaggccttagaggagatgaaggcatccagggtcaacgtggccct
gctggcactgctggccttccatcgttaacaggtcttccaggtgccctagggcctcaggga
tctccaggcctgaaaggagaccgagggaactcaggacgtaccacctttggagaagctggc
ctacctggcagagttggtttgccaggtttaccaggcctgcccggcccatcaggcccacct
ggtcgcacatttgtgtctggacccctgcttagcatagagcctgggttacctggcctccaa
ggagaacaaggcctaaaaggacatcaaggcatcaaaggagtaaaaggagactctggcttt
tgtgcttgtgaaggtggtgcccccaacattggaccacatggggaatcaggcctgcctggg
atacaaggtcccattggtctacgtggtattaaagggacaagaggcgatccaggctctagg
ggcgcatctggccccacagggacaccagggctatttggacctagaggtcagactggcctc
aaaggaaagaaaggagaaccaactgtcagtagaggatcaaaaatggcaggggacaaaggt
gaccctggtcctcaggggatcccaggtttggcaggagctccaggcaaggatggaatacca
ggtttaccaggcttcccaggcactcagggagacgatgggtctggcttcccaggtgaaagg
gggttaccaggacttcctggtgaaaagggccatgatggtccaactggaccacgaggaatt
gggctgccaggacttcctgggccccgtggacttcctggagataaaggagtggatgggtta
ccagggcaacaaggcctccgtggagctaaaggagtcaccttgccttgtattatccctggg
tcatacggtccatcaggatttcctggagctcccggattcccaggctctaagggagctcgg
ggcctccctgggattccaggcaagcctggcactcacggaagcaaaggagagcctggaagt
ccagggttaatccatctcccaggatttccaggattccctggagctcgtggggagaaggga
ctgcctggctttcctgggcttctgggaaaacatggctacccgggaaaagctggcagtcca
ggggtaccaggttccaagggagttgctggtgacatctttggtgctgaaaacggtgcttcg
ggggaacaaggtctacaaggattgccaggggacaaaggatttcctggagactctggcctt
ccaggacccaagggtttgagcgggaaatctggtatgctaggccccaagggtgagcgaggg
aaccctggaacatcaggaccaccaggacagccgggaccctcagggtctactgatccattt
ggtatcaagggcacatctggactcccaggagcaccaggccttccaggcatttcagggcac
cctggaaagaagggccaaagaggagacataggtcaccctgggtcaactggaaaacgaggt
ctccctggaataaaaggccttcctggtcctcaagggttagctggcttcctggggagccca
ggtttgtcaggagtcactgggttgccaggcatcccaggccaaaagggtgaaaaggggtcc
tctggaccagtagggtttcctgggttacctggtcttcctggtcttcctggagctgatgga
ttaaagggattttctgggtcatttggaaaggtgggacaacctggacaggctggtactcct
ggagaaaaaggagacaggggagatccagggccagttggcatatctagtccaagacctcca
atgctaaacctttggttcaaaggagaaaaaggatctcaaggttctgctggatcagatgga
tttcctgggcccagaggtgacaaaggagagcctggcatcccagggttaccaggtgcacct
ggagctcctggacagtccaataccatcaaaggactcagtgggagtccaggttcccctgga
tctatgggaagaaggggattacctggcctgaaggggtctctggggattgcaggtttccca
ggaataccaggaaaaagtggttcacagggcctcactgggactcctggccccctgggagca
actggtatcccaggtttaaaaggagatcaaggcccaacacttggcatttctggaagccca
ggacccaagggacagcctggcgaattaggttttaaaggtgtcaaaggaaaagatggacta
gtcggtgatagaggttacccaggaaacaaaggtgacagtggaaaagttggtagtgctgga
gatcctggcttccctggctccccaggacttaaagggatttcaggtatgaatggagaccca
ggcttcccaggttcctctggccatgtgggatcgatcggccgccctggaccctctggctta
ataggacctaaaggattccctggacttcctggtttacatggactgaatggacttccaggc
accaagggtactcacggcactccaggagccagtatcactggtgttcctgggccagctggc
ctgcctggtcccaaaggggaaaagggaatgccaggaattgtcatcggagatccagggaaa
caaggcttaaggggacaaaaaggtgaccaaggttccccgggtcttcaaggtcctgctggt
actcctggagcctcaggcatctccttgccctcagtcatagcaggacagcctggtgaccct
gggcagccaggcctagatggagaacgaggccgcccaggtctcccaggccctccaggtccc
cctggaccatcctcagatcaaggcgatcctggagactctggcttccctggaattccaggc
cttcaagggctcaagggaaaccagggacttccgggtttctctgggctctctggagatcta
gggctaaaaggcatgagaggggaacctggcctcatggggactcctggcaagattgggcca
ccaggagacccaggatttcctggaatgaaggggaaggcggggccaagaggcttttctggc
ccccagggtgctccaggacacacaccaattgctgaagctcggcacgttcctccaggacct
ttgggtctgccaggcattgatggcatcccaggcctcacaggagaccctggatctcaaggc
tctgtgggtctgcaaggttccaaaggcctacctggcatccctggcaaggatggccctagt
gggcttccaggcccatctggaatccttggtgatcctggtctccctggactacaaggacct
ccaggatttgaaggagctccaggaaatcagggccccattgggcagcctgggatgcctgga
cacagtgtgagagtgggctacacattagtaaaacacagtcagtcagaacacgtgccacca
tgccccattggcatgagccagctatgggtgggttacagcttgctgttcgtggaagggcag
gagaaagcccataaccaggatctgggatttgctggctcctgcctgccccgattcagcacc
atgcccttcatctactgcaacatcaatgaagtgtgccactatgccagacgcaatgataaa
tcctactggctctccactactgcccctattcccatgatgcctgtgggcgagacccagatt
ccccagtatatcagtcgctgttctgtatgtgaggcaccctcacaagccattgctgtgcac
agccaggatatcaccgttccacagtgccctttgggctggcacagcctctggattggatac
tccttcctcatgcacactgctgctgggactgagggtggaggccaatccctagtctcccct
ggctcctgtcttgaggacttccgggccaatcctttcatcgagtgcagcggtgcccgtggt
acctgccactactttgccaacaagtacagtttctggctgacaacagtggaggagagaggg
cagtttcaggaacagcctgtgtccgagaatctgaaaactgggcagcttcacaccagggtg
agccgctgccaggtgtgtatgaaaaccccgtag

KEGG   Rattus norvegicus (rat): 367313
Entry
367313            CDS       T01003                                 

Gene name
Col6a3
Definition
(RefSeq) collagen alpha-3(VI) chain isoform X1
  KO
K06238  collagen type VI alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04974  Protein digestion and absorption
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    367313 (Col6a3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    367313 (Col6a3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    367313 (Col6a3)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    367313 (Col6a3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    367313 (Col6a3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    367313 (Col6a3)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    367313 (Col6a3)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   367313 (Col6a3)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   367313 (Col6a3)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   367313 (Col6a3)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 Kunitz_BPTI fn3
Other DBs
NCBI-GeneID: 367313
NCBI-ProteinID: XP_038940655
RGD: 1308704
Ensembl: ENSRNOG00000019648
Position
9
AA seq 3316 aa
MRKHRHLPLVAVFSLLLSGITMTHAQQQSDVKNGAAADILFLVDSSWSAGKDRFLLVQEF
LSDVVESLSVGDNDFHFALVRLNGNPHTEFLLNAYHSKQEVLSHILNMSYIGESNQTGKG
LEYIIHSHLTEASGSRAADGVPQVIVVLTDGQSEEDGFALPSAELKSADVNVFAVGVEDA
DERTLREIASEPLSMHVFNLENVTSLHDMVGNLVSCIHSSVNPERAGDKESLKDITAQDS
ADIIFLIDGSQNTGKANFDVIRDFLVNVLERLSVGNQQVRVGVVQYSDEPRTMFSLDSYP
SKAAVLDAVKRLSFAGGELANIGQALDFVVENHFTRTGGSRVEEGVPQVLVLISAGPSSD
EIRDAVVALKQGSVFSFGLGAQAASRVELQHIATDDNLVFTVPEFRSFGDLQEQLLPYLV
GVAQRHFVLQPPAIVTQVMEVNKRDIVFLVDGSSSLGQANFNAIRDFISRVIQRLEIGQD
LVQVSVAQYADTVKPEFYLNTYSTKKDAITTVRKMKALEGSALYTGSALDFVRNNLFTSS
AGHRAAEGVPKLLVLITGGKSLDDVSQAAQELQKGSIMALVIGSRAADEDELKEIAFDSS
LVFIPAEFRPAPLQSMLPSLMAPLRTLTGTTEVHVNKRDIIFLLDGSDNVGKNNFPYVRD
FVTNLVNSLDVGSDNIRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGTGLNAG
SALSYVHANHFTEAGGSRIREHVPQLLLLLMAGPSEDVYLQAANALVRSGVLTFCVGTNQ
ADKAELERIAFNPSLVYLMDDFSALPALPQQLIQPLTTYVSGGVQEVPLSQPEVKRDILF
LFDGSVNVMGQFPAVRDFLYRIIEELDVKPDGTRVAIAQFSDDVRVESRFNEHQTKAEIL
NLVKKMKLKTGKGLNLGLALDHALRTIFVRSAGSRIEDNVQQFLVLLVAGRSTDAVAGPA
SSLKQRGVIPFIFQAKNANPSELEQIVPSPAFILAAESLPKIGDLQSEIVGLLKAKQTSD
TVSGEKDVVFLIDGSEGVRSGFPLLKEFVQRVVESLDVGPNQVRVALVQYSDRTRPEFYL
NSHMDQQGVISAIRRLTLLGGPTPNTGAALDFVLRNILTSSTGSRIEEGVPQLLIVLTAE
RSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQVIS
ERVIQLNREQLSSLTPVLMPSPSAGPGNKKDVVFLIDGSQNAGAEFQHIRTLIERLVEYL
DIGFDTTRVAVIQFSEDTKMEFPLNAHFSKDEVQNAVRRLRPKGGRQVYIGNALEYVLKN
IFQRPLGSRIEEGVPQFLVLISSGKSADEVDDSAVELKQFGVAPLTVARHTDQEELVKIS
LSPEYVFSVSTFRELPRLEQRLLTPITTLTSQQIQQILASTRYPPSVAESDAADIVFLID
SSDAMKPDGIAHIRDFVSRIVRRLNIGPNKVRIGVVQFSNDVFPEFYLKTHKSQNSVLEA
IRRLRFKGGSPLNTGRALEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVSRHAQV
ISSSGIMSLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGPTPQP
PEVEFPSSRPEKKKADIVFLLDGSINFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQY
NSDPTDEFFLRDFSTKRQIIDAINKVIYKGGRHANTRVGIEHLLKNHFVSEAGSRLDERV
PQIAFVITGGKSVEDAQDVSLALTQKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQ
ELSELSETVLETLHDAMHETLCPGVTDVSKACNLEVILGFDGSRDQNVFVSQKGLESKVD
TILNRISQIQRISCSSNQLPTVRVSVVANTPSGPVEAFDFAEYQPDLFEKFRTMRSQHPY
VLTADTLKLYQNKFRQSSPDTVKVVIHFTDGADGDMADLYRASEELRQAGVQALILVGLE
RVANLERLMQVEFGRGFMYDRPLRLNLLDLDYELAEQLDNIAEKACCGVPCKCSGERGDR
GPIGSIGPKGISGEDGYRGYPGDEGGPGERGPPGVNGTQGFQGCPGQRGVKGSRGFPGEK
GELGEIGLDGLDGEEGDKGLPGSSGEKGSPGRRGDKGPKGDKGERGDVGIRGDPGDSGRD
SQQRGPKGETGDIGPMGLPGRDGIPGSPGDPGKDGGFGRRGPAGAKGNRGGPGQPGFEGE
QGTRGSQGPPGPIGPPGLIGEQGIPGPRGGGGTAGAPGERGRTGPLGRKGEPGEPGPKGG
TGNRGPRGETGDDGRDGVGSEGRRGKKGERGFPGYPGPKGTPGEPGADGPPGPKGIRGRR
GNSGPPGVVGQKGDPGYPGPSGHRGNRGDSVDQCALIQSIRDKCPCCYGPLECPVFPTEL
AFALDTSEGVTQDTFSRMREVLLSIVGDLTIAESNCPRGARVAVVTYNNEVTTEIRFADS
KKKSALLDSIQNLQVALTSKQPSLETAMSFVARNTFKRVRNGFLMRKVAIFFSNQPTKPS
LQLREAVLKLSDAGITPLFLTSQEDRELINALQINNTAVGHALVLPARRDLRDFLKNVLT
CHVCLDICNIDPSCGFGNWRPSFRDRRAAGSDVDIDLAFILDSSEATTLFQFNEMKKYIS
YMVRQLDLSPDPKASQHFARVAVVQQATYESVDNTSVPPVKVEFSLTDYGAKEKLLDFLS
RRMTQLQGTMDLGNAIEYTIENIFESAPNPRDLKIVVLMLTGEIPREQLEEAQRAILQAK
CKGYFFVVLGIGRKVNVKEVYSFASEPNDVFFKFMDKSTELNEEPLMRFGRLLPSFVGSG
NAFYLPPDLRKQCDWFQGDQPVKNGVKFGHRQINTPHTVNSSLTSKVVTTMKPVTTTKPT
SIVNLPPAKPAPVRPAPAQPVLAKPDPAKPVSAKSVPPQPVHAQPDPAQPVHVQSASAQT
ASAKPAPAKPAPPQTAAAKPAPPQTAAAKPAPPQTAAAKPAPPQTAAAKPVPVKPAPAQP
VHVQSASAQTASAKPVPAKPAPPQTAAAKPVPVKPAVPAQPAPAQPVHAQPAPAQPVLTK
PAAMKPASANKPVAAKPVATNTATVRPASAVKPAAASKPAATRPLPAAVRPVATKPEAPR
PQAKPAATKPATTKPMARVSREVQVSEVTENSARLHWERPEPSSSYFYDLTVTSAHDQSL
VLRQNLTVTDRVIGGLLAGQLYHVVVVSYLQSQVRAIYQGSFNTKKTQPPVLQAAHRASS
STINLMVNTEPLFLTKTDICKLPKDAGTCEDFKLLWHYDLESKSCKRFWYGGCGGNENRF
HSQEECEKMCSPELSV
NT seq 9951 nt   +upstreamnt  +downstreamnt
atgaggaaacaccggcatctgcccctagtggctgtctttagccttctcctctctggcatc
accatgactcatgcccaacaacaatcagatgtcaaaaacggtgcggcggctgacatattg
tttctagtggattcctcttggagtgctgggaaggaccgtttcctgcttgttcaagagttt
ctgtctgatgttgtagagtctttatctgtgggagacaatgattttcattttgctctggtc
cggctcaacggaaacccacatactgagttcctgttaaatgcataccactctaaacaagaa
gtcctttctcatattttgaacatgtcttacattggggaaagcaatcaaactggaaaagga
ttagaatatataattcacagccacctcactgaggcttctggaagccgggccgctgacgga
gtccctcaggttatcgtagtattaactgatgggcaatcggaggaggacggcttcgctctg
ccctcagcggaacttaagtctgcggacgttaacgtgtttgcagttggagttgaggatgca
gatgaaagaaccttaagggaaatagcaagcgaaccgctcagtatgcatgttttcaaccta
gagaacgttacctcacttcatgacatggtagggaacttagtgtcctgtatacactcctct
gtgaatccagaaagggctggggacaaggaaagccttaaagacatcacagcacaagactca
gctgacattattttccttattgatggatcacagaacaccggaaaggccaatttcgatgtc
attcgagacttccttgtaaatgtccttgagagactttcagttggaaatcagcaggtccga
gtgggggtggtacagtatagtgacgagcccagaaccatgttctccttggatagctacccc
tccaaggctgcagttctggatgctgtcaaaaggctcagctttgctggtggggagttggcc
aatataggccaggcccttgactttgtagtagagaatcacttcactcggacaggtggcagc
cgtgtggaggaaggggtcccccaggtgctggtcctcataagtgctgggccttctagtgat
gagattcgagatgcagtggtagctctgaagcagggtagtgtgttctcatttggcctcgga
gcccaggctgcctccagggtagagcttcagcacatagctaccgatgacaacttggtgttt
actgtcccggaattccgtagctttggggacctccaggagcaattactgccataccttgtt
ggtgtggcccaaaggcactttgtcttgcaaccaccagctattgtcacacaagtcatggaa
gtcaacaagagggacatagtcttcctggtggatggttcctccagcctgggacaggctaac
ttcaatgccatccgagacttcattagcagagtcatccagaggctggaaattggtcaggac
ctggttcaggtatcagtggctcagtatgcagacaccgtcaagcccgagttctatttaaat
acctactctaccaagaaggatgctataaccactgtgcggaagatgaaggccctggaaggc
tcagccctgtacacggggtctgctctggactttgttcgaaacaacctcttcactagctca
gccggccacagggccgctgagggggttccgaagctcctggtgctaatcacaggtggtaag
tcactcgatgatgtcagccaggctgcccaggagctgcagaaaggcagcatcatggccttg
gtcattggcagcagggccgccgatgaagacgagctgaaagagatcgctttcgactcctcc
ctggtcttcattcctgctgagttccgccctgctcccttacaaagcatgctgcccagcctg
atggcacccctcaggaccctcaccggaaccacagaagttcacgtaaacaaacgggatatc
atcttccttttggatggctccgacaacgttgggaaaaacaattttccttatgtgcgtgac
tttgtaaccaacctagttaacagcctcgatgttggaagtgacaatatccgtgttggcttg
gtgcaatttagtgacactccggtcacggagttctctctagacacgtatcagaccaagtca
gagctgcttgctcacctgaggcgcttgcagctcaaggggggcaccggcttgaatgcaggc
tctgccttaagctatgtccatgccaatcacttcacggaggcaggcggcagcaggattcgt
gaacatgtacctcagttgctgctcctgctcatggccgggccttctgaggatgtctatttg
caagctgccaatgccctggtgcgctcaggggtgctgaccttctgtgtggggaccaatcag
gcggataaggctgagttagagcgtattgcttttaaccccagcctggtatacctcatggat
gatttcagcgccctgccagctttgcctcaacagctgattcaacccctaaccacatatgtt
agtggaggtgtacaggaagttccactgtcccagccagaggtcaagcgggacattcttttc
ctctttgatggctcagtcaatgtcatgggccagttccctgctgtcagagactttctctac
aggatcattgaggagctcgatgtgaagccagatgggacccgggtagcaatagcgcagttc
agtgatgacgtcagggtggagtcccgtttcaatgagcaccagacgaaggctgaaatcctt
aacctcgtgaagaagatgaagttaaagacaggcaaaggtctcaacctgggcttggccctg
gatcacgcccttaggaccatctttgtgaggtcggctggcagccgcatcgaggacaacgtg
cagcagttcctggtgctcctggtggctggaaggtcgacagatgctgtggccgggcctgca
agctctctgaagcagagaggggtgatacctttcatcttccaagccaagaatgccaatccc
agtgagctggagcagattgtgccatcccccgcattcatcctggctgcagagtcgctccct
aagatcggggacctccagtcagagattgtgggccttctgaaggctaaacagacttcagac
actgtatcaggtgaaaaggatgtggtgtttctgattgatggctccgagggggtccggagt
ggcttccccctgctaaaggagtttgtgcagagggtcgtggagagcctggatgtgggtccc
aaccaggtgcgtgtggcactggtgcagtacagtgaccggaccaggccggagttctatctg
aattcccacatggaccagcagggtgtcatcagcgccatccgcagactgacactgctgggc
ggcccaacccccaacacaggggcggcactggactttgtgttaaggaatatcctgaccagt
tctactgggagcaggatagaagaaggtgttcctcagctcctgatcgtcctcacggcggag
cggtcaggggatgatgtgagaggcccctcggtggtcctgaagcagggaggggctgtgccc
attggtattggcattggaaatgctgacatctctgaaatgcagaccatctccttcatcccc
gacttcgctgtggccatccccaccttccgggagcttgggacaatacagcaggttatctct
gagagggtgatccagcttaatcgtgaacagctgagcagtttgacacctgttttgatgccc
tcaccaagtgcaggtcctggtaacaagaaagacgtggtctttctcatcgatggatcccaa
aacgctggcgcagagttccagcacatccgtaccctgattgagaggctagtcgagtacctg
gacataggcttcgacaccacccgggtagccgtcatccagttcagtgaggacaccaagatg
gagtttcctctgaacgcccacttcagcaaggacgaagtacagaacgctgtgcgcaggctg
cggcccaagggtgggaggcaagtctacattggaaacgctctggagtatgtgctgaagaac
atcttccagagaccgcttgggagccggatagaagagggtgttccacagttcctagtcctc
atttcgtctgggaagtctgctgatgaggtggacgactcagctgtggaactcaagcagttt
ggcgtggcccctctcaccgtagccagacacacagaccaggaagagctggtgaagatctcc
ctgagtcctgagtacgtgttctccgtgagcaccttcagggagctgcccagactggagcag
aggctgctgacgcccatcaccactctaacctcacagcagatccagcagatcctggccagt
acacgatatcccccttcagtggctgaaagtgatgctgcggacatcgtctttctgatcgac
agctccgatgccatgaagcctgacggtattgctcatatccgtgactttgtcagcaggatt
gtccgcagactcaacatcggccccaataaagtgaggattggggtcgtgcagttcagcaac
gatgtcttccctgagttctacctgaagacccacaagtcacagaacagtgtgcttgaagcc
atccggcgtctgaggttcaaaggagggtctcccctgaacactggcagagccctggagttt
gtggcaagaaacctctttgtgaagtctgctgggagccggatagaagatggggtccctcaa
cacctggtcttgttcctgggtgggaagtctcaggacgatgtgtctaggcatgcccaagtc
ataagctcctcggggatcatgagtttaggaataggagaccggaacatcgataggacagac
ttgcaaaccatcaccaacgaccccaggctggtcttcacagttcgggaattcagagagctt
cccaacatagaagagagggtcatgctttcctttggaccctctggacctacccctcagcct
cctgaggtagaattcccttcttcccggccagagaagaagaaggccgacatcgtgttcctg
ctggatggttccatcaactttcggagggacagtttccaggaagtgctccgttttgcatct
gaaatcgtggacacggtctacgaagatggtgactccatcagagtggggctggtccagtac
aactcagaccccacggatgaattcttcctgagggacttttctaccaaaaggcagattatc
gatgccatcaacaaggtcatctacaaaggagggaggcacgccaacacgagggtgggcatt
gagcacctgctgaagaatcactttgtgtccgaggctgggagccgcctggacgagcgggtc
cctcagatcgccttcgtgatcacgggaggaaagtcagtggaggatgctcaggatgtgagc
ctggctctcacccagaaaggtgtcaaggtgtttgctgtcggcgtgaggaacatcgactct
gaggaggtggggaagatagcctccaacagtgccacagccttccgggtaggcagcgtacag
gagctgtcggaactgagtgagacggtgctcgagactctgcatgacgcgatgcacgagacc
ctctgtcctggtgtgaccgatgtttccaaagcctgtaatctggaagtcattctggggttc
gatggatctcgagatcagaacgtgtttgtgagtcagaagggtctcgagtccaaggtggac
accatcttgaatagaatcagccagatacaaaggatcagctgcagcagtaaccagctgccc
actgtgcgggtgtctgtggtggccaacacgccctctgggccagtggaagcctttgacttt
gccgagtatcagccagacctgttcgagaagttccgtaccatgcgcagccagcacccatac
gtcctcaccgctgacacgctgaagctgtaccagaacaaattcaggcaatcgtcacctgac
actgtgaaggtcgtcattcacttcacggatggagctgatggagacatggccgacttatac
cgagcatctgaggagctccgtcaagcaggtgtccaagccctgatcctggtgggtcttgaa
cgtgtggctaacctggagcggctgatgcaagtggagttcgggcggggcttcatgtacgac
aggcccctgaggctcaacttgcttgacttggactacgaattggcggagcagcttgataat
attgctgagaaagcttgctgtggggttccatgcaagtgctctggagagaggggagacaga
gggcccatcggcagcatcggaccaaagggtatatccggggaagatggctaccgaggctat
ccaggcgatgaaggtggacccggtgagcgaggtcccccgggtgtgaatggcacacaaggt
ttccagggctgccccggccagagaggagtgaagggttctcgaggattcccaggcgagaag
ggtgaattgggagaaattggcttggacggtctggatggtgaagagggagacaaggggctg
cctggttcatctggtgaaaaggggagtcctggaagaaggggtgacaaaggacccaaagga
gacaaaggcgagagaggagatgtggggattcgaggtgacccgggtgactcaggacgggac
agccagcagagaggacccaaaggagagacgggcgacattggccccatgggtctcccaggc
agagacggtatccccggcagtccaggagaccctgggaaagacggtggctttggccgaagg
gggcctgcaggagctaagggcaacagaggcggtcctggccagccaggctttgaaggagag
cagggtaccagaggttcacagggcccacctggtcccattggtcccccaggcctgattggg
gaacaaggaattcccggacctcggggaggcggaggtacggctggggctcctggagaacgt
ggaagaactggtccgctggggaggaagggtgagccaggagagccagggccgaagggaggc
actggaaatcgtggacctcgaggggagacgggagatgacgggagagatggggttggcagt
gaaggacgcagaggcaaaaaaggagagagaggcttccctgggtatccgggaccaaagggt
acccctggtgagccaggggcagatggaccaccaggacccaaaggcatcagaggtcgaagg
ggaaattcgggacctccaggagtagttgggcagaagggagaccctggctacccaggacca
tctggtcacaggggcaaccgaggagactccgttgatcaatgtgccctcatccaaagcatc
agagacaaatgcccttgctgctatgggcccctggagtgccctgtattcccaacagaactc
gcctttgccctggatacctctgagggggtaactcaggacaccttcagccggatgcgagaa
gtgcttctgagcattgtgggagatctgaccattgcagagagcaactgtccgaggggtgcc
cgagtggctgtggtcacctataacaatgaggtgaccacagagatccggtttgctgactcc
aagaagaagtctgccctcttagacagtattcagaatctccaagtggccttgacatctaag
caaccaagcctggagaccgcaatgtcattcgtggccaggaataccttcaaacgtgtgagg
aatggatttctgatgaggaaagtggccattttcttcagcaatcagcccacaaaaccatcc
cttcagctccgagaggctgtgctcaagctttcagatgcagggatcacacccttgttcctt
acaagccaggaggaccgagagctcatcaatgccttgcagatcaataacacagcagtagga
catgccctggttctgcctgcaaggagggatctcagagacttcctgaagaacgtcctcacc
tgccacgtttgcttggacatttgcaatattgacccatcttgtggattcggcaactggagg
ccttccttcagggaccggagggcagcaggcagcgatgtggacatagacttggctttcatc
ttggacagttcagaggctaccactctgttccagttcaatgagatgaagaagtacataagc
tatatggtcagacagctggacctgagcccagaccccaaggcttcccagcactttgccagg
gtggctgttgtgcagcaggccacctatgagtccgtggacaacaccagtgtgccaccagtg
aaggtggagttctcactgacagactatggtgccaaggaaaagctgctagacttccttagc
cggaggatgacccaacttcagggaaccatggacctgggcaatgccattgaatacactata
gagaacatctttgaaagtgctccaaacccacgggacctcaaaattgtggtgctgatgctg
accggggagataccaagggagcagctggaggaggcccagagagccatcctccaggccaag
tgcaagggttacttcttcgtggttctgggcattggcaggaaggtgaacgtcaaggaggtc
tacagttttgccagtgagcccaatgatgtcttcttcaaattcatggacaaatcgactgag
ctcaacgaggagcctctgatgcgctttggaaggctgctcccttcctttgtcggcagtgga
aatgcattttacctgcctccagatctcaggaagcagtgtgattggttccaaggggaccaa
ccagtgaagaatggcgtgaagtttggtcacagacagataaacactccacatactgttaat
tccagtcttacatccaaagttgtgaccacaatgaagccagtgacaacaaccaaaccaaca
tcaattgtaaatctgccacctgccaagccagccccagttaggccagcccctgcccagcca
gttcttgctaagccagaccctgccaaaccagtttctgctaagtcagtgcctcctcagcca
gtccatgctcagccagaccctgctcaaccagtccatgttcaatcagcctcagctcagaca
gcttctgcaaaaccagcccctgctaagccagcccctcctcagacagctgctgctaagcca
gcccctcctcagacagctgctgctaagccagcccctcctcagacagctgctgctaagcca
gctcctcctcagacagctgctgctaaaccagtccctgttaagccagcccctgctcaacca
gtccatgttcaatcagcctcagctcagacagcttctgctaaaccagtccctgctaaacca
gcccctcctcagacagctgctgctaagccagtccctgttaagccagcagtccctgctcag
ccagcccctgctcaaccagtgcatgctcagccagccccagctcagccagtccttactaaa
ccagccgctatgaagccagcctctgccaacaagcctgtggcagccaaacctgtggccaca
aacacagccacagtcagacctgcatcagcagtgaagccagcagcagcatcgaagcccgca
gcaacgagacccctccctgctgctgtaaggccagttgctacgaaaccagaggcccccagg
ccacaggccaaaccagctgccaccaagccagccactactaagcccatggcgagagtgtcc
cgagaagtccaggtctctgaagtcaccgaaaacagtgccagactccactgggagcggcct
gaaccttccagttcctatttttatgacctcactgtaacctcggcccatgaccagtccttg
gttctgagacaaaacctcactgtcactgaccgtgtcatcggaggcctgctggctggacag
ctgtaccatgtcgtagtggtctcctacctgcagtctcaagtcagagccatctaccaaggg
agtttcaatacgaagaaaactcagcctccagttcttcaggcagcgcaccgagcatctagt
tcaaccattaacctgatggtgaacacagagccactgtttctcactaaaacagatatctgt
aagctgcccaaagatgctgggacttgtgaggacttcaaattactatggcactatgactta
gagagcaaaagttgcaagagattctggtatggaggctgcgggggcaacgagaacagattc
cactcacaggaagaatgtgaaaagatgtgtagtcctgagttatcagtctga

KEGG   Rattus norvegicus (rat): 445442
Entry
445442            CDS       T01003                                 

Gene name
Thbs1
Definition
(RefSeq) thrombospondin-1 precursor
  KO
K16857  thrombospondin 1
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04015  Rap1 signaling pathway
rno04115  p53 signaling pathway
rno04145  Phagosome
rno04151  PI3K-Akt signaling pathway
rno04350  TGF-beta signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05144  Malaria
rno05165  Human papillomavirus infection
rno05205  Proteoglycans in cancer
rno05206  MicroRNAs in cancer
rno05219  Bladder cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04015 Rap1 signaling pathway
    445442 (Thbs1)
   04350 TGF-beta signaling pathway
    445442 (Thbs1)
   04151 PI3K-Akt signaling pathway
    445442 (Thbs1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    445442 (Thbs1)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    445442 (Thbs1)
  09143 Cell growth and death
   04115 p53 signaling pathway
    445442 (Thbs1)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    445442 (Thbs1)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    445442 (Thbs1)
   05205 Proteoglycans in cancer
    445442 (Thbs1)
  09162 Cancer: specific types
   05219 Bladder cancer
    445442 (Thbs1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    445442 (Thbs1)
  09174 Infectious disease: parasitic
   05144 Malaria
    445442 (Thbs1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:rno04131]
    445442 (Thbs1)
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    445442 (Thbs1)
   04990 Domain-containing proteins not elsewhere classified [BR:rno04990]
    445442 (Thbs1)
Membrane trafficking [BR:rno04131]
 Endocytosis
  Phagocytosis
   Opsonins
    445442 (Thbs1)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   445442 (Thbs1)
Domain-containing proteins not elsewhere classified [BR:rno04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   445442 (Thbs1)
SSDB
Motif
Pfam: TSP_C TSP_3 TSP_1 TSP1_spondin VWC cEGF EGF_3 EGF_CA TSP1_ADAMTS Laminin_G_3 Laminin_G_2 TSP1_CCN EGF Laminin_G_1
Other DBs
NCBI-GeneID: 445442
NCBI-ProteinID: NP_001013080
RGD: 1588455
UniProt: Q71SA3
Position
3
AA seq 1170 aa
MELLRGLGVLFLLHVCGSNRIPESGGDNGVFDIFELIGGARKVPGRRLVKGQDLSSPAFR
IENANLIPPVPDDKFQDLLDAVWADKGFIFLASLRQMKKTRGTLLAVERKDNSGQIFSVV
SNGKAGTLDLSLSLPGKQQVVSVEEALLATGQWKSITLFVQEDRAQLYIDCDKMESAELD
VPIQSIFTRDLANVARLRVAKGDVNDNFQGVLQNVRFVFGTTPEDILRNKGCSSSTNVLL
TLDNNVVNGSSPAIRTNYIGHKTKDLQAICGLSCDELSSMVLELRGLRTIVTTLQDSIRK
VTEENRELASELRRPPLCFHNGVQYRNNEEWTVDSCTECHCQNSVTICKKVSCPIMPCSN
ATVPDGECCPRCWPSDSADDGWSPWSEWTSCSATCGNGIQQRGRSCDSLNNRCEGSSVQT
RTCHIQECDKRFKQDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARE
TKACKKDACPINGGWSPWSLWDICSVTCGGGVQRRSRLCNNPTPQFGGKDCVGDVTENQV
CNKQDCPIDGCLSNPCFAGAKCTSYPDGSWKCGACPPGYSGNGIQCKDVDECKEVPDACF
NHNGEHRCKNTDPGYNCLPCPPRFTGSQPFGRGVEHAMANKQVCKPRNPCTDGTHDCNKN
AKCNYLGHYSDPMYRCECKPGYAGNGIICGEDTDLDGWPNENLVCVANATYHCKKDNCPN
LPNSGQEDYDKDGIGDACDDDDDNDKIPDDRDNCPFHYNPAQYDYDRDDVGDRCDNCPYN
HNPDQADTDNNGEGDACAVDIDGDGILNERDNCQYVYNVDQRDTDMDGVGDQCDNCPLEH
NPDQLDSDSDRIGDTCDNNQDIDEDGHQNNLDNCPYVPNANQADHDKDGKGDACDHDDDN
DGIPDDRDNCRLVPNPDQKDSDGDGRGDACKDDFDHDNVPDIDDICPENVDISETDFRRF
QMIPLDPKGTSQNDPNWVVRHQGKELVQTVNCDPGLAVGYDEFNAVDFSGTFFINTERDD
DYAGFVFGYQSSSRFYVVMWKQVTQSYWDTNPTRAQGYSGLSVKVVNSTTGPGEHLRNAL
WHTGNTPGQVRTLWHDPRHIGWKDFTAYRWRLSHRPKTGFIRVVMYEGKKIMADSGPIYD
KTYAGGRLGLFVFSQEMVFFSDMKYECRDS
NT seq 3513 nt   +upstreamnt  +downstreamnt
atggagctcctcaggggactaggcgtcctgttcctgttgcatgtgtgtggaagcaaccgc
attccagagtctgggggagacaatggtgtgttcgacatttttgaactcattggaggtgcc
cgcaaggttccgggtcgccgactggtgaagggccaagatctatccagccccgccttccgg
atcgagaatgccaacctgatcccccctgtgccagatgacaagttccaagacctactggac
gctgtgtgggccgacaaaggcttcatcttcctggcttccttgaggcagatgaagaagact
cggggcacactcctggccgtggaacggaaagacaattctggccagatcttcagtgtggtc
tccaatggcaaagccggcaccctcgatctgagtctgagcctgcccgggaagcagcaagtg
gtgtcagtggaggaagctctcctggccactggccagtggaagagcatcacgctgtttgtc
caagaagacagggcccagctctacattgactgtgacaagatggagagcgcagagctggat
gttcccatccagagcatcttcaccagggatttggccaatgtcgccaggctccgagtcgca
aagggagatgtcaatgacaattttcagggggtgctgcagaatgtgaggtttgtctttgga
accaccccagaagacattctcaggaacaaaggctgctccagttctaccaacgtccttctc
acccttgacaacaacgtggtgaacggttccagccctgccatccgcaccaactacatcggc
cacaaaacaaaggacctccaagccatctgtggcctctcctgtgacgaactatccagcatg
gtcctggaactgaggggcctgcgtaccatcgtgaccacgctacaggacagcattcgaaaa
gtgacggaagagaacagagagctggctagcgagctgaggcggcctcccctctgcttccac
aatggagtccaatacaggaacaacgaggagtggactgtagatagttgcacagagtgtcac
tgccagaactcagttaccatctgcaaaaaggtgtcctgtcccatcatgccctgctccaat
gccacagttcctgatggtgaatgctgcccacggtgctggcccagtgactctgctgacgat
ggctggtctccctggtctgagtggacctcttgctctgccacctgtggcaatgggattcag
caacgtggtcgctcctgtgacagcctcaacaacagatgtgagggctcttcagtacagacg
aggacctgccacattcaggagtgcgacaaaagatttaaacaggatggtggttggagtcac
tggtccccgtggtcgtcctgttctgtgacctgtggtgacggtgtgatcacgaggatccga
ctctgcaactcccccagcccccagatgaacgggaagccctgtgaaggtgaagctcgggag
accaaagcctgcaagaaggacgcctgcccaatcaatggaggctggagtccctggtcacta
tgggacatctgctctgtcacctgtggaggaggagtacagagacgtagccgactctgcaac
aaccccacaccccagtttggaggcaaagactgtgttggtgatgtgacggaaaatcaagtt
tgcaacaagcaggactgtccgattgatggatgcctgtccaatccctgctttgctggtgcc
aagtgtacgagctatcctgatggtagctggaaatgtggtgcgtgtcctcctggctacagt
ggaaatggtatccagtgcaaagacgtcgacgagtgcaaagaagtgcctgatgcttgcttc
aatcacaacggagaacatcggtgcaagaacacagatcctggctacaactgcctgccctgc
ccaccacgattcaccggctcacagcccttcggcagaggtgtcgaacacgctatggccaac
aaacaggtgtgcaaaccccgaaacccctgcaccgacgggacacacgactgcaacaagaac
gccaagtgcaactacctgggtcactacagcgaccccatgtaccgctgtgagtgcaagcct
ggctacgcaggcaatggcatcatttgcggagaggacacagaccttgacggctggcctaat
gaaaacctggtgtgtgtggccaacgcaacttaccactgcaaaaaggacaactgccccaat
cttcccaattcggggcaggaagactatgacaaggatgggattggcgatgcctgtgatgac
gatgatgacaacgacaagattcctgatgacagggacaactgtccattccattacaaccca
gcccagtatgactatgacagagatgatgtgggagaccgctgtgacaattgcccctacaac
cacaaccctgaccaggcagacacagacaacaatggggagggagatgcctgcgctgtggac
atcgatggggatggaatcctcaatgaacgagacaactgccagtatgtttacaacgtggat
cagagggacacggacatggatggggttggagaccagtgtgacaactgccccctggaacac
aatccagaccagctggactctgactcggaccgcataggggacacctgtgacaacaatcag
gacatcgatgaggatggccatcagaacaacctggacaactgtccctatgtgcccaatgcc
aaccaggccgaccacgataaagatggtaaaggagacgcctgtgaccatgacgatgacaac
gacggcatccctgatgacagagacaactgcaggctggtgcccaatcctgaccagaaggac
tctgatggtgatggccgaggcgatgcctgcaaagacgactttgaccatgacaatgtgcca
gacattgatgacatctgtcctgagaatgttgacatcagtgaaaccgatttccgccgattc
cagatgattcctctagatcccaaaggaacctcccaaaatgaccctaactgggttgtccgc
catcagggcaaagaacttgtccagactgtaaactgtgaccctggacttgctgtaggttat
gatgagtttaatgccgtggacttcagtggtaccttcttcatcaacactgagagggatgac
gactatgctggctttgttttcggctaccagtccagcagccgcttctacgttgtgatgtgg
aaacaagtcacccagtcctactgggacaccaaccccacaagggctcagggatactcaggc
ctgtctgtaaaggttgtaaactccaccactggccccggcgagcacctgcggaatgcactg
tggcacacaggaaacacccctggccaggtgcgcaccctgtggcatgaccctcgtcacatt
ggctggaaagatttcactgcatacagatggcgtctcagccacaggccaaagaccggtttt
atcagagtggtgatgtatgaaggaaagaaaatcatggctgactcaggacccatctatgac
aaaacctacgctggcggtagactaggcctgttcgtcttctctcaagaaatggtgttcttc
tcggacatgaaatacgagtgccgagactcctaa

KEGG   Rattus norvegicus (rat): 501047
Entry
501047            CDS       T01003                                 

Gene name
Col6a5
Definition
(RefSeq) collagen alpha-5(VI) chain isoform X2
  KO
K06238  collagen type VI alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04974  Protein digestion and absorption
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    501047 (Col6a5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    501047 (Col6a5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    501047 (Col6a5)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    501047 (Col6a5)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    501047 (Col6a5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    501047 (Col6a5)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    501047 (Col6a5)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   501047 (Col6a5)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   501047 (Col6a5)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   501047 (Col6a5)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 VWA_CoxE Copine vWA-TerF-like
Other DBs
NCBI-GeneID: 501047
NCBI-ProteinID: XP_008764816
RGD: 1565804
Ensembl: ENSRNOG00000010663
Position
8
AA seq 2635 aa
MKLQLIVFVLILWTETLSDQSPGPGPKYADVVFLVDSSDYLGIKSFPFVRTFLNKIISSL
PVEANKYRVALAQYSDALHNEFHLGAFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRT
YFSAPTNGRDKKQFPPILVVLASAESEDDVEEASKALREDGVKIISVGVQKASEEDLKAM
ATPQFHFNLRTARDLSLFAPDMAQIIQDVTQYREGTTVAVVTDVAPTTPEPLITPAALTT
PANNVDETVPFLASCQKDSLADLIFLVDESVGTTQNLRDLQNFLENVTSSVDVKDNCMRL
GLMSFSDKARTVSSLKSSTSQSDFQQQIQKLSLRTGASNVGAAIEHMRTEGFSESSGSRK
AQGVPQIAVLVTHRASDDAVREAALDLRLQGVTMFAMSVQGANNTQLEDIVSYPSRQSIS
THSSYKHLESYSGNFLKKIHNEIWTQVSTHAEQMELDKTGCVDTKEADIYFLIDGSSSIR
RKEFEQIQIFMSSVVDMFPIGPNNVRVGVVQYSHRNEVEFPVSQYTKGIELRKAVWNIKQ
LKGGTFTGKALDFILPIIKKGKSERIHEVPCYLIVLTDGKSNDSVLEPANRLRAEHITIH
AVGIGEANKTQLQQIAGKDERVSFGQNFDSLKYIKNEIVHRICSDKGCEDMKADIMFLVD
SSGSIGLTNFEKMKTFMKNLVGKIEIGADRSQVGVVQFSDYNKEEFQLNKYSTREEVYAA
IDGMSPINRNTLTGSALTFVNEYFDISKGGRPQVRKFLILLTDGEAQDEVGGPAMALRSK
SVTIFSVGVYGANRTQLEEISGEGSLVFHVENFDHLKTIESKLIFRVCALHDCKRIELLD
IVFVLDHSGSIGPREQESMINLTIHLVEKADVGRDRVQFGALMYSNNPEILFYLNTYSSR
SAITEHLKRPRDTRGDTYTAKALQHANILFMEEHGSRLKQNVRQLMIVITDGESHDRDKL
NDTARELRDKGITIFAVGVGRANQDELETMAGKKENTIHVDNFDKLRDIYLPLQEILCNN
SQETCNLPEADVIFLCDGSDMVSDSDFVAMTTFLSDLIDNFDIESQRLKIGMAQYGSRYQ
EIIELESSLNKTQWKSQVHSVAKSKGLPRIDFALKHVSDMFDPSVGGRGNAGVPQTLVVI
TSSYPHYDVSRAVKVLRELGICILALGIGNVYKGQLLPITGNSEKIITFKDFNTLKNVDV
KKRMVREICQSCGKANCFVDVVVGFDISTHRQGQSLFQGHPRLESYLPGILEDITSIRGV
SCGAGADTQVSLAFKVNSEQEFPAKFQIYQKAAFNSLLQVTVRGPTHLNAQFLQSLWDVF
EERSSSRGQVLLIFSDGLQGESITLLEQQSDRLREAGLDALLVVSLNTFGHHEFSSFEFG
KGFDYRTQLTIGMIDLSKKLSQYLGTIAERACCCTFCKCTGIPGPHGTRGLQASKGSSGL
KGSRGHRGEDGDPGRRGDTGLQGDRGVMGCPGIRGQKGVKGFSGAQGEHGEDGLDGLDGE
EGFYGLRGGKGQKGDPGSQGHPGSRGAPGEDGEKGFPGDPGNPGQDSDIKGQKGAKGERG
RQGRPGQKGIDGSPNSRGSRGMQGRRGPPGPSGQAGDPGPQGIQGPEGLQGPQGSSGDRG
GKGDKGSEGLQGPQGSPGPVGPRGDIGRPGFGGRKGEPGVPGGPGPLGPPGHRGEQGDYG
IPAYGQTGRKGVKGPTGFPGDPGQKGDAGDPGIPGGPGPKGFKGLTLSQGLKGGSGLPGS
QGPPGRRGPKGTAGQPIYSPCELIQFLRNNSPCWKEKCPVYPTELVFALDQSFGTSERRF
NETRDTIASIIDDLNIRENNCPVGARVVVVSYDSDISYLIRGSDYHSKKHLLQLLSQIKY
QVPQKIRDIGSAMRFVARNVFKRTSLGTNTRRVAVFFSNGQAASRASILTATMEFSALDI
SLAVFAHNERVFLDEAFGFDDTGTFQVIPVPPVGDYEPLEKLRRCTLCYDKCFPNTCAEE
PFFPENSYMDVAFLLDNSRNIASDDFQAVKGLVSSVIDGFHITSNPSTSESGDRVALLSY
SPSENSRRKGSVKTEFGFTTFDSESIMKNYIHTSLQQLNGDAAIGLALEWAMEGLFLGTP
NPRKHKVIIVISAGENHEEKEFVKTVALRAKCQGYIIFVVSLGSTRRDEMEELASYPLDH
HLIQLGRMDKPELSYVVKFLKPFVYSVRRGFNQYPPPQMLEDDCRLVDSERGDVLHRRSL
RLTAKLHEVSASKNSPAGQELKARKDLSVVLEDHGGDHLVYIPTHMVEPHKLASHYGNDQ
ESVRMASLTSEHENHSRRELSLAQEPGDASPQEYYMDVAFLIDASQQIGHNAFTEVRAFI
TSVLDYFHITPAPLTSVLGDRVAVLTYSPPGYLPNTEECPVYLEFDLVTYNTVHQMKRHL
QESLKQLNGDVFLGHALQWTIDNVFVGTPNLRKNKVIFIVTAGETNPLDKEVLRNASLRA
KCQGYSIFVFSFGPVHNDLELEELASHPLEHHLVRLGRVHKPDLDYIIKFIKPFVHSIRR
AINKYPSRDLQAKCANLTFPGLENAGTEDSILLIPEVYQIESGESELSGYSGSQEQPFFL
LGKGNDNGSEGTADLMQHLYLLFSSGELMVNDEEEPCSEETSAPANGKQDGEDAR
NT seq 7908 nt   +upstreamnt  +downstreamnt
atgaagctccagctaattgtgtttgtcctaatcctttggactgagaccctgtcagaccag
agcccagggccaggacccaagtatgcagatgtggtctttctggtagacagctcggattac
ctgggaattaagtccttcccattcgtgaggacttttctcaacaagataatcagcagcctc
cccgtagaggccaacaagtaccgtgtggccctggcccagtacagtgatgctctccacaat
gagttccacctaggcgccttcaagaacaggaaccccatgctgaaccacctcaagaagaac
ttcgggttcatcggtggctccctgaagattgggaacgccctgcaggaggctcacaggacc
tatttctctgctcccacaaacggaagagacaagaaacagttccccccaatcctggtggtg
ctggcttcagcggagtccgaggatgacgtggaagaggcttcgaaggccctgcgggaagat
ggggtgaaaatcatctctgtgggggtgcagaaggcttctgaggaagacctgaaggccatg
gccacccctcagttccatttcaacctcaggactgccagagacctcagcctgtttgcacca
gacatggcacagatcatccaggatgtgactcagtacagggaaggaacaacggtggccgta
gtcacagatgtagccccgaccacacccgaacctctaatcacacctgcagctctaaccaca
cctgcaaacaatgtcgatgagacagttcccttccttgcatcctgccagaaagactccctt
gccgaccttatattcttggtggacgagtccgtggggaccacacaaaatctgagggatctg
cagaacttcctggagaatgtcaccagctctgtggatgtgaaggacaattgcatgaggctt
gggctgatgagcttcagtgacaaggctcggactgtgtcttctctgaagtcaagcaccagc
cagtctgacttccagcagcaaatccagaagctctcgctccggacgggggcatctaatgtt
ggggccgccattgagcacatgaggacagaaggcttctcagagtccagtggcagcagaaag
gcacagggggtcccccagattgctgtgctggtcacccacagagcgtcagatgatgcggtg
cgggaggctgcgctggacctgcgcttacagggtgtaactatgtttgcaatgagcgtccaa
ggggccaacaacacacagctagaagacatagtgtcatacccttccagacagagcatttcc
acacacagctcctacaagcacttggaatcatacagtgggaacttcctgaagaagatccac
aatgaaatctggacccaggtgtctacccatgctgaacaaatggagctggataaaactggc
tgtgtggacacaaaggaggccgatatctatttccttattgatgggtcgtccagcatccgc
aggaaagagtttgagcaaatccagatattcatgtcctccgtggtagacatgtttcccatt
ggccccaacaacgtccgggttggagttgtacagtactcacacaggaacgaagtggaattt
cctgtctctcaatacacgaagggcatcgaattgaggaaggctgtttggaacattaagcaa
ctcaaaggtggtacattcactgggaaagccctggatttcatcctgccaataatcaagaag
ggaaagtcagagaggatccacgaagtcccctgctacctcatcgtgctgactgatgggaaa
tcaaacgacagcgttctagaacctgccaacagactgagggccgagcacatcaccattcac
gccgttgggattggcgaggcgaacaagacacagctgcaacaaattgctgggaaagacgaa
agggttagctttgggcagaattttgattctctgaagtacataaaaaacgaaattgtccac
cgcatctgctctgataaaggatgtgaagacatgaaggccgacatcatgttcttggtggac
agttctggaagcatagggcttacaaattttgagaaaatgaaaactttcatgaaaaacctg
gtggggaagattgagatcggcgccgacagatcccaagtcggtgtcgttcagttcagtgac
tacaataaagaagagttccagcttaataaatactccacacgtgaggaagtttatgccgcc
atagatggaatgtctcccatcaacagaaacactttgaccggaagtgcactaaccttcgta
aatgaatactttgacatttccaagggggggcgtccccaggtcagaaagttcctcatcctc
ctcacagatggagaagcccaggatgaagtggggggtccggccatggcgctcaggagcaaa
agtgtgactatcttctctgtgggtgtgtacggtgctaacaggactcagttggaggagatc
agtggggagggaagcctggttttccatgttgagaactttgatcatctgaagaccatagag
agtaaactcatctttcgagtgtgtgctctgcatgattgcaagaggattgaactcttagac
attgtgtttgtgctggaccactcaggcagcatcggtccacgggagcaagaaagcatgatt
aacctcaccatccacctggtggagaaggcagacgttggcagggacagagttcagttcgga
gctctcatgtattccaacaaccccgagattcttttctaccttaacacttactcgagcaga
tcggccatcaccgagcacctgaagaggcccagggacaccagaggggatacctacactgcc
aaggcccttcagcacgccaacatcctgttcatggaagagcatggcagccgcctgaagcag
aacgtgagacagctgatgattgtcatcaccgatggcgagtcccacgaccgagataaactc
aacgacacagcacgggagctcagagacaagggaatcaccatctttgcagttggcgtaggc
cgtgccaaccaggatgaactggagaccatggcagggaagaaagaaaatactatccatgtg
gacaatttcgacaaactgagagacatttacttgcctctacaagagattctgtgcaataat
tcacaagagacttgtaaccttccagaagctgatgtgattttcctttgtgatggctctgac
atggtgtctgattcagattttgttgccatgacaactttcttgtcagacttaattgataat
tttgacattgagtctcaaagactgaaaatcgggatggctcagtacgggagccggtatcag
gaaattattgagttggaaagctctctgaacaaaactcagtggaaatctcaggtccacagt
gtggctaagagcaaggggcttccacgaattgactttgctcttaagcatgtgagtgacatg
ttcgacccttcggttggcgggagaggaaacgctggtgtccctcagactttggttgttata
acatctagttatccgcactatgatgtgtcaagagcagtgaaggtcctgagagaacttggc
atctgtatcctggctttgggaataggaaatgtttataagggacagcttctgcccatcaca
ggcaattcagaaaaaataattacgttcaaagactttaatacactaaagaatgtggacgtg
aagaaaagaatggtccgtgaaatttgccagagctgcgggaaagccaattgctttgtggat
gtagtggtcgggttcgacatttccactcaccgtcagggccagtctttgttccagggccat
ccccgcttggaatcgtacctcccaggcatcttggaggacatcacctctatcagaggggtg
agctgtggtgcaggagcagacacacaggtgagcttggcctttaaggtgaacagtgagcaa
gagttcccagccaagttccagatctatcagaaagcagcgttcaacagcttgttgcaagtc
accgtccgtggaccaactcatctgaatgcacagttcttacagtcactctgggatgtgttt
gaggaaagatcttcatccaggggccaggtcctgctcatcttctcagacggtctccagggt
gaaagcatcactttgctggaacagcagtcagacaggctcagagaagcaggacttgatgcc
ctgttggtggtgtccctgaacacatttggtcaccatgagttttccagctttgaatttgga
aaaggattcgactacaggactcagctgaccattggaatgatagatctgagcaagaagctc
tcacagtatctggggaccatcgcagaacgggcttgctgctgtacattctgtaaatgtaca
gggattccaggacctcatggaacccggggactccaagcctcgaagggttcttcaggtctg
aaaggcagcagaggacacaggggagaggatggagaccctggaagacgaggagacactggg
cttcaaggagacagaggagtcatgggatgcccggggattcgaggtcaaaagggagtcaaa
ggattttcgggagctcagggagaacacggagaggacgggcttgacggactggacggggaa
gaaggattttacggacttcgtgggggaaaaggacaaaaaggtgaccctggctctcagggc
cacccaggttccagaggcgctcctggggaagatggcgagaagggcttcccaggggatccc
ggtaacccaggacaagacagcgacatcaaaggccagaagggcgccaaaggagaacgagga
agacaagggagacctggacagaaggggatagatggcagtcctaactccagaggaagcagg
ggaatgcaaggccggagggggcccccaggtccttcagggcaagcaggagacccgggacct
caaggtatacagggacctgaaggactacaaggcccacaggggtcaagtggagatcgtggc
gggaaaggagacaaaggaagtgaagggcttcagggacctcagggttctcctgggccagta
ggacccagaggggacatcggaagacctggatttggggggagaaagggggaacctggagtt
cctggaggcccaggaccactggggccccctggacatcgaggagaacagggagattatggc
atcccagcctatggtcagacaggacgaaaaggagttaagggcccaacaggattccctggg
gacccagggcagaagggcgatgctggagatcctggaattcctggggggcctggacccaaa
ggattcaagggactgactctctctcaaggcttgaaaggtgggagtggacttccgggctct
caaggcccccctggtcggagaggtcctaaaggcacagcagggcaacctatatattccccc
tgtgaactgatccagttcctgaggaacaatagtccttgttggaaagaaaagtgtcccgtg
tacccaacagagctggtatttgccttggaccagtcctttggtacttcagagaggagattt
aatgagacaagggacaccattgcgtcaattatcgatgacctcaacatcagggaaaacaac
tgtcccgtgggagcacgggttgtcgtggtttcctatgactcagacatcagctacctcatc
cgtgggtctgactaccatagcaagaagcatctccttcagcttctttcccagataaaatac
caagtcccccagaagatcagggacattggcagtgcaatgaggtttgtggcccgcaatgtt
ttcaagcggacatctctgggaaccaacaccaggagagttgctgtgttttttagcaatggc
caggcagccagtagagcatccatcctcacggccaccatggagttcagtgccctggatatc
agccttgcggtctttgctcataatgaaagagttttccttgatgaggcttttgggtttgat
gacactgggacatttcaggtgattccagttcctccagttggagattacgagccactggaa
aagcttcgacgctgcacactttgctatgacaagtgttttccaaatacttgtgcagaagag
cctttctttcctgagaattcctacatggatgttgcattcctcttagacaattctcgaaac
atagccagtgatgacttccaagctgtgaaagggctggtgagctcagtgattgacggcttc
cacatcacttcaaacccttcaacctctgagtctggcgacagggttgctttgctgagctat
tctccctcagagaattccagaaggaagggcagcgtgaagacagagtttgggtttacaacc
tttgacagcgaatcaatcatgaagaactacatccacacttccctccaacagctcaatgga
gatgccgccattggtcttgccctggagtgggccatggagggtctcttcctgggaaccccc
aatccaagaaagcacaaggtcatcattgtgatttcagctggagaaaaccacgaggagaag
gaatttgtgaagacggtggctttgagggccaaatgtcaaggctatatcatatttgtggtt
tctctgggctccacacgaagagatgagatggaggaactagctagttacccactcgatcac
catctgatacagcttgggagaatggacaagccagagctgagttatgttgtgaaatttctg
aagccctttgtatactcggtcagacgcggatttaatcagtacccgccgccacagatgctg
gaggacgactgcaggcttgttgattcagagaggggggacgtgctgcacagaagaagcctc
agattgacggccaagctgcatgaggtttctgcgagcaagaacagtccggctggtcaagag
ctgaaagccaggaaagacttgtctgtagtgttggaggatcacggaggtgaccatttggtt
tacattccaacccatatggtagagccacataagttagcgagtcactatggaaatgaccag
gagtctgttagaatggcaagcctcacttccgaacatgaaaatcatagcagaagagaactg
agtcttgctcaagagcctggagatgcttctcctcaagaatattacatggatgtggctttc
ctcatagatgcttctcaacaaataggacataatgcgtttacagaagtgagagcttttata
acttcagtactagattactttcacatcaccccagctccactgacctctgtcttaggagac
agagtggctgtcctgacctattctcctccaggctatctgccaaacacggaagaatgccca
gtctacctggaattcgatttggtcacttacaacactgtacaccaaatgaagcgtcatctc
caagagtctcttaagcaactcaatggagatgtttttcttgggcatgcccttcagtggaca
attgacaatgtctttgtaggaacacccaatctgaggaaaaacaaagttatctttatagta
actgcaggggaaaccaacccgttagacaaagaagttttaaggaatgcatctctgagagcc
aagtgccaaggctactccatatttgtattttccttcggtcctgtacacaatgacttggag
ttagaagaattagcaagtcaccccctggagcatcacttagtccggcttggtcgggtacac
aagccagatttggactatatcatcaagttcatcaagccatttgttcattcaatcagacgt
gctatcaacaaatatcctagcagagacctgcaagccaagtgtgctaatctcactttccct
ggcctggagaatgctggcacagaagacagtatattactcatccctgaggtatatcaaatc
gagtcaggagagagtgagctgtctggatactccggttctcaggagcagcctttcttcctc
ctagggaagggtaatgataacgggtcagaaggcactgctgatctcatgcagcacctgtac
ctgctcttctcctctggagaactcatggtgaatgatgaagaagagccgtgttctgaagag
acctcagctccagcgaacggcaagcaagatggtgaagatgcgagatga

KEGG   Rattus norvegicus (rat): 681309
Entry
681309            CDS       T01003                                 

Gene name
Thbs3
Definition
(RefSeq) thrombospondin-3 precursor
  KO
K04659  thrombospondin 2/3/4/5
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04145  Phagosome
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno05144  Malaria
rno05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    681309 (Thbs3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    681309 (Thbs3)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    681309 (Thbs3)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    681309 (Thbs3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    681309 (Thbs3)
  09174 Infectious disease: parasitic
   05144 Malaria
    681309 (Thbs3)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:rno04131]
    681309 (Thbs3)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:rno04147]
    681309 (Thbs3)
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    681309 (Thbs3)
   04990 Domain-containing proteins not elsewhere classified [BR:rno04990]
    681309 (Thbs3)
Membrane trafficking [BR:rno04131]
 Endocytosis
  Phagocytosis
   Opsonins
    681309 (Thbs3)
Exosome [BR:rno04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   681309 (Thbs3)
  Exosomal proteins of colorectal cancer cells
   681309 (Thbs3)
  Exosomal proteins of bladder cancer cells
   681309 (Thbs3)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   681309 (Thbs3)
Domain-containing proteins not elsewhere classified [BR:rno04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   681309 (Thbs3)
SSDB
Motif
Pfam: TSP_C TSP_3 COMP EGF_CA cEGF EGF_3 EGF EGF_MSP1_1
Other DBs
NCBI-GeneID: 681309
NCBI-ProteinID: NP_001292387
RGD: 1596780
Ensembl: ENSRNOG00000059903
UniProt: A0A0G2JZH3
Position
2
AA seq 955 aa
MEKPELWVALALLLCSYTCGSQDLQVIDLLTVSESRQMAAVAEKVRTALLTAGDIYLLST
FRLPPKQGGVLFGLYSRQDNTRWLEASVVGKINKVLVRYQREDGKVHAVNLQQAGLADGR
THTALLRLRGPSRPSPGLQLYVDCKLGDQHAGLPALAPIPPAEVSGLEIRTGQKAYLRMQ
GFVESMKIILGGSMARVGALSECPFQGDESIHNAVTSALQSILGEQTKALVTQLTLFNQI
LVELRDDIRDQVKEMSLIRNTIMECQVCGFHEQRSHCSPSPCFRGVDCMEVYEYPGYRCG
PCPPGLRGNGTHCEDIDECAHADPCFLGSSCINTMPGFHCEACPQGYKGTRVSGVGIDYA
RASKQVCSDIDECNDGNNGGCDPNSICTNTVGSFKCGPCRLGFLGNQSQGCTPARTCHSP
AHSPCHIHAHCLFERNGAVSCQCNVGWAGNGNVCGPDTDIDGYPDQALPCMDNNKHCKQD
NCLLTPNSGQEDADNDGVGDQCDDDADGDGIKNVEDNCRLFPNKDQQNSDTDSFGDACDN
CPNVPNNDQKDTDGNGEGDACDNDVDGDGIPNGLDNCPKVPNPLQTDRDEDGVGDTCDSC
PEMSNPTQTDADSDLVGDVCDTNEDSDGDGHQDTKDNCPQLPNSSQLDSDNDGLGDECDS
DDDNDGVPDYVPPGPDNCRLVPNPNQKDSDGNGVGDVCEDDFDNDAVVDPLDVCPESAEV
TLTDFRAYQTVILDPEGDAQIDPNWVVLNQGMEIVQTMNSDPGLAVGYTAFNGVDFEGTF
HVNTVTDDDYAGFLFSYQDSGRFYVVMWKQTEQTYWQATPFRAVAQPGLQLKAVTSISGP
GEHLRNALWHTGHTPDQVRLLWTDPRNVGWRDKTSYRWRLLHRPQVGYIRVKLYEGPQLV
ADSGVIIDTSMRGGRLGVFCFSQENIIWSNLQYRCNDTVPEDFEPFRRQLLQGRV
NT seq 2868 nt   +upstreamnt  +downstreamnt
atggagaagccggaactttgggttgccctggctctcctcctttgctcttacacatgtggc
agtcaggaccttcaggtaattgacctgctgacggtgagtgagtccaggcagatggcagct
gtagcagagaaggtccggacagctttactcactgctggggacatctacctcttgtccact
ttccgcctccctccaaagcaaggtggtgtcctctttggcctctactctcgccaagacaac
acacgatggctggaggcctctgttgttggcaagatcaacaaagtgctggtgcggtaccag
cgggaagatggcaaagtgcacgcagtgaaccttcagcaagcaggcttggctgacgggcgc
acgcacaccgctctcctgcgactccgaggaccttctcgacccagccctggcctgcaactg
tatgtggactgtaaactgggtgaccaacacgctggcctcccagccctagcgcccattcct
ccagcggaggtcagcggactggagattagaacaggacaaaaggcttacctgaggatgcag
ggctttgtggaatcgatgaagattattttgggtggatcgatggctcgggtgggagccctg
agcgaatgtccgttccaaggagatgaatccattcacaatgcagtgaccagcgcgctgcag
tccattctaggagagcagaccaaggcactggtcactcagctcacactcttcaaccagatc
ctagtggagctccgggatgacatccgggaccaggtgaaggaaatgtccctcatccggaac
accatcatggagtgtcaggtgtgcggtttccacgagcagcgttcacactgcagccccagc
ccctgcttcagaggtgtggactgcatggaagtgtatgagtacccaggctaccgctgtggg
ccctgcccacctggcctgcgcggcaatggcacccactgtgaagacatcgacgagtgtgct
cacgctgacccctgtttcctggggtccagctgcatcaacaccatgcctggcttccactgc
gaggcctgtcctcaagggtacaagggcacccgagtgtctggagtgggcattgactatgct
agagccagcaaacaggtctgcagtgacattgacgaatgtaacgacggtaacaatggaggc
tgtgacccaaattccatctgcaccaacactgtggggtctttcaaatgtggtccctgccgc
ctgggtttcctggggaaccagagccagggctgcactccagccaggacgtgccacagcccc
gcccacagcccctgccacatccacgctcattgtctttttgaacgcaacggtgcagtgtca
tgccagtgtaacgtgggctgggcagggaacgggaacgtgtgtggacctgacacggacata
gacggctatccagaccaggcgcttccctgcatggacaacaacaaacactgcaagcaggac
aattgccttttgacacccaactctgggcaagaagatgctgataatgatggtgtcggggac
cagtgtgatgacgacgctgatggggatggtatcaagaatgttgaggacaactgccgactg
ttccccaacaaggaccagcagaattcagacacagattcatttggtgacgcctgtgacaac
tgtcccaatgttcccaacaatgaccagaaggacacggatggcaatggagaaggagacgcc
tgtgacaacgatgtggacggggacggcatccccaatggcttggacaattgccctaaagta
cccaacccactgcagacagaccgggatgaagatggagtcggagacacttgcgacagctgc
cctgagatgagcaatcctacccagacagatgcagacagtgacctggtaggggatgtctgt
gacaccaacgaagacagcgatggagacgggcaccaggacaccaaggacaactgtcctcag
ctgcccaacagctctcagctggactcggacaacgacgggcttggagatgagtgtgacagc
gacgatgacaatgacggggtgcccgattacgtgcctcccgggcctgataactgtcgcctg
gtacccaatcctaaccagaaggactcagacggcaacggtgttggtgatgtctgtgaggat
gactttgacaatgatgcagtggttgaccctctggatgtgtgcccagaaagtgcagaagtt
accctcacggacttccgggcctatcagaccgtgattctggatccggagggtgacgctcag
atcgatccaaactgggtagtgctcaaccagggcatggaaattgttcagactatgaacagt
gacccaggtcttgcagttggatacacagccttcaatggtgtagactttgaaggcactttc
cacgtgaacacagtcactgatgatgactatgcaggcttcctcttcagttaccaggacagt
ggccgcttctatgtggtcatgtggaaacagacggagcagacctactggcaggccacgcct
ttccgagcagtggcccaaccagggctgcagctcaaggcagtgacatcaatatctggccca
ggtgagcacctccggaatgctctgtggcacactggccacacccctgatcaggtacgactg
ctgtggactgatccacggaatgtgggttggcgagacaagacatcctatagatggcggctg
ctccaccggccccaagtcggctacattcgggtgaagctctatgagggtccccagctagtg
gcggactctggggtgattattgatacgtccatgcgaggggggcgtcttggtgtattctgc
ttctcccaagaaaacatcatttggtctaatctccagtaccgatgcaatgacacagtgcct
gaggactttgagccattcaggaggcaactgctccagggaagggtgtga

KEGG   Rattus norvegicus (rat): 84352
Entry
84352             CDS       T01003                                 

Gene name
Col1a2
Definition
(RefSeq) collagen alpha-2(I) chain precursor
  KO
K06236  collagen type I alpha
Organism
rno  Rattus norvegicus (rat)
Pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04512  ECM-receptor interaction
rno04611  Platelet activation
rno04926  Relaxin signaling pathway
rno04933  AGE-RAGE signaling pathway in diabetic complications
rno04974  Protein digestion and absorption
rno05146  Amoebiasis
rno05165  Human papillomavirus infection
rno05205  Proteoglycans in cancer
rno05415  Diabetic cardiomyopathy
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    84352 (Col1a2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    84352 (Col1a2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    84352 (Col1a2)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    84352 (Col1a2)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    84352 (Col1a2)
  09154 Digestive system
   04974 Protein digestion and absorption
    84352 (Col1a2)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    84352 (Col1a2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    84352 (Col1a2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    84352 (Col1a2)
  09166 Cardiovascular disease
   05415 Diabetic cardiomyopathy
    84352 (Col1a2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    84352 (Col1a2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:rno00536]
    84352 (Col1a2)
Glycosaminoglycan binding proteins [BR:rno00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   84352 (Col1a2)
SSDB
Motif
Pfam: Collagen COLFI
Other DBs
NCBI-GeneID: 84352
NCBI-ProteinID: NP_445808
RGD: 621351
Ensembl: ENSRNOG00000011292
UniProt: P02466
Structure
PDB: 
3HQV 3HR2

Position
4
AA seq 1372 aa
MLSFVDTRTLLLLAVTSCLATCQSLQMGSVRKGPTGDRGPRGQRGPAGPRGRDGVDGPVG
PPGPPGAPGPPGPPGPPGLTGNFAAQYSDKGVSAGPGPMGLMGPRGPPGAVGAPGPQGFQ
GPAGEPGEPGQTGPAGSRGPAGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLP
GFKGIRGHNGLDGLKGQPGAQGVKGEPGAPGENGTPGQAGARGLPGERGRVGAPGPAGAR
GSDGSVGPVGPAGPIGSAGPPGFPGAPGPKGELGPVGNPGPAGPAGPRGEAGLPGLSGPV
GPPGNPGANGLTGAKGATGLPGVAGAPGLPGPRGIPGPVGAAGATGPRGLVGEPGPAGSK
GETGNKGEPGSAGAQGPPGPSGEEGKRGSPGEPGSAGPAGPPGLRGSPGSRGLPGADGRA
GVMGPPGNRGSTGPAGVRGPNGDAGRPGEPGLMGPRGLPGSPGNVGPAGKEGPVGLPGID
GRPGPIGPAGPRGEAGNIGFPGPKGPSGDPGKPGEKGHPGLAGARGAPGPDGNNGAQGPP
GPQGVQGGKGEQGPAGPPGFQGLPGPSGTAGEVGKPGERGLPGEFGLPGPAGPRGERGPP
GESGAAGPSGPIGSRGPSGAPGPDGNKGEAGAVGAPGSAGASGPGGLPGERGAAGIPGGK
GEKGETGLRGEIGNPGRDGARGAPGAIGAPGPAGASGDRGEAGAAGPSGPAGPRGSPGER
GEVGPAGPNGFAGPAGSAGQPGAKGEKGTKGPKGENGIVGPTGPVGAAGPSGPNGPPGPA
GSRGDGGPPGMTGFPGAAGRTGPPGPSGITGPPGPPGAAGKEGIRGPRGDQGPVGRTGEI
GASGPPGFAGEKGPSGEPGTTGPPGTAGPQGLLGAPGILGLPGSRGERGLPGIAGALGEP
GPLGIAGPPGARGPPGAVGSPGVNGAPGEAGRDGNPGSDGPPGRDGQPGHKGERGYPGNI
GPTGAAGAPGPHGSVGPAGKHGNRGEPGPAGSVGPVGAVGPRGPSGPQGIRGDKGEPGDK
GARGLPGLKGHNGLQGLPGLAGLHGDQGAPGPVGPAGPRGPAGPSGPIGKDGRSGHPGPV
GPAGVRGSQGSQGPAGPPGPPGPPGPPGVSGGGYDFGFEGDFYRADQPRSQPSLRPKDYE
VDATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWKSDYYWIDPNQGCTMDAIKVY
CDFSTGETCIQAQPVNTPAKNAYSRAQANKHVWLGETINGGSQFEYNAEGVSSKEMATQL
AFMRLLANRASQNITYHCKNSIAYLDEETGRLNKAVILQGSNDVELVAEGNSRFTYTVLV
DGCSKKTNEWDKTIIEYKTNKPSRLPFLDIAPLDIGGTNQEFRVEVGPVCFK
NT seq 4119 nt   +upstreamnt  +downstreamnt
atgctcagctttgtggatacgcgaactctgttgctgcttgcagtaacgtcgtgcctagca
acatgccaatctttacaaatgggatctgtacggaagggccccactggagacagaggaccg
cgtggacaaaggggcccagcaggtccccgaggcagagatggtgttgatggtcccgttggc
cctcctggtccccctggtgcccctggcccccctggtccccctggcccccctggtcttact
gggaactttgctgctcagtattctgacaaaggagtttcagctggccccggaccaatgggc
ttaatgggacccagaggccctcctggtgctgtcggagcccctggccctcaaggtttccaa
ggacctgctggtgagcctggtgagcccggacaaacgggtcctgcaggttcccgtgggcca
gctggccctcctggcaaggccggtgaagatggtcaccctggaaaacctggaagacctggc
gagagaggagtcgttgggccacagggtgctcgtggtttccctgggactcctggacttcct
ggcttcaaaggcattcgaggacacaacggtctggatggattgaaaggacagcctggtgcc
cagggtgtgaagggtgaacccggtgcccctggagagaatggaaccccaggtcaagcagga
gcccgagggcttcctggtgagagagggcgtgtcggagctcctggtccagctggtgcccgt
ggcagtgatggcagtgttggccccgtcggccctgctggtcccattgggtctgctggtcct
cctggtttcccaggtgctcctggtcccaagggtgaacttgggcctgttggtaaccctggc
cctgctggtcctgctggtccccgtggtgaagcgggtcttccgggtctttctggccccgtt
ggacctcctggcaaccctggagcaaatggcctcactggtgccaagggagccactggactc
cctggtgttgctggggctcctggtctccccggtccccgtggtattcctggccccgttggt
gctgctggtgctactggtcccagaggacttgttggtgaacctggtcctgctggctccaaa
ggagagactggtaacaagggtgagcctggctctgctggagcccaaggtcctcctggtccc
agcggtgaagaaggaaagagagggtcccctggagaacctggatctgctggtcctgcaggg
cctccagggctgagaggcagccctggttctcgaggtcttcctggagctgatggcagagct
ggtgtgatgggccctcctggcaatcgtggttcaactggccctgctggagtccgaggtcct
aatggagatgccggtcgccctggagaacctggtctcatgggacccagaggtcttcccggg
tctcctggaaatgttggcccagctggtaaagaaggccctgtgggtctccctggcatcgat
ggcagacctggcccaattggcccagctggaccaagaggtgaagctggcaacatcggattc
cctggacccaaaggcccctctggtgatcctggcaaacctggtgagaaaggccatcccggt
ctcgctggtgctcggggagctccaggtcccgacggcaacaatggtgctcagggccctcct
ggacctcagggtgttcaaggtggcaaaggtgaacagggccctgctggtcctccaggcttc
cagggtctcccgggtccctcaggtaccgctggagaagttggcaagccaggagaaaggggt
cttcccggtgaattcggtctccctggtcctgctggtccaagaggagaacgtggtccccca
ggtgagagtggagctgctggtccttctggtcctattggaagccgaggtccctctggagcc
ccagggcctgatgggaacaagggtgaagctggtgcagtcggtgctccaggctctgctggt
gcctctggtcctggtgggcttccaggagagaggggtgctgctggcatacctggaggcaaa
ggagaaaagggtgaaactggtctcagaggtgaaatcggcaaccctggtagagatggtgct
cgaggtgctcctggtgctataggtgcccctggtcccgctggagcctcaggtgaccggggc
gaagctggtgctgctggtccttctggccctgctggtcctcgaggtagccctggtgaacgt
ggtgaggttggtcctgctggccccaatggatttgctggtcctgctggttctgctggccaa
cctggtgctaaaggagaaaagggaaccaaagggcctaagggagaaaatggcatcgttggt
ccaaccggacctgtcggagctgctggcccatctggtcctaatggcccacctggccctgct
ggaagtcgtggtgatggagggccccctggtatgactggcttccctggtgctgctggacgg
actggtccccccggacccagtggtattactggcccccctggtccccctggtgctgctggt
aaagaaggtattcgtggtccccgtggtgaccaaggtccagttggccgaactggagaaata
ggtgcatctggtccccctggatttgctggcgagaagggtccttctggagaacctggtacc
acaggaccccctggcactgcaggtcctcagggtcttcttggtgctcccggtattctgggt
ctccctggctctcgaggtgaacgtggcctgccaggtattgctggtgctctgggtgaacct
ggtcctctgggcattgcaggccctcctggggcccgtggtccccctggtgctgtgggcagt
cctggagttaatggtgcccctggtgaagccggtcgtgatggtaaccctggcagtgatggt
cccccaggtcgtgatggtcaacccggacacaagggagagcgtggttaccctggcaacatt
ggtcccactggtgctgcaggtgctcctggtcctcatggttctgtgggtcctgctgggaaa
catggaaaccgtggtgagcctggtcctgctggttctgttggtcccgttggtgctgtcggt
ccaagaggtcctagtggtcctcaaggtattcgaggtgacaagggagagcctggtgacaaa
ggggccagaggtcttcctggcttaaaaggacacaatggattacagggtcttcctggtctt
gctgggctgcatggtgaccaaggagctcctggtcccgtgggtcctgctggtcccaggggt
cctgctggtccttctggccccattggcaaagatggtagatctggacatcctggaccagtc
ggtcctgctggcgttcgtggctctcagggtagccaaggtcccgctggtccccctggtccc
cctgggcctcctgggcctcctggtgtcagcggaggaggctatgactttggtttcgaagga
gacttctacagggctgaccagcctcgctcacagccttcactcagacccaaggactatgaa
gttgatgcaactctgaaatctctcaataaccaaatcgagacccttctcactcctgaaggc
tctagaaagaaccctgcccgcacatgccgtgacttaagactcagccacccagagtggaag
agcgattactactggattgaccctaaccaaggatgcactatggatgccatcaaagtgtac
tgcgatttctctactggtgaaacctgcatccaggcccaacctgtcaacaccccagccaag
aatgcatacagccgtgcccaggccaacaagcatgtctggttaggagagaccatcaatggt
ggcagccagtttgaatacaacgcagaaggggtgtcctccaaggaaatggcaactcagctc
gccttcatgcgcctgctagccaaccgtgcttctcagaacatcacctaccactgcaagaac
agcattgcgtacctggacgaggagacaggccgcctgaataaggctgtcattctgcagggc
tccaacgacgtcgaacttgttgctgagggcaacagcagattcacctacactgtccttgtc
gatggctgctccaaaaagacaaatgaatgggacaagacaatcattgaatacaaaacgaat
aagccatctcgcctgccattccttgacattgcacctctggacatcggtggtactaaccaa
gaattccgtgtggaggttggccctgtctgtttcaaataa

DBGET integrated database retrieval system