KEGG   Pan paniscus (bonobo): 100972953
Entry
100972953         CDS       T02283                                 
Symbol
SOS1
Name
(RefSeq) son of sevenless homolog 1 isoform X1
  KO
K03099  son of sevenless
Organism
pps  Pan paniscus (bonobo)
Pathway
pps01521  EGFR tyrosine kinase inhibitor resistance
pps01522  Endocrine resistance
pps04010  MAPK signaling pathway
pps04012  ErbB signaling pathway
pps04014  Ras signaling pathway
pps04062  Chemokine signaling pathway
pps04068  FoxO signaling pathway
pps04072  Phospholipase D signaling pathway
pps04150  mTOR signaling pathway
pps04151  PI3K-Akt signaling pathway
pps04510  Focal adhesion
pps04540  Gap junction
pps04630  JAK-STAT signaling pathway
pps04650  Natural killer cell mediated cytotoxicity
pps04660  T cell receptor signaling pathway
pps04662  B cell receptor signaling pathway
pps04664  Fc epsilon RI signaling pathway
pps04714  Thermogenesis
pps04722  Neurotrophin signaling pathway
pps04810  Regulation of actin cytoskeleton
pps04910  Insulin signaling pathway
pps04912  GnRH signaling pathway
pps04915  Estrogen signaling pathway
pps04917  Prolactin signaling pathway
pps04926  Relaxin signaling pathway
pps04935  Growth hormone synthesis, secretion and action
pps05034  Alcoholism
pps05160  Hepatitis C
pps05161  Hepatitis B
pps05163  Human cytomegalovirus infection
pps05165  Human papillomavirus infection
pps05200  Pathways in cancer
pps05205  Proteoglycans in cancer
pps05206  MicroRNAs in cancer
pps05207  Chemical carcinogenesis - receptor activation
pps05208  Chemical carcinogenesis - reactive oxygen species
pps05210  Colorectal cancer
pps05211  Renal cell carcinoma
pps05213  Endometrial cancer
pps05214  Glioma
pps05215  Prostate cancer
pps05220  Chronic myeloid leukemia
pps05221  Acute myeloid leukemia
pps05223  Non-small cell lung cancer
pps05224  Breast cancer
pps05225  Hepatocellular carcinoma
pps05226  Gastric cancer
pps05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:pps00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    100972953 (SOS1)
   04012 ErbB signaling pathway
    100972953 (SOS1)
   04014 Ras signaling pathway
    100972953 (SOS1)
   04630 JAK-STAT signaling pathway
    100972953 (SOS1)
   04068 FoxO signaling pathway
    100972953 (SOS1)
   04072 Phospholipase D signaling pathway
    100972953 (SOS1)
   04151 PI3K-Akt signaling pathway
    100972953 (SOS1)
   04150 mTOR signaling pathway
    100972953 (SOS1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100972953 (SOS1)
   04540 Gap junction
    100972953 (SOS1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    100972953 (SOS1)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    100972953 (SOS1)
   04660 T cell receptor signaling pathway
    100972953 (SOS1)
   04662 B cell receptor signaling pathway
    100972953 (SOS1)
   04664 Fc epsilon RI signaling pathway
    100972953 (SOS1)
   04062 Chemokine signaling pathway
    100972953 (SOS1)
  09152 Endocrine system
   04910 Insulin signaling pathway
    100972953 (SOS1)
   04912 GnRH signaling pathway
    100972953 (SOS1)
   04915 Estrogen signaling pathway
    100972953 (SOS1)
   04917 Prolactin signaling pathway
    100972953 (SOS1)
   04926 Relaxin signaling pathway
    100972953 (SOS1)
   04935 Growth hormone synthesis, secretion and action
    100972953 (SOS1)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    100972953 (SOS1)
  09159 Environmental adaptation
   04714 Thermogenesis
    100972953 (SOS1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100972953 (SOS1)
   05206 MicroRNAs in cancer
    100972953 (SOS1)
   05205 Proteoglycans in cancer
    100972953 (SOS1)
   05207 Chemical carcinogenesis - receptor activation
    100972953 (SOS1)
   05208 Chemical carcinogenesis - reactive oxygen species
    100972953 (SOS1)
   05231 Choline metabolism in cancer
    100972953 (SOS1)
  09162 Cancer: specific types
   05210 Colorectal cancer
    100972953 (SOS1)
   05225 Hepatocellular carcinoma
    100972953 (SOS1)
   05226 Gastric cancer
    100972953 (SOS1)
   05214 Glioma
    100972953 (SOS1)
   05221 Acute myeloid leukemia
    100972953 (SOS1)
   05220 Chronic myeloid leukemia
    100972953 (SOS1)
   05211 Renal cell carcinoma
    100972953 (SOS1)
   05215 Prostate cancer
    100972953 (SOS1)
   05213 Endometrial cancer
    100972953 (SOS1)
   05224 Breast cancer
    100972953 (SOS1)
   05223 Non-small cell lung cancer
    100972953 (SOS1)
  09172 Infectious disease: viral
   05161 Hepatitis B
    100972953 (SOS1)
   05160 Hepatitis C
    100972953 (SOS1)
   05163 Human cytomegalovirus infection
    100972953 (SOS1)
   05165 Human papillomavirus infection
    100972953 (SOS1)
  09165 Substance dependence
   05034 Alcoholism
    100972953 (SOS1)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    100972953 (SOS1)
   01522 Endocrine resistance
    100972953 (SOS1)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF SOS1_NGEF_PH PH PH_10 PH_19 IQ_SEC7_PH RHG20_PH Takusan
Other DBs
NCBI-GeneID: 100972953
NCBI-ProteinID: XP_003817680
Ensembl: ENSPPAG00000027483
Position
12:87342622..87487223
AA seq 1333 aa
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESNDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDRFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTNEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDAYLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSNDTVFIQVTLPHGPRSASVSSISL
TKGTEEVPVPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSLSD
RTSISDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHGNAFFPNSPSPFTPPP
PQTPSPHGTRRHLPSPPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMH
RDGPPLLENAHSS
NT seq 4002 nt   +upstreamnt  +downstreamnt
atgcaggcgcagcagctgccctacgagtttttcagcgaagagaacgcgcccaagtggcgg
ggactgctggtgcctgcgctgaaaaaggtccaggggcaagttcatcctactctcgagtct
aatgatgatgctcttcagtatgttgaagaattaattttgcaattattaaatatgctatgc
caagctcagccccgaagtgcttcagatgtagaggaacgtgttcaaaaaagtttccctcat
ccaattgataaatgggcaatagctgatgcccaatcagctattgaaaagaggaagcgaaga
aaccctttatctctcccagtagaaaaaattcatcctttattaaaggaggtcctaggttat
aaaattgaccaccaggtttctgtttacatagtagcagtcttagaatacatttctgcagac
attttaaagctggttgggaattatgtaagaaatatacggcattatgaaattacaaaacaa
gatattaaagtggcaatgtgtgctgacaaggtattgatggatatgtttcatcaagatgta
gaagatattaatatattatctttaactgatgaagagccttccacctcaggagaacaaact
tactatgatttggtaaaagcatttatggcagaaattcgacaatatataagggaactaaat
ctaattataaaagtttttagagagccctttgtctccaattcaaaattgttttcagctaat
gatgtagaaaatatatttagtcgcatagtagatatacatgaacttagtgtaaagttactg
ggccatatagaagatacagtagaaatgacagatgaaggcagtccccatccactagtagga
agctgctttgaagacttagcagaggaactggcatttgatccatatgaatcgtatgctcga
gatattttgcgacctggttttcatgatcgtttccttagtcagttatcaaagcctggggca
gcactttatttgcagtcaataggcgaaggtttcaaagaagctgttcaatatgttttaccc
aggctgcttctggcccctgtttaccactgtctccattactttgaacttttgaagcagtta
gaagaaaaaagtgaagatcaagaagacaaggaatgtttaaaacaagcaataacagctttg
cttaatgttcagagtggtatggaaaaaatatgttctaaaagtcttgcaaaacgaagactg
agtgaatctgcatgtcggttttatagtcagcaaatgaaggggaaacaactagcaatcaag
aaaatgaacgagattcagaagaatattgatggttgggagggaaaagacattggacagtgt
tgtaatgaatttataatggaaggaactcttacacgtgtaggagccaaacacgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcaaatcacgggcagccaagactt
cctggtgctagcaatgcagaatatcgtcttaaagaaaagttttttatgcgaaaggtacaa
attaatgataaagatgacaccaatgaatacaagcatgcttttgaaataattttaaaagat
gaaaatagtgttatattttctgccaagtcagctgaagagaaaaacaattggatggcagca
ttgatatctttacagtaccggagtacactggaaaggatgcttgatgtaacaatgctacag
gaagagaaagaggagcagatgaggctgcctagtgctgatgtttatagatttgcagagcct
gactctgaagagaatattatatttgaagagaacatgcagcccaaggctggaattccaatt
atcaaagcaggaactgttattaaacttatagagaggcttacgtaccatatgtacgcagat
cccaattttgttcggacatttcttacaacatacagatccttttgcaaacctcaagaacta
ctgagtcttataatagaaaggtttgaaattccagagcctgagccaacagaagctgatcgc
atagctatagagaatggagatcaacccttgagtgcagaactgaaaagatttagaaaagaa
tatatacagcctgtgcaactgcgagtattaaatgtatgtcggcactgggtagagcaccac
ttctatgattttgaaagagatgcatatcttttgcaacgaatggaagaatttattggaaca
gtaagaggtaaagcaatgaaaaaatgggttgaatccatcactaaaataatccaaaggaaa
aaaattgcaagagacaatggaccaggtcataatattacatttcagagttcacctcccaca
gttgagtggcatataagcagacctgggcacatagagacttttgacctgctcaccttacac
ccaatagaaattgctcgacaactcactttacttgaatcagatctataccgagctgtacag
ccatcagaattagttggaagtgtgtggacaaaagaagacaaagaaattaactctcctaat
cttctgaaaatgattcgacataccaccaacctcactctgtggtttgagaaatgtattgta
gaaactgaaaatttagaagaaagagtagctgtggtgagtcgaattattgagattctacaa
gtctttcaagagttgaacaactttaatggtgtccttgaggttgtcagtgctatgaattca
tcacctgtttacagactagaccacacatttgagcaaataccaagtcgccagaagaaaatt
ttagaagaagctcatgaattgagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctcactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctaaaaagacatggaaaagagcttataaactttagcaaa
aggaggaaagtagcagaaataacaggagagatccagcagtaccaaaatcagccttactgt
ttacgagtagaatcagatatcaaaaggttctttgaaaacttgaatccgatgggaaatagc
atggagaaggaatttacagattatcttttcaacaaatccctagaaatagaaccacgaaac
cctaagcctctcccaagatttccaaaaaaatatagctatcccctaaaatctcctggtgtt
cgtccatcaaacccaagaccaggtaccatgaggcatcccacacctctgcagcaggagcca
aggaaaattagttatagtaggatccctgaaagtgaaacagaaagtacagcatctgcacca
aattctccaagaacaccgttaacacctccgcctgcttctggtgcttccagtaccacagat
gtttgcagtgtatttgattccgatcattcgagcccttttcactcaagcaatgataccgtc
tttatccaagttactctgccccatggcccaagatctgcttctgtatcatctataagttta
accaaaggcactgaggaagtgcctgtccctcctcctgttcctccacgaagacgaccagaa
tctgccccagcagaatcttcaccatctaagattatgtctaagcatttggacagtccccca
gccattcctcctaggcaacccacatcaaaagcctattcaccacgatattcactatcagac
cggacctctatctcagaccctcctgaaagccctcccttattaccaccacgagaacctgtg
aggacacctgatgttttctcaagctcaccactacatctccaacctccccctttgggcaaa
aaaagtgaccatggcaatgccttcttcccaaacagcccttccccctttacaccacctcct
cctcaaacaccttctcctcacggcacaagaaggcatctgccatcaccaccattgacacaa
gaagtggaccttcattccattgctgggccgcctgttcctccacgacaaagcacttctcaa
catatccctaaactccctccaaaaacttacaaaagagagcacacacacccatccatgcac
agagatggaccaccactgttggagaatgcccattcttcctga

KEGG   Pan paniscus (bonobo): 100980038
Entry
100980038         CDS       T02283                                 
Symbol
SOS2
Name
(RefSeq) LOW QUALITY PROTEIN: son of sevenless homolog 2
  KO
K03099  son of sevenless
Organism
pps  Pan paniscus (bonobo)
Pathway
pps01521  EGFR tyrosine kinase inhibitor resistance
pps01522  Endocrine resistance
pps04010  MAPK signaling pathway
pps04012  ErbB signaling pathway
pps04014  Ras signaling pathway
pps04062  Chemokine signaling pathway
pps04068  FoxO signaling pathway
pps04072  Phospholipase D signaling pathway
pps04150  mTOR signaling pathway
pps04151  PI3K-Akt signaling pathway
pps04510  Focal adhesion
pps04540  Gap junction
pps04630  JAK-STAT signaling pathway
pps04650  Natural killer cell mediated cytotoxicity
pps04660  T cell receptor signaling pathway
pps04662  B cell receptor signaling pathway
pps04664  Fc epsilon RI signaling pathway
pps04714  Thermogenesis
pps04722  Neurotrophin signaling pathway
pps04810  Regulation of actin cytoskeleton
pps04910  Insulin signaling pathway
pps04912  GnRH signaling pathway
pps04915  Estrogen signaling pathway
pps04917  Prolactin signaling pathway
pps04926  Relaxin signaling pathway
pps04935  Growth hormone synthesis, secretion and action
pps05034  Alcoholism
pps05160  Hepatitis C
pps05161  Hepatitis B
pps05163  Human cytomegalovirus infection
pps05165  Human papillomavirus infection
pps05200  Pathways in cancer
pps05205  Proteoglycans in cancer
pps05206  MicroRNAs in cancer
pps05207  Chemical carcinogenesis - receptor activation
pps05208  Chemical carcinogenesis - reactive oxygen species
pps05210  Colorectal cancer
pps05211  Renal cell carcinoma
pps05213  Endometrial cancer
pps05214  Glioma
pps05215  Prostate cancer
pps05220  Chronic myeloid leukemia
pps05221  Acute myeloid leukemia
pps05223  Non-small cell lung cancer
pps05224  Breast cancer
pps05225  Hepatocellular carcinoma
pps05226  Gastric cancer
pps05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:pps00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    100980038 (SOS2)
   04012 ErbB signaling pathway
    100980038 (SOS2)
   04014 Ras signaling pathway
    100980038 (SOS2)
   04630 JAK-STAT signaling pathway
    100980038 (SOS2)
   04068 FoxO signaling pathway
    100980038 (SOS2)
   04072 Phospholipase D signaling pathway
    100980038 (SOS2)
   04151 PI3K-Akt signaling pathway
    100980038 (SOS2)
   04150 mTOR signaling pathway
    100980038 (SOS2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100980038 (SOS2)
   04540 Gap junction
    100980038 (SOS2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    100980038 (SOS2)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    100980038 (SOS2)
   04660 T cell receptor signaling pathway
    100980038 (SOS2)
   04662 B cell receptor signaling pathway
    100980038 (SOS2)
   04664 Fc epsilon RI signaling pathway
    100980038 (SOS2)
   04062 Chemokine signaling pathway
    100980038 (SOS2)
  09152 Endocrine system
   04910 Insulin signaling pathway
    100980038 (SOS2)
   04912 GnRH signaling pathway
    100980038 (SOS2)
   04915 Estrogen signaling pathway
    100980038 (SOS2)
   04917 Prolactin signaling pathway
    100980038 (SOS2)
   04926 Relaxin signaling pathway
    100980038 (SOS2)
   04935 Growth hormone synthesis, secretion and action
    100980038 (SOS2)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    100980038 (SOS2)
  09159 Environmental adaptation
   04714 Thermogenesis
    100980038 (SOS2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100980038 (SOS2)
   05206 MicroRNAs in cancer
    100980038 (SOS2)
   05205 Proteoglycans in cancer
    100980038 (SOS2)
   05207 Chemical carcinogenesis - receptor activation
    100980038 (SOS2)
   05208 Chemical carcinogenesis - reactive oxygen species
    100980038 (SOS2)
   05231 Choline metabolism in cancer
    100980038 (SOS2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    100980038 (SOS2)
   05225 Hepatocellular carcinoma
    100980038 (SOS2)
   05226 Gastric cancer
    100980038 (SOS2)
   05214 Glioma
    100980038 (SOS2)
   05221 Acute myeloid leukemia
    100980038 (SOS2)
   05220 Chronic myeloid leukemia
    100980038 (SOS2)
   05211 Renal cell carcinoma
    100980038 (SOS2)
   05215 Prostate cancer
    100980038 (SOS2)
   05213 Endometrial cancer
    100980038 (SOS2)
   05224 Breast cancer
    100980038 (SOS2)
   05223 Non-small cell lung cancer
    100980038 (SOS2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    100980038 (SOS2)
   05160 Hepatitis C
    100980038 (SOS2)
   05163 Human cytomegalovirus infection
    100980038 (SOS2)
   05165 Human papillomavirus infection
    100980038 (SOS2)
  09165 Substance dependence
   05034 Alcoholism
    100980038 (SOS2)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    100980038 (SOS2)
   01522 Endocrine resistance
    100980038 (SOS2)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF SOS1_NGEF_PH PH IQ_SEC7_PH PH_19 RHG20_PH
Other DBs
NCBI-GeneID: 100980038
NCBI-ProteinID: XP_034793406
Position
15:complement(51258304..51371362)
AA seq 1227 aa
MQQAPQPYEFFSEENSPKWRGLLVSALRKVQEQVHPTLSANEESLYYIEELIFQLLNKLC
MAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
IGLVSLCEDEPSSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLSDRKLFKPSDI
EKIFSNISDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPEFHEHFNKLMARPAVALHFQSIADGFKEAVRYVLPRLMLVPVYHCWHYFELLKQLKA
CSEEQEDRECLNQAITALMNLQGSMDRIYKQYSPRRRPGDPVCPFYSHQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLTRIGAKHERHIFLFDGLMISCKPNHGQTRLPG
YSSAEYRLKEKFVMRKIQICDKEDTCEYKHAFELVSKDENSIIFAAKSAEEKNNWMAALI
SLHYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVVKDSEENIVFEDNLQSRSGIPIIK
GGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTDADKLA
IEKGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLERLESFISSVR
GKAMKKWVESIAKIIRRKKQAQANGISHNITFESPPPPIEWHISKPGQFETFDLMTLHPI
EIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEA
ENFEERVAVLSRIIEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRKILD
EAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRR
KVAEITGEIQQYQNQPYCLRIEPDMRRFFENLNPMGSASEKEFTDYLFNKSLEIEPRNCK
QPPRFPRKSTFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKISFSRIAETELESTV
SAPTSPNTPSTPPVSASSDLSVFLDVDLNSSCGSNSIFAPVLLPHSKSFFSSCGSLHKLS
EEPLIPPPLPPRKKFDHDASNSKGNMKSNDDPPAIPPRQPPPPKVKPRVPVPTGAFMGLC
IVHLRHHQEILFLIPLHQFPFGLQNTL
NT seq 3684 nt   +upstreamnt  +downstreamnt
atgcagcaggcgccgcagccttacgagttcttcagcgaggagaacagtccgaaatggcgg
ggactgttggtctcggccctgcggaaggttcaggaacaagtgcatcccactctctcagct
aatgaagagtctctctattatattgaagagctgatttttcagctgcttaataaattatgc
atggcccagccaaggactgttcaagatgtagaggagcgagttcagaagacctttcctcac
ccaattgataaatgggccattgctgatgcacaatctgctatagaaaaacgaaaacgaaga
aatcctcttttactgcctgtggacaaaatccatccttcgttgaaggaagtattagggtac
aaagtggactaccatgtgtccctatatattgtggctgtactagagtatatctcagctgat
attttaaaattggctggtaattatgtttttaatatccggcattatgaaatatctcagcag
gacattaaagtgtcaatgtgtgcggataaggttttgatggacatgtttgatcaggatgac
ataggtttggtttctctctgtgaagatgaacctagttcttctggtgaattaaactactat
gatcttgtcagaactgaaatcgcagaagaaagacagtatctacgggaattaaatatgatc
ataaaagtgtttcgagaagcctttctttctgatagaaagctgtttaaaccttctgatatc
gaaaagatttttagtaacatttcagatatacatgaattgactgtgaaacttttaggtttg
attgaagacacagttgaaatgactgatgaaagcagtcctcatcccttagctggcagctgt
tttgaagatttggcagaagagcaagcatttgatccttatgaaacattatcacaggacatt
ctttcaccagagtttcatgaacatttcaataaattaatggccagacctgcagttgctcta
cactttcagtccattgctgatggttttaaagaggcagttcgttatgtccttccacgtctt
atgctggtgccagtgtatcactgttggcactactttgagttactaaagcaattgaaagca
tgtagtgaagaacaagaagacagagaatgtttgaaccaagctattactgctctcatgaat
ctccaaggtagcatggaccgaatttacaagcagtattcacctagacgtcgacctggagat
cctgtttgccctttttatagtcaccaattaagaagcaaacacctggctatcaaaaaaatg
aatgaaattcagaaaaatattgatggatgggaaggcaaagatattggacagtgttgtaat
gaattcattatggagggaccattgacaagaatcggtgccaaacatgaacggcatattttt
ctgtttgatggcttaatgatcagttgtaaacctaatcatggccagactcggcttccaggt
tacagtagtgcagaatacaggttaaaagaaaaatttgtcatgaggaaaatacaaatttgt
gataaagaagatacttgtgagtacaagcatgcatttgaattagtatccaaagatgagaac
agcataatatttgctgctaagtctgctgaagaaaaaaacaactggatggcagcccttatt
tctcttcattatcgtagtactctagatcgaatgttagattcagtattattgaaagaagaa
aatgagcaaccactgagattaccaagtcctgaagtatatcgttttgtagtaaaagactct
gaggaaaacattgtttttgaagacaacttgcaaagtagaagtggcatccccattattaaa
ggaggaactgtagtgaaattaattgaaaggttaacatatcatatgtatgcagatcccaat
tttgttcgtacttttcttaccacgtatcgttcattttgtaaaccacaggaattgctgagc
ttactgattgaacggtttgaaattccagagccagaacctactgatgcagacaaattggca
atagagaaaggcgagcagccaatcagtgcagaccttaaaagatttcgcaaggaatatgtc
caaccagtacaacttaggatcttaaatgtatttcggcattgggttgaacatcatttttat
gactttgaaagagacttggaattgcttgaaagactagaatccttcatttcaagtgtaaga
gggaaagctatgaaaaaatgggtagagtcaatcgctaagatcatcaggaggaagaagcaa
gctcaggcaaatggaataagccataatattacctttgaaagtccacctccaccaattgaa
tggcatatcagcaaaccaggacagtttgaaacatttgatctcatgacacttcatccaata
gaaattgcacgtcagctgacacttttggagtctgatctctacaggaaagttcaaccgtct
gaacttgtagggagtgtgtggaccaaagaagataaagaaataaattctccaaatttatta
aaaatgattcgccataccacaaatctcaccctctggtttgaaaaatgcattgtggaagca
gaaaattttgaagaacgggtggcagtactaagtagaattatagaaattctgcaagttttt
caagatttgaataatttcaatggcgtattggagatagtcagtgcagtaaattcagtatca
gtatacagactagaccatacctttgaggcattgcaggaaagaaaaaggaaaattttggac
gaagctgtggaattaagtcaagatcactttaaaaaatacctagtaaaacttaagtcaatc
aatccaccttgtgtgcctttttttggaatatatttaacaaatattctgaagaccgaagaa
gggaataatgattttttaaaaaagaaagggaaagatttaatcaatttcagtaagaggagg
aaagtagctgaaattactggagaaattcagcagtatcagaatcagccttactgtttacgg
atagaaccagatatgaggagattctttgaaaaccttaacccaatgggaagtgcatctgaa
aaagagtttacagattatttgttcaacaagtcactagaaattgaacctcgaaactgcaaa
cagccacctcgatttcctaggaaatcaactttttccttaaaatctcctggaataaggcct
aacacaggccgacatggctctacctcaggtactttacgaggtcacccaacaccattagaa
agagaaccatgtaaaataagctttagtcggattgctgaaactgagctggaatcaacagtg
tcagcaccaacctctccaaatacaccatctactccaccagtatctgcttcttcagacctt
agtgtatttttagatgtggatctcaacagctcctgtggcagcaatagcatcttcgctcca
gtgcttttgccacattcaaagtctttctttagttcatgtggtagtttacataaactaagt
gaagagcccctgattcctcctcctcttcctcctcgaaaaaagtttgatcatgatgcttca
aattccaagggaaatatgaaatctaatgatgatcctcctgctattccaccgagacagcct
cctcctccaaaggtaaaacccagagttcctgttcctactggtgcatttatgggcctctgc
atagtccacctccgccaccaccaagagatcctcttcctgatacccctccaccagttcccc
ttcggcctccagaacactttataa

DBGET integrated database retrieval system