KEGG   Danio rerio (zebrafish): 432372
Entry
432372            CDS       T01004                                 
Symbol
cpsf1
Name
(RefSeq) cleavage and polyadenylation specificity factor subunit 1
  KO
K14401  cleavage and polyadenylation specificity factor subunit 1
Organism
dre  Danio rerio (zebrafish)
Pathway
dre03015  mRNA surveillance pathway
Brite
KEGG Orthology (KO) [BR:dre00001]
 09120 Genetic Information Processing
  09122 Translation
   03015 mRNA surveillance pathway
    432372 (cpsf1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03021 Transcription machinery [BR:dre03021]
    432372 (cpsf1)
   03019 Messenger RNA biogenesis [BR:dre03019]
    432372 (cpsf1)
Transcription machinery [BR:dre03021]
 Eukaryotic type
  RNA polymerase II system
   Other transcription-related factors
    Transcription termination factor
     432372 (cpsf1)
Messenger RNA biogenesis [BR:dre03019]
 Eukaryotic type
  mRNA processing factors
   3' end processing
    Cleavage and polyadenylation specificity factor (CPSF) complex
     432372 (cpsf1)
SSDB
Motif
Pfam: CPSF_A MMS1_N UspB
Other DBs
NCBI-GeneID: 432372
NCBI-ProteinID: NP_001108153
ZFIN: ZDB-GENE-040709-2
Ensembl: ENSDARG00000034178
UniProt: A0A8M1NHR0
Position
19:complement(11897631..11950455)
AA seq 1449 aa
MYAVYRQAHPPTAVEFAVYCNFISSQEKNLVVAGTSQLYVYRIIYDVESTSKSEKSSDGK
SRKEKLEQVASFSLFGNVMSMASVQLVGTNRDALLLSFKDAKLSVVEYDPGTHDLKTLSL
HYFEEPELRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDTLADEQEGIVGEG
QKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSI
VAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPPFGVSL
NSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVLTLITDGMRSVRAF
HFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEEGKENEEKEKQPPN
KKKRVDSNWAGCPGKGNLPDELDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCASASM
GEPAFLSEEFQTNPEPDLEVVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVIY
CEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILSREDSTMILQTGQEIMELDTSG
FATQGPTVYAGNIGDNKYIIQVSPMGIRLLEGVNQLHFIPVDLGSPIVHCSVADPYVVIM
TAEGVVTMFVLKNDSYMGKSHRLALQKPQIHTQSRVITLCAYRDVSGMFTTENKVSFLAK
EEIAIRTNSETETIIQDISNTVDDEEEMLYGESNPLTSPNKEESSRGSAAASSAHTGKES
GSGRQEPSHWCLLVRENGVMEIYQLPDWRLVFLVKNFPVGQRVLVDSSASQSATQGELKK
EEVTRQGDIPLVKEVALVSLGYNHSRPYLLAHVEQELLIYEAFPYDQQQAQSNLKVRFKK
MPHNINYREKKVKVRKDKKPEGQGEDTLGVKGRVARFRYFQDISGYSGVFICGPSPHWML
VTSRGAMRLHPMTIDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVR
KIPLRCTVHYVSYHVESKVYAVCTSVKEPCTRIPRMTGEEKEFETIERDERYIHPQQDKF
SIQLISPVSWEAIPNTRVDLEEWEHVTCMKTVALKSQETVSGLKGYVALGTCLMQGEEVT
CRGRILILDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCSGFLVSAIGQKIFLWS
LKDNDLTGMAFIDTQLYIHQMYSIKNFILAADVMKSISLLRYQPESKTLSLVSRDAKPLE
VYSIEFMVDNNQLGFLVSDRDKNLMVYMYLPEAKESFGGMRLLRRADFNVGSHVNAFWRM
PCRGTLDTANKKALTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHA
GLNPKAFRMLHCDRRTLQNAVKNILDGELLNKYLYLSTMERSELAKKIGTTPDIILDDLL
EIERVTAHF
NT seq 4350 nt   +upstreamnt  +downstreamnt
atgtacgcggtgtaccgtcaggctcacccgcccaccgcagtggagtttgccgtttactgc
aacttcatctccagccaagagaaaaacttggtggtggcaggaacatctcagctgtacgtc
tacaggatcatttatgatgtggagagcacttccaagtctgaaaagtcatcagatggcaag
agtcgtaaagagaaactggagcaggtggcatctttctctctctttggaaatgtgatgtca
atggccagcgttcagctggtgggcaccaacagagacgccttgcttctcagcttcaaagat
gccaagctgtcagtagtggagtatgaccctggcacgcatgatctcaagactctgtcgctg
cattattttgaggaaccggagttgagagatgggtttgttcagaatgtgcacattcctatg
gttcgagtggatccagagaatcgatgtgctgtcatgctggtgtacggcacttgcctggtg
gttctgcctttcaggaaggacacgcttgctgacgaacaggaggggattgtgggagagggg
cagaaatctagtttcctacccagctacatcatcgatgttcgtgaactggatgagaaactc
ctaaacatcatcgatatgaagtttctccatggctactatgagcccacactgcttattctg
ttcgagcccaatcaaacatggccagggcgtgtggcggtgcgtcaggacacttgcagtatt
gtggcgatctctctaaacatcatgcagaaggttcatccagtcatctggtctctgagtaac
ctgccctttgactgcaatcaggtcatggctgtccccaaacccatcggtggagttgtggtg
tttgctgtgaattcgttgctgtatcttaatcagagtgttcctccgtttggagtgtctctc
aacagtctgactaacggaaccacagctttccctttgagacctcaggaggaagtaaagatc
accctggattgttctcaagcctccttcatcacctctgacaagatggtcatctcactaaaa
ggaggagaaatttatgtgttgacgctcatcactgatggcatgagaagtgttcgtgcgttt
cactttgataaggctgctgccagtgtcctgactacctgtatgatgactatggagccaggc
tatctgtttttgggctctcgcctgggaaactcactgctgctcaggtacactgagaaactc
caggaaacacccatggaggagggcaaagagaatgaggagaaggagaaacagcctccaaat
aagaaaaaacgtgtggattccaactgggcagggtgtccaggaaagggcaatctacctgat
gagctggatgaaattgaggtgtatgggagtgaagctcagtcaggcactcagctggccaca
tactcatttgaggtttgtgacagcatactaaacattggaccctgtgccagtgcctccatg
ggagaaccagcttttctgtctgaagagtttcaaaccaaccctgagccagatctggaggtg
gtggtgtgttcaggatatggcaagaatggagcactgtctgtattacagaaaagcatcaga
ccacaggtggtgacgacgttcgaacttcctggatgccatgacatgtggacagtcatatac
tgtgaggagaaacccgaaaagccttcagctgagggtgatggagagagtcctgaggaggag
aagcgcgagcccacaatagaggatgacaagaagaaacacggcttcctcatcctcagcaga
gaagactccaccatgattctccagacgggtcaagagatcatggaactggacaccagtgga
ttcgccacacagggtcctacagtttacgcgggcaacatcggagacaataaatacatcatc
caggtgtcccccatgggcatcagactgctcgagggagttaatcagctccactttatccct
gtggacctgggctctcccatagtgcactgctctgtggccgatccttacgtggtcatcatg
acggcagagggagtggtcactatgtttgtcttgaagaatgactcgtatatgggaaagagt
caccgcctggcacttcagaaaccacagatacacacgcaatcacgtgttattacgttatgt
gcatatcgcgatgtcagtggtatgttcacaactgagaacaaggtcagcttcctggcaaaa
gaggaaattgctatcagaaccaactcagagacagaaaccatcatccaggacatcagtaat
acagtggatgatgaagaggagatgctgtacggggagtcaaaccccctgacgagccccaac
aaagaagagtccagccgtggctctgcagcagcaagctccgctcacacaggaaaagagagc
ggctccggcagacaggagccctcacactggtgcctgctggttagagagaatggagtcatg
gagatctaccagcttccagactggcgtttggttttcctggtgaaaaatttcccagtcggt
cagcgtgttttggttgacagctctgccagccagtcagctactcaaggggagcttaaaaaa
gaggaagtgacgaggcaaggtgacattccattggtcaaagaggttgctcttgtgtcatta
ggctacaaccacagccgtccctaccttctggctcatgttgaacaagagcttctcatctat
gaggcctttccgtacgaccagcagcaagctcaaagcaacctgaaagtgcgcttcaagaag
atgcctcataacattaactacagagagaagaaggtcaaagtgaggaaggacaagaagcct
gagggtcagggagaggacactctcggcgtcaaaggtcgtgtggccagattcagatacttc
caggacatctctggatactcaggggtgtttatctgtggtccctctccacactggatgttg
gtgacctctcgcggggcgatgcgcctccaccccatgacaatagatggagccattgagtct
ttctctccctttcacaatatcaactgccccaaaggatttctctacttcaacaaacagggt
gagctgaggatcagtgtgttgccaacatacctctcttatgatgctccctggcctgtgcgc
aagatccctcttaggtgcaccgtccactatgtatcataccatgtggaatccaaggtgtat
gcagtgtgcaccagtgttaaagagccgtgcacgcgcatccccagaatgactggagaagag
aaggaatttgagaccattgaaagagacgagcgatatattcaccctcaacaggacaagttc
tcaattcagctcatctctccagtgagctgggaggcgattccaaacacacgcgttgacctt
gaggagtgggagcatgtgacgtgcatgaagacggtggctctgaagagtcaggagacagtg
tcaggactaaagggatacgtggctcttgggacgtgtctcatgcagggagaagaggtcacc
tgtagaggcagaatcctgattctggatgtgatagaggtggtcccagagcccggacagccc
ctcaccaagaacaagtttaaagtgctgtatgaaaaggagcagaaaggtcccgtcactgct
ctttgtcactgcagtggattcctggtgtctgctattggacagaagatcttcctgtggagt
ctgaaggataatgatctgacgggaatggcattcattgacacacagctctacattcatcag
atgtacagcatcaagaacttcatcctcgcagcagatgtgatgaagagcatctctctgctg
cgctaccagccggagagcaagacgctttcactcgtcagcagggatgcaaagccccttgag
gtttacagcatagagtttatggtagacaataaccagcttggttttttggtttcagacaga
gacaaaaaccttatggtctacatgtacctgcctgaagccaaagagagttttggaggcatg
cgtttgttgcggagagcagattttaacgtgggatcgcatgtgaacgccttctggaggatg
ccatgccgcgggacactggacacagccaacaagaaggccctcacctgggacaacaaacac
atcacctggtttgctacgctggatggtggtgtcggtctgcttttaccaatgcaggagaag
acgtaccgcagactgctgatgttgcagaacgctctcaccaccatgctgccacatcatgcc
gggctcaatccaaaggctttcaggatgctgcactgtgaccggcgaacgctccagaatgct
gtcaagaacatcctggatggagagctgctcaataaatacctgtatctcagtaccatggag
agaagtgaactggccaagaagatcggcacgacccctgacattatcttggatgatcttctg
gagattgaaagagtcactgctcacttctga

DBGET integrated database retrieval system