KEGG   Theobroma cacao (cacao): 18610643
Entry
18610643          CDS       T02994                                 
Name
(RefSeq) DNA mismatch repair protein MLH3
  KO
K08739  DNA mismatch repair protein MLH3
Organism
tcc  Theobroma cacao (cacao)
Pathway
tcc03430  Mismatch repair
Brite
KEGG Orthology (KO) [BR:tcc00001]
 09120 Genetic Information Processing
  09124 Replication and repair
   03430 Mismatch repair
    18610643
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03400 DNA repair and recombination proteins [BR:tcc03400]
    18610643
DNA repair and recombination proteins [BR:tcc03400]
 Eukaryotic type
  SSBR (single strand breaks repair)
   MMR (mismatch excision repair)
    MutL homologs
     18610643
SSDB
Motif
Pfam: DNA_mis_repair MutL_C HATPase_c_3 HATPase_c
Other DBs
NCBI-GeneID: 18610643
NCBI-ProteinID: XP_007046491
Position
1:240112..248761
AA seq 1218 aa
MGSIKPLPEAVRSSVRSAIILFDLTRVVEELIFNSLDASASKVSVFVSVGSSYVKVVDDG
SGISRDGLVSLGERYVTSKLYHLGDLDAASRSFGFRGEALASISDVALVEIITKAYGKPN
GYRKVIKGSKCLYLGIDDDRKDAGTTVVVRDLFYNQPVRKKHMQSCPKKVLHSVKKCVFR
MALVHPMVYFNVIDIESEDELLSTHPSSSPLSLLMSGFGIEDCTSLQKLNADDGSLKLSG
YITGSWDNFAVKAFQFVYINSRFVCKGPIHKLLNNLATSFESLDSKKANNWTKKGKRSRP
QVFPSYILNISCPPSFYDLTLEPSKTYVEFKDWASILTLIEKTIQHLWRKNICRANGLGQ
AETLKEDDNILHVEEDFFDEGPSVDSEFATRKRWTQKYRPSSSLEKLTTDHLFLTDHEDI
PFEECHVNNAQFRDQQNNMKFVHWTDYSFQSWDDSLVKGTSSVFERSDCCLLTTNNNSLV
EDYFLENRFTASGRSNCHVNNNGICSKLGNASDVVESDVTNGTDRNIFPFDYHEHYNDSQ
FRKNISKPFLQSCSSQRTLPLDRELVESEKGIEPPMDSFKTKAKQVCSNERFNMLKTDSS
DQTMWQDGGPCGQIYPKLVSKGGIARDLDVLTRASAKSFLSCGDVSIEENGLPSDSVTPI
EKTGSGHQSLSSEWCSGTSNPFEQFSYKNPIEGCFRSEERTNFGHFSAGEDEDYQFSFDL
ISRSSSQEKCIYDCPNTGLEIDYAKSSRDFHGFLQQYNLNHTFSPEDSNVAIEERDWLCT
DSSINEYKRQIDWFQYQDVEQNPIPKERARRSQSAPPFCSYKRRFISLHHCLASGEPTFS
EVRGPFTSPEIGEKKPPQQSSGVDNLHFEPSFGKNRSNMNNKPNMVFSTVVRKCEDIEQP
HCLEGPESAPVQVFISKGNQDPANSGTKWRSGFAQNTSNSKLCDIDYEYNVLDIASGLPF
VATKSLVPESINKNCLRDAKVLQQVDKKFIPIVAGGTLAIIDQHAADERIQLEELRQKVL
SGKGKTVTYLDTEQELILPEIGYQLLHNYSEQIRNWGWICDIHTQDSKPFKKNLNLIRRK
PAVVKLLAVPCILGVNLSHVDLLEFLQQLADTDGSSTMPPSIIRILNSKACRGAIMFGDS
LLPSECSLIVEELKQTSLCFQCAHGRPTTVPVVKLEALHRQIAKMQMKDGGPRELWHGLC
RHRVSLERASLRLSAAGG
NT seq 3657 nt   +upstreamnt  +downstreamnt
atggggagcattaagcccttgccagaggctgttcgtagttcggtgcgttctgccattata
ttgtttgacttgactagggttgtggaggagctcattttcaacagcctcgatgcttctgct
tcaaaggtgtcagtctttgtaagtgtcgggagcagctatgtcaaagtggtggatgatgga
tctggtatatctcgtgatggattggtgtcactgggagaaagatatgtaacatcaaagctt
taccatctgggtgatttggatgctgccagcaggagctttggctttcggggagaagcactg
gcttctatatctgatgtagccttggtggaaataataacaaaagcttacggaaagccaaat
gggtaccgcaaggtcattaagggatccaagtgtttgtatcttggaattgatgatgatagg
aaagatgcaggtacaacagttgtcgtgcgtgatttattttacaaccaacctgttcggaag
aagcatatgcaatcctgccctaagaaggtgttgcactcagttaaaaagtgcgtattcaga
atggcccttgtgcacccaatggtttacttcaatgtgattgatattgaaagtgaggatgag
cttctcagtacgcatccttcctcttctcctttgtcacttttaatgagtggttttgggatt
gaggactgtacctctctgcagaagctgaatgctgatgatggttccctcaagctttctggc
tacataactggctcctgggacaattttgctgttaaggcctttcaatttgtttatatcaat
tcaaggtttgtctgcaagggtcccattcataagttgctgaacaacttggccactagtttt
gagtctttagattcaaagaaggctaacaactggaccaagaaaggaaagaggagtagacct
caagtatttccgtcctacatactgaatattagttgccctccttctttctatgatttaacc
ttagaaccatcaaagacatatgttgaattcaaggattgggcatctatacttaccttaatt
gagaagacaattcaacacctctggaggaaaaatatttgtcgtgccaatggattaggacaa
gctgaaactttgaaggaagatgacaatatcttacatgtggaagaagatttttttgatgaa
ggaccatctgtggactcagaatttgcaacaaggaaacgttggactcaaaaatatcggcct
tcttcttcattagagaagctaacaacagatcatttgtttcttacagaccatgaagatatt
ccatttgaggagtgccatgtgaataatgcacaatttagagatcaacaaaacaatatgaaa
tttgttcattggactgactattcttttcaaagttgggatgattcccttgtcaaaggcaca
tcctcagtatttgaaaggagtgattgttgtcttttgacaactaataacaattctttagtt
gaggattacttcttggaaaatagattcactgcttcaggaagatcaaactgtcatgtgaac
aacaatggtatatgttcaaagttaggtaatgcatccgatgtggttgagagtgatgtgacc
aatggaacagataggaacatatttccttttgattatcatgaacattacaatgactcacag
ttcagaaagaatatcagcaagccttttctgcaaagttgctcctcccaaagaaccttgcca
cttgacagggagttggttgaaagtgagaaaggaattgaaccaccaatggatagctttaag
accaaagcgaagcaggtttgctcaaatgaaaggttcaatatgctgaaaactgattccagt
gatcagaccatgtggcaggatggaggaccatgcggtcaaatttatcccaaacttgtaagt
aaaggtgggattgctagagatttggatgttctaacaagggcttctgccaaatcgttcctg
tcatgtggagatgtctctattgaagagaatggccttccatctgattcagtcacaccaata
gaaaaaactggctctggtcatcagtccttaagttctgaatggtgttcaggaacctctaat
ccctttgagcagttcagttataaaaatccaattgaagggtgcttcagatctgaagaaagg
accaactttgggcatttctctgctggtgaagatgaggactaccaatttagctttgaccta
atctcaaggagctccagccaagaaaaatgcatctatgattgtccaaacactggactagaa
attgactatgccaaatctagtagagattttcatggattccttcaacaatacaatctaaat
catacattttctccagaagattccaatgtagcaattgaagagagagactggttgtgtaca
gactcaagtattaatgaatataaaagacaaatcgattggtttcaatatcaagatgttgaa
caaaatcctattcctaaagaaagagcaagaagaagccagtcagctcctccattttgcagc
tacaagaggaggtttatctccttacatcattgtttggcatcaggggaacccacttttagt
gaagtccgtggtccattcacttctccagagattggtgagaagaagcctccccaacaatct
tctggtgtggacaatctacattttgaaccaagttttggaaagaatagatcaaatatgaat
aacaagccaaacatggtgttcagcactgtagttcgaaaatgtgaagacattgaacaacct
cattgcctagagggtcctgaatcagctccggtgcaagtatttatctcaaagggaaatcag
gatccagcaaattctggaaccaaatggcggagtggttttgcacagaatacaagcaacagc
aaattatgtgatattgactatgaatataatgtacttgacattgcgtccggattgcccttt
gttgccactaaatcattggttcctgaatctatcaataagaattgtctcagagatgccaag
gttctgcaacaggtggataagaaattcatcccaattgtagctggcggaacacttgctatt
attgatcagcatgcggcagatgaaagaattcaactagaagaacttcgacaaaaggtttta
tctgggaaagggaagacagtcacctatttggatacagagcaagagctgatcctgccagag
attggctatcagttactgcacaattattctgaacaaataagaaattggggttggatctgt
gacattcacacccaagattcaaagcccttcaagaagaatttgaaccttattcgtcgtaag
ccggctgttgtcaaacttcttgcagtaccttgcattttaggtgtcaatttatctcatgtt
gatctcctggaatttctacaacagcttgctgatacagatggatcatcaacaatgcctcca
tcaattattcgaattcttaattctaaagcatgcagaggtgcaattatgtttggagactcc
ttgctaccttcagaatgttccttaattgttgaagagctgaagcagacgtccctgtgcttc
caatgtgctcatgggcgaccaaccactgtcccggttgtgaagttggaggcattgcatagg
cagatagctaaaatgcaaatgaaggatggtggtccaagggaattgtggcacgggctatgt
cgacacagagtcagccttgaacgagccagcttgcgcttaagtgcagctggaggttag

DBGET integrated database retrieval system