ID E1ZEZ8_CHLVA Unreviewed; 1131 AA.
AC E1ZEZ8;
DT 30-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2010, sequence version 1.
DT 27-MAR-2024, entry version 52.
DE RecName: Full=DNA mismatch repair protein {ECO:0000256|PIRNR:PIRNR037677};
GN ORFNames=CHLNCDRAFT_134093 {ECO:0000313|EMBL:EFN55582.1};
OS Chlorella variabilis (Green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Chlorellales; Chlorellaceae; Chlorella clade; Chlorella.
OX NCBI_TaxID=554065 {ECO:0000313|Proteomes:UP000008141};
RN [1] {ECO:0000313|EMBL:EFN55582.1, ECO:0000313|Proteomes:UP000008141}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NC64A {ECO:0000313|EMBL:EFN55582.1,
RC ECO:0000313|Proteomes:UP000008141};
RX PubMed=20852019; DOI=10.1105/tpc.110.076406;
RA Blanc G., Duncan G., Agarkova I., Borodovsky M., Gurnon J., Kuo A.,
RA Lindquist E., Lucas S., Pangilinan J., Polle J., Salamov A., Terry A.,
RA Yamada T., Dunigan D.D., Grigoriev I.V., Claverie J.M., Van Etten J.L.;
RT "The Chlorella variabilis NC64A genome reveals adaptation to
RT photosymbiosis, coevolution with viruses, and cryptic sex.";
RL Plant Cell 22:2943-2955(2010).
CC -!- FUNCTION: Component of the post-replicative DNA mismatch repair system
CC (MMR). {ECO:0000256|PIRNR:PIRNR037677}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the DNA mismatch repair MutS family. MSH3
CC subfamily. {ECO:0000256|ARBA:ARBA00007094}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL433844; EFN55582.1; -; Genomic_DNA.
DR RefSeq; XP_005847684.1; XM_005847622.1.
DR AlphaFoldDB; E1ZEZ8; -.
DR STRING; 554065.E1ZEZ8; -.
DR GeneID; 17355057; -.
DR KEGG; cvr:CHLNCDRAFT_134093; -.
DR eggNOG; KOG0218; Eukaryota.
DR InParanoid; E1ZEZ8; -.
DR OMA; HIAPCEL; -.
DR OrthoDB; 168255at2759; -.
DR Proteomes; UP000008141; Unassembled WGS sequence.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-UniRule.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006298; P:mismatch repair; IEA:InterPro.
DR Gene3D; 1.10.1420.10; -; 2.
DR Gene3D; 3.40.1170.10; DNA repair protein MutS, domain I; 1.
DR Gene3D; 3.30.420.110; MutS, connector domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2.
DR InterPro; IPR007695; DNA_mismatch_repair_MutS-lik_N.
DR InterPro; IPR017261; DNA_mismatch_repair_MutS/MSH.
DR InterPro; IPR000432; DNA_mismatch_repair_MutS_C.
DR InterPro; IPR007861; DNA_mismatch_repair_MutS_clamp.
DR InterPro; IPR007696; DNA_mismatch_repair_MutS_core.
DR InterPro; IPR016151; DNA_mismatch_repair_MutS_N.
DR InterPro; IPR036187; DNA_mismatch_repair_MutS_sf.
DR InterPro; IPR036678; MutS_con_dom_sf.
DR InterPro; IPR045076; MutS_family.
DR InterPro; IPR027417; P-loop_NTPase.
DR PANTHER; PTHR11361:SF122; DNA MISMATCH REPAIR PROTEIN MSH3; 1.
DR PANTHER; PTHR11361; DNA MISMATCH REPAIR PROTEIN MUTS FAMILY MEMBER; 1.
DR Pfam; PF01624; MutS_I; 1.
DR Pfam; PF05192; MutS_III; 1.
DR Pfam; PF05190; MutS_IV; 1.
DR Pfam; PF00488; MutS_V; 1.
DR PIRSF; PIRSF037677; DNA_mis_repair_Msh6; 4.
DR SMART; SM00534; MUTSac; 1.
DR SMART; SM00533; MUTSd; 1.
DR SUPFAM; SSF55271; DNA repair protein MutS, domain I; 1.
DR SUPFAM; SSF53150; DNA repair protein MutS, domain II; 1.
DR SUPFAM; SSF48334; DNA repair protein MutS, domain III; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
PE 3: Inferred from homology;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840, ECO:0000256|PIRNR:PIRNR037677};
KW DNA damage {ECO:0000256|ARBA:ARBA00022763, ECO:0000256|PIRNR:PIRNR037677};
KW DNA repair {ECO:0000256|PIRNR:PIRNR037677};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PIRNR:PIRNR037677};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741,
KW ECO:0000256|PIRNR:PIRNR037677}; Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000008141}.
FT DOMAIN 510..879
FT /note="DNA mismatch repair protein MutS core"
FT /evidence="ECO:0000259|SMART:SM00533"
FT DOMAIN 892..1059
FT /note="DNA mismatch repair proteins mutS family"
FT /evidence="ECO:0000259|SMART:SM00534"
FT REGION 1..97
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 247..307
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 633..658
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 973..1012
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1068..1087
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 49..87
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 250..265
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 979..1001
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1131 AA; 119659 MW; BBFE37AB1DA191ED CRC64;
MAPKPKKQGA PSRGQRSITM FFPGGAGGSP GGEAPRSPPA KRPRTERKQQ QQPPPPPPDS
TAGGSEPSPP PAAEPPRPVP PRAPLPASDP AQAREAMQQK LALVGERRGA VARAAQPIEP
PQLTGGGGEA PKLTPLEQQI VALKRQHPGI LLCVEVGYKM KMFGHDAEVA SRELGVYAYQ
DRAFLCAGFP MLRLHVHVRR LVQAGHKVGV VRQVETAAIK KAGDNRAGPF ERQLTALYTR
ATLEAGELGG MADDGEEDGD EEEPEAAAAA ASAAAGGNEQ QGGAPGSPSR AAAPPAAPPR
RRWRQAAAQP AAPSSYLVCC VEEAVQDVGG AAVQAVDVGL VAVEPSTGSV LHAQFRDGLL
RQELECKLLF AAPAELVLAE PASAETRRLL GAIGQQLGAA PRLEAVPGAR YRQAGAALDA
VLAFYGRRDS TPGGGTGGEP APAAALEAVR ALPPLVARAL AHLLDYLRPF GLQSVLRLGA
SFQPWSGAQE MRLSANTLRQ LEIFRNVDGS PKGSLVSVMD HTKTAFGRRL LHSWVGKPLR
QRAAIEARLD AVQELAEAGG DHPVLCRLVS TLRKLPDLER GITRAFHATI RPSDLAALLH
QFSQLQEQLG VGAGAAAPGT SGDTGGSCCE GAGPVAAASP QPAGGGSEAE GGASGAAAPP
VALPGVRSQL LRQLLRAAAD PSVAAAARQM LDCMDLQAAA ANDKANVLKP SERFPEVLRW
REEAAAAERA LQGLLPQYRR ALQISTLQFQ TLQNQGEYLV EVPTDLVKRV PKDWQKVSSV
KKGVRYHPPE VTAGLQRLAL AREHLKAACD AAWRQFLLDF GSHYLGFRAA VQAAAALDCL
CSLATLASSV GYCRPQLAGE GEAAQVHIEA GRHPVLELLL EGRGQQVVPN DTHLQVQCAL
IAIMAQVGSY VPAAAARLRP LDGVFTRMGA SDCLALGRST FAEELGYPDV CFRLQQALPE
AVACYRMAHV EEEGEQQQPE QQPEQQQQPE QPEQQVGQEQ EQEQEQEQRQ RRRRQAALGR
VVFLFKLVPG VAPASFGINV ARMAQLPDTV LQRAAEVAAR VQDSAERRRR QQAAAAAGGG
GGCGGGAAEE ERAAQLARQC RQFLQLSLSA GRVDVEAGRQ LQQQARALLA G
//