ID W1QL20_OGAPD Unreviewed; 938 AA.
AC W1QL20;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 24-JAN-2024, entry version 41.
DE SubName: Full=DNA mismatch repair protein MSH2 {ECO:0000313|EMBL:ESX03021.1};
GN ORFNames=HPODL_02329 {ECO:0000313|EMBL:ESX03021.1};
OS Ogataea parapolymorpha (strain ATCC 26012 / BCRC 20466 / JCM 22074 / NRRL
OS Y-7560 / DL-1) (Yeast) (Hansenula polymorpha).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Pichiaceae; Ogataea.
OX NCBI_TaxID=871575 {ECO:0000313|EMBL:ESX03021.1, ECO:0000313|Proteomes:UP000008673};
RN [1] {ECO:0000313|EMBL:ESX03021.1, ECO:0000313|Proteomes:UP000008673}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 26012 / BCRC 20466 / JCM 22074 / NRRL Y-7560 / DL-1
RC {ECO:0000313|Proteomes:UP000008673};
RX PubMed=24279325; DOI=10.1186/1471-2164-14-837;
RA Ravin N.V., Eldarov M.A., Kadnikov V.V., Beletsky A.V., Schneider J.,
RA Mardanova E.S., Smekalova E.M., Zvereva M.I., Dontsova O.A., Mardanov A.V.,
RA Skryabin K.G.;
RT "Genome sequence and analysis of methylotrophic yeast Hansenula polymorpha
RT DL1.";
RL BMC Genomics 14:837-837(2013).
RN [2] {ECO:0000313|Proteomes:UP000008673}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 26012 / BCRC 20466 / JCM 22074 / NRRL Y-7560 / DL-1
RC {ECO:0000313|Proteomes:UP000008673};
RA Ravin N.V., Mardanov A.V., Eldarov M.A., Kadnikov V.V., Beletsky A.V.,
RA Zvereva M.I., Smekalova E.M., Dontsova O.A., Skryabin K.G.;
RT "Genome sequence of the methylotrophic yeast Hansenula polymorpha DL1.";
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Component of the post-replicative DNA mismatch repair system
CC (MMR). {ECO:0000256|RuleBase:RU003756}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the DNA mismatch repair MutS family.
CC {ECO:0000256|ARBA:ARBA00006271, ECO:0000256|RuleBase:RU003756}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ESX03021.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AEOI02000003; ESX03021.1; -; Genomic_DNA.
DR RefSeq; XP_013937432.1; XM_014081957.1.
DR AlphaFoldDB; W1QL20; -.
DR STRING; 871575.W1QL20; -.
DR GeneID; 25771782; -.
DR KEGG; opa:HPODL_02329; -.
DR eggNOG; KOG0219; Eukaryota.
DR HOGENOM; CLU_002472_10_0_1; -.
DR OMA; LVRFPQK; -.
DR OrthoDB; 168255at2759; -.
DR Proteomes; UP000008673; Chromosome I.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:InterPro.
DR GO; GO:0006298; P:mismatch repair; IEA:InterPro.
DR Gene3D; 1.10.1420.10; -; 2.
DR Gene3D; 3.40.1170.10; DNA repair protein MutS, domain I; 1.
DR Gene3D; 3.30.420.110; MutS, connector domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR InterPro; IPR011184; DNA_mismatch_repair_Msh2.
DR InterPro; IPR007695; DNA_mismatch_repair_MutS-lik_N.
DR InterPro; IPR000432; DNA_mismatch_repair_MutS_C.
DR InterPro; IPR007861; DNA_mismatch_repair_MutS_clamp.
DR InterPro; IPR007696; DNA_mismatch_repair_MutS_core.
DR InterPro; IPR016151; DNA_mismatch_repair_MutS_N.
DR InterPro; IPR036187; DNA_mismatch_repair_MutS_sf.
DR InterPro; IPR007860; DNA_mmatch_repair_MutS_con_dom.
DR InterPro; IPR036678; MutS_con_dom_sf.
DR InterPro; IPR045076; MutS_family.
DR InterPro; IPR027417; P-loop_NTPase.
DR PANTHER; PTHR11361:SF35; DNA MISMATCH REPAIR PROTEIN MSH2; 1.
DR PANTHER; PTHR11361; DNA MISMATCH REPAIR PROTEIN MUTS FAMILY MEMBER; 1.
DR Pfam; PF01624; MutS_I; 1.
DR Pfam; PF05188; MutS_II; 1.
DR Pfam; PF05192; MutS_III; 1.
DR Pfam; PF05190; MutS_IV; 1.
DR Pfam; PF00488; MutS_V; 1.
DR PIRSF; PIRSF005813; MSH2; 1.
DR SMART; SM00534; MUTSac; 1.
DR SMART; SM00533; MUTSd; 1.
DR SUPFAM; SSF48334; DNA repair protein MutS, domain III; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR PROSITE; PS00486; DNA_MISMATCH_REPAIR_2; 1.
PE 3: Inferred from homology;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA damage {ECO:0000256|ARBA:ARBA00022763, ECO:0000256|RuleBase:RU003756};
KW DNA repair {ECO:0000256|RuleBase:RU003756};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|RuleBase:RU003756};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741,
KW ECO:0000256|RuleBase:RU003756}; Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000008673}.
FT DOMAIN 749..765
FT /note="DNA mismatch repair proteins mutS family"
FT /evidence="ECO:0000259|PROSITE:PS00486"
FT COILED 551..578
FT /evidence="ECO:0000256|SAM:Coils"
SQ SEQUENCE 938 AA; 104654 MW; BD712C9A097CBD59 CRC64;
MSSIRPELKF ADDRDQSSFY SKYFRLPPKS ARTLRVVDKG EYYIVLDEDA ELVADLIYKT
QSVVKTATAE KRTVQYITLS PAVFANLLKL VVVDSGHKLE IYSKNWDNMR TASPGNLSEI
EELINTADLN AVSILAALKL VSSSSEGKKL GLSFYDPNAK ILGVTEFYDN DLFSNLEALL
IQTGVKECLV PASASSDPDM DKIKQLIDRC DIVVSEGRSA DFSDKNIEQD VARLTGNELT
LSANELSSLH VGLACCNAIL VYLSLLADQS NFGSINVVKY DLEQFMKLDY AAVRATNLFP
PPNYNNTMNK TSSLFGLLNN CKTVGGTRLL SQWLKQPLVD VQEIQNRHSI VGHLIDDLNL
RESLQTQFLN EVPDISRLVK RLANPRGTKS LDDVIRLYQL CIRLPDLLDF LGTSMDSLEP
ENAVRKLFQE FWIEPIAQYA GALSKFQEMV ETTVDLESLD NASSAQGSMV AINPEFDASL
MEISHKIEQV KSQMRHEHEL AGEDLGMELD KKLKLEIHHV HGWCFRLTRN DSSCIRGKKK
YRELSTVKAG VYFTTSELSQ LNSEVKSLEE QYDNGQSEVV KEIVTITATY SSIFLKLSIE
LSKLDVLVSF AHTCAFAPVP YTRPEKIHGL GSPERRVRLR EARHPCLEQQ DGLSFIPNDI
NFCRDSEEFL IITGPNMGGK STYIRTMGVI ALMNQIGCYV PAGEGAELCI FDSVLARVGA
SDSQLKGVST FMAEMLEMSS ILKTATSNSL IIVDELGRGT STYDGFGLAW AISEHICKEI
RAFTLFATHF YELTALADKY TAVKNLQVVA HTDTSSDAKN ITLLYKVEPG VSDQSFGVHV
AEIVKFPRKI IEMAKRKASD LDDINKRTAK QTRSGSPEIL EANEVLKTAL KKWKKRVEER
GGLESLGPDE AEAELRKVVE SDFQKDFQSG VLQEMLKL
//