ID A0A1E4S6N4_CYBJN Unreviewed; 889 AA.
AC A0A1E4S6N4;
DT 18-JAN-2017, integrated into UniProtKB/TrEMBL.
DT 18-JAN-2017, sequence version 1.
DT 24-JAN-2024, entry version 26.
DE SubName: Full=DNA mismatch repair protein {ECO:0000313|EMBL:ODV75181.1};
DE Flags: Fragment;
GN ORFNames=CYBJADRAFT_135303 {ECO:0000313|EMBL:ODV75181.1};
OS Cyberlindnera jadinii (strain ATCC 18201 / CBS 1600 / BCRC 20928 / JCM 3617
OS / NBRC 0987 / NRRL Y-1542) (Torula yeast) (Candida utilis).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Phaffomycetaceae; Cyberlindnera.
OX NCBI_TaxID=983966 {ECO:0000313|EMBL:ODV75181.1, ECO:0000313|Proteomes:UP000094389};
RN [1] {ECO:0000313|EMBL:ODV75181.1, ECO:0000313|Proteomes:UP000094389}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 18201 / CBS 1600 / BCRC 20928 / JCM 3617 / NBRC 0987 /
RC NRRL Y-1542 {ECO:0000313|Proteomes:UP000094389};
RX PubMed=27535936; DOI=10.1073/pnas.1603941113;
RA Riley R., Haridas S., Wolfe K.H., Lopes M.R., Hittinger C.T., Goeker M.,
RA Salamov A.A., Wisecaver J.H., Long T.M., Calvey C.H., Aerts A.L.,
RA Barry K.W., Choi C., Clum A., Coughlan A.Y., Deshpande S., Douglass A.P.,
RA Hanson S.J., Klenk H.-P., LaButti K.M., Lapidus A., Lindquist E.A.,
RA Lipzen A.M., Meier-Kolthoff J.P., Ohm R.A., Otillar R.P., Pangilinan J.L.,
RA Peng Y., Rokas A., Rosa C.A., Scheuner C., Sibirny A.A., Slot J.C.,
RA Stielow J.B., Sun H., Kurtzman C.P., Blackwell M., Grigoriev I.V.,
RA Jeffries T.W.;
RT "Comparative genomics of biotechnologically important yeasts.";
RL Proc. Natl. Acad. Sci. U.S.A. 113:9882-9887(2016).
CC -!- FUNCTION: Component of the post-replicative DNA mismatch repair system
CC (MMR). {ECO:0000256|RuleBase:RU003756}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the DNA mismatch repair MutS family.
CC {ECO:0000256|ARBA:ARBA00006271, ECO:0000256|RuleBase:RU003756}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KV453926; ODV75181.1; -; Genomic_DNA.
DR RefSeq; XP_020072220.1; XM_020213129.1.
DR AlphaFoldDB; A0A1E4S6N4; -.
DR STRING; 983966.A0A1E4S6N4; -.
DR GeneID; 30987525; -.
DR OMA; LVRFPQK; -.
DR OrthoDB; 168255at2759; -.
DR Proteomes; UP000094389; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:InterPro.
DR GO; GO:0006298; P:mismatch repair; IEA:InterPro.
DR CDD; cd03285; ABC_MSH2_euk; 1.
DR Gene3D; 1.10.1420.10; -; 2.
DR Gene3D; 3.40.1170.10; DNA repair protein MutS, domain I; 1.
DR Gene3D; 3.30.420.110; MutS, connector domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR InterPro; IPR011184; DNA_mismatch_repair_Msh2.
DR InterPro; IPR007695; DNA_mismatch_repair_MutS-lik_N.
DR InterPro; IPR000432; DNA_mismatch_repair_MutS_C.
DR InterPro; IPR007861; DNA_mismatch_repair_MutS_clamp.
DR InterPro; IPR007696; DNA_mismatch_repair_MutS_core.
DR InterPro; IPR016151; DNA_mismatch_repair_MutS_N.
DR InterPro; IPR036187; DNA_mismatch_repair_MutS_sf.
DR InterPro; IPR007860; DNA_mmatch_repair_MutS_con_dom.
DR InterPro; IPR001079; Galectin_CRD.
DR InterPro; IPR032642; Msh2_ATP-bd.
DR InterPro; IPR036678; MutS_con_dom_sf.
DR InterPro; IPR045076; MutS_family.
DR InterPro; IPR027417; P-loop_NTPase.
DR PANTHER; PTHR11361:SF35; DNA MISMATCH REPAIR PROTEIN MSH2; 1.
DR PANTHER; PTHR11361; DNA MISMATCH REPAIR PROTEIN MUTS FAMILY MEMBER; 1.
DR Pfam; PF01624; MutS_I; 1.
DR Pfam; PF05188; MutS_II; 1.
DR Pfam; PF05192; MutS_III; 1.
DR Pfam; PF05190; MutS_IV; 1.
DR Pfam; PF00488; MutS_V; 1.
DR PIRSF; PIRSF005813; MSH2; 1.
DR SMART; SM00534; MUTSac; 1.
DR SMART; SM00533; MUTSd; 1.
DR SUPFAM; SSF48334; DNA repair protein MutS, domain III; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR PROSITE; PS00486; DNA_MISMATCH_REPAIR_2; 1.
DR PROSITE; PS51304; GALECTIN; 1.
PE 3: Inferred from homology;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA damage {ECO:0000256|ARBA:ARBA00022763, ECO:0000256|RuleBase:RU003756};
KW DNA repair {ECO:0000256|RuleBase:RU003756};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|RuleBase:RU003756};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741,
KW ECO:0000256|RuleBase:RU003756}; Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000094389}.
FT DOMAIN 1..90
FT /note="Galectin"
FT /evidence="ECO:0000259|PROSITE:PS51304"
FT REGION 842..864
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 460..487
FT /evidence="ECO:0000256|SAM:Coils"
FT NON_TER 889
FT /evidence="ECO:0000313|EMBL:ODV75181.1"
SQ SEQUENCE 889 AA; 99911 MW; 9E552DCA1AD821A8 CRC64;
MSSTRPELKF SDTADERSFY RKFQKLEPKQ ENTIRIVDKG DYYSVFAEDA RFVAELIYRT
TSVIKEAQGV EYVTVSPAVL SNLLQQCLFE KGLKIEFYDK AWNLIKFASP GNIEAVEDLF
NSSEVESALV VVSLKVSNKA DGKTIGFCFV DTNAKEISVS EFLDNDLYSN LESFLIQIDA
KEVIIQQPNS IEDPDFVKLA GLIERCGPRI TQVKPSDFNT NDVEQDLTRL VGDDLALSVG
EEAKSIIGLG AAAAIIRYLG LLTDDSNFGA FKLKPHALNQ FMKLDSSAVK SLNLLPASKT
NNSGKNSSVF DLLNHCKSVG GVRLLHQWIK QPLVDVDDIT ARHQLVELLI EDTEMRSTLQ
SDLLPSIPDI RRLNKKLAKS KHANLEDVVR IYQFLIKVAD IIDLLESKQN SITDEELQIL
VDVTWTTEIK RCYEPLMKLQ EMVETTVDLD SLERHEFVIK PDYDEQLLEY RQRLDNIEDE
IRSIHQEVAQ ELGLDPDKKL KLELHPNHGW CMRLTRTEER SIRGKPEFIE LQTVKAGVFF
TTETMREISA ESIDIQHKYA RQQSSLVKEI VSITVTYAPV LESLSLVLAN LDVIVSFAHV
SAYAPVAYVR PKMHGLNSNV GRTILKEARH PCVEMQDEVT FIANDYELIK SETEFLIITG
PNMGGKSTYI RQLGVISLMA QIGCFVPASE AELCVIDAIL ARIGAGDSQL KGVSTFMVEM
LETASILKTA TTNSLIIIDE LGRGTSTYDG FGLAWAISEH ISQNLHCFTL FATHFHELTR
LSETVPTVKN LHVVAHVGES HNTEDITLLY KVEPGISDQS FGIHVAEVVK FPHKIISMAK
RKASELEGEN DNKDDPYVSD KRTKCTSKEI EEGSKLLRKI LKQWRATVD
//