ID I3LHZ9_PIG Unreviewed; 1333 AA.
AC I3LHZ9;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 2.
DT 27-MAR-2024, entry version 71.
DE RecName: Full=DNA mismatch repair protein {ECO:0000256|PIRNR:PIRNR037677};
GN Name=MSH6 {ECO:0000313|Ensembl:ENSSSCP00000023698.2,
GN ECO:0000313|VGNC:VGNC:90420};
OS Sus scrofa (Pig).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Suina; Suidae; Sus.
OX NCBI_TaxID=9823 {ECO:0000313|Ensembl:ENSSSCP00000023698.2, ECO:0000313|Proteomes:UP000008227};
RN [1] {ECO:0000313|Ensembl:ENSSSCP00000023698.2, ECO:0000313|Proteomes:UP000008227}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Duroc {ECO:0000313|Ensembl:ENSSSCP00000023698.2,
RC ECO:0000313|Proteomes:UP000008227};
RG Porcine genome sequencing project;
RL Submitted (NOV-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSSCP00000023698.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Component of the post-replicative DNA mismatch repair system
CC (MMR). {ECO:0000256|PIRNR:PIRNR037677, ECO:0000256|RuleBase:RU003756}.
CC -!- SIMILARITY: Belongs to the DNA mismatch repair MutS family.
CC {ECO:0000256|PIRNR:PIRNR037677, ECO:0000256|RuleBase:RU003756}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR PeptideAtlas; I3LHZ9; -.
DR Ensembl; ENSSSCT00000028757.3; ENSSSCP00000023698.2; ENSSSCG00000020931.4.
DR VGNC; VGNC:90420; MSH6.
DR GeneTree; ENSGT00550000075024; -.
DR HOGENOM; CLU_002472_1_3_1; -.
DR TreeFam; TF105842; -.
DR Proteomes; UP000008227; Chromosome 3.
DR Bgee; ENSSSCG00000020931; Expressed in hindlimb bud and 44 other cell types or tissues.
DR ExpressionAtlas; I3LHZ9; baseline and differential.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-UniRule.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006298; P:mismatch repair; IEA:InterPro.
DR CDD; cd05837; PWWP_MSH6; 1.
DR Gene3D; 1.10.1420.10; -; 2.
DR Gene3D; 2.30.30.140; -; 1.
DR Gene3D; 3.40.1170.10; DNA repair protein MutS, domain I; 1.
DR Gene3D; 3.30.420.110; MutS, connector domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2.
DR InterPro; IPR007695; DNA_mismatch_repair_MutS-lik_N.
DR InterPro; IPR017261; DNA_mismatch_repair_MutS/MSH.
DR InterPro; IPR000432; DNA_mismatch_repair_MutS_C.
DR InterPro; IPR007861; DNA_mismatch_repair_MutS_clamp.
DR InterPro; IPR007696; DNA_mismatch_repair_MutS_core.
DR InterPro; IPR016151; DNA_mismatch_repair_MutS_N.
DR InterPro; IPR036187; DNA_mismatch_repair_MutS_sf.
DR InterPro; IPR007860; DNA_mmatch_repair_MutS_con_dom.
DR InterPro; IPR036678; MutS_con_dom_sf.
DR InterPro; IPR045076; MutS_family.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR000313; PWWP_dom.
DR PANTHER; PTHR11361:SF148; DNA MISMATCH REPAIR PROTEIN MSH6; 1.
DR PANTHER; PTHR11361; DNA MISMATCH REPAIR PROTEIN MUTS FAMILY MEMBER; 1.
DR Pfam; PF01624; MutS_I; 1.
DR Pfam; PF05188; MutS_II; 1.
DR Pfam; PF05192; MutS_III; 1.
DR Pfam; PF05190; MutS_IV; 1.
DR Pfam; PF00488; MutS_V; 1.
DR Pfam; PF00855; PWWP; 1.
DR PIRSF; PIRSF037677; DNA_mis_repair_Msh6; 2.
DR SMART; SM00534; MUTSac; 1.
DR SMART; SM00533; MUTSd; 1.
DR SMART; SM00293; PWWP; 1.
DR SUPFAM; SSF55271; DNA repair protein MutS, domain I; 1.
DR SUPFAM; SSF48334; DNA repair protein MutS, domain III; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS50812; PWWP; 1.
PE 1: Evidence at protein level;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840, ECO:0000256|PIRNR:PIRNR037677};
KW DNA damage {ECO:0000256|ARBA:ARBA00022763, ECO:0000256|PIRNR:PIRNR037677};
KW DNA repair {ECO:0000256|PIRNR:PIRNR037677, ECO:0000256|RuleBase:RU003756};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PIRNR:PIRNR037677};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741,
KW ECO:0000256|PIRNR:PIRNR037677};
KW Proteomics identification {ECO:0007829|PeptideAtlas:I3LHZ9};
KW Reference proteome {ECO:0000313|Proteomes:UP000008227}.
FT DOMAIN 94..156
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT REGION 14..67
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 196..325
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 250..279
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1333 AA; 149388 MW; F29736F580A8EDEB CRC64;
MSRQSTLYSF FPKSPALTNA NKAPARVSSE GSAPAAAAGA SPSPGGDAAW SETGPGSGSP
AGSASRAEAR KLNGGLRRSA APAVPASSSC DFSPGDLVWA KMEGYPWWPC LVYNHPFDGT
FIREKGKSAR VHVQFFDDSP TRGWVSRRLL KPYTGSKSKE AQKGGHFYSA KPEILRAMQR
ADEALNKDKI KRLELAVCDE PSEPEEEEEM EAGAPYASEK SEEENEMESE EEVQPKVQGS
RRSSRHIKKR RVISDSESDV GGSDVEFKPD TKEEGSSDEM SSGVGDSDSE GLDSPVKIAP
KRKRMVTGNG SLKRKSSRKE MPSATKRAIG ISSETRSTLS AFSAPQNSEP QAHISGGCDD
SNRPTVWYHE TLEWLKEERR RDVHRRRPDH PDFDASTLYV PEDFLNSCTP GMRKWWQIKS
QNFDLVIFYK VGKFYELYHM DALIGVSELG LVFMKGNWAH SGFPEIAFGR YSDSLVQKGY
KVARVEQTET PEMMEARCRK MAHISKYDRV VRREICRVIT KGTQTYSVLE GDPSENYSKY
LLSLKEKEDD SSGHTRVYGV CFVDTSLGKF FIGQFSDDRH CSRFRTLVAH YPPVQVLFEK
GSLSTETKMI LKGSLSSSLQ EGLIPGSQFW DAGKTLRTLL EEGYFTDKLN EDSGVMLPQV
LKGMTSESDS IGLTPGEKSE LALSALGGCV FYLKKCLIDQ ELLSMANFEE YIPLDSDVVS
ASRPGAVFAK ANQRMVLDAV TLNNLEIFLN ATNGSPEGTL LEKIDTCHTP FGKRLLKQWL
CAPLCNPYAI SDRLDAIEDL MVVPDKISEV VDLLKKLPDL ERLLSKIHNV GSPLKSQNHP
DSRAIMYEET TYSKKKIIDF LSALEGFKVI CKIRGIMEEV IDDFKSKILK QVLTLQTKNP
EGRFPDLTVE LNRWDTAFDH EKARKTGLIT PKAGFDSDYD QALADIRENE QSLLEYLEKQ
RSRIGCRTIV YWGIGRNRYQ LEIPENFTTR NLPEEYELKS TKKGCKRYWT KTIEKKLANL
INAEERRDVS LKDCMRRLFY NFDKNYKDWQ AAVECIAVLD VLLCLANYSR GGDGPMCRPV
ILLPGEDTPP FLYLKGSRHP CITKTFFGDD FIPNDILIGC EEEEEENDKA YCVLVTGPNM
GGKSTLMRQA GLLAVMAQMG CYVPAEVCRL TPIDRVFTRL GASDRIMSGR GTATFDGTAI
ANAVVKELAE NIKCRTLFST HYHSLVEDYS QNVAVRLGHM ACMVENECED PSQETITFLY
KFIKGACPKS YGFNAARLAN LPEEVIQKGH RKAREFEKMT QSLRLFREVC LASERSTVDA
EAVHKLLTLM EEL
//