ID A0A0B2VA79_TOXCA Unreviewed; 1255 AA.
AC A0A0B2VA79;
DT 04-MAR-2015, integrated into UniProtKB/TrEMBL.
DT 04-MAR-2015, sequence version 1.
DT 24-JAN-2024, entry version 28.
DE RecName: Full=DNA mismatch repair protein {ECO:0000256|PIRNR:PIRNR037677};
GN Name=Msh6 {ECO:0000313|EMBL:KHN80366.1};
GN ORFNames=Tcan_09759 {ECO:0000313|EMBL:KHN80366.1};
OS Toxocara canis (Canine roundworm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Spirurina; Ascaridomorpha; Ascaridoidea; Toxocaridae; Toxocara.
OX NCBI_TaxID=6265 {ECO:0000313|EMBL:KHN80366.1, ECO:0000313|Proteomes:UP000031036};
RN [1] {ECO:0000313|EMBL:KHN80366.1, ECO:0000313|Proteomes:UP000031036}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PN_DK_2014 {ECO:0000313|EMBL:KHN80366.1};
RA Zhu X.-Q., Korhonen P.K., Cai H., Young N.D., Nejsum P.,
RA von Samson-Himmelstjerna G., Boag P.R., Tan P., Li Q., Min J., Yang Y.,
RA Wang X., Fang X., Hall R.S., Hofmann A., Sternberg P.W., Jex A.R.,
RA Gasser R.B.;
RT "Genetic blueprint of the zoonotic pathogen Toxocara canis.";
RL Submitted (NOV-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Component of the post-replicative DNA mismatch repair system
CC (MMR). {ECO:0000256|PIRNR:PIRNR037677, ECO:0000256|RuleBase:RU003756}.
CC -!- SIMILARITY: Belongs to the DNA mismatch repair MutS family.
CC {ECO:0000256|PIRNR:PIRNR037677, ECO:0000256|RuleBase:RU003756}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KHN80366.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JPKZ01001747; KHN80366.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0B2VA79; -.
DR STRING; 6265.A0A0B2VA79; -.
DR OMA; TPMMAQY; -.
DR Proteomes; UP000031036; Unassembled WGS sequence.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-UniRule.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006298; P:mismatch repair; IEA:InterPro.
DR Gene3D; 1.10.1420.10; -; 2.
DR Gene3D; 6.10.140.80; -; 1.
DR Gene3D; 3.40.1170.10; DNA repair protein MutS, domain I; 1.
DR Gene3D; 3.30.420.110; MutS, connector domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2.
DR InterPro; IPR007695; DNA_mismatch_repair_MutS-lik_N.
DR InterPro; IPR017261; DNA_mismatch_repair_MutS/MSH.
DR InterPro; IPR000432; DNA_mismatch_repair_MutS_C.
DR InterPro; IPR007861; DNA_mismatch_repair_MutS_clamp.
DR InterPro; IPR007696; DNA_mismatch_repair_MutS_core.
DR InterPro; IPR016151; DNA_mismatch_repair_MutS_N.
DR InterPro; IPR036187; DNA_mismatch_repair_MutS_sf.
DR InterPro; IPR007860; DNA_mmatch_repair_MutS_con_dom.
DR InterPro; IPR036678; MutS_con_dom_sf.
DR InterPro; IPR045076; MutS_family.
DR InterPro; IPR027417; P-loop_NTPase.
DR PANTHER; PTHR11361:SF148; DNA MISMATCH REPAIR PROTEIN MSH6; 1.
DR PANTHER; PTHR11361; DNA MISMATCH REPAIR PROTEIN MUTS FAMILY MEMBER; 1.
DR Pfam; PF01624; MutS_I; 1.
DR Pfam; PF05188; MutS_II; 1.
DR Pfam; PF05192; MutS_III; 1.
DR Pfam; PF05190; MutS_IV; 1.
DR Pfam; PF00488; MutS_V; 1.
DR PIRSF; PIRSF037677; DNA_mis_repair_Msh6; 3.
DR SMART; SM00534; MUTSac; 1.
DR SMART; SM00533; MUTSd; 1.
DR SUPFAM; SSF55271; DNA repair protein MutS, domain I; 1.
DR SUPFAM; SSF53150; DNA repair protein MutS, domain II; 1.
DR SUPFAM; SSF48334; DNA repair protein MutS, domain III; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
PE 3: Inferred from homology;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840, ECO:0000256|PIRNR:PIRNR037677};
KW DNA damage {ECO:0000256|ARBA:ARBA00022763, ECO:0000256|PIRNR:PIRNR037677};
KW DNA repair {ECO:0000256|PIRNR:PIRNR037677, ECO:0000256|RuleBase:RU003756};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PIRNR:PIRNR037677};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741,
KW ECO:0000256|PIRNR:PIRNR037677};
KW Reference proteome {ECO:0000313|Proteomes:UP000031036}.
FT DOMAIN 682..1026
FT /note="DNA mismatch repair protein MutS core"
FT /evidence="ECO:0000259|SMART:SM00533"
FT DOMAIN 1051..1219
FT /note="DNA mismatch repair proteins mutS family"
FT /evidence="ECO:0000259|SMART:SM00534"
FT REGION 1..162
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 26..47
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 50..84
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 91..115
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 116..162
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1255 AA; 140830 MW; 4B66AD393A2BB806 CRC64;
MSSSRKQTSL FSFFTRSDDK EDGSPSRATF SSSVMKQREN FLTEKMNRNG SMKAPERDET
EERKTCGAPK RTAQDDEVCS PLKRNHTPKR CRIVLSSDSE PDDECGHENS KQVEISVEAD
HSPTSSDNPT SSEPCTPDLN VKPKNRTSTP RSVPRTSAAS TPRLVSEMAH SFIGCFRVNE
EDMDAPNVSM SSLDRSIYKV EESAACGTSK KEGGLAVLSD CEAGRFPHLD FDFLQPDKIR
DANGRRISDP YYCPRTLFVP DAFIKQQTPG HRQWWLAKSA YFDTVLFFKV GKFYEMYHMD
AVIGVENLNL TYMRGKFAHC GFPEIAYGRF ADQLVSRGYK VARVEQTETP TQLEERNRLE
KNREKVVRRE ICRVTSAGTR TYGVLDTCDG DCALDAVESS ARHLLSFAEK VLPNGLSTYG
VCFIDTSVGR FYVGQFVDDA NRSSMRTLFA HYEPSQILYE RGRISAASQS LLNSVASAVI
KEALVPKKEF PDAESAVKML TSKLYFGEVV REWPQTVRSL LVDADALDAK CAPEFNECMA
ALGAVLWYLK RSLIDVDMVT MRTFERYIPP CLLGHPASQN DSIEGMQSGE GYWRHRRLVL
DGVSLYNLNI VPPLDGVRRS GMRDSISSKY SLYNTINKCI TPFGEGYWRH RRLVLDGVSL
YNLNIVPPLD GVRRSGMRDS ISSKYSLYNT INKCITPFGK RVLRQWVCAP SCDGDVLRAR
QDAIQWLMSP TAKLFTDKAT EVLRKMPDLE RLLQKIHTLG LKYRASQHPD SRAVMFEPLR
YNRRKIGDLL SALSGFERVL ELVRLYGKLF TEAEQRPKLI DRCFGACFPD ISGDLSHFQE
AFDHEKAKAE GIIVPEKGVD AEYDGAVEDV HECIHNLDAY LFAIRKRLGC GSIQYFGSGR
SRYQLEIPET IAKNLGGDFE LKSSRKGYKR FSTEESVRLF EELVEAETRC DIIRRDVMRR
VFADFDTRAA KWAAVTERVA LFDVLLSLAR YAKSSGLAMC RPEFVFDSDT PFLNIAAGYH
PCLAAKMCAG REREASFTYI PNDTQLGGDH PLTMLLTGPN MGGKSTLMRQ VAVLVVLAQI
GSLVPAKKMR LSPVDRIFTR IGANDRISAG QSTFFVELNE ANIIMRDASV YSLVVMDELG
RGTRSVYLSL HNLMLRKVDR LPGLLSFMDN ENDVDPTLEN VTFLYSLSDG VCPKSYGFFA
AKVSGIKPEV IRTAFAASQR LDMGIDGKSS RLALLVREAK KGCDVGQLSS MLASM
//