ID A0A061F4P4_THECC Unreviewed; 818 AA.
AC A0A061F4P4;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 59.
DE SubName: Full=MUTS {ECO:0000313|EMBL:EOY12305.1};
GN ORFNames=TCM_030844 {ECO:0000313|EMBL:EOY12305.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY12305.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY12305.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the DNA mismatch repair MutS family.
CC {ECO:0000256|ARBA:ARBA00006271}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001885; EOY12305.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061F4P4; -.
DR STRING; 3641.A0A061F4P4; -.
DR EnsemblPlants; EOY12305; EOY12305; TCM_030844.
DR Gramene; EOY12305; EOY12305; TCM_030844.
DR eggNOG; KOG0221; Eukaryota.
DR HOGENOM; CLU_002472_8_2_1; -.
DR InParanoid; A0A061F4P4; -.
DR OMA; METHQDG; -.
DR Proteomes; UP000026915; Chromosome 7.
DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:EnsemblPlants.
DR GO; GO:0043073; C:germ cell nucleus; IEA:EnsemblPlants.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0003690; F:double-stranded DNA binding; IBA:GO_Central.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:InterPro.
DR GO; GO:0051026; P:chiasma assembly; IBA:GO_Central.
DR GO; GO:0010777; P:meiotic mismatch repair involved in reciprocal meiotic recombination; IEA:EnsemblPlants.
DR CDD; cd03281; ABC_MSH5_euk; 1.
DR Gene3D; 1.10.1420.10; -; 2.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR InterPro; IPR011184; DNA_mismatch_repair_Msh2.
DR InterPro; IPR000432; DNA_mismatch_repair_MutS_C.
DR InterPro; IPR007696; DNA_mismatch_repair_MutS_core.
DR InterPro; IPR036187; DNA_mismatch_repair_MutS_sf.
DR InterPro; IPR045076; MutS_family.
DR InterPro; IPR027417; P-loop_NTPase.
DR PANTHER; PTHR11361; DNA MISMATCH REPAIR PROTEIN MUTS FAMILY MEMBER; 1.
DR PANTHER; PTHR11361:SF20; MUTS PROTEIN HOMOLOG 5; 1.
DR Pfam; PF05192; MutS_III; 1.
DR Pfam; PF00488; MutS_V; 1.
DR PIRSF; PIRSF005813; MSH2; 1.
DR SMART; SM00534; MUTSac; 1.
DR SMART; SM00533; MUTSd; 1.
DR SUPFAM; SSF48334; DNA repair protein MutS, domain III; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR PROSITE; PS00486; DNA_MISMATCH_REPAIR_2; 1.
PE 3: Inferred from homology;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT DOMAIN 656..672
FT /note="DNA mismatch repair proteins mutS family"
FT /evidence="ECO:0000259|PROSITE:PS00486"
SQ SEQUENCE 818 AA; 92459 MW; 2D4E6F192AED0827 CRC64;
MDEEMDETEA VPQVYMACIQ HGHRIGISYY DSSIRQLNVL EVWDDGSSDF PMIELVKYQA
KPVVIYTSTK AEESFLSALQ GSDGMTEAPT VKLVKSSIFT YEQAWHRLIY LRVTGMDDGL
NIKERICYLS SMMDMGSDVQ VRVSGGLLAI LENERIVDTL EQKECGNASI TIDSVVEISL
DKFLKLDAAA HEALQIFQVD KHPSHMGIGR AKEGFSVFGM MNKCVTPMGR RLLRNWFLRP
ILDLENLNNR LNAVSFYSLI IWGMIKISFF LSSEELMVSL RETLKSVKDI PHILKKFNSP
NSMCTSSDWM AFLKSVCSLL HVNKIFEVGI SENLREHMEY LNLDIVAKAS SCITADLAFV
YELVIGVIDV NRSKDKGYGT MVKEGFCDEL DELRHIYEEL PEFLEEVASL ELAQLPHLRK
EEFAPRIVYI HQIGYLMCFF EEKIDEITQE KLQDFEFAFS DSGGITKRFF YRTPKTRELD
DLLGDIYHKI LDMERAIIRD LVSHVSTFST HLIKAVNFVA ELDCFLSLAM VARQNNYVRP
TLTMETFLDI QNGRHVLQEM TVDTFIPNDT KILDEGRIHI ITGPNYSGKS IYIKQVALIV
FLSHIGSFVP ADAATVGLTD RIFCGMGSKL MTAEQSTFMI DLHQVGMMLR QATSRSLCLL
DEFGKGTLTE DGIGLLGGTI NHFVTSYVPP KVLVCTHLTE LFNESCLPKS EKINFYTMSV
LRPDDNATNV EDIIFLYRLV PGHAALSYGL HCALLAGVPK EVINRAALVL DAIENNKNVE
RLCDEKISAK DRQYKGAVDK MLAFDALKGD LSAFFRDI
//