ID A0A3P8N6E9_ASTCA Unreviewed; 1379 AA.
AC A0A3P8N6E9;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE RecName: Full=DNA mismatch repair protein {ECO:0000256|PIRNR:PIRNR037677};
OS Astatotilapia calliptera (Eastern happy) (Chromis callipterus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Haplochromini; Astatotilapia.
OX NCBI_TaxID=8154 {ECO:0000313|Ensembl:ENSACLP00000000335.1, ECO:0000313|Proteomes:UP000265100};
RN [1] {ECO:0000313|Ensembl:ENSACLP00000000335.1, ECO:0000313|Proteomes:UP000265100}
RP NUCLEOTIDE SEQUENCE.
RA Datahose.;
RL Submitted (MAY-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSACLP00000000335.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- FUNCTION: Component of the post-replicative DNA mismatch repair system
CC (MMR). {ECO:0000256|PIRNR:PIRNR037677, ECO:0000256|RuleBase:RU003756}.
CC -!- SIMILARITY: Belongs to the DNA mismatch repair MutS family.
CC {ECO:0000256|PIRNR:PIRNR037677, ECO:0000256|RuleBase:RU003756}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSACLT00000000349.1; ENSACLP00000000335.1; ENSACLG00000000235.1.
DR GeneTree; ENSGT00550000075024; -.
DR OrthoDB; 168255at2759; -.
DR Proteomes; UP000265100; Chromosome 13.
DR Bgee; ENSACLG00000000235; Expressed in testis and 6 other cell types or tissues.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-UniRule.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006298; P:mismatch repair; IEA:InterPro.
DR CDD; cd05837; PWWP_MSH6; 1.
DR Gene3D; 1.10.1420.10; -; 2.
DR Gene3D; 2.30.30.140; -; 1.
DR Gene3D; 3.40.1170.10; DNA repair protein MutS, domain I; 1.
DR Gene3D; 3.30.420.110; MutS, connector domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR InterPro; IPR007695; DNA_mismatch_repair_MutS-lik_N.
DR InterPro; IPR017261; DNA_mismatch_repair_MutS/MSH.
DR InterPro; IPR000432; DNA_mismatch_repair_MutS_C.
DR InterPro; IPR007861; DNA_mismatch_repair_MutS_clamp.
DR InterPro; IPR007696; DNA_mismatch_repair_MutS_core.
DR InterPro; IPR016151; DNA_mismatch_repair_MutS_N.
DR InterPro; IPR036187; DNA_mismatch_repair_MutS_sf.
DR InterPro; IPR007860; DNA_mmatch_repair_MutS_con_dom.
DR InterPro; IPR036678; MutS_con_dom_sf.
DR InterPro; IPR045076; MutS_family.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR000313; PWWP_dom.
DR PANTHER; PTHR11361:SF148; DNA MISMATCH REPAIR PROTEIN MSH6; 1.
DR PANTHER; PTHR11361; DNA MISMATCH REPAIR PROTEIN MUTS FAMILY MEMBER; 1.
DR Pfam; PF01624; MutS_I; 1.
DR Pfam; PF05188; MutS_II; 1.
DR Pfam; PF05192; MutS_III; 1.
DR Pfam; PF05190; MutS_IV; 1.
DR Pfam; PF00488; MutS_V; 1.
DR Pfam; PF00855; PWWP; 1.
DR PIRSF; PIRSF037677; DNA_mis_repair_Msh6; 1.
DR SMART; SM00534; MUTSac; 1.
DR SMART; SM00533; MUTSd; 1.
DR SMART; SM00293; PWWP; 1.
DR SUPFAM; SSF55271; DNA repair protein MutS, domain I; 1.
DR SUPFAM; SSF53150; DNA repair protein MutS, domain II; 1.
DR SUPFAM; SSF48334; DNA repair protein MutS, domain III; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS00486; DNA_MISMATCH_REPAIR_2; 1.
DR PROSITE; PS50812; PWWP; 1.
PE 3: Inferred from homology;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840, ECO:0000256|PIRNR:PIRNR037677};
KW DNA damage {ECO:0000256|ARBA:ARBA00022763, ECO:0000256|PIRNR:PIRNR037677};
KW DNA repair {ECO:0000256|PIRNR:PIRNR037677, ECO:0000256|RuleBase:RU003756};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PIRNR:PIRNR037677};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741,
KW ECO:0000256|PIRNR:PIRNR037677}.
FT DOMAIN 95..157
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT REGION 1..75
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 195..214
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 257..351
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 35..55
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 257..316
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1379 AA; 153734 MW; ECBE95B3192A0684 CRC64;
MAKQSSLFNF FTKSPPLVAK PKPSPSPEEA DLPCSIEKSN SSPKEQAQQT PQQTKDTNKA
KKESKSKPAS GGFKKLFGDK AQTTKESSPP CLFSAGALVW AKMEGYPWWP CMVVPQPLTG
QEMRGHGRDQ RIHVHFFDEP PTRGWVSSNR VREYQGSDSS DAKPGGVYFC GKPVIRRAME
LADGVMFDSP EKRLKIPLCT DPSDAEEDDD EEMELDSLVV TDEEGSDEYE NKSEALKPSK
VSTRLFCALM EKGSKAKRRR IIAASDSDGS DEEFKPEDAA SSSEDEEDRR KESCGEKEES
TEKSDVESPI KPAKRKRPAE KSAKTKAKTP TAPSVAPKRA PAAVATDTKS RLSAFSAPES
FESQATGSGS TAGAVWDHEK LEWLQDGKRK DGKRRRQTDD DYDPSTLYVP NDFMNQITPG
IRRWWQLKSE MFDTVIFYKV GKFYELYHMD AVIGVNELGL TFMKGTWAHS GFPEIGFGRF
SDVLVQKGYK VARVEQTETP EMMEARCKTM LKPTKLDRVV KREVCRIITR GTQTYSVLDG
APSESQSKFL LSLKEKAEEE SSGRCRTYGV CFIDTSVGCF HVGQFSDDRH CSRLRTLIAH
YAPAEVLFEK GNPSVETRKI LKASLSSALQ EGLNAGTQFW DAQKTLKTLS EEDYFKEATG
KEEGTGSSFL PALLKEMTSE SDSLCLTPKE GYELALSALG GCIFYLKKCL VDQELLSMAN
FEEYVPVDVE VEKAAGPANF FAKTRQRMVL DGVTLANLEI FQNGSGGTEG TLLERLDTCS
TPFGKRLLKQ WLCAPLCNPT SIRDRLDAVE DLMGVQAQAT EVSDLLKKLP DLERLLSKIH
SIGTPLKSQD HPDSRAVLYE DVTYSKRKIA VFLSALEGFK TMQDIVSLFA PVSGEFRSTL
LCRVISLNSE KNGLFPDLSG ELKRWDTAFD HQKARTTGVI TPKAGFDPEY DQALTGIKNC
ERELQDYLDK QKKRLGCKSM CFWGTGKNRY QMEVPDSVLE KNIPDEYELK STKKGWKRYV
TKKTERMFSE LQGFEEKRDA ALKDCMRMLF YNFDKNYSDW KTAVECMAVL DVLLAFSRYS
QGGDGPMARP EVVLPEDDAQ VAAFIDLKGS RHPCVTKTFF GDDFIPNDIF IGCPGTGENS
EDDSLASCIL VTGPNMGGKS TLMRQCGLVI ILAQLGCYIP AESLRFTPVD RVFTRLGASD
RIMAGESTFF VELSETASIL RHATKHSLVL LDELGRGTAT YDGTAIASAV VKELAEKICC
RTLFSTHYHS LVEDYANNPA VRLGHMACMV ENECEDQSQE TITFLYKFIT GACPKSYGFN
AARLANLPEE VIQSGHRKAR EFETSTVNLR LFKKLCQFAE DATLDSTRFT SLIQLLSTL
//