ID A0A182M414_9DIPT Unreviewed; 1173 AA.
AC A0A182M414;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 24-JAN-2024, entry version 28.
DE RecName: Full=DNA mismatch repair protein {ECO:0000256|PIRNR:PIRNR037677};
OS Anopheles culicifacies.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles; culicifacies species complex.
OX NCBI_TaxID=139723 {ECO:0000313|EnsemblMetazoa:ACUA008912-PA, ECO:0000313|Proteomes:UP000075883};
RN [1] {ECO:0000313|Proteomes:UP000075883}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=A-37 {ECO:0000313|Proteomes:UP000075883};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Besansky N., Howell P., Walton C., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles culicifacies species A.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ACUA008912-PA}
RP IDENTIFICATION.
RC STRAIN=A-37 {ECO:0000313|EnsemblMetazoa:ACUA008912-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- FUNCTION: Component of the post-replicative DNA mismatch repair system
CC (MMR). {ECO:0000256|PIRNR:PIRNR037677, ECO:0000256|RuleBase:RU003756}.
CC -!- SIMILARITY: Belongs to the DNA mismatch repair MutS family.
CC {ECO:0000256|PIRNR:PIRNR037677, ECO:0000256|RuleBase:RU003756}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AXCM01004089; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A182M414; -.
DR STRING; 139723.A0A182M414; -.
DR EnsemblMetazoa; ACUA008912-RA; ACUA008912-PA; ACUA008912.
DR VEuPathDB; VectorBase:ACUA008912; -.
DR OrthoDB; 168255at2759; -.
DR Proteomes; UP000075883; Unassembled WGS sequence.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-UniRule.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006298; P:mismatch repair; IEA:InterPro.
DR CDD; cd03286; ABC_MSH6_euk; 1.
DR Gene3D; 1.10.1420.10; -; 2.
DR Gene3D; 3.40.1170.10; DNA repair protein MutS, domain I; 1.
DR Gene3D; 3.30.420.110; MutS, connector domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR InterPro; IPR007695; DNA_mismatch_repair_MutS-lik_N.
DR InterPro; IPR017261; DNA_mismatch_repair_MutS/MSH.
DR InterPro; IPR000432; DNA_mismatch_repair_MutS_C.
DR InterPro; IPR007861; DNA_mismatch_repair_MutS_clamp.
DR InterPro; IPR007696; DNA_mismatch_repair_MutS_core.
DR InterPro; IPR016151; DNA_mismatch_repair_MutS_N.
DR InterPro; IPR036187; DNA_mismatch_repair_MutS_sf.
DR InterPro; IPR007860; DNA_mmatch_repair_MutS_con_dom.
DR InterPro; IPR036678; MutS_con_dom_sf.
DR InterPro; IPR045076; MutS_family.
DR InterPro; IPR027417; P-loop_NTPase.
DR PANTHER; PTHR11361:SF148; DNA MISMATCH REPAIR PROTEIN MSH6; 1.
DR PANTHER; PTHR11361; DNA MISMATCH REPAIR PROTEIN MUTS FAMILY MEMBER; 1.
DR Pfam; PF01624; MutS_I; 1.
DR Pfam; PF05188; MutS_II; 1.
DR Pfam; PF05192; MutS_III; 1.
DR Pfam; PF05190; MutS_IV; 1.
DR Pfam; PF00488; MutS_V; 1.
DR PIRSF; PIRSF037677; DNA_mis_repair_Msh6; 1.
DR SMART; SM00534; MUTSac; 1.
DR SMART; SM00533; MUTSd; 1.
DR SUPFAM; SSF55271; DNA repair protein MutS, domain I; 1.
DR SUPFAM; SSF48334; DNA repair protein MutS, domain III; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR PROSITE; PS00486; DNA_MISMATCH_REPAIR_2; 1.
PE 3: Inferred from homology;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840, ECO:0000256|PIRNR:PIRNR037677};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA damage {ECO:0000256|ARBA:ARBA00022763, ECO:0000256|PIRNR:PIRNR037677};
KW DNA repair {ECO:0000256|PIRNR:PIRNR037677, ECO:0000256|RuleBase:RU003756};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PIRNR:PIRNR037677};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741,
KW ECO:0000256|PIRNR:PIRNR037677}.
FT DOMAIN 1024..1040
FT /note="DNA mismatch repair proteins mutS family"
FT /evidence="ECO:0000259|PROSITE:PS00486"
FT REGION 1..193
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 749..776
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 1..58
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 59..104
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 149..187
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1173 AA; 131599 MW; FA1B89831DE33A2C CRC64;
MSKNKEKFPS SPNTLHNYFA KSPASTKNPA KPSTPTSALA VERSSSFNQA NGTPKSSKAA
VKKETESVGK EKRSAAVKSA VNDDEEEEIA TQKKRRRILL LDDTSDNEED TAGQNNENKP
NNNNNQDDHD DQTGTKKFAL LSSFERPNET IEIEQRKEAN KRESDDGPAQ KKIKLEPVNE
PNEKPGDGLD EPTVWTHQKL DFLKPNKIKD IHGNKPGSEK YDNRTLYVPD SYLSTLTPAM
RQWWILKSKN FDCVLFFKVG KFYELYHMDA EVGVTELGFS YMKGDFAHSG FPEAAYDRMS
TTLVEKGYKV ARVEQTETPE MMQERCKTER TNSKYDKVVR REICQITLMG TEVFGQQVNI
TPNHQPRYML AITEAVRQGM GSRYGVCFID TSIGLFHLGE FDDDNQQSRL LTFLSHYPPV
LVLHERAGSG NVSEGTQRIL RTLLANVKRE ALTHGSQFWS GETTLKHLAE TVYGGSMSEE
SKWPPVLRTM LDDADSLGLT PKESYQLALK ALGGCVWYLQ RCLLDQQVLS LATFEEYVPF
DENKETVSGT IEQRLGAAQV KRFMVLDSIT LNNLKIVGSE GSLVDRMDHC CTKFGKRLLY
SWVCAPSCVK EVILQRQEAV SELIEKVDLL QDVRQILGQL PDMERHLAQI HGFGLALNNH
PARRAILYEE HVYGKKKMRD FIATLKGFQS LLALPRLFAG ISSKLLVRLT QKANSNSPGA
FPSMEKQIEF FENSFDHENA LKSGSIVPEK GLDAEYDAIE QEIKDLNGEL EEYLVQQGKF
FGCTVKYFGN DKKRFQLEVP EARAKKATSD YTLEGTKTGK NAAKRFHTEE TRHFLKQMMQ
LEDRRKSVLK DLARRIFERF SRDYEMWKNC IDLVATLDVL TSLAEYARSE GLACVPELLD
KDDAVNGSKP FIEIEEGVHP CLASDAAENF IPNGIAIGGD GQANLVLLTG PNMGGKSTLM
RQVGLLAVMA QIGSRIPAQS CRMTIIDRIF TRLGASDDIM AGHSTFLVEL NETSAILKHA
TADSLVLLDE LGRGTATYDG TAVAGAVVHF LADLKCRTMF STHYHNLVDS FHDDPRIALG
HMACMVENED GDDPTQETVT FLYRYTDGAC PKSYGFNAAK LAGMPPDIIK RAYELSKTVE
AEALKRKILM KLLKNAPQNE IKDLVVKFKS CQF
//