ID A0A2U1NFN6_ARTAN Unreviewed; 1218 AA.
AC A0A2U1NFN6;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE RecName: Full=DNA mismatch repair protein {ECO:0000256|PIRNR:PIRNR037677};
GN ORFNames=CTI12_AA272220 {ECO:0000313|EMBL:PWA72260.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA72260.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA72260.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA72260.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- FUNCTION: Component of the post-replicative DNA mismatch repair system
CC (MMR). {ECO:0000256|PIRNR:PIRNR037677}.
CC -!- SIMILARITY: Belongs to the DNA mismatch repair MutS family.
CC {ECO:0000256|PIRNR:PIRNR037677}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA72260.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01002927; PWA72260.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1NFN6; -.
DR STRING; 35608.A0A2U1NFN6; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-UniRule.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006298; P:mismatch repair; IEA:InterPro.
DR CDD; cd03286; ABC_MSH6_euk; 1.
DR Gene3D; 1.10.1420.10; -; 1.
DR Gene3D; 3.40.1170.10; DNA repair protein MutS, domain I; 1.
DR Gene3D; 3.30.420.110; MutS, connector domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR InterPro; IPR007695; DNA_mismatch_repair_MutS-lik_N.
DR InterPro; IPR017261; DNA_mismatch_repair_MutS/MSH.
DR InterPro; IPR000432; DNA_mismatch_repair_MutS_C.
DR InterPro; IPR007696; DNA_mismatch_repair_MutS_core.
DR InterPro; IPR016151; DNA_mismatch_repair_MutS_N.
DR InterPro; IPR036187; DNA_mismatch_repair_MutS_sf.
DR InterPro; IPR007860; DNA_mmatch_repair_MutS_con_dom.
DR InterPro; IPR036678; MutS_con_dom_sf.
DR InterPro; IPR045076; MutS_family.
DR InterPro; IPR027417; P-loop_NTPase.
DR PANTHER; PTHR11361:SF153; DNA MISMATCH REPAIR PROTEIN MSH7; 1.
DR PANTHER; PTHR11361; DNA MISMATCH REPAIR PROTEIN MUTS FAMILY MEMBER; 1.
DR Pfam; PF01624; MutS_I; 1.
DR Pfam; PF05188; MutS_II; 1.
DR Pfam; PF05192; MutS_III; 1.
DR Pfam; PF00488; MutS_V; 1.
DR PIRSF; PIRSF037677; DNA_mis_repair_Msh6; 2.
DR SMART; SM00534; MUTSac; 1.
DR SMART; SM00533; MUTSd; 1.
DR SUPFAM; SSF55271; DNA repair protein MutS, domain I; 1.
DR SUPFAM; SSF53150; DNA repair protein MutS, domain II; 1.
DR SUPFAM; SSF48334; DNA repair protein MutS, domain III; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR PROSITE; PS00486; DNA_MISMATCH_REPAIR_2; 1.
PE 3: Inferred from homology;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840, ECO:0000256|PIRNR:PIRNR037677};
KW DNA damage {ECO:0000256|ARBA:ARBA00022763, ECO:0000256|PIRNR:PIRNR037677};
KW DNA repair {ECO:0000256|PIRNR:PIRNR037677};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PIRNR:PIRNR037677};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741,
KW ECO:0000256|PIRNR:PIRNR037677};
KW Reference proteome {ECO:0000313|Proteomes:UP000245207}.
FT DOMAIN 1035..1051
FT /note="DNA mismatch repair proteins mutS family"
FT /evidence="ECO:0000259|PROSITE:PS00486"
FT REGION 1..158
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 259..278
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 77..109
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 124..141
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1218 AA; 134475 MW; DF88BD6B7E5A33D3 CRC64;
MKRQQSILSF LHKPPSTDKP PSNPNPPRGN PTDDVTGTDT PPEKEQRSFF AAGVNGETKG
SGGSMFDCIK HKFMRPNNTL QKPTDRNTLD VGHSSLSSNK NSLSNGRDKE SLLSGFPRMK
NVIDLDETPG EGDERHPLLV DSESDITGPE TPGTRPLVPR LKRVQEDGFN FGSATTHSSI
DSTKRVRFSN DLSAEGKKDG VASEISKRVT FSLDSPTENR KPELSSVNSK RVKFSHDLLA
EGKKDGVASE ISKRVTFSLD SPAENKKPEL SSDNSKRGKF SYHLPADIKK PELATDNINR
SKFFHNLSAE IKKPELASDN SKRSKYFHDL PAEIKKEELA SDMGSKFDWL HPSKIKDANG
RRPGNPLYDK RTLYIPPDVL RKMSASQKQY WSVKSEYMDV LIFFKVGKFY ELYEVDAEIG
HKELDWKMTM SGVGKCRQVG ISESGIDDAV EKLLARGYKV GRVEQLETSE QAKSRGGTAV
IQRKLVNVLT PSTLIHGNIG PQAVHLLALK EGMRGLDDGT TAYGFAFVDC AALQFWVGSV
TDDASCAALG ALLMQVSPAE VIFDSQGLSK EAQKALNKYS LIGSVATQMT PTQPATGFVD
SSEVCSFIKM NGYFKGSSNV WDRALDGVVH QEIAVCALGG LANHLSRLKL DDALRNGSLL
PYEVYRGCLR MDGQTMANLE IFSNSEDGTL YKYLDNCITL SGKRLLRKWL CHPLKDIQEI
NQRLNLVEEL MGHTEIMQLI AQHLKKLPDL ERFLGQIKAT CNSTALLLLP LIGSKILKQR
VKAFGFLVKG LRAGMHLLML LQKEDNVFSL LSKLFTLPVL SGSDGLDKFL TQFEAAVDSD
FPNYQAHELK DSDAELLSIL IELFMEKANE WFQVIFALNC IDVLTSFATT ANFSCMAMSR
PVIIPRSSSD FSQGSKGPSL HMKGLCHPYA LAESGGTPVP NDLCLGDNEF GYNPRTLLLT
GPNMGGKSTI LRATCLAVIL AQLGSYVPCE MCVLSPVDTI FTRLGATDRI MTGESTFLIE
CTETASVLQN ASQDSLVILD ELGRGTSTFD GYAIAYAVFR HLVEKVNCRL LFATHYHPLT
KEFAAHPHVT LQHMACAFKS ISGTSQSTSQ RLVFLYRLTN GACPESYGMQ VALMAGIPQK
VVEAASKAAE VMKTKIGVSF QSSERRSEFS TLHEEWLKSL LAMARVDDHK LEGGDENDVF
DELFCLWHEL KCSNKKLM
//