ID B8C801_THAPS Unreviewed; 749 AA.
AC B8C801;
DT 03-MAR-2009, integrated into UniProtKB/TrEMBL.
DT 03-MAR-2009, sequence version 1.
DT 24-JAN-2024, entry version 73.
DE SubName: Full=Mlh1-like protein {ECO:0000313|EMBL:EED90196.1};
GN ORFNames=THAPSDRAFT_263509 {ECO:0000313|EMBL:EED90196.1};
OS Thalassiosira pseudonana (Marine diatom) (Cyclotella nana).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=35128 {ECO:0000313|EMBL:EED90196.1, ECO:0000313|Proteomes:UP000001449};
RN [1] {ECO:0000313|EMBL:EED90196.1, ECO:0000313|Proteomes:UP000001449}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED90196.1};
RX PubMed=15459382; DOI=10.1126/science.1101156;
RA Armbrust E.V., Berges J.A., Bowler C., Green B.R., Martinez D.,
RA Putnam N.H., Zhou S., Allen A.E., Apt K.E., Bechner M., Brzezinski M.A.,
RA Chaal B.K., Chiovitti A., Davis A.K., Demarest M.S., Detter J.C.,
RA Glavina T., Goodstein D., Hadi M.Z., Hellsten U., Hildebrand M.,
RA Jenkins B.D., Jurka J., Kapitonov V.V., Kroger N., Lau W.W., Lane T.W.,
RA Larimer F.W., Lippmeier J.C., Lucas S., Medina M., Montsant A., Obornik M.,
RA Parker M.S., Palenik B., Pazour G.J., Richardson P.M., Rynearson T.A.,
RA Saito M.A., Schwartz D.C., Thamatrakoln K., Valentin K., Vardi A.,
RA Wilkerson F.P., Rokhsar D.S.;
RT "The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and
RT metabolism.";
RL Science 306:79-86(2004).
RN [2] {ECO:0000313|EMBL:EED90196.1, ECO:0000313|Proteomes:UP000001449}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED90196.1};
RX PubMed=18923393; DOI=10.1038/nature07410;
RA Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A.,
RA Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., Salamov A.,
RA Vandepoele K., Beszteri B., Gruber A., Heijde M., Katinka M., Mock T.,
RA Valentin K., Verret F., Berges J.A., Brownlee C., Cadoret J.P.,
RA Chiovitti A., Choi C.J., Coesel S., De Martino A., Detter J.C., Durkin C.,
RA Falciatore A., Fournet J., Haruta M., Huysman M.J., Jenkins B.D.,
RA Jiroutova K., Jorgensen R.E., Joubert Y., Kaplan A., Kroger N., Kroth P.G.,
RA La Roche J., Lindquist E., Lommer M., Martin-Jezequel V., Lopez P.J.,
RA Lucas S., Mangogna M., McGinnis K., Medlin L.K., Montsant A.,
RA Oudot-Le Secq M.P., Napoli C., Obornik M., Parker M.S., Petit J.L.,
RA Porcel B.M., Poulsen N., Robison M., Rychlewski L., Rynearson T.A.,
RA Schmutz J., Shapiro H., Siaut M., Stanley M., Sussman M.R., Taylor A.R.,
RA Vardi A., von Dassow P., Vyverman W., Willis A., Wyrwicz L.S.,
RA Rokhsar D.S., Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y.,
RA Grigoriev I.V.;
RT "The Phaeodactylum genome reveals the evolutionary history of diatom
RT genomes.";
RL Nature 456:239-244(2008).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the DNA mismatch repair MutL/HexB family.
CC {ECO:0000256|ARBA:ARBA00006082}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000645; EED90196.1; -; Genomic_DNA.
DR RefSeq; XP_002292221.1; XM_002292185.1.
DR AlphaFoldDB; B8C801; -.
DR STRING; 35128.B8C801; -.
DR PaxDb; 35128-Thaps263509; -.
DR EnsemblProtists; EED90196; EED90196; THAPSDRAFT_263509.
DR GeneID; 7449303; -.
DR KEGG; tps:THAPSDRAFT_263509; -.
DR eggNOG; KOG1979; Eukaryota.
DR HOGENOM; CLU_004131_2_0_1; -.
DR InParanoid; B8C801; -.
DR OMA; ANYHVKK; -.
DR Proteomes; UP000001449; Chromosome 9.
DR GO; GO:0032389; C:MutLalpha complex; IBA:GO_Central.
DR GO; GO:0005524; F:ATP binding; IEA:InterPro.
DR GO; GO:0016887; F:ATP hydrolysis activity; IBA:GO_Central.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:InterPro.
DR GO; GO:0006298; P:mismatch repair; IBA:GO_Central.
DR CDD; cd16926; HATPase_MutL-MLH-PMS-like; 1.
DR Gene3D; 3.30.230.10; -; 1.
DR Gene3D; 3.30.565.10; Histidine kinase-like ATPase, C-terminal domain; 1.
DR InterPro; IPR014762; DNA_mismatch_repair_CS.
DR InterPro; IPR013507; DNA_mismatch_S5_2-like.
DR InterPro; IPR036890; HATPase_C_sf.
DR InterPro; IPR032189; Mlh1_C.
DR InterPro; IPR002099; MutL/Mlh/PMS.
DR InterPro; IPR038973; MutL/Mlh/Pms-like.
DR InterPro; IPR020568; Ribosomal_Su5_D2-typ_SF.
DR InterPro; IPR014721; Ribsml_uS5_D2-typ_fold_subgr.
DR NCBIfam; TIGR00585; mutl; 1.
DR PANTHER; PTHR10073; DNA MISMATCH REPAIR PROTEIN MLH, PMS, MUTL; 1.
DR PANTHER; PTHR10073:SF12; DNA MISMATCH REPAIR PROTEIN MLH1; 1.
DR Pfam; PF01119; DNA_mis_repair; 1.
DR Pfam; PF13589; HATPase_c_3; 1.
DR Pfam; PF16413; Mlh1_C; 1.
DR SMART; SM01340; DNA_mis_repair; 1.
DR SUPFAM; SSF55874; ATPase domain of HSP90 chaperone/DNA topoisomerase II/histidine kinase; 1.
DR SUPFAM; SSF54211; Ribosomal protein S5 domain 2-like; 1.
DR PROSITE; PS00058; DNA_MISMATCH_REPAIR_1; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000001449}.
FT DOMAIN 231..352
FT /note="DNA mismatch repair protein S5"
FT /evidence="ECO:0000259|SMART:SM01340"
FT REGION 129..155
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 429..464
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 133..149
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 439..455
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EED90196.1"
SQ SEQUENCE 749 AA; 81307 MW; 1F92AFAC8AA507DB CRC64;
SKKIRPLPKE VVDRIAAGEV VQRPVSVVKE LLENSLDADG TQIDIQCQKG GLESITITDN
GTGISPTSLP LACTRFATSK LVTVDDLKSI RTFGFRGEAL ASASMVGRVC ISKTLQNNNC
AFKMHYRDGN PTSDPKLPTN KNSTATIKPK PSAGKEGTTI TVQDLFYNIP SRRRAMEGRR
SERDEYDRIL NCVQRYAVHE AKRGVGFVCR GGGGGGGGKG AAGRATTKDV IGHIFGTAVS
RELLPLNAGE GDVEAVTTGL ITNGSYSAPK SSAAFLLFIN DRLVESASLR RAVESIYSDA
LPKGGKPFVY LSLELPGPHV DVNVHPTKRE VAFLHEDRLC VALAAAVKEV IGSATSSRTF
AVAASGALLA PEEKRVRVQP KKAIAMGERE GLVVSTASNV TTEIGESDSG AVDDAMAVDE
PEPQQIVDNT NQQQDQQQNE QPKKRHADES KQPPLSKKPY DPSRLVRTNS AAPAAPNAIV
VRINANNNIV RPKKISPTEC DYESIAKLRG DIVSRNHQNL NETLRGASFV GAVSRSRSLI
QYGIDLLMIN HRELARETFY QIALMKFNGM PIATLGGGGV DSDATHESLR VKVNKTNATL
AKQATSCLSE KADMLEEYFS IKFERRGKSL FVTGLPVLLE GHSPQPHALP LFLMRLATEV
NWMDERLCFQ NVCTELGSYY SEPPVANDEE ENAATEAPDY IDDEAKAFVK HTLFPAISFL
LVPPKEFATN GTVLKLANLT SLYKVFERC
//