ID H2R3J1_PANTR Unreviewed; 1427 AA.
AC H2R3J1; K7C1P8;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 2.
DT 27-MAR-2024, entry version 73.
DE SubName: Full=MutL homolog 3 {ECO:0000313|EMBL:JAA36496.1, ECO:0000313|Ensembl:ENSPTRP00000045604.5};
GN Name=MLH3 {ECO:0000313|EMBL:JAA36496.1,
GN ECO:0000313|Ensembl:ENSPTRP00000045604.5,
GN ECO:0000313|VGNC:VGNC:12561};
OS Pan troglodytes (Chimpanzee).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pan.
OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000045604.5, ECO:0000313|Proteomes:UP000002277};
RN [1] {ECO:0000313|Ensembl:ENSPTRP00000045604.5, ECO:0000313|Proteomes:UP000002277}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16136131; DOI=10.1038/nature04072;
RG Chimpanzee sequencing and analysis consortium;
RT "Initial sequence of the chimpanzee genome and comparison with the human
RT genome.";
RL Nature 437:69-87(2005).
RN [2] {ECO:0000313|EMBL:JAA36496.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Skeletal muscle {ECO:0000313|EMBL:JAA36496.1};
RA Maudhoo M.D., Meehan D.T., Norgren R.B.Jr.;
RT "De novo assembly of the reference chimpanzee transcriptome from NextGen
RT mRNA sequences.";
RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|Ensembl:ENSPTRP00000045604.5}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the DNA mismatch repair MutL/HexB family.
CC {ECO:0000256|ARBA:ARBA00006082}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AACZ04008417; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; GABE01008243; JAA36496.1; -; mRNA.
DR RefSeq; XP_001158557.1; XM_001158557.4.
DR Ensembl; ENSPTRT00000047363.5; ENSPTRP00000045604.5; ENSPTRG00000006548.7.
DR GeneID; 453043; -.
DR KEGG; ptr:453043; -.
DR CTD; 27030; -.
DR VGNC; VGNC:12561; MLH3.
DR eggNOG; KOG1977; Eukaryota.
DR GeneTree; ENSGT00800000124176; -.
DR HOGENOM; CLU_002376_0_0_1; -.
DR OrthoDB; 9570at2759; -.
DR TreeFam; TF329597; -.
DR Proteomes; UP000002277; Chromosome 14.
DR Bgee; ENSPTRG00000006548; Expressed in skeletal muscle tissue and 21 other cell types or tissues.
DR GO; GO:0032300; C:mismatch repair complex; IEA:InterPro.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0005524; F:ATP binding; IEA:InterPro.
DR GO; GO:0016887; F:ATP hydrolysis activity; IEA:InterPro.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:InterPro.
DR GO; GO:0006298; P:mismatch repair; IEA:InterPro.
DR CDD; cd16926; HATPase_MutL-MLH-PMS-like; 1.
DR CDD; cd03486; MutL_Trans_MLH3; 1.
DR Gene3D; 3.30.230.10; -; 1.
DR Gene3D; 3.30.565.10; Histidine kinase-like ATPase, C-terminal domain; 1.
DR Gene3D; 3.30.1540.20; MutL, C-terminal domain, dimerisation subdomain; 1.
DR Gene3D; 3.30.1370.100; MutL, C-terminal domain, regulatory subdomain; 1.
DR InterPro; IPR014762; DNA_mismatch_repair_CS.
DR InterPro; IPR013507; DNA_mismatch_S5_2-like.
DR InterPro; IPR036890; HATPase_C_sf.
DR InterPro; IPR002099; MutL/Mlh/PMS.
DR InterPro; IPR038973; MutL/Mlh/Pms-like.
DR InterPro; IPR014790; MutL_C.
DR InterPro; IPR042120; MutL_C_dimsub.
DR InterPro; IPR042121; MutL_C_regsub.
DR InterPro; IPR037198; MutL_C_sf.
DR InterPro; IPR020568; Ribosomal_Su5_D2-typ_SF.
DR InterPro; IPR014721; Ribsml_uS5_D2-typ_fold_subgr.
DR NCBIfam; TIGR00585; mutl; 1.
DR PANTHER; PTHR10073; DNA MISMATCH REPAIR PROTEIN MLH, PMS, MUTL; 1.
DR PANTHER; PTHR10073:SF47; DNA MISMATCH REPAIR PROTEIN MLH3; 1.
DR Pfam; PF01119; DNA_mis_repair; 1.
DR Pfam; PF13589; HATPase_c_3; 1.
DR SMART; SM01340; DNA_mis_repair; 1.
DR SMART; SM00853; MutL_C; 1.
DR SUPFAM; SSF55874; ATPase domain of HSP90 chaperone/DNA topoisomerase II/histidine kinase; 1.
DR SUPFAM; SSF118116; DNA mismatch repair protein MutL; 1.
DR SUPFAM; SSF54211; Ribosomal protein S5 domain 2-like; 1.
DR PROSITE; PS00058; DNA_MISMATCH_REPAIR_1; 1.
PE 2: Evidence at transcript level;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000002277}.
FT DOMAIN 211..349
FT /note="DNA mismatch repair protein S5"
FT /evidence="ECO:0000259|SMART:SM01340"
FT DOMAIN 1189..1346
FT /note="MutL C-terminal dimerisation"
FT /evidence="ECO:0000259|SMART:SM00853"
FT REGION 624..643
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 929..959
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1427 AA; 160666 MW; E8E9A04068B5241F CRC64;
MIKCLSVEVQ AKLRSGLAIS SLGQCVEELA LNSIDAEAKC VAVRVNMETF QVQVIDNGFG
MGSDDVEKVG NRYFTSKCHS VQDLENPRFY GFRGEALANI ADMASAVEIS SKKNRTMKTF
VKLFQSGKAL KACEADVTRA SAGTTVTVYN LFYQLPVRRK CMDPRLEFEK VRQRIEALSL
MHPSISFSLR NDVSGSMVLQ LPKTKDVCSR FCQIYGLGKS QKLREISFKY KEFELSGYIS
SEAHYNKNMQ FLFVNKRLVL RTKLHKLIDF LLRKESIICK PKNGPTSRQM NSSLRHRSTP
ELYGIYVINV QCQFCEYDVC MEPAKTLIEF QNWDTLLFCI QEGVKMFLKQ EKLFVELSGE
DIKEFSEDNG FSLFDATLQK RMTSDERSNF QEACNNILDS YEMFNLQSKA VKRKTTAENV
NTQNSRDSEG TRKNTNDAFL YIYESGGPGH SKMTEPSLQN KDSSCSESKM LEQETIVASE
AGENEKHKKS CLEHSSLENP CGTSLEMFLS PFQTPCHFEE SGEDLEIWKE STTVNGMAAN
ILKNNSIQNQ PKRFKDATEV GCQPLPFATT LWGVHSAQTE KEKKKESSNC GRRNVFSYGR
VKLCSTGFIT HVVQNEKTKS TETEHSFKNY VRPGPTRAQE TFGNRTRHSV EIPDIKDLAS
TLSKESGQLP NKKNCRTNIS YGLEDEPTAT YTMFSAFQEG SKKSDCILSD TSPSFPWYRH
VSNDSRKTDK LIGFSKPIVR KKLSLSSQLG SLEKFKRQYG KVENPLDTEV EESNGVTTNL
SLQVEPDILL KDKNRLENSD VCKITTMEHS DADSSCQPAS HTLDSEKFPF SKDEDCLEQQ
MPSLRESPMT LKELSLFNRK PLDLEKSSES LASKLSRLKG SERETQTMGM MSRFNELPNS
DSSRKDSKLC SVLTQDYCML FNNKHEKTEN GVIPTSDSAT QDNSFNKNSK THSNSNTTEN
CVVSETPLVL PYNNSKVTGK DSDVLIRASE QQIGSLDSPS GMLMNPVEDA TGDQNGICFQ
SEESKARACS ETEESNTCCS DWQRHFDVAL GRMVYVNKMT GLSTFIAPTE DIQAACTKDL
TTVAVDVVLE NGSQYRCQPF RSDLVLPFLP RARAERTVMR QDNRDTVDDT VSSESLQSLF
SEWDNPVFAR YPEVAVDVSS GQAESLAVKI HNILYPYRFT KGMIHSMQVL QQVDNKFIAC
LMSTKTEENG EADSYEKQQA QGSGRKKLLS STLIPPLEIT VTEEQRRLLW CYHKNLEDLG
LEFAFPDTSD SLVLVGKVPL CFVEREANEL RRGRSTVTKS IVEEFIREQL ELLQTTGGIQ
GTLPLTVQKV LASQACHGAI KFNDGLSLQE SCRLIEALSS CQLPFQCAHG RPSMLPLADI
DHLEQEKQIK PNLTKLRKMA QAWHLFGKAE CDTRQSLQQS MPPCEPP
//