ID G3X0P6_SARHA Unreviewed; 1414 AA.
AC G3X0P6;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 66.
DE SubName: Full=MutL homolog 3 {ECO:0000313|Ensembl:ENSSHAP00000021251.2};
GN Name=MLH3 {ECO:0000313|Ensembl:ENSSHAP00000021251.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000021251.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000021251.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000021251.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the DNA mismatch repair MutL/HexB family.
CC {ECO:0000256|ARBA:ARBA00006082}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9305.ENSSHAP00000021251; -.
DR Ensembl; ENSSHAT00000021423.2; ENSSHAP00000021251.2; ENSSHAG00000018012.2.
DR eggNOG; KOG1977; Eukaryota.
DR GeneTree; ENSGT00800000124176; -.
DR HOGENOM; CLU_002376_0_0_1; -.
DR InParanoid; G3X0P6; -.
DR TreeFam; TF329597; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005712; C:chiasma; IEA:Ensembl.
DR GO; GO:0001673; C:male germ cell nucleus; IEA:Ensembl.
DR GO; GO:0032300; C:mismatch repair complex; IEA:Ensembl.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0000795; C:synaptonemal complex; IEA:Ensembl.
DR GO; GO:0005524; F:ATP binding; IEA:InterPro.
DR GO; GO:0016887; F:ATP hydrolysis activity; IEA:InterPro.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0019237; F:centromeric DNA binding; IEA:Ensembl.
DR GO; GO:0003682; F:chromatin binding; IEA:Ensembl.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:InterPro.
DR GO; GO:0007144; P:female meiosis I; IEA:Ensembl.
DR GO; GO:0007140; P:male meiotic nuclear division; IEA:Ensembl.
DR GO; GO:0006298; P:mismatch repair; IEA:Ensembl.
DR GO; GO:0008104; P:protein localization; IEA:Ensembl.
DR GO; GO:0007130; P:synaptonemal complex assembly; IEA:Ensembl.
DR CDD; cd16926; HATPase_MutL-MLH-PMS-like; 1.
DR CDD; cd03486; MutL_Trans_MLH3; 1.
DR Gene3D; 3.30.230.10; -; 1.
DR Gene3D; 3.30.565.10; Histidine kinase-like ATPase, C-terminal domain; 1.
DR Gene3D; 3.30.1540.20; MutL, C-terminal domain, dimerisation subdomain; 1.
DR Gene3D; 3.30.1370.100; MutL, C-terminal domain, regulatory subdomain; 1.
DR InterPro; IPR014762; DNA_mismatch_repair_CS.
DR InterPro; IPR013507; DNA_mismatch_S5_2-like.
DR InterPro; IPR036890; HATPase_C_sf.
DR InterPro; IPR002099; MutL/Mlh/PMS.
DR InterPro; IPR038973; MutL/Mlh/Pms-like.
DR InterPro; IPR014790; MutL_C.
DR InterPro; IPR042120; MutL_C_dimsub.
DR InterPro; IPR042121; MutL_C_regsub.
DR InterPro; IPR037198; MutL_C_sf.
DR InterPro; IPR020568; Ribosomal_Su5_D2-typ_SF.
DR InterPro; IPR014721; Ribsml_uS5_D2-typ_fold_subgr.
DR NCBIfam; TIGR00585; mutl; 1.
DR PANTHER; PTHR10073; DNA MISMATCH REPAIR PROTEIN MLH, PMS, MUTL; 1.
DR PANTHER; PTHR10073:SF47; DNA MISMATCH REPAIR PROTEIN MLH3; 1.
DR Pfam; PF13589; HATPase_c_3; 1.
DR Pfam; PF08676; MutL_C; 1.
DR SMART; SM01340; DNA_mis_repair; 1.
DR SMART; SM00853; MutL_C; 1.
DR SUPFAM; SSF55874; ATPase domain of HSP90 chaperone/DNA topoisomerase II/histidine kinase; 1.
DR SUPFAM; SSF118116; DNA mismatch repair protein MutL; 1.
DR SUPFAM; SSF54211; Ribosomal protein S5 domain 2-like; 1.
DR PROSITE; PS00058; DNA_MISMATCH_REPAIR_1; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648}.
FT DOMAIN 211..350
FT /note="DNA mismatch repair protein S5"
FT /evidence="ECO:0000259|SMART:SM01340"
FT DOMAIN 1159..1340
FT /note="MutL C-terminal dimerisation"
FT /evidence="ECO:0000259|SMART:SM00853"
FT REGION 717..738
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1414 AA; 159643 MW; 7A4AEE70BCE4628E CRC64;
MIKCLSEEVQ VKLRSGVAVS SISQCVEELA LNSIDAEAKC VAVRVNMETF KVQVIDNGSG
IERDDVERVG KQYFTSKCKS VQDLENPKFY GFRGEALSSI VNMASAVEIA SKTNKTVETF
VKLFQNGKAL ETCEAELTRP SSGTTVTIFN LFYQLPVRRK CMDPRLEFEK VRQRIEALSL
MHPSISFSLR NDISGSMVLQ LPKTKDTCSR FCQIHGLGKS QKLREINFKH KEFELNGYIS
CEAHYNKNLQ FLFVNKRLVL RTRLHKLIDF LLRKESIICR PKGGPASKQM TSSPPRHRSN
SELHGIYVIN VKCQFCEYDV CLDPAKTLIE FRNWDTVLVC IQEGIKCFLK QEHLFIELSG
DDIKEFNEDS DLTLFNAILQ PAISDEKCVQ NNFQEAYENM DSYEIFNMKS KTVKRKAVVE
DISLKTFRTE EDIKHIKDCQ VLTNSDPNEM CINDMIELSE PCQDSTYSKP NVLVQQEAKI
TDSEKNNTKN ICLEPECSSN HHGASSEVLK SSFQIPHHLE ANGENPCLQK EKINDNGIVV
NSVCEGQQRL KDIPEMACEH TPFGRILLET CDTLKEDEGT KKESDCNRGK TIFSYGKVQL
CSTGFITHVM QTQQSKTSEM DFTLKNNFQP GPISAREIFG NKTQSSVETP NIDLNTNLSE
ESAKSVKQNF CLPNIKAGPK SKTIGICKNE AFFLKKSSKQ THTSTLLPNA SLTFPWNTHT
SNNGKNTEKL TGSKPFPHKK LNLFSQPVSL EKFKRQYKKV ESPMPTRIQN ISNDFELTTN
NGSQVELDIS QRGNSHLDHF NICEIPPINN SESNVRNQPD NHIPSEQFRI SKKKEILEQQ
NASVTENPIT LTDYSEFNRK PLNVNKPLGS LASKLSRMKG HNKETVITES IEHSNDSGSS
MKDNSLCSVL AQDSHELSHN IHKITEDSIH LPDSETAVQD NTCNKNTNSA FRNQSLLIST
TEDYPMKYNV PLMLSRNTAF EVCEVPNNPL LSSEQQLETA NPSSTALISH VDISTNDQTA
ACFQKENSGT RISKDEESMA QSFDWQAHFD VSLGRMVYVN KITGLSTFSA PQEENLASCT
KDLTTMAVSV GFRDDTVDEA IGTDCLQTLF SEWENPVFAR YPEVALDVSN EQAESLTVKI
HNILYPYRFT KEMVHSMQVL QQVDNKFIAC LMSTKQEENG KAGGNLLVLV DQHAAHERIR
LEQLICDSYE KEQPKSFHRK KLLSSTIYPP MKVTVTEEQR RLLQCYHKAL EDLGLKLIFP
DPPSSHILVG EVPLCFVERE ANEVRRGRPT VTKSILEEFI REQVELLQTT GGAQGTLPLT
IQKLLASQAC HGAIKFNDSL SLEESCRLIE ALSWCQLPFQ CAHGRPSMLP LADINHLEQE
KQNSKPNLAK LCKMAQAWHL FKKAEAHDEK QNKG
//