ID A0A0V0WJ63_9BILA Unreviewed; 1311 AA.
AC A0A0V0WJ63;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 24-JAN-2024, entry version 32.
DE SubName: Full=DNA mismatch repair protein Msh2 {ECO:0000313|EMBL:KRX75251.1};
GN Name=Msh2 {ECO:0000313|EMBL:KRX75251.1};
GN ORFNames=T06_16285 {ECO:0000313|EMBL:KRX75251.1};
OS Trichinella sp. T6.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=92179 {ECO:0000313|EMBL:KRX75251.1, ECO:0000313|Proteomes:UP000054673};
RN [1] {ECO:0000313|EMBL:KRX75251.1, ECO:0000313|Proteomes:UP000054673}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS34 {ECO:0000313|EMBL:KRX75251.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRX75251.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDK01000127; KRX75251.1; -; Genomic_DNA.
DR STRING; 92179.A0A0V0WJ63; -.
DR Proteomes; UP000054673; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:InterPro.
DR GO; GO:0006298; P:mismatch repair; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.1420.10; -; 2.
DR Gene3D; 3.40.1170.10; DNA repair protein MutS, domain I; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 3.30.420.110; MutS, connector domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR InterPro; IPR007695; DNA_mismatch_repair_MutS-lik_N.
DR InterPro; IPR000432; DNA_mismatch_repair_MutS_C.
DR InterPro; IPR007861; DNA_mismatch_repair_MutS_clamp.
DR InterPro; IPR007696; DNA_mismatch_repair_MutS_core.
DR InterPro; IPR016151; DNA_mismatch_repair_MutS_N.
DR InterPro; IPR036187; DNA_mismatch_repair_MutS_sf.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR020479; Homeobox_metazoa.
DR InterPro; IPR036678; MutS_con_dom_sf.
DR InterPro; IPR045076; MutS_family.
DR InterPro; IPR027417; P-loop_NTPase.
DR PANTHER; PTHR11361:SF35; DNA MISMATCH REPAIR PROTEIN MSH2; 1.
DR PANTHER; PTHR11361; DNA MISMATCH REPAIR PROTEIN MUTS FAMILY MEMBER; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF01624; MutS_I; 1.
DR Pfam; PF05192; MutS_III; 1.
DR Pfam; PF05190; MutS_IV; 1.
DR Pfam; PF00488; MutS_V; 1.
DR PRINTS; PR00024; HOMEOBOX.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00534; MUTSac; 1.
DR SMART; SM00533; MUTSd; 1.
DR SUPFAM; SSF48334; DNA repair protein MutS, domain III; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR PROSITE; PS00486; DNA_MISMATCH_REPAIR_2; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 4: Predicted;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000054673}.
FT DOMAIN 1051..1111
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 1053..1112
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
SQ SEQUENCE 1311 AA; 149854 MW; EA9961CC0F321787 CRC64;
MDKDLYGIFV KIIEGKSPTN ICMFDRADYY FCIGDDAFYI AKEILKTVTI LKHILVEDVP
QQYITLNKNT FERVVRELLL VHLYRIEVYR CSSTRSQDWE LSAKASPGYI REFENILTNC
EELQEQSRIL AFKITEEAGN ICKLNVSMVC CDLFYHTFQI CDILDNDNLA NFKRILTQLR
PKECLTVDSS VTLNQMRKNI LAQIAIPVTT LKKSDFNSSN VLNDLDMLVK FKKGSHGFCG
SLPEMKVSDS TLQCFAGLIK YFGLVNDDCF LKQFQIQYFT SEKHLRVEWS TFESLNLFRF
GGEKSADNLY GILNHCQTAV GQRLLHEYLK MPLLDKNKID ERLDIVDLFV QNGEIRNILQ
QELLCRFPDL HRLCRKFVIK RAGLPEVYKV YSAVNCASDM LKLLDKMNTS VIKDNFTEPL
LITLDDFQKL TEMVSMTLDL DCIARTGEYR VKPEFSQFLQ DLNNCMQNID QKMTSYLTKA
MKMYGFEQGK SLKLEYSAQW GHVFRVSRKD EKIIRNQKAV ELLDMQKGSV RFTDEILKNM
NSEMMKAKDS YEEFQKRIVD ELVATVATYM DPMKSLAQVV GHLDVMVSFA VASVNAPVQY
VRPKILQPGS GILNLTQARH PCLEMQPDVS FIANDLKLAK GETELIILTG PNMGGKSTYL
RQTAMIIIMA QMGCFVPCQE ATISILDAVY TRIGASDNQY KGLSTFMTEM VEVSEILELP
SENSLIVIDE LGRGTSTFDG LGIAWAVAET IATKVRAFCL FASHFYEMTM MASELFTVKN
YQALATFANN NLVLLYKIKP GICDRSYGIN VAEMVGFPDI VLKEAWKEAR RLEAHRHGGT
TDEDGNDQQF GLNGSDTVCL LNEDAFHGSG REKSFLNGTR AKIKNDWPSL VKSRRTVQEH
QHSVLKINAL PSDTSASMSI PSDAPYVRQS PNWKKVVLAL SFDNIGCMFR IANLLNLNEQ
TRNSPANSKI TVQLPDSVDH ATKAKATNPG VVEPCPLPVV FFSVSNESMR WPLRTCPVDE
TLWRRENTYN GLYWTPFNSV SPVFCGSSRC MKQRRPRTSF TSQQLVELES KFKEFKYLSR
PQRYEIATAL SLSENQVKIW FQNRRMKWKR YRMAEREKEK EANFCKNHGT SKRKKFYSLV
RRINKSKVLG PRPENVMRMS VLRCSWMFLK WLADRCSMMC DGVSQICVYS RISSQMFCVV
IRKRGGGAQI RCVYQLALVF HVEQQGVFKE QVGTEQRLRL SNDFILLLYV ASSGQAGTAF
SRSGRQFLVE IHSIFTTTGG ERDHSGIIPK WEATESLWAQ HMLVESPLEH F
//