GenomeNet

Database: UniProt
Entry: A0A0V0WJ63_9BILA
LinkDB: A0A0V0WJ63_9BILA
Original site: A0A0V0WJ63_9BILA 
ID   A0A0V0WJ63_9BILA        Unreviewed;      1311 AA.
AC   A0A0V0WJ63;
DT   16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT   16-MAR-2016, sequence version 1.
DT   24-JAN-2024, entry version 32.
DE   SubName: Full=DNA mismatch repair protein Msh2 {ECO:0000313|EMBL:KRX75251.1};
GN   Name=Msh2 {ECO:0000313|EMBL:KRX75251.1};
GN   ORFNames=T06_16285 {ECO:0000313|EMBL:KRX75251.1};
OS   Trichinella sp. T6.
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC   Trichinellida; Trichinellidae; Trichinella.
OX   NCBI_TaxID=92179 {ECO:0000313|EMBL:KRX75251.1, ECO:0000313|Proteomes:UP000054673};
RN   [1] {ECO:0000313|EMBL:KRX75251.1, ECO:0000313|Proteomes:UP000054673}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ISS34 {ECO:0000313|EMBL:KRX75251.1};
RA   Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT   "Evolution of Trichinella species and genotypes.";
RL   Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KRX75251.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JYDK01000127; KRX75251.1; -; Genomic_DNA.
DR   STRING; 92179.A0A0V0WJ63; -.
DR   Proteomes; UP000054673; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR   GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR   GO; GO:0030983; F:mismatched DNA binding; IEA:InterPro.
DR   GO; GO:0006298; P:mismatch repair; IEA:InterPro.
DR   CDD; cd00086; homeodomain; 1.
DR   Gene3D; 1.10.1420.10; -; 2.
DR   Gene3D; 3.40.1170.10; DNA repair protein MutS, domain I; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   Gene3D; 3.30.420.110; MutS, connector domain; 1.
DR   Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR   InterPro; IPR007695; DNA_mismatch_repair_MutS-lik_N.
DR   InterPro; IPR000432; DNA_mismatch_repair_MutS_C.
DR   InterPro; IPR007861; DNA_mismatch_repair_MutS_clamp.
DR   InterPro; IPR007696; DNA_mismatch_repair_MutS_core.
DR   InterPro; IPR016151; DNA_mismatch_repair_MutS_N.
DR   InterPro; IPR036187; DNA_mismatch_repair_MutS_sf.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR020479; Homeobox_metazoa.
DR   InterPro; IPR036678; MutS_con_dom_sf.
DR   InterPro; IPR045076; MutS_family.
DR   InterPro; IPR027417; P-loop_NTPase.
DR   PANTHER; PTHR11361:SF35; DNA MISMATCH REPAIR PROTEIN MSH2; 1.
DR   PANTHER; PTHR11361; DNA MISMATCH REPAIR PROTEIN MUTS FAMILY MEMBER; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   Pfam; PF01624; MutS_I; 1.
DR   Pfam; PF05192; MutS_III; 1.
DR   Pfam; PF05190; MutS_IV; 1.
DR   Pfam; PF00488; MutS_V; 1.
DR   PRINTS; PR00024; HOMEOBOX.
DR   SMART; SM00389; HOX; 1.
DR   SMART; SM00534; MUTSac; 1.
DR   SMART; SM00533; MUTSd; 1.
DR   SUPFAM; SSF48334; DNA repair protein MutS, domain III; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR   PROSITE; PS00486; DNA_MISMATCH_REPAIR_2; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   4: Predicted;
KW   ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW   DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000054673}.
FT   DOMAIN          1051..1111
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        1053..1112
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
SQ   SEQUENCE   1311 AA;  149854 MW;  EA9961CC0F321787 CRC64;
     MDKDLYGIFV KIIEGKSPTN ICMFDRADYY FCIGDDAFYI AKEILKTVTI LKHILVEDVP
     QQYITLNKNT FERVVRELLL VHLYRIEVYR CSSTRSQDWE LSAKASPGYI REFENILTNC
     EELQEQSRIL AFKITEEAGN ICKLNVSMVC CDLFYHTFQI CDILDNDNLA NFKRILTQLR
     PKECLTVDSS VTLNQMRKNI LAQIAIPVTT LKKSDFNSSN VLNDLDMLVK FKKGSHGFCG
     SLPEMKVSDS TLQCFAGLIK YFGLVNDDCF LKQFQIQYFT SEKHLRVEWS TFESLNLFRF
     GGEKSADNLY GILNHCQTAV GQRLLHEYLK MPLLDKNKID ERLDIVDLFV QNGEIRNILQ
     QELLCRFPDL HRLCRKFVIK RAGLPEVYKV YSAVNCASDM LKLLDKMNTS VIKDNFTEPL
     LITLDDFQKL TEMVSMTLDL DCIARTGEYR VKPEFSQFLQ DLNNCMQNID QKMTSYLTKA
     MKMYGFEQGK SLKLEYSAQW GHVFRVSRKD EKIIRNQKAV ELLDMQKGSV RFTDEILKNM
     NSEMMKAKDS YEEFQKRIVD ELVATVATYM DPMKSLAQVV GHLDVMVSFA VASVNAPVQY
     VRPKILQPGS GILNLTQARH PCLEMQPDVS FIANDLKLAK GETELIILTG PNMGGKSTYL
     RQTAMIIIMA QMGCFVPCQE ATISILDAVY TRIGASDNQY KGLSTFMTEM VEVSEILELP
     SENSLIVIDE LGRGTSTFDG LGIAWAVAET IATKVRAFCL FASHFYEMTM MASELFTVKN
     YQALATFANN NLVLLYKIKP GICDRSYGIN VAEMVGFPDI VLKEAWKEAR RLEAHRHGGT
     TDEDGNDQQF GLNGSDTVCL LNEDAFHGSG REKSFLNGTR AKIKNDWPSL VKSRRTVQEH
     QHSVLKINAL PSDTSASMSI PSDAPYVRQS PNWKKVVLAL SFDNIGCMFR IANLLNLNEQ
     TRNSPANSKI TVQLPDSVDH ATKAKATNPG VVEPCPLPVV FFSVSNESMR WPLRTCPVDE
     TLWRRENTYN GLYWTPFNSV SPVFCGSSRC MKQRRPRTSF TSQQLVELES KFKEFKYLSR
     PQRYEIATAL SLSENQVKIW FQNRRMKWKR YRMAEREKEK EANFCKNHGT SKRKKFYSLV
     RRINKSKVLG PRPENVMRMS VLRCSWMFLK WLADRCSMMC DGVSQICVYS RISSQMFCVV
     IRKRGGGAQI RCVYQLALVF HVEQQGVFKE QVGTEQRLRL SNDFILLLYV ASSGQAGTAF
     SRSGRQFLVE IHSIFTTTGG ERDHSGIIPK WEATESLWAQ HMLVESPLEH F
//
DBGET integrated database retrieval system