ID A0A0V1BIQ6_TRISP Unreviewed; 1890 AA.
AC A0A0V1BIQ6;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=MutS-like protein 5 {ECO:0000313|EMBL:KRY36909.1};
GN Name=MSH5 {ECO:0000313|EMBL:KRY36909.1};
GN ORFNames=T01_11944 {ECO:0000313|EMBL:KRY36909.1};
OS Trichinella spiralis (Trichina worm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=6334 {ECO:0000313|EMBL:KRY36909.1, ECO:0000313|Proteomes:UP000054776};
RN [1] {ECO:0000313|EMBL:KRY36909.1, ECO:0000313|Proteomes:UP000054776}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS3 {ECO:0000313|EMBL:KRY36909.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRY36909.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDH01000038; KRY36909.1; -; Genomic_DNA.
DR Proteomes; UP000054776; Unassembled WGS sequence.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:InterPro.
DR GO; GO:0007127; P:meiosis I; IEA:UniProt.
DR GO; GO:0006298; P:mismatch repair; IEA:InterPro.
DR CDD; cd15729; FYVE_endofin; 1.
DR CDD; cd07067; HP_PGM_like; 1.
DR Gene3D; 1.10.1420.10; -; 1.
DR Gene3D; 3.30.500.40; -; 1.
DR Gene3D; 3.30.1360.220; Domain of unknown function (DUF3480), N-terminal subdomain; 1.
DR Gene3D; 3.30.420.110; MutS, connector domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 3.40.50.1240; Phosphoglycerate mutase-like; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR000432; DNA_mismatch_repair_MutS_C.
DR InterPro; IPR007696; DNA_mismatch_repair_MutS_core.
DR InterPro; IPR036187; DNA_mismatch_repair_MutS_sf.
DR InterPro; IPR013078; His_Pase_superF_clade-1.
DR InterPro; IPR029033; His_PPase_superfam.
DR InterPro; IPR036678; MutS_con_dom_sf.
DR InterPro; IPR045076; MutS_family.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR022557; SARA-like_C.
DR InterPro; IPR000306; Znf_FYVE.
DR InterPro; IPR017455; Znf_FYVE-rel.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR11361; DNA MISMATCH REPAIR PROTEIN MUTS FAMILY MEMBER; 1.
DR PANTHER; PTHR11361:SF20; MUTS PROTEIN HOMOLOG 5; 1.
DR Pfam; PF01363; FYVE; 1.
DR Pfam; PF00300; His_Phos_1; 1.
DR Pfam; PF05192; MutS_III; 1.
DR Pfam; PF00488; MutS_V; 1.
DR Pfam; PF11979; SARA_C; 2.
DR SMART; SM01421; DUF3480; 1.
DR SMART; SM00064; FYVE; 1.
DR SMART; SM00534; MUTSac; 1.
DR SMART; SM00533; MUTSd; 1.
DR SUPFAM; SSF48334; DNA repair protein MutS, domain III; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF53254; Phosphoglycerate mutase-like; 1.
DR PROSITE; PS00486; DNA_MISMATCH_REPAIR_2; 1.
DR PROSITE; PS50178; ZF_FYVE; 1.
PE 4: Predicted;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Reference proteome {ECO:0000313|Proteomes:UP000054776};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00091}.
FT DOMAIN 132..191
FT /note="FYVE-type"
FT /evidence="ECO:0000259|PROSITE:PS50178"
FT REGION 251..278
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 825..846
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 257..272
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1890 AA; 214969 MW; 1A6EA7F170E04493 CRC64;
MFFFAILKSS VQNDSLSSKG DGSEIDAEVE RVLTFIVLQT CTLFDEKDSA LIHQMEENAD
RLNSRSTSLL GFPDAGSSMP CSGKIFESSL RVELTSDEVS SVNLLDRGDH SDSNMAVFKE
RIGLVPPEWI PDEQWRICMS CSARFTLIKR RHHCRACGRV LCCDCCHLRV KLQYLENKKA
RVCQLCASLL DQYDSQQGSA EQISSDVLRA SCVLKKSAYP KAQSKNVTFE DGVCPGEVDS
RRVNMFADSL SSSHTVQPKN DKRDRRSREK ESLLPNTAAL PPSTFQKCST ELQMTYTAVC
RDEDLKVLLN EGKEVVFHIK RNLCVNVKLM QDSSLAKPFF WRFISNGFCN LGISEKMFFL
ERTNDEILPP RDMFLYYNQL YNKFLDGELT NDSDCSFVDV IPGCFPFGFL GNKENAAVLF
FRPNRTIINQ HKLPKTYYHI GLLVHRSEMI WAEILPLRLL LRFGFSDGVY PWSVVSSPMR
CSFFGETGHT VMSLLNDMRN FTYTIPMVTN SMITVDKKLV VITIAEDSYQ QDIDSGTYVT
KRFSKNRSSS PEVTGSAFII FNASLKSASL AVKNSIVEDG VMVQIHLDNL NDLRECLRNK
KDYFLKSNEE GGCSLEIRWD AHITASSNFS IASHIDSYNL DLKYRYLLPF RFVEAYRDGQ
LVRLTDVFTL PVDVNETLHS EPESFAASCR RIASATCKAL LQFAEDLLVE NSSVVSLRLF
MSADVIDYKF SVKSQSLLRM MMAALDDAIV SVLHLEAINS ISSWKAEFIF RFLSQIFHSF
TIEKRKLSFL FRTMMSSEED DLDPKKTLRV ALNNCNTTKG LAVRKSKRRR RRRSMKQPDM
SNRRIGPDAI PKLMMMRHGE RLDSCRFDIR RCFESGSYTP LQLNHPSFLP TRGNGQLISC
IEDWVEDTPL SNMGRAAAFL MGRAMAREDE ALDYVFASPA HRCVETADEV VRGYESVYDL
LPEYKLKVKI EDGLFEFAFG KLNRVPPFLS LTALKEQYCV DENYVPFFPR EKLSIDESYV
EFVRRTQAVV EHFARFSIAN KCSTLLVSHA PYMDALTSLY KGGEPRDPND WIHLVQNTPY
LYMRAVKWNG KKWQTTFFDM IILLFRRVVY KYTVFITSFT NIYIKMSAFS RNSTALEDFC
FDSSSFSDEI ESHSGQIVLC IYLSNQKLGA AYYDTDSSKI FTLNDVAESM NEFSLLIDVL
NQVRPTTILL SSKADERLCK LIDDFQNNKC QQNAEVEEAE NENDLIIVAA SEFELSRCKN
TVERFISSIG SGHASVEDKI RATLFVDMES VCMIRALGVL INYCEIESIS TDHFGRPVAF
CLRSFEVNDM LVMDDNAYES LQIFKKQFHP SVYKAGRDGF KEGFGLYSVC NRCCSSVGAA
KLRRWFLRPT RNLNKLKQRL DSIEILSCDS NFHFIQAVQK PLKQIKSVTA IFNRLRAAKV
NPSDWISLYK TINACIFVAE FLKNRKNLKV ILSEVDFKAM VEERQFTVKP GIDRLLDEKK
ELLKRLPELL TEVIKSEIDN LPFPLAACTC VYIPTIGYLI AVPRSRSCSK DGDYEKPGLE
FMFITNDRVH YKNETMRQLD AELGDVKLDI KDLESSIMIS LQNFVLKHAT VIQHAVECAA
TVDCLISMAL TAREYNWVRP ELVEENVIDI TGSRHPLQEL CTAPFVNNSV KSDETEFGKM
HVLTGPNACG KSVYLKQVGL LVYLAHIGSF VPASAARIGL VDRICTRLHS TGSINDGMST
FAADVKQVAM ATKYATKKSL IIIDEFGKGT LTDVGVALLG ACLTYWLERD KCPHVFVSTH
LHRLFNILPQ STLLRYNTMK VMRQGEDVLF LYSICEGQAT ESYAACAASK AGLPERVCNR
IRQVCQNQKR GYQLFRPEST AEEIKEEFKR
//