ID A0A1Y1X4B6_9FUNG Unreviewed; 1468 AA.
AC A0A1Y1X4B6;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 08-NOV-2023, entry version 28.
DE RecName: Full=DNA mismatch repair protein S5 domain-containing protein {ECO:0000259|SMART:SM01340};
GN ORFNames=BCR32DRAFT_268985 {ECO:0000313|EMBL:ORX80206.1};
OS Anaeromyces robustus.
OC Eukaryota; Fungi; Fungi incertae sedis; Chytridiomycota;
OC Chytridiomycota incertae sedis; Neocallimastigomycetes; Neocallimastigales;
OC Neocallimastigaceae; Anaeromyces.
OX NCBI_TaxID=1754192 {ECO:0000313|EMBL:ORX80206.1, ECO:0000313|Proteomes:UP000193944};
RN [1] {ECO:0000313|EMBL:ORX80206.1, ECO:0000313|Proteomes:UP000193944}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S4 {ECO:0000313|EMBL:ORX80206.1,
RC ECO:0000313|Proteomes:UP000193944};
RG DOE Joint Genome Institute;
RA Haitjema C.H., Gilmore S.P., Henske J.K., Solomon K.V., De Groot R.,
RA Kuo A., Mondo S.J., Salamov A.A., Labutti K., Zhao Z., Chiniquy J.,
RA Barry K., Brewer H.M., Purvine S.O., Wright A.T., Boxma B., Van Alen T.,
RA Hackstein J.H., Baker S.E., Grigoriev I.V., O'Malley M.A.;
RT "A Parts List for Fungal Cellulosomes Revealed by Comparative Genomics.";
RL Submitted (AUG-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ORX80206.1, ECO:0000313|Proteomes:UP000193944}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S4 {ECO:0000313|EMBL:ORX80206.1,
RC ECO:0000313|Proteomes:UP000193944};
RG DOE Joint Genome Institute;
RA Mondo S.J., Dannebaum R.O., Kuo R.C., Labutti K., Haridas S., Kuo A.,
RA Salamov A., Ahrendt S.R., Lipzen A., Sullivan W., Andreopoulos W.B.,
RA Clum A., Lindquist E., Daum C., Ramamoorthy G.K., Gryganskyi A., Culley D.,
RA Magnuson J.K., James T.Y., O'Malley M.A., Stajich J.E., Spatafora J.W.,
RA Visel A., Grigoriev I.V.;
RT "Pervasive Adenine N6-methylation of Active Genes in Fungi.";
RL Submitted (AUG-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the DNA mismatch repair MutL/HexB family.
CC {ECO:0000256|ARBA:ARBA00006082}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ORX80206.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCFG01000152; ORX80206.1; -; Genomic_DNA.
DR STRING; 1754192.A0A1Y1X4B6; -.
DR OrthoDB; 9570at2759; -.
DR Proteomes; UP000193944; Unassembled WGS sequence.
DR GO; GO:0032300; C:mismatch repair complex; IEA:InterPro.
DR GO; GO:0005524; F:ATP binding; IEA:InterPro.
DR GO; GO:0016887; F:ATP hydrolysis activity; IEA:InterPro.
DR GO; GO:0140664; F:ATP-dependent DNA damage sensor activity; IEA:InterPro.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:InterPro.
DR GO; GO:0006298; P:mismatch repair; IEA:InterPro.
DR Gene3D; 3.30.230.10; -; 1.
DR Gene3D; 3.30.565.10; Histidine kinase-like ATPase, C-terminal domain; 1.
DR Gene3D; 3.30.1540.20; MutL, C-terminal domain, dimerisation subdomain; 2.
DR InterPro; IPR014762; DNA_mismatch_repair_CS.
DR InterPro; IPR013507; DNA_mismatch_S5_2-like.
DR InterPro; IPR036890; HATPase_C_sf.
DR InterPro; IPR002099; MutL/Mlh/PMS.
DR InterPro; IPR038973; MutL/Mlh/Pms-like.
DR InterPro; IPR042120; MutL_C_dimsub.
DR InterPro; IPR037198; MutL_C_sf.
DR InterPro; IPR020568; Ribosomal_Su5_D2-typ_SF.
DR InterPro; IPR014721; Ribsml_uS5_D2-typ_fold_subgr.
DR NCBIfam; TIGR00585; mutl; 1.
DR PANTHER; PTHR10073; DNA MISMATCH REPAIR PROTEIN MLH, PMS, MUTL; 1.
DR PANTHER; PTHR10073:SF47; DNA MISMATCH REPAIR PROTEIN MLH3; 1.
DR SMART; SM01340; DNA_mis_repair; 1.
DR SUPFAM; SSF55874; ATPase domain of HSP90 chaperone/DNA topoisomerase II/histidine kinase; 1.
DR SUPFAM; SSF118116; DNA mismatch repair protein MutL; 2.
DR SUPFAM; SSF54211; Ribosomal protein S5 domain 2-like; 1.
DR PROSITE; PS00058; DNA_MISMATCH_REPAIR_1; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW Reference proteome {ECO:0000313|Proteomes:UP000193944}.
FT DOMAIN 224..379
FT /note="DNA mismatch repair protein S5"
FT /evidence="ECO:0000259|SMART:SM01340"
FT REGION 491..532
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 557..592
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 617..646
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 666..723
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 760..906
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 944..975
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1264..1290
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1312..1347
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 491..506
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 509..526
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 566..591
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 617..635
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 667..695
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 696..723
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 775..823
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 824..839
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 852..891
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 944..974
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1312..1330
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1468 AA; 172004 MW; 6A609ECAA241B79B CRC64;
MVLRKLDFNT RNLLRSTYVI TSIFQCMIEL VENKIYLNYI YEKGLDANAT KIEIEIDFNR
YDIHIIDNGT GISFENLKII GNRYVTSKCH TIEDLKNIKT YGFRGEALAS IFQVADIEIN
TKIENSEKVY SLYVNESKVK YCDISDNYIK YKSGTSIYVQ NLFSKIPVRQ KLLKKRRYIY
DGIKKNLEIL ALIEPKVEFS VINKKTGKMI LHTKPNNISS LDVFKDLHND LLCKDLVSID
LNSKNITEYK YNKYEYSISG FIGFSGNPKK YQYIYLNKRY LESNEIYKLI NKIFNENQNI
QKYNEHFQKN YNDLFGPYMP YFILNTDKKL RKYPIFFLNI SCSTNIYDIC MDPSKRVVEF
EDWESILNFV KLTINNALEQ YKVIIQEQEK QKRKEEHNNK LSSITQSKFF NSQLPTNNEN
NFGIISSGSN SSVNKYDYEF NNNSNNDNNK IGYNSLYFNK KGTSSMDPQD NNNNDFDDIF
SSSNTSISDV FNDINVTDNN NSNNNDDNDD NIEKEEEEEE EDNNDDNNKD FDLKEKNENN
IEYDKISNTP LSLSTSFSFS ENSISSSDDE NDKNDKNDKN DKNDKNDKND QIVQNDLSFM
ELNSFSFNLI SDIEDTNTYS NNKTISNNTN NDINDDNNGK NDKSNENLSF KVSQDLFNDI
DEDNNIINSD EQEKDNNNDN NDINKETKSN NKKYNKNKIN VEINNDNNNN NNNNNNDINN
NYNNNQKEIK EVNEIEEIEE IEIEEKEELE EKENNISENS YINNSTKEGG KEEVMIEEIE
EEEEEEEEEE EKEKEEEMID EIEEEEEEEE EKEKEEEMIE DIENDISEKY YINNRIEEDE
KEEETIEEIE EEKEEKEKYY MNHNNENSID IENEEKGESK SECNGELEMI ESRDESENDS
ENDSDNDFIF QKQIDFIKNK FTYESKLKSN SKLPIIRKAN DIKTRKRKQE EDDDVDIINN
SLSDHDDKNN GNDGDDVDII NNSLSNHDDK NNDDKYYRFN YQSIYNKFLN HKIIFDKEHL
KHLQVIGQAD QKFIVCKLSK YENKEQSDDQ HAIDERIQLE HLLDQYQHGN SQGNGPEITQ
LTPPIRIILP NHEIERIKIF ISEFKCIGIY FVEEDFKQSS IKSIRKKLTI EEEKIIENYG
SSYFDKDHQL KTSLYNLFNK NSILNNKKGN KNNEEYNNKY NEEYNDEVEQ DEDNIGEINE
INNNFNSISF QKIDSNSLLS PLSIIRIIKL PRLIVERCLT NVEKLTSIVR NCLYELENSS
TTKYKNNTNL FSDDSYSSSS SSSPFPSPSS SPSHFLSNTS LYNHFSTSQS ILSSNNNNNN
NNNNNNNNNN KDKNKNKDKD KNNQLSFNTT TTTTTIITNN GNNTNIIQSF HFSDRKVKFI
PSSIYSILCS LACHKAIKFD DILTLKQCRE IVEQMPTLKF PFQCAHGRPT MIPLINLTYL
KHISRMNNKD KDSFNHISNL LKKFNKNK
//