ID K2RSI6_MACPH Unreviewed; 691 AA.
AC K2RSI6;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 24-JAN-2024, entry version 34.
DE SubName: Full=DNA repair protein {ECO:0000313|EMBL:EKG15687.1};
GN ORFNames=MPH_07122 {ECO:0000313|EMBL:EKG15687.1};
OS Macrophomina phaseolina (strain MS6) (Charcoal rot fungus).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Dothideomycetes;
OC Dothideomycetes incertae sedis; Botryosphaeriales; Botryosphaeriaceae;
OC Macrophomina.
OX NCBI_TaxID=1126212 {ECO:0000313|EMBL:EKG15687.1, ECO:0000313|Proteomes:UP000007129};
RN [1] {ECO:0000313|EMBL:EKG15687.1, ECO:0000313|Proteomes:UP000007129}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MS6 {ECO:0000313|EMBL:EKG15687.1,
RC ECO:0000313|Proteomes:UP000007129};
RX PubMed=22992219; DOI=10.1186/1471-2164-13-493;
RA Islam M.S., Haque M.S., Islam M.M., Emdad E.M., Halim A., Hossen Q.M.M.,
RA Hossain M.Z., Ahmed B., Rahim S., Rahman M.S., Alam M.M., Hou S., Wan X.,
RA Saito J.A., Alam M.;
RT "Tools to kill: Genome of one of the most destructive plant pathogenic
RT fungi Macrophomina phaseolina.";
RL BMC Genomics 13:493-493(2012).
CC -!- SIMILARITY: Belongs to the XPG/RAD2 endonuclease family. GEN subfamily.
CC {ECO:0000256|ARBA:ARBA00038112}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EKG15687.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AHHD01000293; EKG15687.1; -; Genomic_DNA.
DR AlphaFoldDB; K2RSI6; -.
DR STRING; 1126212.K2RSI6; -.
DR VEuPathDB; FungiDB:MPH_07122; -.
DR eggNOG; KOG2519; Eukaryota.
DR HOGENOM; CLU_007575_0_0_1; -.
DR InParanoid; K2RSI6; -.
DR OrthoDB; 1779469at2759; -.
DR Proteomes; UP000007129; Unassembled WGS sequence.
DR GO; GO:0008821; F:crossover junction DNA endonuclease activity; IEA:InterPro.
DR GO; GO:0006281; P:DNA repair; IEA:UniProt.
DR CDD; cd09906; H3TH_YEN1; 1.
DR CDD; cd09870; PIN_YEN1; 1.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 2.
DR InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR InterPro; IPR041177; GEN1_C.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR InterPro; IPR006086; XPG-I_dom.
DR InterPro; IPR006084; XPG/Rad2.
DR InterPro; IPR006085; XPG_DNA_repair_N.
DR InterPro; IPR037316; Yen1_H3TH.
DR PANTHER; PTHR11081; FLAP ENDONUCLEASE FAMILY MEMBER; 1.
DR PANTHER; PTHR11081:SF59; FLAP ENDONUCLEASE GEN HOMOLOG 1; 1.
DR Pfam; PF18380; GEN1_C; 1.
DR Pfam; PF00867; XPG_I; 1.
DR Pfam; PF00752; XPG_N; 1.
DR PRINTS; PR00853; XPGRADSUPER.
DR SMART; SM00484; XPGI; 1.
DR SMART; SM00485; XPGN; 1.
DR SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000007129}.
FT DOMAIN 1..98
FT /note="XPG N-terminal"
FT /evidence="ECO:0000259|SMART:SM00485"
FT DOMAIN 109..182
FT /note="XPG-I"
FT /evidence="ECO:0000259|SMART:SM00484"
FT REGION 464..492
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 691 AA; 76691 MW; F2FFA5DAC39869CF CRC64;
MGIHGIYKEI GPGERIALSK LAIEKFEDTG RPLRIAIDTS IWLFQIQSSK GGTNPALRTF
YYRLLRLISL SIHPLFVFDG PNKPPFKRNK RTGPNVASIP EFLAKQLLKQ FGFPFHIAPG
EAEAECALLQ REGIVDVVLS EDVDTLMFGS RITLRNWSPE QKSSKVPTHV NVYDAGKTKS
GPSGLDREGM ILVALMSGGD YLPEGIPGCG PKTACEAARA GFGHRLCAIK KKDTAALQAW
REDLARELRT NESKFFKRKH GTLSVPEDFP RADILGYYVS PAISSPEALE RLKRNLRWDQ
DLNFAGLRTF TADAFEWVKV TGAKKFIRNL APALLVRHLR MRGQAAAEGR LSEDPDVIEA
EEGKLVTSIH GTRQHPSTDN TPELRVAFSP IDIVNIDLDA EEPDDEPGED EDEEEEMALA
GNCDGEAPKK RGPYLYDPTH LDKIWIFETL VKIGAPLTVQ DWEEKQRRKT APKPAKVTKP
CAPKKTTGGM QKGAMDSFTR ITKANAGLAL AKRRSPSIDR VDLSRTSEGL ELELGEAPMA
SSSRATFKPS SRPDSHSVVM KAPVIIDILS SPEAPCQGSH LPAEKLDEHY RSPNITKRIR
TLRRSQTSPD LSNGWRSFYI DDAWIDRRIN ILPPANSERT EGCSNNKLFT RYQTQHTVQE
ATAAAEEEED RGFLAVEAAN DAPGVAWLVK E
//