ID A0A137QHN3_9AGAR Unreviewed; 901 AA.
AC A0A137QHN3;
DT 11-MAY-2016, integrated into UniProtKB/TrEMBL.
DT 11-MAY-2016, sequence version 1.
DT 27-MAR-2024, entry version 27.
DE RecName: Full=XPG-I domain-containing protein {ECO:0000259|SMART:SM00484};
GN ORFNames=AN958_09763 {ECO:0000313|EMBL:KXN86679.1};
OS Leucoagaricus sp. SymC.cos.
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; Agaricomycetes;
OC Agaricomycetidae; Agaricales; Agaricineae; Agaricaceae; Leucoagaricus.
OX NCBI_TaxID=1714833 {ECO:0000313|EMBL:KXN86679.1, ECO:0000313|Proteomes:UP000070249};
RN [1] {ECO:0000313|EMBL:KXN86679.1, ECO:0000313|Proteomes:UP000070249}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SymC.cos {ECO:0000313|EMBL:KXN86679.1,
RC ECO:0000313|Proteomes:UP000070249};
RA Hu H.;
RT "Leucoagaricus sp. SymC.cos WGS genome.";
RL Submitted (JAN-2016) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ962334; KXN86679.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A137QHN3; -.
DR STRING; 1714833.A0A137QHN3; -.
DR OrthoDB; 1779469at2759; -.
DR Proteomes; UP000070249; Unassembled WGS sequence.
DR GO; GO:0008821; F:crossover junction DNA endonuclease activity; IEA:InterPro.
DR GO; GO:0006281; P:DNA repair; IEA:UniProt.
DR CDD; cd09906; H3TH_YEN1; 1.
DR CDD; cd09870; PIN_YEN1; 1.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 2.
DR InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR InterPro; IPR041177; GEN1_C.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR InterPro; IPR006086; XPG-I_dom.
DR InterPro; IPR006084; XPG/Rad2.
DR InterPro; IPR037316; Yen1_H3TH.
DR PANTHER; PTHR11081:SF71; ENDONUCLEASE, PUTATIVE (AFU_ORTHOLOGUE AFUA_3G13260)-RELATED; 1.
DR PANTHER; PTHR11081; FLAP ENDONUCLEASE FAMILY MEMBER; 1.
DR Pfam; PF18380; GEN1_C; 1.
DR Pfam; PF00867; XPG_I; 1.
DR PRINTS; PR00853; XPGRADSUPER.
DR SMART; SM00484; XPGI; 1.
DR SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000070249}.
FT DOMAIN 133..199
FT /note="XPG-I"
FT /evidence="ECO:0000259|SMART:SM00484"
FT REGION 395..415
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 478..515
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 545..699
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 799..852
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 561..576
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 599..623
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 670..686
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 824..852
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 901 AA; 99788 MW; 122E09F7A08BEAF7 CRC64;
MGVAGLWDVR NSAISHQKNK LIFLLLPQVL KPAATTRSLT ELAVTEGFKA NPRGLRGYRI
GIDASIWFFH AEYGKEGENP VLRTLFFRCA TLMKAPFLPL FVFDGPKRPE FKRGKKINKA
GNKLIPGMKR IVEAFGFEWR MAPGEAEAEL AYLNRIGVID GILSDDVDNF LFGALTVIRN
QSNNLSGNKA NPVLNSEGRD DRNHTRVFRF QDLHDHPDIK LTRGGLILIG IMSGGDYEGG
LERCGIATAH ALARCGFGDT LFDAARNLTR EDLVAFLDNW RQELRHELKT NSRGLMARKA
LSLAKSLPDT FPNIDVLLSY VTPVTSESLG HEGSNPKITW SKEPNVAALA SACELFFEWG
YKEAIIKRFR TVIWPGTVLR ILRRSVLEVD ALSTTPTTPH KCHRKTVDDE EGYGTPSKMI
AQHFADISLN DSDDDNSTDE HRLIASISRE RAHASTDGLP EYRLEVAPKQ LVEIAVSGLQ
GLRQPEGPNE WASDEDVDDD DDEGAPSKKV KVKEPVDPMS HLRLWMPTCM VDPVERRLVR
EYRNKEERKK QKKRAPVKKK ASTKKAQEEE EPMVSPPPAR KKASTKKAPE KKLPDDSPPC
KTTSATSTST TRKVFQPRPQ AKANSKTLPK LLVETPPRSV PQPFPIDFVA RREEEEESSD
DDEGELPTVP ILPKPKPLPP PAPTTILEKP SSSTRPKSKI NQVAAIFEST TSSTTTSLPS
LQAIEDALAL SESDEELLMN LYQTTRPTKP IKVATSNIIP GYLSLSDDDI NFHKKSVSPT
KRKKLTLKKQ GTIQTMLSGF VQRVSRKPDP DNQDDRPGTP PVQKTPRKSK TQNSPRSTDS
ATKTTKEGSS IIIISDESDS GYSGLSRPLI APLQLARSRA KGASTSACSK PTSICDVIDL
T
//