ID A0A178ZUG4_9EURO Unreviewed; 555 AA.
AC A0A178ZUG4;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 05-FEB-2025, entry version 27.
DE RecName: Full=XPG-I domain-containing protein {ECO:0000259|SMART:SM00484};
GN ORFNames=AYL99_02353 {ECO:0000313|EMBL:OAP63126.1};
OS Fonsecaea erecta.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Chaetothyriomycetidae; Chaetothyriales; Herpotrichiellaceae; Fonsecaea.
OX NCBI_TaxID=1367422 {ECO:0000313|EMBL:OAP63126.1, ECO:0000313|Proteomes:UP000078343};
RN [1] {ECO:0000313|EMBL:OAP63126.1, ECO:0000313|Proteomes:UP000078343}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 125763 {ECO:0000313|EMBL:OAP63126.1,
RC ECO:0000313|Proteomes:UP000078343};
RA Weiss V.A., Vicente V.A., Raittz R.T., Moreno L.F., De Souza E.M.,
RA Pedrosa F.O., Steffens M.B., Faoro H., Tadra-Sfeir M.Z., Najafzadeh M.J.,
RA Felipe M.S., Teixeira M., Sun J., Xi L., Gomes R., De Azevedo C.M.,
RA Salgado C.G., Da Silva M.B., Nascimento M.F., Queiroz-Telles F.,
RA Attili D.S., Gorbushina A.;
RT "Draft genome of Fonsecaea erecta CBS 125763.";
RL Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OAP63126.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LVYI01000002; OAP63126.1; -; Genomic_DNA.
DR RefSeq; XP_018696493.1; XM_018833869.1.
DR AlphaFoldDB; A0A178ZUG4; -.
DR STRING; 1367422.A0A178ZUG4; -.
DR GeneID; 30006523; -.
DR OrthoDB; 2959108at2759; -.
DR Proteomes; UP000078343; Unassembled WGS sequence.
DR GO; GO:0017108; F:5'-flap endonuclease activity; IEA:TreeGrafter.
DR GO; GO:0006281; P:DNA repair; IEA:UniProtKB-ARBA.
DR FunFam; 3.40.50.1010:FF:000037; Rad2-like endonuclease, putative (AFU_orthologue AFUA_3G13260); 1.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 1.
DR InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR InterPro; IPR041177; GEN1_C.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR InterPro; IPR006086; XPG-I_dom.
DR InterPro; IPR006084; XPG/Rad2.
DR PANTHER; PTHR11081:SF75; ENDONUCLEASE, PUTATIVE (AFU_ORTHOLOGUE AFUA_3G13260)-RELATED; 1.
DR PANTHER; PTHR11081; FLAP ENDONUCLEASE FAMILY MEMBER; 1.
DR Pfam; PF18380; GEN1_C; 1.
DR Pfam; PF00867; XPG_I; 1.
DR PRINTS; PR00853; XPGRADSUPER.
DR SMART; SM00484; XPGI; 1.
DR SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000078343}.
FT DOMAIN 7..82
FT /note="XPG-I"
FT /evidence="ECO:0000259|SMART:SM00484"
FT REGION 302..333
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 370..391
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 424..518
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 531..555
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 373..388
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 458..467
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 531..545
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 555 AA; 61655 MW; 62ED1CF748BF3BB7 CRC64;
MSKGLISLFQ FPTHVAPGEA EAECAMLQRE GVVDAVMTQD VDALMFGSGL TLRDWSKEAN
GKKGNKTPTH VSVLDLPRVK NMCGLDPEGM ILVALLSGGD YDEDGVAGIG CKLACEIARA
GFGSDLVELV RNGDEDGITE WRERLQFELE TNESGYFKMK RKSVRIPDSF PNRKILGYYM
NPAVTPVSEM ERLERKWAKT WDEEIDVQAL RDYAADMFEW LYKPGAWKFV RVMAPALLAD
GLRRDRATSH LTSVDQITEQ RQHFVSDGIP ELRVTVVPAK VVGLNLDAEE DSPTYLQLLA
EEENEGAEGE TEHAEIDENA PLSPSKKRKS PAWLPDAPEK MWIAQTIVEI GAREHVEQWK
RIQFENANDP KKLATRKCPK KKESKKPKVI GGMQPGALLS YVVPADQSTE LEVSIAVPIK
SPRRRAGANI SAPRRSKPKS SKLEEGYPPT MLGFFKSTKG SQPSHSTMPA RDAASYDLVK
PGAGNDDDPL TLSNESRGDK HPRPSTSSPP ETGQRIGLHW IKAENFGNWD TGREAIRQPN
ERNDANHLFA KPGRF
//