ID A0A1V6SQY4_9EURO Unreviewed; 874 AA.
AC A0A1V6SQY4;
DT 07-JUN-2017, integrated into UniProtKB/TrEMBL.
DT 07-JUN-2017, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE RecName: Full=XPG-I domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=PENFLA_c029G00655 {ECO:0000313|EMBL:OQE16079.1};
OS Penicillium flavigenum.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Penicillium.
OX NCBI_TaxID=254877 {ECO:0000313|EMBL:OQE16079.1, ECO:0000313|Proteomes:UP000191342};
RN [1] {ECO:0000313|Proteomes:UP000191342}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IBT 14082 {ECO:0000313|Proteomes:UP000191342};
RX PubMed=28368369; DOI=10.1038/nmicrobiol.2017.44;
RA Nielsen J.C., Grijseels S., Prigent S., Ji B., Dainat J., Nielsen K.F.,
RA Frisvad J.C., Workman M., Nielsen J.;
RT "Global analysis of biosynthetic gene clusters reveals vast potential of
RT secondary metabolite production in Penicillium species.";
RL Nat. Microbiol. 2:17044-17044(2017).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OQE16079.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MLQL01000029; OQE16079.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1V6SQY4; -.
DR STRING; 254877.A0A1V6SQY4; -.
DR OrthoDB; 1779469at2759; -.
DR Proteomes; UP000191342; Unassembled WGS sequence.
DR GO; GO:0008821; F:crossover junction DNA endonuclease activity; IEA:InterPro.
DR GO; GO:0006281; P:DNA repair; IEA:UniProt.
DR CDD; cd09906; H3TH_YEN1; 1.
DR CDD; cd09870; PIN_YEN1; 1.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 2.
DR InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR InterPro; IPR041177; GEN1_C.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR InterPro; IPR006086; XPG-I_dom.
DR InterPro; IPR006084; XPG/Rad2.
DR InterPro; IPR006085; XPG_DNA_repair_N.
DR InterPro; IPR037316; Yen1_H3TH.
DR PANTHER; PTHR11081:SF71; ENDONUCLEASE, PUTATIVE (AFU_ORTHOLOGUE AFUA_3G13260)-RELATED; 1.
DR PANTHER; PTHR11081; FLAP ENDONUCLEASE FAMILY MEMBER; 1.
DR Pfam; PF18380; GEN1_C; 1.
DR Pfam; PF00867; XPG_I; 1.
DR Pfam; PF00752; XPG_N; 1.
DR PRINTS; PR00853; XPGRADSUPER.
DR SMART; SM00484; XPGI; 1.
DR SMART; SM00485; XPGN; 1.
DR SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW Reference proteome {ECO:0000313|Proteomes:UP000191342}.
FT DOMAIN 1..96
FT /note="XPG N-terminal"
FT /evidence="ECO:0000259|SMART:SM00485"
FT DOMAIN 111..189
FT /note="XPG-I"
FT /evidence="ECO:0000259|SMART:SM00484"
FT REGION 405..444
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 484..510
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 682..816
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 833..865
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 698..718
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 723..745
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 787..816
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 840..855
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 874 AA; 96305 MW; 2C8038BAC42C3CD9 CRC64;
MGIPGLINAI GPGERISLAK LAVTHLERTA RPIRIAVDIS IWLFQVQAGR GGRNPELRTL
FYRLLKLLAL PVHPLFVYDG RQKPAFKRGK AVSARSYGNA PIIKRSKDLI ERFRFPWHEA
PGEAEAECAR LQQAGIVDAV MSNDVDALMF GSSLTIMNFS KESGSGSSSA THVTCYAMGQ
DGHPSNIPLD RPGMILFAML SGGDYLPSGV PKCGSKLAAE IAKAGFGEDL LQELASQADV
DTGLNEWRER LQYELEENES GYFTRKHPAV RIPHVFPDRQ ILEYYARPKV SSDEEMSSLK
NRLAQAWDND IDPLAIRTFA ADHFEWNYRS GARKLIRLLA EPLVSYRLRL QRPVLGVSPG
FSFVPDCPPG MQKVYRSRSS FGTDAMPELQ LDMLPADIVG LDLLAEEPNP PVPSQTSVQE
GEEEDDPEVI AGSPPPTPSK SRVTKRFDPF SLEKVWVFET VARIGVPGVV KSWEKEQAEK
AAKAAEAAAK KKTSTRRTGP KKKGPIDAGM KRGSILKYGT LTKEKSELST TKQTHLLEAA
TSTQNPLCQL VAGSGSSSPV VVDQENDPFS SPSMYIQQRP SLTRHYVSRE VDNLLDTFSS
MCSLAPSANI KRHPMSNQSR IRSRCTALGT GGVDIEESDA LSMNTDHSPS RLARRMGLKI
RYSVSDVCES DGFLEDFLVG TPMTMPSPSR KTLSKSKKIP KVQREEKAEV EHIEKAIESL
SLSSKIDEGQ HENTATHSLR QPSTKKAPQP RKVRTRSSKV DGVEPPSPYS QAAPKPKLEP
GHTLPLNLKT CENNETNVNR SPKKSSKEIT STISKHKTTG HLENIILHDG FWSIDQSPPE
QDPCVGSTAD SLSNSRRQAA KKKRIPRVSI LDLV
//