ID A0A2T7A1W0_TUBBO Unreviewed; 746 AA.
AC A0A2T7A1W0;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE RecName: Full=XPG-I domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=B9Z19DRAFT_1062433 {ECO:0000313|EMBL:PUU81729.1};
OS Tuber borchii (White truffle).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Pezizomycetes;
OC Pezizales; Tuberaceae; Tuber.
OX NCBI_TaxID=42251 {ECO:0000313|EMBL:PUU81729.1, ECO:0000313|Proteomes:UP000244722};
RN [1] {ECO:0000313|EMBL:PUU81729.1, ECO:0000313|Proteomes:UP000244722}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Tbo3840 {ECO:0000313|EMBL:PUU81729.1,
RC ECO:0000313|Proteomes:UP000244722};
RG DOE Joint Genome Institute;
RA Murat C., Kuo A., Barry K.W., Clum A., Dockter R.B., Fauchery L., Iotti M.,
RA Kohler A., Labutti K., Lindquist E.A., Lipzen A., Ohm R.A., Wang M.,
RA Grigoriev I.V., Zambonelli A., Martin F.M.;
RT "Draft genome sequence of Tuber borchii Vittad., a whitish edible
RT truffle.";
RL Submitted (APR-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PUU81729.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NESQ01000041; PUU81729.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2T7A1W0; -.
DR STRING; 42251.A0A2T7A1W0; -.
DR Proteomes; UP000244722; Unassembled WGS sequence.
DR GO; GO:0008821; F:crossover junction DNA endonuclease activity; IEA:InterPro.
DR GO; GO:0006281; P:DNA repair; IEA:UniProt.
DR CDD; cd09906; H3TH_YEN1; 1.
DR CDD; cd09870; PIN_YEN1; 1.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 2.
DR InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR InterPro; IPR041177; GEN1_C.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR InterPro; IPR006086; XPG-I_dom.
DR InterPro; IPR006084; XPG/Rad2.
DR InterPro; IPR006085; XPG_DNA_repair_N.
DR InterPro; IPR037316; Yen1_H3TH.
DR PANTHER; PTHR11081:SF71; ENDONUCLEASE, PUTATIVE (AFU_ORTHOLOGUE AFUA_3G13260)-RELATED; 1.
DR PANTHER; PTHR11081; FLAP ENDONUCLEASE FAMILY MEMBER; 1.
DR Pfam; PF18380; GEN1_C; 1.
DR Pfam; PF00867; XPG_I; 1.
DR Pfam; PF00752; XPG_N; 1.
DR PRINTS; PR00853; XPGRADSUPER.
DR SMART; SM00484; XPGI; 1.
DR SMART; SM00485; XPGN; 1.
DR SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW Reference proteome {ECO:0000313|Proteomes:UP000244722}.
FT DOMAIN 1..105
FT /note="XPG N-terminal"
FT /evidence="ECO:0000259|SMART:SM00485"
FT DOMAIN 109..184
FT /note="XPG-I"
FT /evidence="ECO:0000259|SMART:SM00484"
FT REGION 390..412
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 438..515
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 531..581
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 616..708
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 536..579
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 638..661
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 746 AA; 81927 MW; AD2363CB412B6B09 CRC64;
MGIVGLYQEL GPGPRVSLAK LSADHYISTG RRLRLAVDAS IWAFQVQAGK GGNNPALRTL
YYRLIRLLHL NITPLFIFDG PNRPVFKRNH KTNTVMTPTL TRSTKALLKL FGFPYHEAPG
EAEAECANLQ THGIVDAVLS EDVDTLMFGC EVTFRNWSGE GGKSKAPTHV SVYERESVEA
MSGLTPEGMV LIALMRGGDY SPEGVPRCGI KIAADAARAG FGELLCALDV QDKEGLAAWR
RRLQRELETN ESGYFKRKNK TIRIPEGFPD ITILGHYKKP VVSSKEKVER LRNSICWSGE
INFSGLRESA RWAFEWRGKI GAAKFIRCLA PAVMSWNLCT STEGVRELVS GFHGRREHFS
TGECRELRIS FVPGNVVKVD MDAEADDDVE VQPDDDEEFA QEGEDGSRKA RDYDPYTLER
MWVLEVFARQ GVPEMVEEYE NPKPKAKKTA AATTKKSRLA RSKDKSVEAV HTMPEYLTVS
KPGKQRPTPS KHPQEPRSSS SARRTQREVS VSLSSLTSHL SALQILEDDA FTHPAPSPTS
VPTSKGNPCR QPLTSNPSTQ NHHPLPSERE TSQEQENIWT APAWPCETIG LSPPRGRGHP
ALGAYGAAVS PQAVSPAVTA PAPEEERVTV SIPSSPEVET VRRRRVDQRR SSTDKKKNTS
RGKTPVCILD ISDSESEGEV PDRRASPPLS LSGGRNEKKA VASGRANKKA VLRTSVEGSW
KFADGEALAS GGGGWVDVEV MDLTGA
//