ID G1WY35_ARTOA Unreviewed; 1248 AA.
AC G1WY35;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 64.
DE RecName: Full=DNA repair protein rad2 {ECO:0008006|Google:ProtNLM};
GN ORFNames=AOL_s00004g195 {ECO:0000313|EMBL:EGX54162.1};
OS Arthrobotrys oligospora (strain ATCC 24927 / CBS 115.81 / DSM 1491)
OS (Nematode-trapping fungus) (Didymozoophaga oligospora).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Orbiliomycetes;
OC Orbiliales; Orbiliaceae; Orbilia; Orbilia oligospora.
OX NCBI_TaxID=756982 {ECO:0000313|EMBL:EGX54162.1, ECO:0000313|Proteomes:UP000008784};
RN [1] {ECO:0000313|EMBL:EGX54162.1, ECO:0000313|Proteomes:UP000008784}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 24927 / CBS 115.81 / DSM 1491
RC {ECO:0000313|Proteomes:UP000008784};
RX PubMed=21909256; DOI=10.1371/journal.ppat.1002179;
RA Yang J., Wang L., Ji X., Feng Y., Li X., Zou C., Xu J., Ren Y., Mi Q.,
RA Wu J., Liu S., Liu Y., Huang X., Wang H., Niu X., Li J., Liang L., Luo Y.,
RA Ji K., Zhou W., Yu Z., Li G., Liu Y., Li L., Qiao M., Feng L., Zhang K.-Q.;
RT "Genomic and proteomic analyses of the fungus Arthrobotrys oligospora
RT provide insights into nematode-trap formation.";
RL PLoS Pathog. 7:E1002179-E1002179(2011).
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC Evidence={ECO:0000256|ARBA:ARBA00001946};
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the XPG/RAD2 endonuclease family. XPG subfamily.
CC {ECO:0000256|ARBA:ARBA00005283}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGX54162.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADOT01000005; EGX54162.1; -; Genomic_DNA.
DR RefSeq; XP_011117147.1; XM_011118845.1.
DR AlphaFoldDB; G1WY35; -.
DR STRING; 756982.G1WY35; -.
DR GeneID; 22888044; -.
DR eggNOG; KOG2520; Eukaryota.
DR HOGENOM; CLU_003018_0_0_1; -.
DR InParanoid; G1WY35; -.
DR OMA; PNSMDFS; -.
DR OrthoDB; 5479162at2759; -.
DR Proteomes; UP000008784; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0004519; F:endonuclease activity; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003697; F:single-stranded DNA binding; IEA:InterPro.
DR GO; GO:0006289; P:nucleotide-excision repair; IEA:InterPro.
DR CDD; cd09904; H3TH_XPG; 1.
DR CDD; cd09868; PIN_XPG_RAD2; 2.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 2.
DR InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR InterPro; IPR008918; HhH2.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR InterPro; IPR003903; UIM_dom.
DR InterPro; IPR006086; XPG-I_dom.
DR InterPro; IPR006084; XPG/Rad2.
DR InterPro; IPR001044; XPG/Rad2_eukaryotes.
DR InterPro; IPR019974; XPG_CS.
DR InterPro; IPR006085; XPG_DNA_repair_N.
DR PANTHER; PTHR16171:SF7; DNA EXCISION REPAIR PROTEIN ERCC-5; 1.
DR PANTHER; PTHR16171; DNA REPAIR PROTEIN COMPLEMENTING XP-G CELLS-RELATED; 1.
DR Pfam; PF00867; XPG_I; 1.
DR Pfam; PF00752; XPG_N; 1.
DR PRINTS; PR00853; XPGRADSUPER.
DR PRINTS; PR00066; XRODRMPGMNTG.
DR SMART; SM00279; HhH2; 1.
DR SMART; SM00726; UIM; 3.
DR SMART; SM00484; XPGI; 1.
DR SMART; SM00485; XPGN; 1.
DR SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
DR PROSITE; PS50330; UIM; 2.
DR PROSITE; PS00841; XPG_1; 1.
DR PROSITE; PS00842; XPG_2; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Magnesium {ECO:0000256|ARBA:ARBA00022842};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000008784}.
FT DOMAIN 1..98
FT /note="XPG N-terminal"
FT /evidence="ECO:0000259|SMART:SM00485"
FT DOMAIN 879..948
FT /note="XPG-I"
FT /evidence="ECO:0000259|SMART:SM00484"
FT REGION 421..450
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 484..517
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1132..1248
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 112..139
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 835..862
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 424..438
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 489..509
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1146..1163
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1194..1223
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1248 AA; 141222 MW; 9F429A2FC071EFFE CRC64;
MGVTGLWTVV QPVARPVKLE TLAQKRLAID ASIWIYQFLK AVRDKEGNAL RNAHVVGFFR
RIVKLLFHGI RPVFVFDGGA PLLKRQTIAN RKSHRQDRKL DASNTAKKLL SIQMQKRAKE
EAEERRRALQ NQYHNVREEE PVPENPVYAS QVFEAPQERV KVKAPFRRLD QYHLPEMDGS
LEAMRGANDP RIMSMEELED YARRFEGGED INLYDFSKID FDSPFFTSLP PGDQYNILNA
ARIRSRLRMG LSKEQLEKMF PNRLEFSRFQ IDRVRERNEL TQRLMHLNGM NDDLISVRRV
AGEKDREYVL VKNEGAEGGW ALGVVGQRGP REGETVHKAI VVDEEDKVED EDDDSEDEIE
FEDIPIEGLN RLPKLRAPQI ELTRHRQTSE EREIQMAIRL SRQEQQRKAL KDSEATIFIQ
DDDLSESNEA ENELDSLFED LDDAKGTRDE DEELRKAIAL SLERDQREGE EEERALELAL
KMSLEQDDEN DSGATVQPSS NSKPRAAEQS LNPNKPKFVE FDDIVPPAKS NIFSDTAPYR
VSSDLHKSGL NKKHETITAK PANLENASIT SFKGVDFNPF ALEGDGFTLE NAMEYPTEPS
NTNMQSSKAN DFSFDNNILP FESMKLGGLA GLQKRLNDAK ANQVEERTEE QPKVPLPPWF
APEENSAFKA GKAGYQVDNY DREDETERIV STLPEHLRPI ENRASIQPKS KQVIVLSEDE
DNDIQMVDLT NSKPQTQNEL VALSNETNAS NRKAVEVVEA ISPEVEAQDS DEPVEWEASD
IETGTPLIRV PAENLAANEP ESEDDAGMDS EEEQLMHQMR AENEEHARFA STFNQRSTEQ
NAMDYERELR TLRDQQKKDR RDADEVTKIM ITECQQLLQM FGIPYITAPM EAEAQCAELV
NLGLVDGIVT DDSDIFLFGG TRVYKNMFNQ AKYVECYLAS DLENEYSLDR KKMIRLAHLL
GSDYTEGLVG VGPVTALEVL ANFGGGDDAL HDFKAWWTRI QSGFRDPADD KSKLKKSLKK
LEDKLFLPPS FPDDQVDMAY LNPEVDKDST PFEWGVPDLH GLRTFLMATI GWTSERTDEV
LVPVIRDMNR KVAEGHQVNL TNFFSGSTGA GAFAPRVREA NKSKRMENAL MSLHRQSVKR
QGSNLSLDGE GEEHKKIDES SSTDEELQGV VSLNGRVRIT KGSNKKAPAK KKRKTVTKDN
ESGSSESEDN PEVIEDLKQK GKKVKAAGQP RNSSGRGRGR GRGRAKKT
//