ID E3Q4E2_COLGM Unreviewed; 380 AA.
AC E3Q4E2;
DT 11-JAN-2011, integrated into UniProtKB/TrEMBL.
DT 11-JAN-2011, sequence version 1.
DT 27-MAR-2024, entry version 49.
DE SubName: Full=Mismatch-specific thymine-DNA glycosylate {ECO:0000313|EMBL:EFQ25454.1};
GN ORFNames=GLRG_00598 {ECO:0000313|EMBL:EFQ25454.1};
OS Colletotrichum graminicola (strain M1.001 / M2 / FGSC 10212) (Maize
OS anthracnose fungus) (Glomerella graminicola).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Glomerellales; Glomerellaceae; Colletotrichum;
OC Colletotrichum graminicola species complex.
OX NCBI_TaxID=645133 {ECO:0000313|Proteomes:UP000008782};
RN [1] {ECO:0000313|Proteomes:UP000008782}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=M1.001 / M2 / FGSC 10212 {ECO:0000313|Proteomes:UP000008782};
RX PubMed=22885923; DOI=10.1038/ng.2372;
RA O'Connell R.J., Thon M.R., Hacquard S., Amyotte S.G., Kleemann J.,
RA Torres M.F., Damm U., Buiate E.A., Epstein L., Alkan N., Altmueller J.,
RA Alvarado-Balderrama L., Bauser C.A., Becker C., Birren B.W., Chen Z.,
RA Choi J., Crouch J.A., Duvick J.P., Farman M.A., Gan P., Heiman D.,
RA Henrissat B., Howard R.J., Kabbage M., Koch C., Kracher B., Kubo Y.,
RA Law A.D., Lebrun M.-H., Lee Y.-H., Miyara I., Moore N., Neumann U.,
RA Nordstroem K., Panaccione D.G., Panstruga R., Place M., Proctor R.H.,
RA Prusky D., Rech G., Reinhardt R., Rollins J.A., Rounsley S., Schardl C.L.,
RA Schwartz D.C., Shenoy N., Shirasu K., Sikhakolli U.R., Stueber K.,
RA Sukno S.A., Sweigard J.A., Takano Y., Takahara H., Trail F.,
RA van der Does H.C., Voll L.M., Will I., Young S., Zeng Q., Zhang J.,
RA Zhou S., Dickman M.B., Schulze-Lefert P., Ver Loren van Themaat E.,
RA Ma L.-J., Vaillancourt L.J.;
RT "Lifestyle transitions in plant pathogenic Colletotrichum fungi deciphered
RT by genome and transcriptome analyses.";
RL Nat. Genet. 44:1060-1065(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG697332; EFQ25454.1; -; Genomic_DNA.
DR RefSeq; XP_008089474.1; XM_008091283.1.
DR AlphaFoldDB; E3Q4E2; -.
DR STRING; 645133.E3Q4E2; -.
DR EnsemblFungi; EFQ25454; EFQ25454; GLRG_00598.
DR GeneID; 24405963; -.
DR VEuPathDB; FungiDB:GLRG_00598; -.
DR eggNOG; KOG4120; Eukaryota.
DR HOGENOM; CLU_042829_1_2_1; -.
DR OrthoDB; 1378435at2759; -.
DR Proteomes; UP000008782; Unassembled WGS sequence.
DR GO; GO:0000700; F:mismatch base pair DNA N-glycosylase activity; IEA:InterPro.
DR GO; GO:0006285; P:base-excision repair, AP site formation; IEA:InterPro.
DR CDD; cd10028; UDG-F2_TDG_MUG; 1.
DR Gene3D; 3.40.470.10; Uracil-DNA glycosylase-like domain; 1.
DR InterPro; IPR015637; MUG/TDG.
DR InterPro; IPR005122; Uracil-DNA_glycosylase-like.
DR InterPro; IPR036895; Uracil-DNA_glycosylase-like_sf.
DR PANTHER; PTHR12159; G/T AND G/U MISMATCH-SPECIFIC DNA GLYCOSYLASE; 1.
DR PANTHER; PTHR12159:SF9; G_T MISMATCH-SPECIFIC THYMINE DNA GLYCOSYLASE; 1.
DR Pfam; PF03167; UDG; 1.
DR SUPFAM; SSF52141; Uracil-DNA glycosylase-like; 1.
PE 4: Predicted;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000008782}.
FT DOMAIN 180..364
FT /note="Uracil-DNA glycosylase-like"
FT /evidence="ECO:0000259|Pfam:PF03167"
FT REGION 28..166
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 80..109
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 110..125
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 380 AA; 40425 MW; 6C873EB45B4DA5F6 CRC64;
MESPFFDREP RPQAASFRGR LQLQDYMFSA ATTSAPPGGG AAGTPATAAS PGLPSPSPTP
LRRSPRKSTA LGPVPSPNRV SKHRNASGRN NSNSTAAATA IKSAASTPEP SQKTPPPPAS
GVLSPKPEPP DSDSIDLALV PANITPPAGN AKPKRRRKAA GYAPPSTYAH LPHLTDILAP
GLLILFVGLN PGLRTAALGH AYAHPSNLFW KLLHSSGITP RLCAPTEDRD LPRLYGLGNT
NIVARPSRNG AELSKAEMDE GVDVLEDKVR GCRPEVVCIV GKGIWESIWR VRRGRGIRKE
EFRYGWQDER EDMGVVVAAK GGDAAADGAS PTAWAGARVF VATSTSGLAA TLRLAEKERI
WRELGEWCER RRAERGAEAS
//