ID C1DY40_MICCC Unreviewed; 827 AA.
AC C1DY40;
DT 26-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 26-MAY-2009, sequence version 1.
DT 27-MAR-2024, entry version 53.
DE RecName: Full=XPG-I domain-containing protein {ECO:0000259|SMART:SM00484};
GN ORFNames=MICPUN_56071 {ECO:0000313|EMBL:ACO61359.1};
OS Micromonas commoda (strain RCC299 / NOUM17 / CCMP2709) (Picoplanktonic
OS green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; Mamiellophyceae; Mamiellales;
OC Mamiellaceae; Micromonas.
OX NCBI_TaxID=296587 {ECO:0000313|EMBL:ACO61359.1, ECO:0000313|Proteomes:UP000002009};
RN [1] {ECO:0000313|EMBL:ACO61359.1, ECO:0000313|Proteomes:UP000002009}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=RCC299 / NOUM17 {ECO:0000313|Proteomes:UP000002009};
RX PubMed=19359590; DOI=10.1126/science.1167222;
RA Worden A.Z., Lee J.H., Mock T., Rouze P., Simmons M.P., Aerts A.L.,
RA Allen A.E., Cuvelier M.L., Derelle E., Everett M.V., Foulon E.,
RA Grimwood J., Gundlach H., Henrissat B., Napoli C., McDonald S.M.,
RA Parker M.S., Rombauts S., Salamov A., Von Dassow P., Badger J.H.,
RA Coutinho P.M., Demir E., Dubchak I., Gentemann C., Eikrem W., Gready J.E.,
RA John U., Lanier W., Lindquist E.A., Lucas S., Mayer K.F., Moreau H.,
RA Not F., Otillar R., Panaud O., Pangilinan J., Paulsen I., Piegu B.,
RA Poliakov A., Robbens S., Schmutz J., Toulza E., Wyss T., Zelensky A.,
RA Zhou K., Armbrust E.V., Bhattacharya D., Goodenough U.W., Van de Peer Y.,
RA Grigoriev I.V.;
RT "Green evolution and dynamic adaptations revealed by genomes of the marine
RT picoeukaryotes Micromonas.";
RL Science 324:268-272(2009).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP001323; ACO61359.1; -; Genomic_DNA.
DR RefSeq; XP_002500101.1; XM_002500055.1.
DR AlphaFoldDB; C1DY40; -.
DR STRING; 296587.C1DY40; -.
DR GeneID; 8240817; -.
DR KEGG; mis:MICPUN_56071; -.
DR eggNOG; KOG2519; Eukaryota.
DR InParanoid; C1DY40; -.
DR OMA; DGVECEC; -.
DR OrthoDB; 318721at2759; -.
DR Proteomes; UP000002009; Chromosome 2.
DR GO; GO:0004518; F:nuclease activity; IEA:InterPro.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 2.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR InterPro; IPR006086; XPG-I_dom.
DR InterPro; IPR006084; XPG/Rad2.
DR PANTHER; PTHR11081:SF54; FI23547P1; 1.
DR PANTHER; PTHR11081; FLAP ENDONUCLEASE FAMILY MEMBER; 1.
DR Pfam; PF00867; XPG_I; 1.
DR PRINTS; PR00853; XPGRADSUPER.
DR SMART; SM00484; XPGI; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002009}.
FT DOMAIN 165..239
FT /note="XPG-I"
FT /evidence="ECO:0000259|SMART:SM00484"
FT REGION 9..31
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 546..568
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 607..827
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 614..633
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 658..676
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 687..706
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 716..732
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 812..827
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 827 AA; 89311 MW; 3BE1425C49E374A4 CRC64;
MGITNLWCAP AVPAPPEPDT ASRFDAPPRR GNAKKLLDEE GAIVRVDGSD EDDPDAQRII
RESVDGKALA VDLSLWIIQA CTQQALDEVY NEDLGFDDPD ASKTAKVVFE RALNYLRHGC
VPVGVIDGQA PWQKLGALRA RWGAHAGGGG GAFGRCGEVA LEVLRALGLP GVEAPGEAEA
TCAVMDSMGI VDGCVTSDGD SLLFGARTVF KTLRLSQNDQ RDLFMERVEA ADVGKRLMLG
DDVEHVAPAL TALALLTGGD YDLTGAKNVG GTKALLVVKA LAKDEARRTR AEPGGRDRRQ
RSLPARLDAF LASAPDPTIE ALDKCTGCAR CKHDGCLKNK VKNHVRGCSE CGTDAGCVER
DGVECECPFH ARADERWLHK VRERANATDG YARSFRDAAR GYAAQARDAE EALGDASDVE
WVERAENDGD EFDSFRKMRW RRRPDVAALQ EIMETYCQWE PRRTREKLLP VLVEWDMKAA
RRALRRVEDP STVDPGRRWE LLRRRGVEFV VERIAKVSGN NKAPPWRYLL EVNSADPAHW
AAWQRAATGD PAAGESAPKG GTAVLPPNFH ERGDDLAFLK GLAQRRSVRM SLVRETCPHL
VARFDAGGSK AKSAPSTPAK SKSKPTTPRK TPKTVRGTKT RSPDQHDITS FFSQRRPAAP
PSAPPSAPPS APLASTPAPR TPASKRKPPT EVGDDSAHDS DSDEILGAGL RPDLGAVRRL
EAKKKLSFVA TEEEAQDEGL ASPARPKSSA VNVVDLRTPP GSPARTPAPD SSRKGRPGGA
PASPAPRSPL FSPTKKARQA TLFECLSPKK AATPSQPETV DLCTPQK
//