GenomeNet

Database: UniProt
Entry: W6UPR8_ECHGR
LinkDB: W6UPR8_ECHGR
Original site: W6UPR8_ECHGR 
ID   W6UPR8_ECHGR            Unreviewed;       940 AA.
AC   W6UPR8;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   16-APR-2014, sequence version 1.
DT   07-NOV-2018, entry version 28.
DE   SubName: Full=DNA repair protein UVH3 {ECO:0000313|EMBL:EUB55399.1};
GN   ORFNames=EGR_09745 {ECO:0000313|EMBL:EUB55399.1};
OS   Echinococcus granulosus (Hydatid tapeworm).
OC   Eukaryota; Metazoa; Platyhelminthes; Cestoda; Eucestoda;
OC   Cyclophyllidea; Taeniidae; Echinococcus;
OC   Echinococcus granulosus group.
OX   NCBI_TaxID=6210 {ECO:0000313|EMBL:EUB55399.1, ECO:0000313|Proteomes:UP000019149};
RN   [1] {ECO:0000313|EMBL:EUB55399.1, ECO:0000313|Proteomes:UP000019149}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=24013640; DOI=10.1038/ng.2757;
RA   Zheng H., Zhang W., Zhang L., Zhang Z., Li J., Lu G., Zhu Y., Wang Y.,
RA   Huang Y., Liu J., Kang H., Chen J., Wang L., Chen A., Yu S., Gao Z.,
RA   Jin L., Gu W., Wang Z., Zhao L., Shi B., Wen H., Lin R., Jones M.K.,
RA   Brejova B., Vinar T., Zhao G., McManus D.P., Chen Z., Zhou Y.,
RA   Wang S.;
RT   "The genome of the hydatid tapeworm Echinococcus granulosus.";
RL   Nat. Genet. 45:1168-1175(2013).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|SAAS:SAAS00537425}.
CC   -!- CAUTION: The sequence shown here is derived from an
CC       EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC       preliminary data. {ECO:0000313|EMBL:EUB55399.1}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   -----------------------------------------------------------------------
DR   EMBL; APAU02000164; EUB55399.1; -; Genomic_DNA.
DR   Proteomes; UP000019149; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0004519; F:endonuclease activity; IEA:InterPro.
DR   GO; GO:0003697; F:single-stranded DNA binding; IEA:InterPro.
DR   GO; GO:0006289; P:nucleotide-excision repair; IEA:InterPro.
DR   InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR   InterPro; IPR008918; HhH2.
DR   InterPro; IPR029060; PIN-like_dom_sf.
DR   InterPro; IPR006086; XPG-I_dom.
DR   InterPro; IPR006084; XPG/Rad2.
DR   InterPro; IPR001044; XPG/Rad2_eukaryotes.
DR   InterPro; IPR019974; XPG_CS.
DR   InterPro; IPR006085; XPG_DNA_repair_N.
DR   Pfam; PF00867; XPG_I; 1.
DR   Pfam; PF00752; XPG_N; 1.
DR   PRINTS; PR00853; XPGRADSUPER.
DR   PRINTS; PR00066; XRODRMPGMNTG.
DR   SMART; SM00279; HhH2; 1.
DR   SMART; SM00484; XPGI; 1.
DR   SMART; SM00485; XPGN; 1.
DR   SUPFAM; SSF47807; SSF47807; 1.
DR   SUPFAM; SSF88723; SSF88723; 1.
DR   PROSITE; PS00841; XPG_1; 1.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Complete proteome {ECO:0000313|Proteomes:UP000019149};
KW   Nucleus {ECO:0000256|SAAS:SAAS00022948};
KW   Reference proteome {ECO:0000313|Proteomes:UP000019149}.
FT   DOMAIN        1     94       XPGN. {ECO:0000259|SMART:SM00485}.
FT   DOMAIN      541    616       XPGI. {ECO:0000259|SMART:SM00484}.
FT   COILED      278    298       {ECO:0000256|SAM:Coils}.
SQ   SEQUENCE   940 AA;  104334 MW;  7D4A9EFAFB3D70FB CRC64;
     MGVKGLWQIL EPSRRRVDLE YFRGKRMAID MNIWLHQALK ANVKGGRNSH LAILFRRICK
     LLFFGIRPIF VFDGAVPALK KATMAARRIS RSTAKAKSCQ ARDRLLKRLF RRLAESAAKS
     QTPSEELIAE FVRRFNSTEE IRKAELDVEM FGSQSSSIEA APPTAVLQLE EEQESASQLA
     WDFVDNSPSI DLQSDAFSAL PIHAQLRARA GESFSQTQVN RLIARRDLAL KKVEIESKMN
     EVLVASVTPR NLPSGLDMSV TAQRIASQDE GHAILMRKRS SKERADELKE RLDRLLKGSV
     TEPVCDDEED SKKALELENS PRDEYQIAED HKVEMVEKIM KTLEEHSSSE SEVCQAAHDN
     AQLEGGDEST DGPSDQSNES TVDIEADDVK NEGELKLKKE SDSEVSSTDL AEFTEVLDDT
     TTSKASSDLK HLASQSEGNE DFDHKGRITS EQSNIEVETA VIEITSEAPS SDSGEFADVS
     EPPTPIQPVE TLPKTQTFLE HEEEDDLAID DDILRAEAEK LECQAQEATT SCVAEAQRLV
     QLFGFPLINS PEEAEAQCCY LQQLGLVDIV ASDDSDVWVF GATLVCRHLF GRNKGKSGSG
     SSSLYCLKDI REQLGLDRRQ FVRIALLCGS DYTDGLDNIG PIKALEILST FATTSDPTFT
     SEEHEILLPL TEFRRRCQER GGGGRWTSTK FPADFPSEAV VKGYLSPRVE NASEYAGFHW
     DTPNLLPLVK YPFQARQKSE GTLAPVIGRF KATYEGAVAS TIPLITDFFP RIETKKKASS
     RLTQATNRLK YATEFQDLPT LDTDWSTDAD NDDEGAKPST DTSRRRKSNE DLPKGWKNGS
     GALASPSRGL KTGCRCRPRH CVVTHLLQLS TGSNVPLGPL TKEFYSLKTT LAPLLVVTPT
     RFRVSTMLVI HIRCLSYKAC WRDFRSLGMG SLTAANNVAS
//
DBGET integrated database retrieval system