GenomeNet

Database: UniProt
Entry: A0A182I7E2_ANOAR
LinkDB: A0A182I7E2_ANOAR
Original site: A0A182I7E2_ANOAR 
ID   A0A182I7E2_ANOAR        Unreviewed;      1445 AA.
AC   A0A182I7E2;
DT   07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT   07-SEP-2016, sequence version 1.
DT   24-JAN-2024, entry version 40.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:AARA009496-PA.1};
OS   Anopheles arabiensis (Mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7173 {ECO:0000313|EnsemblMetazoa:AARA009496-PA.1, ECO:0000313|Proteomes:UP000075840};
RN   [1] {ECO:0000313|Proteomes:UP000075840}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Dongola {ECO:0000313|Proteomes:UP000075840};
RG   The Broad Institute Genomics Platform;
RA   Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA   Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA   Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA   Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA   Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA   Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA   Birren B.;
RT   "The Genome Sequence of Anopheles arabiensis DONG5_A.";
RL   Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EnsemblMetazoa:AARA009496-PA.1}
RP   IDENTIFICATION.
RC   STRAIN=Dongola {ECO:0000313|EnsemblMetazoa:AARA009496-PA.1};
RG   EnsemblMetazoa;
RL   Submitted (AUG-2022) to UniProtKB.
CC   -!- COFACTOR:
CC       Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC         Evidence={ECO:0000256|ARBA:ARBA00001946};
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- SIMILARITY: Belongs to the XPG/RAD2 endonuclease family. XPG subfamily.
CC       {ECO:0000256|ARBA:ARBA00005283}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; APCN01003386; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EnsemblMetazoa; AARA009496-RA; AARA009496-PA; AARA009496.
DR   VEuPathDB; VectorBase:AARA009496; -.
DR   VEuPathDB; VectorBase:AARA21_007378; -.
DR   OrthoDB; 26655at2759; -.
DR   Proteomes; UP000075840; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0004519; F:endonuclease activity; IEA:UniProtKB-KW.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   GO; GO:0003697; F:single-stranded DNA binding; IEA:InterPro.
DR   GO; GO:0006289; P:nucleotide-excision repair; IEA:InterPro.
DR   CDD; cd09904; H3TH_XPG; 1.
DR   CDD; cd09868; PIN_XPG_RAD2; 2.
DR   Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR   Gene3D; 3.40.50.1010; 5'-nuclease; 2.
DR   InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR   InterPro; IPR008918; HhH2.
DR   InterPro; IPR029060; PIN-like_dom_sf.
DR   InterPro; IPR003903; UIM_dom.
DR   InterPro; IPR006086; XPG-I_dom.
DR   InterPro; IPR006084; XPG/Rad2.
DR   InterPro; IPR001044; XPG/Rad2_eukaryotes.
DR   InterPro; IPR019974; XPG_CS.
DR   InterPro; IPR006085; XPG_DNA_repair_N.
DR   PANTHER; PTHR16171:SF7; DNA EXCISION REPAIR PROTEIN ERCC-5; 1.
DR   PANTHER; PTHR16171; DNA REPAIR PROTEIN COMPLEMENTING XP-G CELLS-RELATED; 1.
DR   Pfam; PF00867; XPG_I; 1.
DR   Pfam; PF00752; XPG_N; 1.
DR   PRINTS; PR00853; XPGRADSUPER.
DR   PRINTS; PR00066; XRODRMPGMNTG.
DR   SMART; SM00279; HhH2; 1.
DR   SMART; SM00484; XPGI; 1.
DR   SMART; SM00485; XPGN; 1.
DR   SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR   SUPFAM; SSF88723; PIN domain-like; 1.
DR   PROSITE; PS50330; UIM; 1.
DR   PROSITE; PS00841; XPG_1; 1.
DR   PROSITE; PS00842; XPG_2; 1.
PE   3: Inferred from homology;
KW   DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW   DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW   Endonuclease {ECO:0000256|ARBA:ARBA00022759};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Magnesium {ECO:0000256|ARBA:ARBA00022842};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242}.
FT   DOMAIN          1..98
FT                   /note="XPG N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00485"
FT   DOMAIN          983..1052
FT                   /note="XPG-I"
FT                   /evidence="ECO:0000259|SMART:SM00484"
SQ   SEQUENCE   1445 AA;  160450 MW;  E49134F036C432F6 CRC64;
     MGVTGLWKLI EQSGKPVPLD TLENKVLAVD ISIWLHQVVK GFQDSKGSAL PNAHVLGLFH
     RLCKLMYYRI KPIFVFDGGA PLLKKQTIAK RQQSKNNYQN EADRIQQLLL ETLAKEKVVQ
     QALGSATNIL ISPSKKAITN GGPSTSKQPD REEEPDAMFK LPPLKAPEEP IDLDRSDSSM
     DEKASRHYYH LNLNAIDVTS VYFKNLPADV RHEILNDIKE TRKQSSWGRL HELPVQSDSF
     SSFQMKRLLK RRQVQVELEE AEKEMGGKCL SLTELESLLN EEGVETSSNR AAQKIASDEN
     TRFLLVRDVQ KAIEKAKARE EAEKLAPAPP KVPKISKDEV YSQLQEEADD KEMDEDLQLA
     IKMSLMQDET PHAVIELDED ELRMSRTQKR VLGNAAQSLA RGFMLEYGGL TTEEFNELLH
     QTQDVDSGDI NESMSQMFVP SEEPSTVGRE RETLTEIREE QEMILESIKE SEKKTEAKPP
     SEGAADSDAE TESDSDFVDV PEDNLNDSVT GISLPLNSTN HFKPHYNPIV DFTLDDLKQL
     SQAGPSKKKE VVQVVIKPEE IGVCDQDDIF ADIFTVKKEA PETPVKEKGE KEVNHPDGPN
     PPMISIPKKQ MNEIMQEQHE NNSSDSDKQP SVDGAVAPKA FTGLKIKKVE TINAQLKEEL
     ENLKKAPPVI DLGNILGQPV LPVPDPVLPV TDLKSISETL KQQLEELKAS ANAIKLDEIN
     LDAVVANSEE PKDRNDDEEN DSEATIIYDA DLDAEKTPTK QIVSQNDKKE EAKSPSTATE
     DDLLKSKSPI TTIGDEESIT KSPSTSKEAE GANTASPVIE IIDSPAKAGT LEHLIIARTP
     GKDSHKSTEP EESIPRVPKP FFVNKTPPSA KKANAEEDAE PTKQTTPSKP VAKELFPAEP
     VPSTSKQAPP PPPAPEPQPV RAEDLITEMA DTLKEAHTPL ELKRMALNLA ETERELERER
     NKQSRIGLSI TEQMRRDCME LLQIFGVPFI VAPMEAEAQC AFLNQLDMTD GTITDDSDIW
     LFGGKKVYKN FFNQQKLVLE FTIDGIEQMF QMDRKKLIQL ALLVGSDYTT GIHGIGAVTA
     LEILASFPPT PEQPGETSEM MSMLSGLRKF RDWWHHGRNG ATGTRISLKS KLKNIEIGEG
     FPSTGVVEAY LQPTVDCSEE EFTWGYPDAD RLRDYARQKF GWSQTKTNDI LLPVLKRLDE
     RKSQSSIKNY FKVQSAVGHN RLKVSKRVQH AVDTMAGKID PEEEAKPKKR SPAKAKQPGG
     RKRKQPAAAK NAIETVDLEV IEEEEDEKKK EESPKATVAE EDDDFVEPAK TTQKAGRGRG
     AAGRKKANDA AAPGTAAKPK RGRKAAPKSD TTNDTQQPEG GEPSTVQRPT HSLANIGGII
     ANINQQSADS MGVEESVEGR RKRIGNKMPD FNPAIPQRVK DEQIMAERKR QAAELFKKLK
     ANGKK
//
DBGET integrated database retrieval system