ID A0A182I7E2_ANOAR Unreviewed; 1445 AA.
AC A0A182I7E2;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 24-JAN-2024, entry version 40.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:AARA009496-PA.1};
OS Anopheles arabiensis (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7173 {ECO:0000313|EnsemblMetazoa:AARA009496-PA.1, ECO:0000313|Proteomes:UP000075840};
RN [1] {ECO:0000313|Proteomes:UP000075840}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Dongola {ECO:0000313|Proteomes:UP000075840};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles arabiensis DONG5_A.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AARA009496-PA.1}
RP IDENTIFICATION.
RC STRAIN=Dongola {ECO:0000313|EnsemblMetazoa:AARA009496-PA.1};
RG EnsemblMetazoa;
RL Submitted (AUG-2022) to UniProtKB.
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC Evidence={ECO:0000256|ARBA:ARBA00001946};
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the XPG/RAD2 endonuclease family. XPG subfamily.
CC {ECO:0000256|ARBA:ARBA00005283}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; APCN01003386; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EnsemblMetazoa; AARA009496-RA; AARA009496-PA; AARA009496.
DR VEuPathDB; VectorBase:AARA009496; -.
DR VEuPathDB; VectorBase:AARA21_007378; -.
DR OrthoDB; 26655at2759; -.
DR Proteomes; UP000075840; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0004519; F:endonuclease activity; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003697; F:single-stranded DNA binding; IEA:InterPro.
DR GO; GO:0006289; P:nucleotide-excision repair; IEA:InterPro.
DR CDD; cd09904; H3TH_XPG; 1.
DR CDD; cd09868; PIN_XPG_RAD2; 2.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 2.
DR InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR InterPro; IPR008918; HhH2.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR InterPro; IPR003903; UIM_dom.
DR InterPro; IPR006086; XPG-I_dom.
DR InterPro; IPR006084; XPG/Rad2.
DR InterPro; IPR001044; XPG/Rad2_eukaryotes.
DR InterPro; IPR019974; XPG_CS.
DR InterPro; IPR006085; XPG_DNA_repair_N.
DR PANTHER; PTHR16171:SF7; DNA EXCISION REPAIR PROTEIN ERCC-5; 1.
DR PANTHER; PTHR16171; DNA REPAIR PROTEIN COMPLEMENTING XP-G CELLS-RELATED; 1.
DR Pfam; PF00867; XPG_I; 1.
DR Pfam; PF00752; XPG_N; 1.
DR PRINTS; PR00853; XPGRADSUPER.
DR PRINTS; PR00066; XRODRMPGMNTG.
DR SMART; SM00279; HhH2; 1.
DR SMART; SM00484; XPGI; 1.
DR SMART; SM00485; XPGN; 1.
DR SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
DR PROSITE; PS50330; UIM; 1.
DR PROSITE; PS00841; XPG_1; 1.
DR PROSITE; PS00842; XPG_2; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Endonuclease {ECO:0000256|ARBA:ARBA00022759};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Magnesium {ECO:0000256|ARBA:ARBA00022842};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242}.
FT DOMAIN 1..98
FT /note="XPG N-terminal"
FT /evidence="ECO:0000259|SMART:SM00485"
FT DOMAIN 983..1052
FT /note="XPG-I"
FT /evidence="ECO:0000259|SMART:SM00484"
SQ SEQUENCE 1445 AA; 160450 MW; E49134F036C432F6 CRC64;
MGVTGLWKLI EQSGKPVPLD TLENKVLAVD ISIWLHQVVK GFQDSKGSAL PNAHVLGLFH
RLCKLMYYRI KPIFVFDGGA PLLKKQTIAK RQQSKNNYQN EADRIQQLLL ETLAKEKVVQ
QALGSATNIL ISPSKKAITN GGPSTSKQPD REEEPDAMFK LPPLKAPEEP IDLDRSDSSM
DEKASRHYYH LNLNAIDVTS VYFKNLPADV RHEILNDIKE TRKQSSWGRL HELPVQSDSF
SSFQMKRLLK RRQVQVELEE AEKEMGGKCL SLTELESLLN EEGVETSSNR AAQKIASDEN
TRFLLVRDVQ KAIEKAKARE EAEKLAPAPP KVPKISKDEV YSQLQEEADD KEMDEDLQLA
IKMSLMQDET PHAVIELDED ELRMSRTQKR VLGNAAQSLA RGFMLEYGGL TTEEFNELLH
QTQDVDSGDI NESMSQMFVP SEEPSTVGRE RETLTEIREE QEMILESIKE SEKKTEAKPP
SEGAADSDAE TESDSDFVDV PEDNLNDSVT GISLPLNSTN HFKPHYNPIV DFTLDDLKQL
SQAGPSKKKE VVQVVIKPEE IGVCDQDDIF ADIFTVKKEA PETPVKEKGE KEVNHPDGPN
PPMISIPKKQ MNEIMQEQHE NNSSDSDKQP SVDGAVAPKA FTGLKIKKVE TINAQLKEEL
ENLKKAPPVI DLGNILGQPV LPVPDPVLPV TDLKSISETL KQQLEELKAS ANAIKLDEIN
LDAVVANSEE PKDRNDDEEN DSEATIIYDA DLDAEKTPTK QIVSQNDKKE EAKSPSTATE
DDLLKSKSPI TTIGDEESIT KSPSTSKEAE GANTASPVIE IIDSPAKAGT LEHLIIARTP
GKDSHKSTEP EESIPRVPKP FFVNKTPPSA KKANAEEDAE PTKQTTPSKP VAKELFPAEP
VPSTSKQAPP PPPAPEPQPV RAEDLITEMA DTLKEAHTPL ELKRMALNLA ETERELERER
NKQSRIGLSI TEQMRRDCME LLQIFGVPFI VAPMEAEAQC AFLNQLDMTD GTITDDSDIW
LFGGKKVYKN FFNQQKLVLE FTIDGIEQMF QMDRKKLIQL ALLVGSDYTT GIHGIGAVTA
LEILASFPPT PEQPGETSEM MSMLSGLRKF RDWWHHGRNG ATGTRISLKS KLKNIEIGEG
FPSTGVVEAY LQPTVDCSEE EFTWGYPDAD RLRDYARQKF GWSQTKTNDI LLPVLKRLDE
RKSQSSIKNY FKVQSAVGHN RLKVSKRVQH AVDTMAGKID PEEEAKPKKR SPAKAKQPGG
RKRKQPAAAK NAIETVDLEV IEEEEDEKKK EESPKATVAE EDDDFVEPAK TTQKAGRGRG
AAGRKKANDA AAPGTAAKPK RGRKAAPKSD TTNDTQQPEG GEPSTVQRPT HSLANIGGII
ANINQQSADS MGVEESVEGR RKRIGNKMPD FNPAIPQRVK DEQIMAERKR QAAELFKKLK
ANGKK
//