ID A0A2R9AB08_PANPA Unreviewed; 312 AA.
AC A0A2R9AB08;
DT 20-JUN-2018, integrated into UniProtKB/TrEMBL.
DT 20-JUN-2018, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=Abasic site processing protein HMCES {ECO:0000256|ARBA:ARBA00015888, ECO:0000256|RuleBase:RU364100};
DE Short=ES cell-specific 5hmC-binding protein {ECO:0000256|RuleBase:RU364100};
DE EC=3.4.-.- {ECO:0000256|RuleBase:RU364100};
DE AltName: Full=Embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein {ECO:0000256|ARBA:ARBA00030390, ECO:0000256|RuleBase:RU364100};
DE AltName: Full=Peptidase HMCES {ECO:0000256|ARBA:ARBA00030898, ECO:0000256|RuleBase:RU364100};
DE AltName: Full=SRAP domain-containing protein 1 {ECO:0000256|ARBA:ARBA00031130, ECO:0000256|RuleBase:RU364100};
GN Name=HMCES {ECO:0000313|Ensembl:ENSPPAP00000014449.1};
OS Pan paniscus (Pygmy chimpanzee) (Bonobo).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pan.
OX NCBI_TaxID=9597 {ECO:0000313|Ensembl:ENSPPAP00000014449.1, ECO:0000313|Proteomes:UP000240080};
RN [1] {ECO:0000313|Ensembl:ENSPPAP00000014449.1, ECO:0000313|Proteomes:UP000240080}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=22722832; DOI=10.1038/nature11128;
RA Prufer K., Munch K., Hellmann I., Akagi K., Miller J.R., Walenz B.,
RA Koren S., Sutton G., Kodira C., Winer R., Knight J.R., Mullikin J.C.,
RA Meader S.J., Ponting C.P., Lunter G., Higashino S., Hobolth A., Dutheil J.,
RA Karakoc E., Alkan C., Sajjadian S., Catacchio C.R., Ventura M.,
RA Marques-Bonet T., Eichler E.E., Andre C., Atencia R., Mugisha L.,
RA Junhold J., Patterson N., Siebauer M., Good J.M., Fischer A., Ptak S.E.,
RA Lachmann M., Symer D.E., Mailund T., Schierup M.H., Andres A.M., Kelso J.,
RA Paabo S.;
RT "The bonobo genome compared with the chimpanzee and human genomes.";
RL Nature 486:527-531(2012).
RN [2] {ECO:0000313|Ensembl:ENSPPAP00000014449.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Sensor of abasic sites in single-stranded DNA (ssDNA)
CC required to preserve genome integrity by promoting error-free repair of
CC abasic sites. Acts as an enzyme that recognizes and binds abasic sites
CC in ssDNA at replication forks and chemically modifies the lesion by
CC forming a covalent cross-link with DNA: forms a stable thiazolidine
CC linkage between a ring-opened abasic site and the alpha-amino and
CC sulfhydryl substituents of its N-terminal catalytic cysteine residue.
CC The HMCES DNA-protein cross-link is then degraded by the proteasome.
CC Promotes error-free repair of abasic sites by acting as a 'suicide'
CC enzyme that is degraded, thereby protecting abasic sites from
CC translesion synthesis (TLS) polymerases and endonucleases that are
CC error-prone and would generate mutations and double-strand breaks. Has
CC preference for ssDNA, but can also accommodate double-stranded DNA with
CC 3' or 5' overhang (dsDNA), and dsDNA-ssDNA 3' junction.
CC {ECO:0000256|RuleBase:RU364100}.
CC -!- SIMILARITY: Belongs to the SOS response-associated peptidase family.
CC {ECO:0000256|ARBA:ARBA00008136, ECO:0000256|RuleBase:RU364100}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AJFE02103497; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AJFE02103498; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AJFE02103499; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AJFE02103500; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AJFE02103501; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AJFE02103502; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AJFE02103503; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AJFE02103504; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A2R9AB08; -.
DR Ensembl; ENSPPAT00000037133.1; ENSPPAP00000014449.1; ENSPPAG00000030370.1.
DR GeneTree; ENSGT00390000018439; -.
DR Proteomes; UP000240080; Chromosome 3.
DR Bgee; ENSPPAG00000030370; Expressed in prefrontal cortex and 6 other cell types or tissues.
DR GO; GO:0008233; F:peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0003697; F:single-stranded DNA binding; IEA:InterPro.
DR GO; GO:0006974; P:DNA damage response; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR Gene3D; 3.90.1680.10; SOS response associated peptidase-like; 2.
DR InterPro; IPR003738; SRAP.
DR InterPro; IPR036590; SRAP-like.
DR PANTHER; PTHR13604:SF0; ABASIC SITE PROCESSING PROTEIN HMCES; 1.
DR PANTHER; PTHR13604; DC12-RELATED; 1.
DR Pfam; PF02586; SRAP; 2.
DR SUPFAM; SSF143081; BB1717-like; 1.
PE 3: Inferred from homology;
KW Covalent protein-DNA linkage {ECO:0000256|ARBA:ARBA00023124};
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU364100};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU364100};
KW Reference proteome {ECO:0000313|Proteomes:UP000240080}.
FT REGION 250..312
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 251..265
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 266..282
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 292..312
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 312 AA; 35557 MW; F800EE02297D6745 CRC64;
MCGRTSCHLP RDVLTRACAY QDRRGQQRLP EWRDPDKYCP SYNKSPQSNS PVLLSRLHFE
KDADSSERII APMRWGLVPS WFKESDPSKL QFNTTNCRSD TIMEKRSFKS GSIGAADSPE
NWEKVWDNWR LLTMAGIFDC WEPPEGGDVL YSYTIITVDS CKGLSDIHHR MPAILDGEEA
VSKWLDFGEV STQEALKLIH PTENITFHAV SSVVNNSRNN TPECLAPVDL VVKKELKASG
SSQRMLQWLA TKSPKKEDSK TPQKEESDVP QWSSQFLQKS PLPTKRGTAG LLEQWLKREK
EEEPVAKRPY SQ
//