GenomeNet

Database: UniProt
Entry: H31_CANAL
LinkDB: H31_CANAL
Original site: H31_CANAL 
ID   H31_CANAL               Reviewed;         136 AA.
AC   Q59VN2; A0A1D8PD88;
DT   09-JAN-2007, integrated into UniProtKB/Swiss-Prot.
DT   23-JAN-2007, sequence version 3.
DT   27-MAR-2024, entry version 105.
DE   RecName: Full=Histone H3.1/H3.2;
GN   Name=HHT21; Synonyms=HHT1; OrderedLocusNames=CAALFM_C104260WA;
GN   ORFNames=CaO19.1061, CaO19.8663;
GN   and
GN   Name=HHT2; Synonyms=HHT22; OrderedLocusNames=CAALFM_CR06810WA;
GN   ORFNames=CaO19.1853, CaO19.9411;
OS   Candida albicans (strain SC5314 / ATCC MYA-2876) (Yeast).
OC   Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC   Saccharomycetales; Debaryomycetaceae; Candida/Lodderomyces clade; Candida.
OX   NCBI_TaxID=237561;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=SC5314 / ATCC MYA-2876;
RX   PubMed=15123810; DOI=10.1073/pnas.0401648101;
RA   Jones T., Federspiel N.A., Chibana H., Dungan J., Kalman S., Magee B.B.,
RA   Newport G., Thorstenson Y.R., Agabian N., Magee P.T., Davis R.W.,
RA   Scherer S.;
RT   "The diploid genome sequence of Candida albicans.";
RL   Proc. Natl. Acad. Sci. U.S.A. 101:7329-7334(2004).
RN   [2]
RP   GENOME REANNOTATION.
RC   STRAIN=SC5314 / ATCC MYA-2876;
RX   PubMed=17419877; DOI=10.1186/gb-2007-8-4-r52;
RA   van het Hoog M., Rast T.J., Martchenko M., Grindle S., Dignard D.,
RA   Hogues H., Cuomo C., Berriman M., Scherer S., Magee B.B., Whiteway M.,
RA   Chibana H., Nantel A., Magee P.T.;
RT   "Assembly of the Candida albicans genome into sixteen supercontigs aligned
RT   on the eight chromosomes.";
RL   Genome Biol. 8:RESEARCH52.1-RESEARCH52.12(2007).
RN   [3]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND GENOME REANNOTATION.
RC   STRAIN=SC5314 / ATCC MYA-2876;
RX   PubMed=24025428; DOI=10.1186/gb-2013-14-9-r97;
RA   Muzzey D., Schwartz K., Weissman J.S., Sherlock G.;
RT   "Assembly of a phased diploid Candida albicans genome facilitates allele-
RT   specific measurements and provides a simple model for repeat and indel
RT   structure.";
RL   Genome Biol. 14:RESEARCH97.1-RESEARCH97.14(2013).
CC   -!- FUNCTION: Core component of nucleosome. Nucleosomes wrap and compact
CC       DNA into chromatin, limiting DNA accessibility to the cellular
CC       machineries which require DNA as a template. Histones thereby play a
CC       central role in transcription regulation, DNA repair, DNA replication
CC       and chromosomal stability. DNA accessibility is regulated via a complex
CC       set of post-translational modifications of histones, also called
CC       histone code, and nucleosome remodeling (By similarity). {ECO:0000250}.
CC   -!- SUBUNIT: The nucleosome is a histone octamer containing two molecules
CC       each of H2A, H2B, H3 and H4 assembled in one H3-H4 heterotetramer and
CC       two H2A-H2B heterodimers. The octamer wraps approximately 147 bp of DNA
CC       (By similarity). {ECO:0000250}.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250}. Chromosome {ECO:0000250}.
CC   -!- PTM: Phosphorylated by IPL1 to form H3S10ph. H3S10ph promotes
CC       subsequent H3K14ac formation and is required for transcriptional
CC       activation through TBP recruitment to the promoters (By similarity).
CC       {ECO:0000250}.
CC   -!- PTM: Mono-, di- and trimethylated by the COMPASS complex to form
CC       H3K4me1/2/3. H3K4me activates gene expression by regulating
CC       transcription elongation and plays a role in telomere length
CC       maintenance. H3K4me enrichment correlates with transcription levels,
CC       and occurs in a 5' to 3' gradient with H3K4me3 enrichment at the 5'-end
CC       of genes, shifting to H3K4me2 and then H3K4me1. Methylated by SET2 to
CC       form H3K36me. H3K36me represses gene expression. Methylated by DOT1 to
CC       form H3K79me. H3K79me is required for association of SIR proteins with
CC       telomeric regions and for telomeric silencing. The COMPASS-mediated
CC       formation of H3K4me2/3 and the DOT1-mediated formation of H3K79me
CC       require H2BK123ub1 (By similarity). {ECO:0000250}.
CC   -!- PTM: Acetylation of histone H3 leads to transcriptional activation.
CC       H3K14ac formation by GCN5 is promoted by H3S10ph. H3K14ac can also be
CC       formed by ESA1. H3K56ac formation occurs predominantly in newly
CC       synthesized H3 molecules during G1, S and G2/M of the cell cycle and
CC       may be involved in DNA repair (By similarity). {ECO:0000250}.
CC   -!- SIMILARITY: Belongs to the histone H3 family. {ECO:0000305}.
CC   -!- CAUTION: To ensure consistency between histone entries, we follow the
CC       'Brno' nomenclature for histone modifications, with positions referring
CC       to those used in the literature for the 'closest' model organism. Due
CC       to slight variations in histone sequences between organisms and to the
CC       presence of initiator methionine in UniProtKB/Swiss-Prot sequences, the
CC       actual positions of modified amino acids in the sequence generally
CC       differ. In this entry the following conventions are used: H3K4me1/2/3 =
CC       mono-, di- and trimethylated Lys-5; H3K9ac = acetylated Lys-10; H3K9me1
CC       = monomethylated Lys-10; H3S10ph = phosphorylated Ser-11; H3K14ac =
CC       acetylated Lys-15; H3K14me2 = dimethylated Lys-15; H3K18ac = acetylated
CC       Lys-19; H3K18me1 = monomethylated Lys-19; H3K23ac = acetylated Lys-24;
CC       H3K23me1 = monomethylated Lys-24; H3K27ac = acetylated Lys-28;
CC       H3K27me1/2/3 = mono-, di- and trimethylated Lys-28; H3K36ac =
CC       acetylated Lys-37; H3K36me1/2/3 = mono-, di- and trimethylated Lys-37;
CC       H3K56ac = acetylated Lys-57; H3K64ac = acetylated Lys-65; H3K79me1/2/3
CC       = mono-, di- and trimethylated Lys-80. {ECO:0000305}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CP017623; AOW26100.1; -; Genomic_DNA.
DR   EMBL; CP017630; AOW31378.1; -; Genomic_DNA.
DR   RefSeq; XP_711082.1; XM_705990.1.
DR   RefSeq; XP_713665.1; XM_708572.1.
DR   AlphaFoldDB; Q59VN2; -.
DR   SMR; Q59VN2; -.
DR   STRING; 237561.Q59VN2; -.
DR   EnsemblFungi; C1_04260W_A-T; C1_04260W_A-T-p1; C1_04260W_A.
DR   EnsemblFungi; CR_06810W_A-T; CR_06810W_A-T-p1; CR_06810W_A.
DR   GeneID; 3644707; -.
DR   GeneID; 3647326; -.
DR   KEGG; cal:CAALFM_C104260WA; -.
DR   KEGG; cal:CAALFM_CR06810WA; -.
DR   CGD; CAL0000187299; HHT2.
DR   CGD; CAL0000180612; HHT21.
DR   VEuPathDB; FungiDB:C1_04260W_A; -.
DR   VEuPathDB; FungiDB:CR_06810W_A; -.
DR   eggNOG; KOG1745; Eukaryota.
DR   HOGENOM; CLU_078295_4_0_1; -.
DR   InParanoid; Q59VN2; -.
DR   OrthoDB; 5482964at2759; -.
DR   Proteomes; UP000000559; Chromosome 1.
DR   Proteomes; UP000000559; Chromosome R.
DR   GO; GO:0000786; C:nucleosome; IEA:UniProtKB-KW.
DR   GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0046982; F:protein heterodimerization activity; IEA:InterPro.
DR   GO; GO:0030527; F:structural constituent of chromatin; IEA:InterPro.
DR   GO; GO:0009303; P:rRNA transcription; IBA:GO_Central.
DR   Gene3D; 1.10.20.10; Histone, subunit A; 1.
DR   InterPro; IPR009072; Histone-fold.
DR   InterPro; IPR007125; Histone_H2A/H2B/H3.
DR   InterPro; IPR000164; Histone_H3/CENP-A.
DR   PANTHER; PTHR11426; HISTONE H3; 1.
DR   PANTHER; PTHR11426:SF268; HISTONE H3; 1.
DR   Pfam; PF00125; Histone; 1.
DR   PRINTS; PR00622; HISTONEH3.
DR   SMART; SM00428; H3; 1.
DR   SUPFAM; SSF47113; Histone-fold; 1.
DR   PROSITE; PS00322; HISTONE_H3_1; 1.
DR   PROSITE; PS00959; HISTONE_H3_2; 1.
PE   3: Inferred from homology;
KW   Acetylation; Chromosome; DNA-binding; Methylation; Nucleosome core;
KW   Nucleus; Phosphoprotein; Reference proteome.
FT   INIT_MET        1
FT                   /note="Removed"
FT                   /evidence="ECO:0000250"
FT   CHAIN           2..136
FT                   /note="Histone H3.1/H3.2"
FT                   /id="PRO_0000270586"
FT   REGION          1..43
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   MOD_RES         5
FT                   /note="N6,N6,N6-trimethyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         5
FT                   /note="N6,N6-dimethyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         5
FT                   /note="N6-methyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         10
FT                   /note="N6-acetyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         10
FT                   /note="N6-methyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         11
FT                   /note="Phosphoserine"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         15
FT                   /note="N6,N6-dimethyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         15
FT                   /note="N6-acetyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         19
FT                   /note="N6-acetyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         19
FT                   /note="N6-methyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         24
FT                   /note="N6-acetyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         24
FT                   /note="N6-methyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         28
FT                   /note="N6,N6,N6-trimethyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         28
FT                   /note="N6,N6-dimethyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         28
FT                   /note="N6-acetyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         28
FT                   /note="N6-methyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         37
FT                   /note="N6,N6,N6-trimethyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         37
FT                   /note="N6,N6-dimethyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         37
FT                   /note="N6-acetyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         37
FT                   /note="N6-methyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         57
FT                   /note="N6-acetyllysine"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         65
FT                   /note="N6-acetyllysine"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         80
FT                   /note="N6,N6,N6-trimethyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         80
FT                   /note="N6,N6-dimethyllysine; alternate"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         80
FT                   /note="N6-methyllysine; alternate"
FT                   /evidence="ECO:0000250"
SQ   SEQUENCE   136 AA;  15360 MW;  B1B9A5859C0AC67A CRC64;
     MARTKQTARK STGGKAPRKQ LASKAARKSA PSTGGVKKPH RYKPGTVALR EIRRFQKSTE
     LLIRKLPFQR LVREIAQDFK TDLRFQSSAI GALQEAVEAY LVGLFEDTNL CAIHAKRVTI
     QKKDMQLARR LRGERS
//
DBGET integrated database retrieval system