ID A0A445GP18_GLYSO Unreviewed; 149 AA.
AC A0A445GP18;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 22-FEB-2023, entry version 12.
DE RecName: Full=Histone H2A {ECO:0000256|RuleBase:RU003767};
GN ORFNames=D0Y65_039912 {ECO:0000313|EMBL:RZB62968.1};
OS Glycine soja (Wild soybean).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3848 {ECO:0000313|EMBL:RZB62968.1, ECO:0000313|Proteomes:UP000289340};
RN [1] {ECO:0000313|EMBL:RZB62968.1, ECO:0000313|Proteomes:UP000289340}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. W05 {ECO:0000313|Proteomes:UP000289340};
RC TISSUE=Hypocotyl of etiolated seedlings {ECO:0000313|EMBL:RZB62968.1};
RA Xie M., Chung C.Y.L., Li M.-W., Wong F.-L., Chan T.-F., Lam H.-M.;
RT "A high-quality reference genome of wild soybean provides a powerful tool
RT to mine soybean genomes.";
RL Submitted (SEP-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBUNIT: The nucleosome is a histone octamer containing two molecules
CC each of H2A, H2B, H3 and H4 assembled in one H3-H4 heterotetramer and
CC two H2A-H2B heterodimers. The octamer wraps approximately 147 bp of
CC DNA. {ECO:0000256|RuleBase:RU003767}.
CC -!- SUBCELLULAR LOCATION: Chromosome {ECO:0000256|ARBA:ARBA00004286}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123, ECO:0000256|RuleBase:RU003767}.
CC -!- SIMILARITY: Belongs to the histone H2A family.
CC {ECO:0000256|ARBA:ARBA00010691, ECO:0000256|RuleBase:RU003767}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RZB62968.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QZWG01000015; RZB62968.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A445GP18; -.
DR Proteomes; UP000289340; Chromosome 15.
DR GO; GO:0000786; C:nucleosome; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046982; F:protein heterodimerization activity; IEA:InterPro.
DR GO; GO:0030527; F:structural constituent of chromatin; IEA:InterPro.
DR CDD; cd00074; H2A; 1.
DR Gene3D; 1.10.20.10; Histone, subunit A; 1.
DR InterPro; IPR009072; Histone-fold.
DR InterPro; IPR002119; Histone_H2A.
DR InterPro; IPR007125; Histone_H2A/H2B/H3.
DR InterPro; IPR032454; Histone_H2A_C.
DR InterPro; IPR032458; Histone_H2A_CS.
DR PANTHER; PTHR23430; HISTONE H2A; 1.
DR PANTHER; PTHR23430:SF377; HISTONE H2A; 1.
DR Pfam; PF00125; Histone; 1.
DR Pfam; PF16211; Histone_H2A_C; 1.
DR PRINTS; PR00620; HISTONEH2A.
DR SMART; SM00414; H2A; 1.
DR SUPFAM; SSF47113; Histone-fold; 1.
DR PROSITE; PS00046; HISTONE_H2A; 1.
PE 3: Inferred from homology;
KW Chromosome {ECO:0000256|ARBA:ARBA00022454, ECO:0000256|RuleBase:RU003767};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|RuleBase:RU003767};
KW Nucleosome core {ECO:0000256|ARBA:ARBA00023269,
KW ECO:0000256|RuleBase:RU003767};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|RuleBase:RU003767};
KW Reference proteome {ECO:0000313|Proteomes:UP000289340}.
FT DOMAIN 20..97
FT /note="Histone H2A/H2B/H3"
FT /evidence="ECO:0000259|Pfam:PF00125"
FT DOMAIN 100..134
FT /note="Histone H2A C-terminal"
FT /evidence="ECO:0000259|Pfam:PF16211"
FT REGION 1..27
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 127..149
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 128..142
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 149 AA; 15806 MW; 70087E5D6F1539D7 CRC64;
MDADGKIKKG AGGRKGGGPK KKPVSRSVKA GLQFPVGRIG RYLKKGRYAQ RVGTGAPVYL
AAVLEYLAAE VLELAGNAAR DNKKNRIIPR HVLLAVRNDE ELGKLLAGVT IAHGGVLPNI
NPVLLPKKTE RASKEPKSPS KATKSPKKA
//