ID G3QJ94_GORGO Unreviewed; 638 AA.
AC G3QJ94;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 2.
DT 27-MAR-2024, entry version 66.
DE RecName: Full=SUMO-activating enzyme subunit 2 {ECO:0000256|PIRNR:PIRNR039133};
DE EC=2.3.2.- {ECO:0000256|PIRNR:PIRNR039133};
GN Name=UBA2 {ECO:0000313|Ensembl:ENSGGOP00000002481.3};
OS Gorilla gorilla gorilla (Western lowland gorilla).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Gorilla.
OX NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000002481.3, ECO:0000313|Proteomes:UP000001519};
RN [1] {ECO:0000313|Ensembl:ENSGGOP00000002481.3, ECO:0000313|Proteomes:UP000001519}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Scally A.;
RT "Insights into the evolution of the great apes provided by the gorilla
RT genome.";
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGGOP00000002481.3, ECO:0000313|Proteomes:UP000001519}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=22398555; DOI=10.1038/nature10842;
RA Scally A., Dutheil J.Y., Hillier L.W., Jordan G.E., Goodhead I.,
RA Herrero J., Hobolth A., Lappalainen T., Mailund T., Marques-Bonet T.,
RA McCarthy S., Montgomery S.H., Schwalie P.C., Tang Y.A., Ward M.C., Xue Y.,
RA Yngvadottir B., Alkan C., Andersen L.N., Ayub Q., Ball E.V., Beal K.,
RA Bradley B.J., Chen Y., Clee C.M., Fitzgerald S., Graves T.A., Gu Y.,
RA Heath P., Heger A., Karakoc E., Kolb-Kokocinski A., Laird G.K., Lunter G.,
RA Meader S., Mort M., Mullikin J.C., Munch K., O'Connor T.D., Phillips A.D.,
RA Prado-Martinez J., Rogers A.S., Sajjadian S., Schmidt D., Shaw K.,
RA Simpson J.T., Stenson P.D., Turner D.J., Vigilant L., Vilella A.J.,
RA Whitener W., Zhu B., Cooper D.N., de Jong P., Dermitzakis E.T.,
RA Eichler E.E., Flicek P., Goldman N., Mundy N.I., Ning Z., Odom D.T.,
RA Ponting C.P., Quail M.A., Ryder O.A., Searle S.M., Warren W.C.,
RA Wilson R.K., Schierup M.H., Rogers J., Tyler-Smith C., Durbin R.;
RT "Insights into hominid evolution from the gorilla genome sequence.";
RL Nature 483:169-175(2012).
RN [3] {ECO:0000313|Ensembl:ENSGGOP00000002481.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: The heterodimer acts as an E1 ligase for SUMO1, SUMO2, SUMO3,
CC and probably SUMO4. It mediates ATP-dependent activation of SUMO
CC proteins followed by formation of a thioester bond between a SUMO
CC protein and a conserved active site cysteine residue on UBA2/SAE2.
CC {ECO:0000256|PIRNR:PIRNR039133}.
CC -!- PATHWAY: Protein modification; protein sumoylation.
CC {ECO:0000256|ARBA:ARBA00004718, ECO:0000256|PIRNR:PIRNR039133}.
CC -!- SUBUNIT: Heterodimer of SAE1 and UBA2/SAE2. The heterodimer corresponds
CC to the two domains that are encoded on a single polypeptide chain in
CC ubiquitin-activating enzyme E1. Interacts with UBE2I.
CC {ECO:0000256|ARBA:ARBA00026003}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PIRNR:PIRNR039133}.
CC -!- SIMILARITY: Belongs to the ubiquitin-activating E1 family.
CC {ECO:0000256|ARBA:ARBA00005673, ECO:0000256|PIRNR:PIRNR039133}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CABD030113143; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030113144; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030113145; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030113146; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; G3QJ94; -.
DR STRING; 9593.ENSGGOP00000002481; -.
DR Ensembl; ENSGGOT00000002534.3; ENSGGOP00000002481.3; ENSGGOG00000002517.3.
DR GeneTree; ENSGT00550000074924; -.
DR HOGENOM; CLU_013325_7_4_1; -.
DR InParanoid; G3QJ94; -.
DR OMA; RFDIKQM; -.
DR UniPathway; UPA00886; -.
DR Proteomes; UP000001519; Chromosome 19.
DR Bgee; ENSGGOG00000002517; Expressed in testis and 5 other cell types or tissues.
DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0031510; C:SUMO activating enzyme complex; IBA:GO_Central.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000287; F:magnesium ion binding; IEA:Ensembl.
DR GO; GO:0046982; F:protein heterodimerization activity; IEA:Ensembl.
DR GO; GO:0044388; F:small protein activating enzyme binding; IEA:Ensembl.
DR GO; GO:0019948; F:SUMO activating enzyme activity; IBA:GO_Central.
DR GO; GO:0032183; F:SUMO binding; IEA:Ensembl.
DR GO; GO:0044390; F:ubiquitin-like protein conjugating enzyme binding; IEA:Ensembl.
DR GO; GO:0033235; P:positive regulation of protein sumoylation; IEA:Ensembl.
DR GO; GO:0016925; P:protein sumoylation; IBA:GO_Central.
DR CDD; cd01489; Uba2_SUMO; 1.
DR Gene3D; 1.10.10.520; Ubiquitin activating enzymes (Uba3). Chain: B, domain 2; 1.
DR Gene3D; 3.50.50.80; Ubiquitin-activating enzyme E1, inactive adenylation domain, subdomain 1; 1.
DR Gene3D; 3.10.290.20; Ubiquitin-like 2 activating enzyme e1b. Chain: B, domain 3; 2.
DR InterPro; IPR045886; ThiF/MoeB/HesA.
DR InterPro; IPR000594; ThiF_NAD_FAD-bd.
DR InterPro; IPR028077; UAE_UbL_dom.
DR InterPro; IPR042449; Ub-E1_IAD_1.
DR InterPro; IPR023318; Ub_act_enz_dom_a_sf.
DR InterPro; IPR030661; Uba2.
DR InterPro; IPR032426; UBA2_C.
DR InterPro; IPR035985; Ubiquitin-activating_enz.
DR InterPro; IPR018074; UBQ-activ_enz_E1_CS.
DR InterPro; IPR033127; UBQ-activ_enz_E1_Cys_AS.
DR PANTHER; PTHR10953:SF5; SUMO-ACTIVATING ENZYME SUBUNIT 2; 1.
DR PANTHER; PTHR10953; UBIQUITIN-ACTIVATING ENZYME E1; 1.
DR Pfam; PF00899; ThiF; 1.
DR Pfam; PF14732; UAE_UbL; 1.
DR Pfam; PF16195; UBA2_C; 1.
DR PIRSF; PIRSF039133; SUMO_E1B; 1.
DR SUPFAM; SSF69572; Activating enzymes of the ubiquitin-like proteins; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR PROSITE; PS00536; UBIQUITIN_ACTIVAT_1; 1.
DR PROSITE; PS00865; UBIQUITIN_ACTIVAT_2; 1.
PE 3: Inferred from homology;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840, ECO:0000256|PIRNR:PIRNR039133};
KW Metal-binding {ECO:0000256|PIRNR:PIRNR039133,
KW ECO:0000256|PIRSR:PIRSR039133-3};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741,
KW ECO:0000256|PIRNR:PIRNR039133}; Nucleus {ECO:0000256|PIRNR:PIRNR039133};
KW Reference proteome {ECO:0000313|Proteomes:UP000001519};
KW Ubl conjugation {ECO:0000256|PIRNR:PIRNR039133};
KW Ubl conjugation pathway {ECO:0000256|ARBA:ARBA00022786,
KW ECO:0000256|PIRNR:PIRNR039133};
KW Zinc {ECO:0000256|PIRNR:PIRNR039133, ECO:0000256|PIRSR:PIRSR039133-3}.
FT DOMAIN 13..441
FT /note="THIF-type NAD/FAD binding fold"
FT /evidence="ECO:0000259|Pfam:PF00899"
FT DOMAIN 452..536
FT /note="Ubiquitin/SUMO-activating enzyme ubiquitin-like"
FT /evidence="ECO:0000259|Pfam:PF14732"
FT DOMAIN 547..629
FT /note="SUMO-activating enzyme subunit 2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF16195"
FT REGION 202..231
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 549..638
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 208..231
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 561..584
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 585..599
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 600..638
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 173
FT /note="Glycyl thioester intermediate"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-1,
FT ECO:0000256|PROSITE-ProRule:PRU10132"
FT BINDING 24..29
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-2"
FT BINDING 48
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-2"
FT BINDING 56..59
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-2"
FT BINDING 72
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-2"
FT BINDING 95..96
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-2"
FT BINDING 117..122
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-2"
FT BINDING 158
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-3"
FT BINDING 161
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-3"
FT BINDING 441
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-3"
FT BINDING 444
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-3"
FT CROSSLNK 164
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO1)"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-4"
FT CROSSLNK 190
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO)"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-4"
FT CROSSLNK 236
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO1); alternate"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-4"
FT CROSSLNK 257
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO)"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-4"
FT CROSSLNK 420
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO1)"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-4"
FT CROSSLNK 621
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO)"
FT /evidence="ECO:0000256|PIRSR:PIRSR039133-4"
SQ SEQUENCE 638 AA; 71194 MW; 566391E4EA68D7EF CRC64;
MALSRGLPRE LAEAVAGGRV LVVGAGGIGC ELLKNLVLTG FSHIDLIDLD TIDVSNLNRQ
FLFQKKHVGR SKAQVAKESV LQFYPKANIV AYHDSIMNPD YNVEFFRQFI LVMNALDNRA
ARNHVNRMCL AADVPLIESG TAGYLGQVTT IKKGVTECYE CHPKPTQRTF PGCTIRNTPS
EPIHCIVWAK YLFNQLFGEE DADQEVSPDR ADPEAAWEPT EAEARARASN EDGDIKRIST
KEWAKSTGYD PVKLFTKLFK DDIRYLLTMD KLWRKRKPPV PLDWAEVQSQ GEETNASDQQ
NEPQLGLKDQ QVLDVKSYAR LFSKSIETLR VHLAEKGDGA ELIWDKDDPS AMDFVTSAAN
LRMHIFSMNM KSRFDIKSMA GNIIPAIATT NAVIAGLIVL EGLKILSGKI DQCRTIFLNK
QPNPRKKLLV PCALDPPNPN CYVCASKPEV TVRLNVHKVT VLTLQDKIVK EKFAMVAPDV
QIEDGKGTIL ISSEEGETEG IIHCIYSFLS LRRLLEEYDF LQDYTLLINI LHSEDLGKDV
EFEVVGDAPE KVGPKQAEDA AKSITNGSDD GAQPSTSTAQ EQDDVLIVDS DEEDSSNNAD
VSEEERSRKR KLDEKENLSA KRSRIEQKEE LDDVIALD
//