ID H2PC44_PONAB Unreviewed; 319 AA.
AC H2PC44; A0A2J8WGY1; A0A663D5D7;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2012, sequence version 1.
DT 27-MAR-2024, entry version 69.
DE SubName: Full=SOX2 isoform 1 {ECO:0000313|EMBL:PNJ69032.1};
DE SubName: Full=SRY-box transcription factor 2 {ECO:0000313|Ensembl:ENSPPYP00000016027.1};
GN Name=SOX2 {ECO:0000313|Ensembl:ENSPPYP00000016027.1};
GN ORFNames=CR201_G0010540 {ECO:0000313|EMBL:PNJ69032.1};
OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pongo.
OX NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000016027.1, ECO:0000313|Proteomes:UP000001595};
RN [1] {ECO:0000313|Ensembl:ENSPPYP00000016027.1, ECO:0000313|Proteomes:UP000001595}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Wilson R.K., Mardis E.;
RT "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome.";
RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:PNJ69032.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Susie {ECO:0000313|EMBL:PNJ69032.1};
RA Pollen A., Hastie A., Hormozdiari F., Dougherty M., Liu R., Chaisson M.,
RA Hoppe E., Hill C., Pang A., Hillier L., Baker C., Armstrong J.,
RA Shendure J., Paten B., Wilson R., Chao H., Schneider V., Ventura M.,
RA Kronenberg Z., Murali S., Gordon D., Cantsilieris S., Munson K., Nelson B.,
RA Raja A., Underwood J., Diekhans M., Fiddes I., Haussler D., Eichler E.;
RT "High-resolution comparative analysis of great ape genomes.";
RL Submitted (DEC-2017) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|Ensembl:ENSPPYP00000016027.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NDHI03003390; PNJ69032.1; -; Genomic_DNA.
DR RefSeq; XP_002814367.1; XM_002814321.3.
DR STRING; 9601.ENSPPYP00000016027; -.
DR Ensembl; ENSPPYT00000016670.2; ENSPPYP00000016027.1; ENSPPYG00000014330.2.
DR GeneID; 100460719; -.
DR KEGG; pon:100460719; -.
DR CTD; 6657; -.
DR eggNOG; KOG0527; Eukaryota.
DR GeneTree; ENSGT00940000160614; -.
DR HOGENOM; CLU_021123_0_0_1; -.
DR OMA; AHNPSQM; -.
DR OrthoDB; 2902801at2759; -.
DR TreeFam; TF351735; -.
DR Proteomes; UP000001595; Chromosome 3.
DR GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:Ensembl.
DR GO; GO:0035198; F:miRNA binding; IEA:Ensembl.
DR GO; GO:0000976; F:transcription cis-regulatory region binding; IEA:Ensembl.
DR GO; GO:0001714; P:endodermal cell fate specification; IEA:Ensembl.
DR GO; GO:0001654; P:eye development; IEA:Ensembl.
DR GO; GO:0048839; P:inner ear development; IEA:Ensembl.
DR GO; GO:0090090; P:negative regulation of canonical Wnt signaling pathway; IEA:Ensembl.
DR GO; GO:1902807; P:negative regulation of cell cycle G1/S phase transition; IEA:Ensembl.
DR GO; GO:0050680; P:negative regulation of epithelial cell proliferation; IEA:Ensembl.
DR GO; GO:0001649; P:osteoblast differentiation; IEA:Ensembl.
DR GO; GO:0021983; P:pituitary gland development; IEA:Ensembl.
DR GO; GO:0043410; P:positive regulation of MAPK cascade; IEA:Ensembl.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:Ensembl.
DR GO; GO:0070848; P:response to growth factor; IEA:Ensembl.
DR GO; GO:0009611; P:response to wounding; IEA:Ensembl.
DR GO; GO:0035019; P:somatic stem cell population maintenance; IEA:Ensembl.
DR CDD; cd01388; HMG-box_SoxB; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR022097; SOX_fam.
DR PANTHER; PTHR10270; SOX TRANSCRIPTION FACTOR; 1.
DR PANTHER; PTHR10270:SF231; TRANSCRIPTION FACTOR SOX-2; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF12336; SOXp; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP000001595}.
FT DOMAIN 43..111
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 43..111
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..46
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 247..268
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 299..319
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 10..35
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 247..266
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 319 AA; 34424 MW; 125DBDD129A64700 CRC64;
MYNMMETELK PPGPQQTSGG GGGGGNSTAA AAGGNQKNSP DRVKRPMNAF MVWSRGQRRK
MAQENPKMHN SEISKRLGAE WKLLSETEKR PFIDEAKRLR ALHMKEHPDY KYRPRRKTKT
LMKKDKYTLP GGLLAPGGNS MASGVGVGAG LGAGVNQRMD SYAHMNGWSN GSYSMMQDQL
GYPQHPGLNA HGAAQMQPMH RYDVSALQYN SMTSSQTYMN GSPTYSMSYS QQGTPGMALG
SMGSVVKSEA SSSPPVVTSS SHSRAPCQAG DLRDMISMYL PGAEVPEPAA PSRLHMSQHY
QSGPVPGTAI NGTLPLSHM
//