GenomeNet

Database: UniProt
Entry: A0A087Y201_POEFO
LinkDB: A0A087Y201_POEFO
Original site: A0A087Y201_POEFO 
ID   A0A087Y201_POEFO        Unreviewed;       355 AA.
AC   A0A087Y201;
DT   29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT   29-OCT-2014, sequence version 1.
DT   27-MAR-2024, entry version 50.
DE   RecName: Full=Transcription factor SOX {ECO:0000256|PIRNR:PIRNR038098};
OS   Poecilia formosa (Amazon molly) (Limia formosa).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC   Poecilia.
OX   NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000012054.1, ECO:0000313|Proteomes:UP000028760};
RN   [1] {ECO:0000313|Proteomes:UP000028760}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA   Schartl M., Warren W.;
RL   Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSPFOP00000012054.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PIRNR:PIRNR038098}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AYCK01010820; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; A0A087Y201; -.
DR   STRING; 48698.ENSPFOP00000012054; -.
DR   Ensembl; ENSPFOT00000012071.1; ENSPFOP00000012054.1; ENSPFOG00000012067.1.
DR   eggNOG; KOG0527; Eukaryota.
DR   GeneTree; ENSGT00940000161652; -.
DR   OMA; DICQTNW; -.
DR   Proteomes; UP000028760; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR   GO; GO:0007399; P:nervous system development; IEA:UniProtKB-KW.
DR   Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR   InterPro; IPR009071; HMG_box_dom.
DR   InterPro; IPR036910; HMG_box_dom_sf.
DR   InterPro; IPR017386; SOX-12/11/4.
DR   PANTHER; PTHR10270; SOX TRANSCRIPTION FACTOR; 1.
DR   PANTHER; PTHR10270:SF113; TRANSCRIPTION FACTOR SOX-11; 1.
DR   Pfam; PF00505; HMG_box; 1.
DR   PIRSF; PIRSF038098; SOX-12/11/4a; 1.
DR   SMART; SM00398; HMG; 1.
DR   SUPFAM; SSF47095; HMG-box; 1.
DR   PROSITE; PS50118; HMG_BOX_2; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PIRNR:PIRNR038098};
KW   Neurogenesis {ECO:0000256|ARBA:ARBA00022902};
KW   Nucleus {ECO:0000256|PIRNR:PIRNR038098, ECO:0000256|PROSITE-
KW   ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW   Transcription {ECO:0000256|PIRNR:PIRNR038098};
KW   Transcription regulation {ECO:0000256|PIRNR:PIRNR038098}.
FT   DOMAIN          46..114
FT                   /note="HMG box"
FT                   /evidence="ECO:0000259|PROSITE:PS50118"
FT   DNA_BIND        46..114
FT                   /note="HMG box"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT   REGION          1..25
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          113..140
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..23
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   355 AA;  39395 MW;  B0BBB4746CC105FA CRC64;
     MVQHTDHGET DGSMSREATD SEESEFMACS PVAINPDWCK TATGHIKRPM NAFMVWSKIE
     RRKIMEQSPD MHNAEISKRL GKRWKMLKDS EKIPFIREAE RLRLKHMADY PDYKYRPKKK
     PKLDGSKPSA PSPEKCGKIA KTPSKKCSKI KTKSGSKTAH GYAEDCVFPS IKVAKTVKSE
     LTDEDDDDEY EEDYRMGFKA GEEERLRPYG VAKVPASPTL SSSAESEGAS MYEEVRSHNR
     LFYNIKSISK QGALHAASVS PASSRSVSGS SSGEDADDLL FDFSLNFAPS AAGSELGNPN
     SGNLSLSLVD KDLDSFSEGS LGSHFEFPDY CTPELSEMIA GDWLEANFSD LVFTY
//
DBGET integrated database retrieval system