GenomeNet

Database: UniProt
Entry: H2LY37_ORYLA
LinkDB: H2LY37_ORYLA
Original site: H2LY37_ORYLA 
ID   H2LY37_ORYLA            Unreviewed;       739 AA.
AC   H2LY37;
DT   21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT   05-DEC-2018, sequence version 2.
DT   27-MAR-2024, entry version 66.
DE   SubName: Full=SRY-box transcription factor 5 {ECO:0000313|Ensembl:ENSORLP00000011049.2};
GN   Name=sox5 {ECO:0000313|Ensembl:ENSORLP00000011049.2};
OS   Oryzias latipes (Japanese rice fish) (Japanese killifish).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; Oryziinae;
OC   Oryzias.
OX   NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000011049.2, ECO:0000313|Proteomes:UP000001038};
RN   [1] {ECO:0000313|Ensembl:ENSORLP00000011049.2, ECO:0000313|Proteomes:UP000001038}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000011049.2,
RC   ECO:0000313|Proteomes:UP000001038};
RX   PubMed=17554307; DOI=10.1038/nature05846;
RA   Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., Yamada T.,
RA   Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., Shimada A.,
RA   Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., Asakawa S.,
RA   Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., Sugano S.,
RA   Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., Nomoto H., Nogata K.,
RA   Morishita T., Endo T., Shin-I T., Takeda H., Morishita S., Kohara Y.;
RT   "The medaka draft genome and insights into vertebrate genome evolution.";
RL   Nature 447:714-719(2007).
RN   [2] {ECO:0000313|Ensembl:ENSORLP00000011049.2}
RP   IDENTIFICATION.
RC   STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000011049.2};
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; H2LY37; -.
DR   Ensembl; ENSORLT00000011050.2; ENSORLP00000011049.2; ENSORLG00000008803.2.
DR   eggNOG; KOG0528; Eukaryota.
DR   GeneTree; ENSGT00940000156122; -.
DR   HOGENOM; CLU_018522_0_0_1; -.
DR   TreeFam; TF320471; -.
DR   Proteomes; UP000001038; Chromosome 23.
DR   Bgee; ENSORLG00000008803; Expressed in intestine and 12 other cell types or tissues.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   CDD; cd22042; HMG-box_EGL13-like; 1.
DR   Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR   InterPro; IPR009071; HMG_box_dom.
DR   InterPro; IPR036910; HMG_box_dom_sf.
DR   PANTHER; PTHR45789; FI18025P1; 1.
DR   PANTHER; PTHR45789:SF3; TRANSCRIPTION FACTOR SOX-5; 1.
DR   Pfam; PF00505; HMG_box; 1.
DR   SMART; SM00398; HMG; 1.
DR   SUPFAM; SSF47095; HMG-box; 1.
DR   PROSITE; PS50118; HMG_BOX_2; 1.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001038}.
FT   DOMAIN          534..602
FT                   /note="HMG box"
FT                   /evidence="ECO:0000259|PROSITE:PS50118"
FT   DNA_BIND        534..602
FT                   /note="HMG box"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT   REGION          1..128
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          341..389
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          394..413
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          505..534
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          664..739
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          184..264
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          433..460
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        59..93
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        341..377
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        399..413
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        667..686
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        697..715
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        716..730
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   739 AA;  81112 MW;  8408ECD16C34D780 CRC64;
     LLTEDALCWS MSSKRPASPY GGTDGEVTMA TSRQRVEDEE NEGLGGVIHL PLASYCGKVS
     PRSPSNRNLE SPANTEHDGS KGSSLSPYPQ HNATSPGKEE GVGGGRPCSD GGFGMGPLGT
     PERRKGSLAD VVDTLKQRKM EELIKNEPEE APSIERLLSK DWKDKLLAMG SGHIAEIKGT
     PDSLAEKERQ LMGMIGQLSS LREQLLAAHE EQKKLAASQM EKQRQQMELA KQQQDQIARQ
     QQQLLQQQHK INLLQQQIQQ VQGQLPPLMI PVFPPDQRTL AAAAAQQGFL MPPGFNYKPG
     CSDPYPLQLI PTTMAAATPG LGPLQLQQLY AAQLAAMQVS PGAKQQHGGN LPPQANLGTH
     SPTTNTHPQS DKSRSPPPAN KTKVPAAAAA ATKLGPISSM KHSAPSSIGG PPSRVSSIDL
     LSSLSSTAYM NDHEAVTKAF AEARQMKEQL KREQQVLDAK VAAVSNLGLN NGRSDKDKAA
     LESLSQQLKQ QAEDKFSHAM LDFTMSGDSD GSPSVSDSRI FRESRGRSNN EPHIKRPMNA
     FMVWAKDERR KILQAFPDMH NSNISKILGS RWKAMTNLEK QPYYEEQARL SKQHLEKYPD
     YKYKPRPKRT CLVDGKKLRI GEYKAIMRSR RQEMRQYFSV GQQGQLPLSS AGVVYPGALT
     MAGMPSPQMP SEHSSMSSSP EPTANHHPNP NLAGLKGDEP RIKEEELRLD DSNGDAYDDF
     DYEDEEADYA SDNENHITQ
//
DBGET integrated database retrieval system