ID I3JFV1_ORENI Unreviewed; 773 AA.
AC I3JFV1;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 2.
DT 24-JAN-2024, entry version 65.
DE SubName: Full=SRY-box transcription factor 6 {ECO:0000313|Ensembl:ENSONIP00000007742.2};
OS Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000007742.2, ECO:0000313|Proteomes:UP000005207};
RN [1] {ECO:0000313|Proteomes:UP000005207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Broad Institute Genome Assembly Team;
RG Broad Institute Sequencing Platform;
RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSONIP00000007742.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; I3JFV1; -.
DR Ensembl; ENSONIT00000007747.2; ENSONIP00000007742.2; ENSONIG00000006148.2.
DR eggNOG; KOG0528; Eukaryota.
DR GeneTree; ENSGT00940000156433; -.
DR HOGENOM; CLU_018522_0_1_1; -.
DR TreeFam; TF320471; -.
DR Proteomes; UP000005207; Linkage group LG1.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd22042; HMG-box_EGL13-like; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR45789; FI18025P1; 1.
DR PANTHER; PTHR45789:SF1; TRANSCRIPTION FACTOR SOX-6; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW Activator {ECO:0000256|ARBA:ARBA00023159};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000005207}.
FT DOMAIN 565..633
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 565..633
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 23..53
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 99..132
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 349..439
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 697..716
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 729..773
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 214..265
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 30..44
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 362..429
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 743..758
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 773 AA; 85753 MW; 9EDF8EE38E4BF42C CRC64;
LSGFSPPLFM SSKQATSPFA SVVDGDDAMS QEHLSWEKEE SAEAHGTPQL PLHSLLHGKA
PDEMQPLSSV PSESDWDSLV SAQQRMESDS NKVCSLYSFR NNSTSPHKPE EGARERSDLL
SGSAFGTPER RKGSLADVVD TLKQKKLEEM TKTEQDESSC MEKLLSKDWK EKMERLNTSE
LLAEVKGTPE SLAEKEHQLS TMITQLISLR EQLLAAHDEQ KKLAASQMEK QRQQMELARQ
QQEQIARQQQ QLLQQQHKIN LLQQQIQVQG HMPPLMIPIF PHDQRTLAAA AAAQQGFLFP
PGMSYKPGDN YPVQFIPSSV AAAAASGLSP LQLQQLYAAQ LASMQVSPGA KMPPLPQPLT
ASGPISPSSL KNDKSSSSPI TQVKEEGTQP LNLSARPKTT ELVKSPTSPT QNLFPGSKSS
PNSLMSKGGT PSPLGGLGRG SSLDILSSLN STALFGDQDT VMKAIQEARK MREQIQREQL
QHHQQGMEAK LSALSSVGLN NCRADKERAH FDSIGHHLGK LGEEGKLGHR VIDLTRPEDL
DGGAGPTEAR VFRESRGRNN NEPHIKRPMN AFMVWAKDER RKILQAFPDM HNSNISKILG
SRWKAMSNQD KQPYYEEQAR LSKLHLEKYP NYKYKPRPKR TCIIDGKKLR IGEYKQMMRS
RRQEMRQFFT VGQPPQIPIT TSASMVYPGA ITMATTTPSP QMTSECSSAS ASPEPSIPVI
QSTYNMKLEP TGMVNNSEPV NGEDEMDMYE DFEDEPKSDY SSENDTQEPV SAN
//