ID H3BDZ1_LATCH Unreviewed; 658 AA.
AC H3BDZ1;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 64.
DE SubName: Full=SRY-box transcription factor 6 {ECO:0000313|Ensembl:ENSLACP00000020112.1};
GN Name=SOX6 {ECO:0000313|Ensembl:ENSLACP00000020112.1};
OS Latimeria chalumnae (Coelacanth).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Coelacanthiformes; Coelacanthidae; Latimeria.
OX NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000020112.1, ECO:0000313|Proteomes:UP000008672};
RN [1] {ECO:0000313|Proteomes:UP000008672}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Wild caught {ECO:0000313|Proteomes:UP000008672};
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lander E., Lindblad-Toh K.;
RT "The draft genome of Latimeria chalumnae.";
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLACP00000020112.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AFYH01025707; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01025708; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01025709; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01025710; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01025711; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01025712; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01025713; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01025714; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01025715; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01025716; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; H3BDZ1; -.
DR Ensembl; ENSLACT00000020250.1; ENSLACP00000020112.1; ENSLACG00000017676.2.
DR GeneTree; ENSGT00940000156433; -.
DR HOGENOM; CLU_018522_0_1_1; -.
DR Proteomes; UP000008672; Unassembled WGS sequence.
DR Bgee; ENSLACG00000017676; Expressed in post-anal tail muscle and 5 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd22042; HMG-box_EGL13-like; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR45789; FI18025P1; 1.
DR PANTHER; PTHR45789:SF1; TRANSCRIPTION FACTOR SOX-6; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW Activator {ECO:0000256|ARBA:ARBA00023159};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000008672}.
FT DOMAIN 451..519
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 451..519
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 211..296
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 583..658
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 8..81
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 322..352
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 211..251
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 264..290
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 583..623
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 628..643
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 658 AA; 72800 MW; D2C0C6191033D003 CRC64;
LTGTPEGLAE KERQLSTMIT QLIGLREQLL AAHDEQKKLA ASQIEKQRQQ MELARQQQEQ
IARQQQQLLQ QQHKINLLQQ QIQQVQGHMP PLMIPIFPHD QRTLAAAAAA QQGFLFPPGI
TYKPGDNYPV QFIPSTMAAA AASGLSPLQL QQKGHIESPL FNPTWNGLNN RPLGGSIDTV
KHTTGNLTKI KITIAIKQLY AAQLASMQVS PGTKLTSTPQ PPNTTGPLSP TGLKSEKRGT
SPITQIKDEA AQPLNLSARP KTAEPVKSPT SPTQSLFPVS KTSPISLPNK SGIPSPVGGN
LGRGSSLDIL STLNSTALFG DQDAVMKAIQ EARKMREQIQ REQQQQQQQQ CIDGKLSSMN
SMGLNNCRAD KERSRFENLG PQLTGKPGED VKLGPGVIDL TRPEDIDGSK AVNGSAAKLQ
QYYCWPTGGA SVAEARVYRD PRGRNNSEPH IKRPMNAFMV WAKDERRKIL QAFPDMHNSN
ISKILGSRWK SMSNQEKQPY YEEQARLSKI HLEKYPNYKY KPRPKRTCIV DGKKLRIGEY
KQLMRSRRQE MRQFFTVGQQ PQIPMTTATG VGYPAPITMA TTTPSPQMTS DCSSTSASPE
PSIPVIQSTY GMKTDNGSII GNETMNGEEE MEMYEDYEDE PKSDYSSENE TQEAVSAN
//