GenomeNet

Database: UniProt
Entry: A0A151MC57_ALLMI
LinkDB: A0A151MC57_ALLMI
Original site: A0A151MC57_ALLMI 
ID   A0A151MC57_ALLMI        Unreviewed;       820 AA.
AC   A0A151MC57;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   27-MAR-2024, entry version 28.
DE   SubName: Full=Transcription factor SOX-6 isoform B {ECO:0000313|EMBL:KYO22082.1};
GN   Name=SOX6 {ECO:0000313|EMBL:KYO22082.1};
GN   ORFNames=Y1Q_0000693 {ECO:0000313|EMBL:KYO22082.1};
OS   Alligator mississippiensis (American alligator).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC   Alligator.
OX   NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO22082.1};
RN   [1] {ECO:0000313|EMBL:KYO22082.1, ECO:0000313|Proteomes:UP000050525}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO22082.1};
RX   PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA   St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA   Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA   Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA   Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA   Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA   McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA   Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA   Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA   Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA   Ray D.A.;
RT   "Sequencing three crocodilian genomes to illuminate the evolution of
RT   archosaurs and amniotes.";
RL   Genome Biol. 13:415-415(2012).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KYO22082.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AKHW03006283; KYO22082.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A151MC57; -.
DR   STRING; 8496.A0A151MC57; -.
DR   Proteomes; UP000050525; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   CDD; cd22042; HMG-box_EGL13-like; 1.
DR   Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR   InterPro; IPR009071; HMG_box_dom.
DR   InterPro; IPR036910; HMG_box_dom_sf.
DR   PANTHER; PTHR45789; FI18025P1; 1.
DR   PANTHER; PTHR45789:SF1; TRANSCRIPTION FACTOR SOX-6; 1.
DR   Pfam; PF00505; HMG_box; 1.
DR   SMART; SM00398; HMG; 1.
DR   SUPFAM; SSF47095; HMG-box; 1.
DR   PROSITE; PS50118; HMG_BOX_2; 1.
PE   4: Predicted;
KW   Activator {ECO:0000256|ARBA:ARBA00023159};
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW   Reference proteome {ECO:0000313|Proteomes:UP000050525}.
FT   DOMAIN          613..681
FT                   /note="HMG box"
FT                   /evidence="ECO:0000259|PROSITE:PS50118"
FT   DNA_BIND        613..681
FT                   /note="HMG box"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT   REGION          44..64
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          370..458
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          494..522
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          745..820
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          213..286
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        48..63
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        370..410
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        423..452
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        745..783
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        790..805
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   820 AA;  91072 MW;  2104431EB4F08320 CRC64;
     MPSSSEVLAG VVGESRTEEW KNSHYIRMSS KQATSPFACA ADGEEAMTQD LASRDKEEGN
     NDQHATSHLP LHTIMHNNPH SEELPTLVTT IQQDAEWDSV ISAQHRMESE SNKLCSLYSF
     RNTSTSPHKP DEGGRDRSEL MTSVNFGTPE RRKGSLADVV DTLKQKKLEE MTRTEQEDSS
     CMEKLLSKDW KEKMERLNTS ELLGEIKGTP ESLAEKERQL STMITQLISL REQLLAAHDE
     QKKLAASQIE KQRQQMDLAR QQQEQIARQQ QQLLQQQHKI NLLQQQIQQV QGHMPPLMIP
     IFPHDQRTLA AAAAAQQGFL FPPGITYKPG DNYPVQFIPS TMAAAAASGL SPLQLQQLYA
     AQLASMQVSP GAKMPTTPQP PNTTGALSPT GMKNEKRGTS PVTQVKDEAA QPLNLSARPK
     TAEPVKSPTS PTQNLFPASK SSPVNVSNKS GIPSPIGGPL GRGSSLDILS SLNSPALFGD
     QDTVMKAIQE ARKMREQIQR EQQQQQQQPP PHTVDGKLPS MNNMGLNNCR NEKERTRFEN
     LGPQLTGKPN EDGKLGPGVI DLTRPEDAEG SKAMNGSAAK LQQYYCWPTG GATVAEARVY
     RDSRGRASSE PHIKRPMNAF MVWAKDERRK ILQAFPDMHN SNISKILGSR WKSMSNQEKQ
     PYYEEQARLS KIHLEKYPNY KYKPRPKRTC IVDGKKLRIG EYKQLMRSRR QEMRQFFTVG
     QQPQIPITTG TGVVYPGAIT MATTTPSPQM TSDCSSTSAS PEPSIPVIQS TYGMKTDSGS
     LAGNEMINGE DEIEMYEDYE DDPKSDYSSE NEAPEAVSAN
//
DBGET integrated database retrieval system