ID L5KP74_PTEAL Unreviewed; 499 AA.
AC L5KP74;
DT 06-MAR-2013, integrated into UniProtKB/TrEMBL.
DT 06-MAR-2013, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE RecName: Full=Transcription factor SOX-9 {ECO:0000256|ARBA:ARBA00022377};
GN ORFNames=PAL_GLEAN10014806 {ECO:0000313|EMBL:ELK12418.1};
OS Pteropus alecto (Black flying fox).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Chiroptera; Megachiroptera; Pteropodidae;
OC Pteropodinae; Pteropus.
OX NCBI_TaxID=9402 {ECO:0000313|EMBL:ELK12418.1, ECO:0000313|Proteomes:UP000010552};
RN [1] {ECO:0000313|Proteomes:UP000010552}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23258410; DOI=10.1126/science.1230835;
RA Zhang G., Cowled C., Shi Z., Huang Z., Bishop-Lilly K.A., Fang X.,
RA Wynne J.W., Xiong Z., Baker M.L., Zhao W., Tachedjian M., Zhu Y., Zhou P.,
RA Jiang X., Ng J., Yang L., Wu L., Xiao J., Feng Y., Chen Y., Sun X.,
RA Zhang Y., Marsh G.A., Crameri G., Broder C.C., Frey K.G., Wang L.F.,
RA Wang J.;
RT "Comparative analysis of bat genomes provides insight into the evolution of
RT flight and immunity.";
RL Science 339:456-460(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB030661; ELK12418.1; -; Genomic_DNA.
DR RefSeq; XP_006912328.1; XM_006912266.2.
DR STRING; 9402.L5KP74; -.
DR GeneID; 102888164; -.
DR KEGG; pale:102888164; -.
DR CTD; 6662; -.
DR eggNOG; KOG0527; Eukaryota.
DR InParanoid; L5KP74; -.
DR OrthoDB; 2902801at2759; -.
DR Proteomes; UP000010552; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd22031; HMG-box_SoxE; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR022151; Sox_N.
DR PANTHER; PTHR45803; SOX100B; 1.
DR PANTHER; PTHR45803:SF1; TRANSCRIPTION FACTOR SOX-9; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF12444; Sox_N; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Isopeptide bond {ECO:0000256|ARBA:ARBA00022499};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP000010552};
KW Ubl conjugation {ECO:0000256|ARBA:ARBA00022843}.
FT DOMAIN 105..173
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 105..173
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..67
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 160..273
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 294..433
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 465..499
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 18..52
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 160..187
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 208..233
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 311..325
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 349..369
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 370..414
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 474..499
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 499 AA; 55078 MW; A7B0A98A3AE8F248 CRC64;
MNLLDPFMKM TDEQEKGLSG APSPTMSEDS AGSPCPSGSG SDTENTRPQE NTFPKGEPDL
KKESEEDKFP VCIREAVSQV LKGYDWTLVP MPVRVNGASK SKPHVKRPMN AFMVWAQAAR
RKLADQYPHL HNAELSKTLG KLWRLLNESE KRPFVEEAER LRVQHKKDHP DYKYQPRRRK
SVKNGQAEAE EAAEQTHISP NAIFKALQAD SPHSSSGMSE VHSPGEHSGQ SQGPPTPPTT
PKTDVQPGKA DLKREGRPLP EGGRQPPIDF RDVDIGELSS DVISNIETFD VNEFDQYLPP
NGHPGVPATH GQVTYTGSYG ISSTAASPAG AGHVWMSKQQ XXXXXXHAPP QQPPAPPQQQ
PPAPPQQPPA HTLTTLSSEP GQSQRTHIKT EQLSPSHYSE QQHSPQQIAY SPFSLPHYSP
SYPPITRSQY DYTDHQNSGS YYSHAAGQGS GLYSTFTYMN PAQRPMYTPI ADTSGVPSIP
QTHSPQHWEQ PVYTQLTRP
//