ID Q9VUD1_DROME Unreviewed; 388 AA.
AC Q9VUD1;
DT 01-MAY-2000, integrated into UniProtKB/TrEMBL.
DT 01-MAY-2000, sequence version 1.
DT 27-MAR-2024, entry version 178.
DE SubName: Full=IP01552p {ECO:0000313|EMBL:AAY51532.1};
DE SubName: Full=Sox21a, isoform A {ECO:0000313|EMBL:AAF49756.1};
GN Name=Sox21a {ECO:0000313|EMBL:AAF49756.1,
GN ECO:0000313|FlyBase:FBgn0036411};
GN Synonyms=Dmel\CG7345 {ECO:0000313|EMBL:AAF49756.1}, Sox B2-3
GN {ECO:0000313|EMBL:AAF49756.1}, Sox21 {ECO:0000313|EMBL:AAF49756.1},
GN sox21a {ECO:0000313|EMBL:AAF49756.1}, SOXB2.3
GN {ECO:0000313|EMBL:AAF49756.1}, SoxB2.3 {ECO:0000313|EMBL:AAF49756.1};
GN ORFNames=CG7345 {ECO:0000313|EMBL:AAF49756.1,
GN ECO:0000313|FlyBase:FBgn0036411}, Dmel_CG7345
GN {ECO:0000313|EMBL:AAF49756.1};
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AAF49756.1, ECO:0000313|Proteomes:UP000000803};
RN [1] {ECO:0000313|EMBL:AAF49756.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A.,
RA An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H.,
RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E.,
RA Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C.,
RA Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A.,
RA Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C.,
RA Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G.,
RA Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.,
RA Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [2] {ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537568;
RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A.,
RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A.,
RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R.,
RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J.,
RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C.,
RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.;
RT "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster
RT euchromatic genome sequence.";
RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002).
RN [3] {ECO:0000313|Proteomes:UP000000803}
RP GENOME REANNOTATION.
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4] {ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537573;
RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R.,
RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M.,
RA Celniker S.E.;
RT "The transposable elements of the Drosophila melanogaster euchromatin: a
RT genomics perspective.";
RL Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002).
RN [5] {ECO:0000313|EMBL:AAF49756.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537574;
RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A.,
RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G.,
RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M.,
RA Karpen G.H.;
RT "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.";
RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002).
RN [6] {ECO:0000313|EMBL:AAF49756.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022;
RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D.,
RA Ashburner M., Anxolabehere D.;
RT "Combined evidence annotation of transposable elements in genome
RT sequences.";
RL PLoS Comput. Biol. 1:166-175(2005).
RN [7] {ECO:0000313|EMBL:AAF49756.1}
RP NUCLEOTIDE SEQUENCE.
RA Celniker S., Carlson J., Wan K., Frise E., Hoskins R., Park S.,
RA Svirskas R., Rubin G.;
RL Submitted (AUG-2006) to the EMBL/GenBank/DDBJ databases.
RN [8] {ECO:0000313|EMBL:AAF49756.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569856; DOI=10.1126/science.1139815;
RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.;
RT "The Release 5.1 annotation of Drosophila melanogaster heterochromatin.";
RL Science 316:1586-1591(2007).
RN [9] {ECO:0000313|EMBL:AAF49756.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569867; DOI=10.1126/science.1139816;
RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M.,
RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A.,
RA Dimitri P., Karpen G.H., Celniker S.E.;
RT "Sequence finishing and mapping of Drosophila melanogaster
RT heterochromatin.";
RL Science 316:1625-1628(2007).
RN [10] {ECO:0000313|EMBL:AAF49756.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=26109357; DOI=.1534/g3.115.018929;
RG FlyBase Consortium;
RA Matthews B.B., Dos Santos G., Crosby M.A., Emmert D.B., St Pierre S.E.,
RA Gramates L.S., Zhou P., Schroeder A.J., Falls K., Strelets V., Russo S.M.,
RA Gelbart W.M., null;
RT "Gene Model Annotations for Drosophila melanogaster: Impact of High-
RT Throughput Data.";
RL G3 (Bethesda) 5:1721-1736(2015).
RN [11] {ECO:0000313|EMBL:AAF49756.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=26109356; DOI=.1534/g3.115.018937;
RG FlyBase Consortium;
RA Crosby M.A., Gramates L.S., Dos Santos G., Matthews B.B., St Pierre S.E.,
RA Zhou P., Schroeder A.J., Falls K., Emmert D.B., Russo S.M., Gelbart W.M.,
RA null;
RT "Gene Model Annotations for Drosophila melanogaster: The Rule-Benders.";
RL G3 (Bethesda) 5:1737-1749(2015).
RN [12] {ECO:0000313|EMBL:AAF49756.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=25589440;
RA Hoskins R.A., Carlson J.W., Wan K.H., Park S., Mendez I., Galle S.E.,
RA Booth B.W., Pfeiffer B.D., George R.A., Svirskas R., Krzywinski M.,
RA Schein J., Accardo M.C., Damia E., Messina G., Mendez-Lago M.,
RA de Pablos B., Demakova O.V., Andreyeva E.N., Boldyreva L.V., Marra M.,
RA Carvalho A.B., Dimitri P., Villasante A., Zhimulev I.F., Rubin G.M.,
RA Karpen G.H., Celniker S.E.;
RT "The Release 6 reference sequence of the Drosophila melanogaster genome.";
RL Genome Res. 25:445-458(2015).
RN [13] {ECO:0000313|EMBL:AAY51532.1}
RP NUCLEOTIDE SEQUENCE.
RA Song W.-J., Kurnit D.M.;
RL Submitted (DEC-2016) to the EMBL/GenBank/DDBJ databases.
RN [14] {ECO:0000313|EMBL:AAF49756.1}
RP NUCLEOTIDE SEQUENCE.
RG Berkeley Drosophila Genome Project;
RA Celniker S., Carlson J., Wan K., Pfeiffer B., Frise E., George R.,
RA Hoskins R., Stapleton M., Pacleb J., Park S., Svirskas R., Smith E., Yu C.,
RA Rubin G.;
RT "Drosophila melanogaster release 4 sequence.";
RL Submitted (NOV-2022) to the EMBL/GenBank/DDBJ databases.
RN [15] {ECO:0000313|EMBL:AAF49756.1}
RP NUCLEOTIDE SEQUENCE.
RG FlyBase;
RL Submitted (NOV-2022) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE014296; AAF49756.1; -; Genomic_DNA.
DR EMBL; BT022137; AAY51532.1; -; mRNA.
DR RefSeq; NP_648694.1; NM_140437.2.
DR AlphaFoldDB; Q9VUD1; -.
DR SMR; Q9VUD1; -.
DR DNASU; 39567; -.
DR EnsemblMetazoa; FBtr0075748; FBpp0075490; FBgn0036411.
DR GeneID; 39567; -.
DR UCSC; CG7345-RA; d. melanogaster.
DR AGR; FB:FBgn0036411; -.
DR CTD; 30543; -.
DR FlyBase; FBgn0036411; Sox21a.
DR VEuPathDB; VectorBase:FBgn0036411; -.
DR GeneTree; ENSGT00940000172703; -.
DR HOGENOM; CLU_781355_0_0_1; -.
DR OMA; SHHNAPN; -.
DR OrthoDB; 2902801at2759; -.
DR BioGRID-ORCS; 39567; 0 hits in 3 CRISPR screens.
DR ChiTaRS; Sox21a; fly.
DR GenomeRNAi; 39567; -.
DR Proteomes; UP000000803; Chromosome 3L.
DR Bgee; FBgn0036411; Expressed in midgut and 6 other cell types or tissues.
DR ExpressionAtlas; Q9VUD1; baseline and differential.
DR GO; GO:0005634; C:nucleus; IDA:FlyBase.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0009653; P:anatomical structure morphogenesis; IBA:GO_Central.
DR GO; GO:0030154; P:cell differentiation; IBA:GO_Central.
DR GO; GO:1903703; P:enterocyte differentiation; IMP:FlyBase.
DR GO; GO:0045596; P:negative regulation of cell differentiation; IMP:FlyBase.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:0072089; P:stem cell proliferation; IMP:FlyBase.
DR CDD; cd01388; HMG-box_SoxB; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR10270:SF161; SOX DOMAIN-CONTAINING PROTEIN DICHAETE-RELATED; 1.
DR PANTHER; PTHR10270; SOX TRANSCRIPTION FACTOR; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 2: Evidence at transcript level;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000000803}.
FT DOMAIN 121..189
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 121..189
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..36
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 57..119
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 182..278
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 348..388
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 64..84
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 231..250
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 355..388
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 388 AA; 40783 MW; 345B762AA1FDEEEF CRC64;
MTSISALLHR SHSHSNGYTS SGSSNHSHSS SLSPQLPGIN LGLGMVGGLG MGMSVGVGSG
SGNTTPPPMP PADLAVPPAA PTPVAPKMHQ HTHHHGNSHH NAPTSHSNSN TGSHHNSHDH
IKRPMNAFMV WSRGQRRKMA QDNPKMHNSE ISKRLGAEWK LLTEGQKRPF IDEAKRLRAL
HMKEHPDYKY RPRRKPKTLN KSPVPGGGGG GGGGGANGGV NAGGAGNSGP SGPGSVGSPK
DMQPQLSPLG QSLPHLHGHP HQSPYQSHPH HPHPHPHHVQ LAAATLSAKY GFGSPLELSL
PRLPNAFPGL AHYPLDPTLA LDLQARLQAM YAGSIYHPWR YLPLISPETP PSPPSSSGTG
ISSYGCVKSE KSSPNAVVAS AASPPNII
//