GenomeNet

Database: UniProt
Entry: Q8SY59_DROME
LinkDB: Q8SY59_DROME
Original site: Q8SY59_DROME 
ID   Q8SY59_DROME            Unreviewed;       919 AA.
AC   Q8SY59; E2QCN6;
DT   01-JUN-2002, integrated into UniProtKB/TrEMBL.
DT   01-JUN-2002, sequence version 1.
DT   24-JAN-2024, entry version 176.
DE   SubName: Full=GH01967p {ECO:0000313|EMBL:AAL68179.1};
DE   SubName: Full=Gemini, isoform A {ECO:0000313|EMBL:AAF58836.2};
DE   SubName: Full=Gemini, isoform G {ECO:0000313|EMBL:AAM68771.3};
GN   Name=gem {ECO:0000313|EMBL:AAF58836.2,
GN   ECO:0000313|FlyBase:FBgn0050011};
GN   Synonyms=anon-AE003830.1 {ECO:0000313|EMBL:AAF58836.2}, BcDNA:HL07889
GN   {ECO:0000313|EMBL:AAF58836.2}, CG11867 {ECO:0000313|EMBL:AAF58836.2},
GN   CG3459 {ECO:0000313|EMBL:AAF58836.2}, dCP2
GN   {ECO:0000313|EMBL:AAF58836.2}, Dmel\CG30011
GN   {ECO:0000313|EMBL:AAF58836.2};
GN   ORFNames=CG30011 {ECO:0000313|EMBL:AAL68179.1,
GN   ECO:0000313|FlyBase:FBgn0050011}, Dmel_CG30011
GN   {ECO:0000313|EMBL:AAF58836.2};
OS   Drosophila melanogaster (Fruit fly).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC   Drosophilidae; Drosophila; Sophophora.
OX   NCBI_TaxID=7227 {ECO:0000313|EMBL:AAL68179.1};
RN   [1] {ECO:0000313|EMBL:AAF58836.2, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA   Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA   Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA   George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA   Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA   Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA   Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A.,
RA   An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA   Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA   Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA   Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA   Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA   Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA   Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA   Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA   Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA   Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA   Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA   Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F.,
RA   Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA   Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA   Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA   McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA   Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA   Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA   Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA   Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H.,
RA   Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E.,
RA   Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C.,
RA   Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A.,
RA   Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C.,
RA   Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G.,
RA   Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.,
RA   Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.;
RT   "The genome sequence of Drosophila melanogaster.";
RL   Science 287:2185-2195(2000).
RN   [2] {ECO:0000313|EMBL:AAF58836.2, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537568;
RA   Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A.,
RA   Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A.,
RA   George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R.,
RA   Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J.,
RA   Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C.,
RA   Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.;
RT   "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster
RT   euchromatic genome sequence.";
RL   Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002).
RN   [3] {ECO:0000313|EMBL:AAF58836.2, ECO:0000313|Proteomes:UP000000803}
RP   GENOME REANNOTATION.
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA   Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA   Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA   Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA   Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA   Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA   Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT   "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT   review.";
RL   Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN   [4] {ECO:0000313|EMBL:AAF58836.2, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537573;
RA   Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R.,
RA   Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M.,
RA   Celniker S.E.;
RT   "The transposable elements of the Drosophila melanogaster euchromatin: a
RT   genomics perspective.";
RL   Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002).
RN   [5] {ECO:0000313|EMBL:AAF58836.2, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=12537574;
RA   Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A.,
RA   Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G.,
RA   Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M.,
RA   Karpen G.H.;
RT   "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.";
RL   Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002).
RN   [6] {ECO:0000313|EMBL:AAF58836.2, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=16110336; DOI=10.1371/journal.pcbi.0010022;
RA   Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D.,
RA   Ashburner M., Anxolabehere D.;
RT   "Combined evidence annotation of transposable elements in genome
RT   sequences.";
RL   PLoS Comput. Biol. 1:166-175(2005).
RN   [7] {ECO:0000313|EMBL:AAL68179.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=Berkeley {ECO:0000313|EMBL:AAL68179.1};
RA   Stapleton M., Carlson J., Chavez C., Frise E., George R., Pacleb J.,
RA   Park S., Wan K., Yu C., Celniker S.;
RL   Submitted (AUG-2005) to the EMBL/GenBank/DDBJ databases.
RN   [8] {ECO:0000313|EMBL:AAF58836.2}
RP   NUCLEOTIDE SEQUENCE.
RA   Celniker S., Carlson J., Wan K., Frise E., Hoskins R., Park S.,
RA   Svirskas R., Rubin G.;
RL   Submitted (AUG-2006) to the EMBL/GenBank/DDBJ databases.
RN   [9] {ECO:0000313|EMBL:AAF58836.2, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=17569856; DOI=10.1126/science.1139815;
RA   Smith C.D., Shu S., Mungall C.J., Karpen G.H.;
RT   "The Release 5.1 annotation of Drosophila melanogaster heterochromatin.";
RL   Science 316:1586-1591(2007).
RN   [10] {ECO:0000313|EMBL:AAF58836.2, ECO:0000313|Proteomes:UP000000803}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX   PubMed=17569867; DOI=10.1126/science.1139816;
RA   Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M.,
RA   Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A.,
RA   Dimitri P., Karpen G.H., Celniker S.E.;
RT   "Sequence finishing and mapping of Drosophila melanogaster
RT   heterochromatin.";
RL   Science 316:1625-1628(2007).
RN   [11] {ECO:0000313|EMBL:AAF58836.2}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=26109357; DOI=.1534/g3.115.018929;
RG   FlyBase Consortium;
RA   Matthews B.B., Dos Santos G., Crosby M.A., Emmert D.B., St Pierre S.E.,
RA   Gramates L.S., Zhou P., Schroeder A.J., Falls K., Strelets V., Russo S.M.,
RA   Gelbart W.M., null;
RT   "Gene Model Annotations for Drosophila melanogaster: Impact of High-
RT   Throughput Data.";
RL   G3 (Bethesda) 5:1721-1736(2015).
RN   [12] {ECO:0000313|EMBL:AAF58836.2}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=26109356; DOI=.1534/g3.115.018937;
RG   FlyBase Consortium;
RA   Crosby M.A., Gramates L.S., Dos Santos G., Matthews B.B., St Pierre S.E.,
RA   Zhou P., Schroeder A.J., Falls K., Emmert D.B., Russo S.M., Gelbart W.M.,
RA   null;
RT   "Gene Model Annotations for Drosophila melanogaster: The Rule-Benders.";
RL   G3 (Bethesda) 5:1737-1749(2015).
RN   [13] {ECO:0000313|EMBL:AAF58836.2}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=25589440;
RA   Hoskins R.A., Carlson J.W., Wan K.H., Park S., Mendez I., Galle S.E.,
RA   Booth B.W., Pfeiffer B.D., George R.A., Svirskas R., Krzywinski M.,
RA   Schein J., Accardo M.C., Damia E., Messina G., Mendez-Lago M.,
RA   de Pablos B., Demakova O.V., Andreyeva E.N., Boldyreva L.V., Marra M.,
RA   Carvalho A.B., Dimitri P., Villasante A., Zhimulev I.F., Rubin G.M.,
RA   Karpen G.H., Celniker S.E.;
RT   "The Release 6 reference sequence of the Drosophila melanogaster genome.";
RL   Genome Res. 25:445-458(2015).
RN   [14] {ECO:0000313|EMBL:AAF58836.2}
RP   NUCLEOTIDE SEQUENCE.
RG   FlyBase;
RL   Submitted (APR-2020) to the EMBL/GenBank/DDBJ databases.
RN   [15] {ECO:0000313|EMBL:AAF58836.2}
RP   NUCLEOTIDE SEQUENCE.
RG   Berkeley Drosophila Genome Project;
RA   Celniker S., Carlson J., Wan K., Pfeiffer B., Frise E., George R.,
RA   Hoskins R., Stapleton M., Pacleb J., Park S., Svirskas R., Smith E., Yu C.,
RA   Rubin G.;
RT   "Drosophila melanogaster release 4 sequence.";
RL   Submitted (MAY-2020) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU01313}.
CC   -!- SIMILARITY: Belongs to the grh/CP2 family. CP2 subfamily.
CC       {ECO:0000256|ARBA:ARBA00010852}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AE013599; AAF58836.2; -; Genomic_DNA.
DR   EMBL; AY075312; AAL68179.1; -; mRNA.
DR   EMBL; AE013599; AAM68771.3; -; Genomic_DNA.
DR   RefSeq; NP_724893.3; NM_165748.3.
DR   RefSeq; NP_724895.1; NM_165750.4.
DR   AlphaFoldDB; Q8SY59; -.
DR   SMR; Q8SY59; -.
DR   IntAct; Q8SY59; 2.
DR   DNASU; 36064; -.
DR   EnsemblMetazoa; FBtr0089759; FBpp0088700; FBgn0050011.
DR   EnsemblMetazoa; FBtr0346634; FBpp0312214; FBgn0050011.
DR   GeneID; 36064; -.
DR   UCSC; CG30011-RA; d. melanogaster.
DR   AGR; FB:FBgn0050011; -.
DR   CTD; 2669; -.
DR   FlyBase; FBgn0050011; gem.
DR   VEuPathDB; VectorBase:FBgn0050011; -.
DR   GeneTree; ENSGT00940000173985; -.
DR   HOGENOM; CLU_015127_1_0_1; -.
DR   OrthoDB; 1363858at2759; -.
DR   BioGRID-ORCS; 36064; 0 hits in 3 CRISPR screens.
DR   ChiTaRS; gem; fly.
DR   GenomeRNAi; 36064; -.
DR   Proteomes; UP000000803; Chromosome 2R.
DR   Bgee; FBgn0050011; Expressed in brain and 37 other cell types or tissues.
DR   ExpressionAtlas; Q8SY59; baseline.
DR   GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR   GO; GO:0001228; F:DNA-binding transcription activator activity, RNA polymerase II-specific; IBA:GO_Central.
DR   GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR   GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR   CDD; cd09537; SAM_CP2-like; 1.
DR   Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR   InterPro; IPR007604; CP2.
DR   InterPro; IPR013761; SAM/pointed_sf.
DR   InterPro; IPR041418; SAM_3.
DR   InterPro; IPR040167; TF_CP2-like.
DR   PANTHER; PTHR11037:SF21; GEMINI, ISOFORM C; 1.
DR   PANTHER; PTHR11037; TRANSCRIPTION FACTOR CP2; 1.
DR   Pfam; PF04516; CP2; 2.
DR   Pfam; PF18016; SAM_3; 1.
DR   SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR   PROSITE; PS51968; GRH_CP2_DB; 1.
PE   1: Evidence at protein level;
KW   DNA-binding {ECO:0000256|PROSITE-ProRule:PRU01313};
KW   Nucleus {ECO:0000256|PROSITE-ProRule:PRU01313};
KW   Proteomics identification {ECO:0007829|PeptideAtlas:Q8SY59};
KW   Reference proteome {ECO:0000313|Proteomes:UP000000803}.
FT   DOMAIN          381..663
FT                   /note="Grh/CP2 DB"
FT                   /evidence="ECO:0000259|PROSITE:PS51968"
FT   REGION          29..65
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          90..119
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          149..173
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          196..238
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          303..325
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          539..579
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          596..620
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        196..214
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        215..235
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        539..578
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        596..614
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   919 AA;  101140 MW;  4D8F76BCE186F44E CRC64;
     MALSFLSQNS GLLDLQSIFD PQYTHHQQQQ QQLLHLSHHP QHHIQQHQNQ QLQQQAEQIQ
     QHQQERTLST KFDLNLFNEL DQMEFNNLNR SQYQNNNNNN SISNNQNSNN NTNTTNGISE
     NLNQIQNRHF ISGYHHQHIG SDYEQVINFV DSPPNSEESW TDGQSKDSPG PQIIDVQTIY
     LNSGSRKRRM DWDSLDIGQS ENSPTASQSG DLPSKVAHQE KEKHKREKHS GRSSWSDDIG
     FDLNAEFNSN SYLNNENFLS FSPTLTTLKQ EPQTDQIKPS PKVSLDNASA SPSIAIAKLD
     EVQNSPSQAI SGQDSANGSG SAANGKHDVN SGLACGCGSP QGSPLANAEY ELNEKGKPQQ
     LSVLDPAKIE IGSANGATHA EDHKFQYILA AATSIATKNN EETLTYLNQG QSYEIKLKKI
     GDLSLYRDKI LKSVIKICFH ERRLQFMERE QMQQWQQSRP GERIIEVDVP LSYGLCHVSQ
     PLSSGSLNTV EIFWDPLKEV GVYIKVNCIS TEFTPKKHGG EKGVPFRLQI ETYIENTNSA
     TASGSGGSNN SAIASGSGSS GSAAPASPER TPSAGSNGKQ AVHAAACQIK VFKLKGADRK
     HKQDREKIQK RPQSEQEKFQ PSYECTIMND ISLDLVMSAT TTGCYSPEYM KLWPNSPVHI
     PKYDGMLPFA PSAASPATSS SPIAINSVTS TNSPTLKLMD ATNMVSPQHV PADMDDYSQN
     IMPESTPSQV TQWLTNHRLT AYLSTFAQFS GADIMRMSKE DLIQICGLAD GIRMFNILRA
     KTIAPRLTLY ASVDGCSYNA IYLLSNTAKE LQQKLFKMPG FYEFMAKASA QENGAGGAAT
     AAAAALFNNW GMHSKYSGSG SNIFNDANKS CVYISGPSGI LVTITDEVLN NEIKDGSLYA
     LEVQAGKVIL KLINKQDNN
//
DBGET integrated database retrieval system