GenomeNet

Database: UniProt
Entry: B8CF75_THAPS
LinkDB: B8CF75_THAPS
Original site: B8CF75_THAPS 
ID   B8CF75_THAPS            Unreviewed;       374 AA.
AC   B8CF75; B8LBK0;
DT   03-MAR-2009, integrated into UniProtKB/TrEMBL.
DT   03-MAR-2009, sequence version 1.
DT   27-MAR-2024, entry version 75.
DE   RecName: Full=HSF-type DNA-binding domain-containing protein {ECO:0000259|Pfam:PF00447};
GN   ORFNames=THAPSDRAFT_11667 {ECO:0000313|EMBL:EED87752.1},
GN   THAPSDRAFT_24396 {ECO:0000313|EMBL:EED87333.1};
OS   Thalassiosira pseudonana (Marine diatom) (Cyclotella nana).
OC   Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC   Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC   Thalassiosiraceae; Thalassiosira.
OX   NCBI_TaxID=35128 {ECO:0000313|EMBL:EED87752.1, ECO:0000313|Proteomes:UP000001449};
RN   [1] {ECO:0000313|EMBL:EED87752.1, ECO:0000313|Proteomes:UP000001449}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CCMP1335 {ECO:0000313|EMBL:EED87752.1};
RX   PubMed=15459382; DOI=10.1126/science.1101156;
RA   Armbrust E.V., Berges J.A., Bowler C., Green B.R., Martinez D.,
RA   Putnam N.H., Zhou S., Allen A.E., Apt K.E., Bechner M., Brzezinski M.A.,
RA   Chaal B.K., Chiovitti A., Davis A.K., Demarest M.S., Detter J.C.,
RA   Glavina T., Goodstein D., Hadi M.Z., Hellsten U., Hildebrand M.,
RA   Jenkins B.D., Jurka J., Kapitonov V.V., Kroger N., Lau W.W., Lane T.W.,
RA   Larimer F.W., Lippmeier J.C., Lucas S., Medina M., Montsant A., Obornik M.,
RA   Parker M.S., Palenik B., Pazour G.J., Richardson P.M., Rynearson T.A.,
RA   Saito M.A., Schwartz D.C., Thamatrakoln K., Valentin K., Vardi A.,
RA   Wilkerson F.P., Rokhsar D.S.;
RT   "The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and
RT   metabolism.";
RL   Science 306:79-86(2004).
RN   [2] {ECO:0000313|EMBL:EED87752.1, ECO:0000313|Proteomes:UP000001449}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CCMP1335 {ECO:0000313|EMBL:EED87752.1};
RX   PubMed=18923393; DOI=10.1038/nature07410;
RA   Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A.,
RA   Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., Salamov A.,
RA   Vandepoele K., Beszteri B., Gruber A., Heijde M., Katinka M., Mock T.,
RA   Valentin K., Verret F., Berges J.A., Brownlee C., Cadoret J.P.,
RA   Chiovitti A., Choi C.J., Coesel S., De Martino A., Detter J.C., Durkin C.,
RA   Falciatore A., Fournet J., Haruta M., Huysman M.J., Jenkins B.D.,
RA   Jiroutova K., Jorgensen R.E., Joubert Y., Kaplan A., Kroger N., Kroth P.G.,
RA   La Roche J., Lindquist E., Lommer M., Martin-Jezequel V., Lopez P.J.,
RA   Lucas S., Mangogna M., McGinnis K., Medlin L.K., Montsant A.,
RA   Oudot-Le Secq M.P., Napoli C., Obornik M., Parker M.S., Petit J.L.,
RA   Porcel B.M., Poulsen N., Robison M., Rychlewski L., Rynearson T.A.,
RA   Schmutz J., Shapiro H., Siaut M., Stanley M., Sussman M.R., Taylor A.R.,
RA   Vardi A., von Dassow P., Vyverman W., Willis A., Wyrwicz L.S.,
RA   Rokhsar D.S., Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y.,
RA   Grigoriev I.V.;
RT   "The Phaeodactylum genome reveals the evolutionary history of diatom
RT   genomes.";
RL   Nature 456:239-244(2008).
RN   [3] {ECO:0000313|EMBL:EED87752.1}
RP   GENOME REANNOTATION.
RC   STRAIN=CCMP1335 {ECO:0000313|EMBL:EED87752.1};
RG   Diatom Consortium;
RA   Grigoriev I., Grimwood J., Kuo A., Otillar R.P., Salamov A., Detter J.C.,
RA   Schmutz J., Lindquist E., Shapiro H., Lucas S., Glavina del Rio T.,
RA   Bruce D., Pitluck S., Rokhsar D., Armbrust V.;
RL   Submitted (SEP-2008) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; DS999415; EED87333.1; -; Genomic_DNA.
DR   EMBL; CM000653; EED87752.1; -; Genomic_DNA.
DR   RefSeq; XP_002294972.1; XM_002294936.1.
DR   RefSeq; XP_002296637.1; XM_002296601.1.
DR   PaxDb; 35128-Thaps11667; -.
DR   EnsemblProtists; EED87333; EED87333; THAPSDRAFT_24396.
DR   EnsemblProtists; EED87752; EED87752; THAPSDRAFT_11667.
DR   GeneID; 7445421; -.
DR   GeneID; 7449467; -.
DR   KEGG; tps:THAPSDRAFT_11667; -.
DR   KEGG; tps:THAPSDRAFT_24396; -.
DR   eggNOG; ENOG502SSDF; Eukaryota.
DR   HOGENOM; CLU_740798_0_0_1; -.
DR   OMA; HCAARER; -.
DR   Proteomes; UP000001449; Chromosome 11.
DR   Proteomes; UP000001449; Chromosome 22.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR   Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR   InterPro; IPR000232; HSF_DNA-bd.
DR   InterPro; IPR036388; WH-like_DNA-bd_sf.
DR   InterPro; IPR036390; WH_DNA-bd_sf.
DR   Pfam; PF00447; HSF_DNA-bind; 1.
DR   SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001449}.
FT   DOMAIN          18..92
FT                   /note="HSF-type DNA-binding"
FT                   /evidence="ECO:0000259|Pfam:PF00447"
FT   REGION          118..147
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          182..262
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          288..349
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        182..260
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        288..302
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   374 AA;  40188 MW;  ABF7AB8928F0852F CRC64;
     MASASAPPPA KDEKTIPFLR SLTEMLQNNK ELISFVPGKK TATETIQGKI LVHDRIRVQT
     EVLPIYFNHA SFASLRRQLS YFSFIRVGKG RQGGVTYVNE GVMVLSDILR LKRRTAQSGG
     GGAAAAKGAA AGKEKSVEQQ QPSQSASLED VASAVLKGTL HAVKTNQYLD TDTAVAAAAA
     STSATHHHGP SNITSSSVCR TNSTTNSSSA SSDNAESTGN NNNGNTDNKD IPRNISPSKL
     PFKSSDSKRP LSQNKSGSVA ISRKDCPRLH RLLYVNNVVP FIHLPPKMKR SKSNEENEDG
     SITDGNAKVV AAAPPSKKAR RGSKQEENTA GTPKSKQHQQ RRQHQEELFQ HCAARERTTS
     ESAISALLAL GVQQ
//
DBGET integrated database retrieval system