ID B8C2S7_THAPS Unreviewed; 931 AA.
AC B8C2S7;
DT 03-MAR-2009, integrated into UniProtKB/TrEMBL.
DT 03-MAR-2009, sequence version 1.
DT 27-MAR-2024, entry version 69.
DE RecName: Full=WW domain-containing protein {ECO:0000259|PROSITE:PS50020};
GN ORFNames=THAPSDRAFT_22716 {ECO:0000313|EMBL:EED92000.1};
OS Thalassiosira pseudonana (Marine diatom) (Cyclotella nana).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=35128 {ECO:0000313|EMBL:EED92000.1, ECO:0000313|Proteomes:UP000001449};
RN [1] {ECO:0000313|EMBL:EED92000.1, ECO:0000313|Proteomes:UP000001449}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED92000.1};
RX PubMed=15459382; DOI=10.1126/science.1101156;
RA Armbrust E.V., Berges J.A., Bowler C., Green B.R., Martinez D.,
RA Putnam N.H., Zhou S., Allen A.E., Apt K.E., Bechner M., Brzezinski M.A.,
RA Chaal B.K., Chiovitti A., Davis A.K., Demarest M.S., Detter J.C.,
RA Glavina T., Goodstein D., Hadi M.Z., Hellsten U., Hildebrand M.,
RA Jenkins B.D., Jurka J., Kapitonov V.V., Kroger N., Lau W.W., Lane T.W.,
RA Larimer F.W., Lippmeier J.C., Lucas S., Medina M., Montsant A., Obornik M.,
RA Parker M.S., Palenik B., Pazour G.J., Richardson P.M., Rynearson T.A.,
RA Saito M.A., Schwartz D.C., Thamatrakoln K., Valentin K., Vardi A.,
RA Wilkerson F.P., Rokhsar D.S.;
RT "The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and
RT metabolism.";
RL Science 306:79-86(2004).
RN [2] {ECO:0000313|EMBL:EED92000.1, ECO:0000313|Proteomes:UP000001449}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED92000.1};
RX PubMed=18923393; DOI=10.1038/nature07410;
RA Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A.,
RA Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., Salamov A.,
RA Vandepoele K., Beszteri B., Gruber A., Heijde M., Katinka M., Mock T.,
RA Valentin K., Verret F., Berges J.A., Brownlee C., Cadoret J.P.,
RA Chiovitti A., Choi C.J., Coesel S., De Martino A., Detter J.C., Durkin C.,
RA Falciatore A., Fournet J., Haruta M., Huysman M.J., Jenkins B.D.,
RA Jiroutova K., Jorgensen R.E., Joubert Y., Kaplan A., Kroger N., Kroth P.G.,
RA La Roche J., Lindquist E., Lommer M., Martin-Jezequel V., Lopez P.J.,
RA Lucas S., Mangogna M., McGinnis K., Medlin L.K., Montsant A.,
RA Oudot-Le Secq M.P., Napoli C., Obornik M., Parker M.S., Petit J.L.,
RA Porcel B.M., Poulsen N., Robison M., Rychlewski L., Rynearson T.A.,
RA Schmutz J., Shapiro H., Siaut M., Stanley M., Sussman M.R., Taylor A.R.,
RA Vardi A., von Dassow P., Vyverman W., Willis A., Wyrwicz L.S.,
RA Rokhsar D.S., Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y.,
RA Grigoriev I.V.;
RT "The Phaeodactylum genome reveals the evolutionary history of diatom
RT genomes.";
RL Nature 456:239-244(2008).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the NELF-D family.
CC {ECO:0000256|ARBA:ARBA00005726}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000642; EED92000.1; -; Genomic_DNA.
DR RefSeq; XP_002290248.1; XM_002290212.1.
DR AlphaFoldDB; B8C2S7; -.
DR STRING; 35128.B8C2S7; -.
DR PaxDb; 35128-Thaps22716; -.
DR EnsemblProtists; EED92000; EED92000; THAPSDRAFT_22716.
DR GeneID; 7449644; -.
DR KEGG; tps:THAPSDRAFT_22716; -.
DR eggNOG; KOG0152; Eukaryota.
DR HOGENOM; CLU_314391_0_0_1; -.
DR InParanoid; B8C2S7; -.
DR OMA; GHHREIA; -.
DR Proteomes; UP000001449; Chromosome 5.
DR GO; GO:0005685; C:U1 snRNP; IBA:GO_Central.
DR GO; GO:0071004; C:U2-type prespliceosome; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IBA:GO_Central.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IBA:GO_Central.
DR GO; GO:0045892; P:negative regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00201; WW; 2.
DR Gene3D; 2.20.70.10; -; 2.
DR InterPro; IPR006942; TH1.
DR InterPro; IPR001202; WW_dom.
DR InterPro; IPR036020; WW_dom_sf.
DR PANTHER; PTHR12144:SF0; NEGATIVE ELONGATION FACTOR C_D; 1.
DR PANTHER; PTHR12144; NEGATIVE ELONGATION FACTOR D; 1.
DR Pfam; PF04858; TH1; 1.
DR Pfam; PF00397; WW; 2.
DR SMART; SM00456; WW; 2.
DR SUPFAM; SSF51045; WW domain; 2.
DR PROSITE; PS01159; WW_DOMAIN_1; 2.
DR PROSITE; PS50020; WW_DOMAIN_2; 2.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000001449};
KW Repressor {ECO:0000256|ARBA:ARBA00022491};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 15..42
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 152..185
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT REGION 30..248
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 322..355
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 96..117
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 118..135
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 137..151
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 156..170
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 182..203
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 231..246
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 931 AA; 100780 MW; 3C2ED6165A6D3E29 CRC64;
MATDDAFEER TIGDWTAYQD DEGRTYYYNN ETEESSWDPP PGFEAGGGCG DAALSDDDGV
GARDDAGEGV SPPYAATPVD DKDGITRSPG DDNDVITRSP MDEEDDGITR SPPINHDAEE
ETTTTNNNNS DNFEQGGGAD VEEEDNQAVD GEDIGDGWIA YKDDEGRTYY YNADSGETQW
ERPDVVASSS IAKDTTSSDN AAAEKGDYYS DGETGATPTS DGEEEGDGDI TSSGGEAKDK
TSKAQLHQQE DPATIAENFL QQPDAIMEPS VMDHISTLVN KEGAQVGFPK AVQSLINGYQ
GDTAICGVMG LWLAELKSSG GAAGAEGGKG GGGGRSISDV GANEVESESD RFNQGADAAR
DVAEEVVNRL AKERFTKNGG DAIMSLSKKQ AAFVDEMIQS ERWRNLLIDL SASNKESKLF
IAIAKRINQS DYFGVFNSML ASELSLVGKI AVDGYSKEVS KSIDTTQGQM GTLIADLRRT
CTSTSYTYLY AMEVLDELIS ESKKQSKNGT ADAPGLELAA RKWERLKEEL EEEMLKPHAT
GTTFQRKRRI DVALAMSDLV QRQRRRIDPT ADGENKKMIA STVSPKHALA NTLDDAIAGF
LTKSSLGNQI DKDTAESMLN YAYGGSTDRI GDLLIKHPSA VNSLLRNMFG SKRVRQLETR
QKCARLVALA VIASERSARS SSQIDIAESD EDFLCQTFLK GSQLCEQVES MVSFIAIDSV
ENTDGSVGRQ LSALCIKYSV IAQGVLIWAK ELASGAEFVT TAGYPTLSPC ILSLVRLICV
YHPLARPSVL DLALVFMGHS NSEISHQKMQ SIKEQCLRLL LFLSAQGMSL AVISAVRSSE
IDSALLRYFV SGMLEIVQPP LSLPFVRGFG SLLMERPFVD TLISQHFEAS KRYQIVQMIH
QFEAAFTVKG AAPSEADAAL LTMLKSTYVK G
//