ID A0A024GP15_9STRA Unreviewed; 606 AA.
AC A0A024GP15;
DT 09-JUL-2014, integrated into UniProtKB/TrEMBL.
DT 09-JUL-2014, sequence version 1.
DT 24-JAN-2024, entry version 36.
DE RecName: Full=TFIIS N-terminal domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BN9_091410 {ECO:0000313|EMBL:CCI48092.1};
OS Albugo candida.
OC Eukaryota; Sar; Stramenopiles; Oomycota; Albuginales; Albuginaceae; Albugo.
OX NCBI_TaxID=65357 {ECO:0000313|EMBL:CCI48092.1, ECO:0000313|Proteomes:UP000053237};
RN [1] {ECO:0000313|EMBL:CCI48092.1, ECO:0000313|Proteomes:UP000053237}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Ac Nc2 {ECO:0000313|EMBL:CCI48092.1,
RC ECO:0000313|Proteomes:UP000053237};
RA Gardiner A., Kemen E., Schultz-Larsen T., MacLean D., Van Oosterhout C.,
RA Jones J.D.G.;
RT "Recombination and specialization in a pathogen metapopulation.";
RL Submitted (MAY-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00649}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCI48092.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAIX01000203; CCI48092.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A024GP15; -.
DR STRING; 65357.A0A024GP15; -.
DR InParanoid; A0A024GP15; -.
DR OrthoDB; 57028at2759; -.
DR Proteomes; UP000053237; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR CDD; cd05162; PWWP; 1.
DR CDD; cd00183; TFIIS_I; 1.
DR Gene3D; 2.30.30.140; -; 1.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 2.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR014876; DEK_C.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR003617; TFIIS/CRSP70_N_sub.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR InterPro; IPR017923; TFIIS_N.
DR Pfam; PF08766; DEK_C; 1.
DR Pfam; PF08711; Med26; 2.
DR Pfam; PF00855; PWWP; 1.
DR SMART; SM00509; TFS2N; 2.
DR SUPFAM; SSF47676; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 2.
DR SUPFAM; SSF109715; DEK C-terminal domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS51998; DEK_C; 1.
DR PROSITE; PS50812; PWWP; 1.
DR PROSITE; PS51319; TFIIS_N; 2.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00649}; Reference proteome {ECO:0000313|Proteomes:UP000053237}.
FT DOMAIN 9..70
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 293..348
FT /note="DEK-C"
FT /evidence="ECO:0000259|PROSITE:PS51998"
FT DOMAIN 416..490
FT /note="TFIIS N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51319"
FT DOMAIN 530..606
FT /note="TFIIS N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51319"
FT REGION 131..256
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 352..409
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 151..185
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 186..256
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 359..394
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 395..409
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 606 AA; 68981 MW; BF482876992C95A2 CRC64;
MSSGAEFSVN SVGWFILEGF PWWPVYLCDA KSLRTKLYYL GDGHAKILKK ARDYSNDYIL
VYFFGSHQFS LARTRRGVLK PWGCSDQSTL CKGHPKHLAK RGNQIEELKM AIIEVEVDFL
SQPEDCRLPP HFVPSDLNLA LTPPPLKGPS EEDMDDDEDE EMYDDDNEER DDETEKEDEE
EDEPEDKEKT PTKLRAQNGK TEKEIHSSGK RKRKANEKSF NSMEKIELKK SKETKETKET
KGLSREQTVK EPIEKASSDV VPSDDLKKMK VSEEKVSCRP ADSSNDATML SDKVLSDKLE
IEIRWILNNC EFEEMTTKTV RRLLQQRLNM NLRSHKGIIK EVVTKVIAAM EEEDDETNPS
TLQPNTTDSK CQPQASIEPQ LKHQSINSET KQIDQIDAKS TKSEHSPKKV KLISEADLLD
AKAKLSDPNV SHEQILECLE ALTSVPLTIQ LLKKTGISRS VSSLRQHVND KVSASASALR
TRWMKLLKAD DEQQNDPKIK TAQCHSNQLP QEGSHNSQLS KLIEMVETLQ DQVEYKDQVA
TKPTRIVHEK QLRVLHDLYQ MRLSTKEIIE SKVGMAVSRL RKSGNEAIVK AASKLRRKWK
TEAEAA
//