ID B9QI17_TOXGV Unreviewed; 1095 AA.
AC B9QI17; A0A0F7UWU2;
DT 24-MAR-2009, integrated into UniProtKB/TrEMBL.
DT 24-MAR-2009, sequence version 1.
DT 27-MAR-2024, entry version 69.
DE RecName: Full=Transcription initiation factor TFIID subunit 5 {ECO:0000256|ARBA:ARBA00044130};
GN ORFNames=BN1205_009680 {ECO:0000313|EMBL:CEL72563.1}, TGVEG_318260
GN {ECO:0000313|EMBL:ESS29696.1};
OS Toxoplasma gondii (strain ATCC 50861 / VEG).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Conoidasida; Coccidia;
OC Eucoccidiorida; Eimeriorina; Sarcocystidae; Toxoplasma.
OX NCBI_TaxID=432359 {ECO:0000313|EMBL:ESS29696.1, ECO:0000313|Proteomes:UP000002226};
RN [1] {ECO:0000313|EMBL:ESS29696.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=VEG {ECO:0000313|EMBL:ESS29696.1};
RA Paulsen I.;
RL Submitted (MAR-2007) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000002226}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50861 / VEG {ECO:0000313|Proteomes:UP000002226};
RA Lorenzi H., Inman J., Amedeo P., Brunk B., Roos D., Caler E.;
RT "Annotation of Toxoplasma gondii VEG.";
RL Submitted (MAR-2008) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:ESS29696.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=VEG {ECO:0000313|EMBL:ESS29696.1};
RA Sibley D., Venepally P., Karamycheva S., Hadjithomas M., Khan A., Brunk B.,
RA Roos D., Caler E., Lorenzi H.;
RL Submitted (AUG-2013) to the EMBL/GenBank/DDBJ databases.
RN [4] {ECO:0000313|EMBL:CEL72563.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=VEG {ECO:0000313|EMBL:CEL72563.1};
RX PubMed=25875305; DOI=10.1371/journal.pone.0124473;
RA Ramaprasad A., Mourier T., Naeem R., Malas T.B., Moussa E., Panigrahi A.,
RA Vermont S.J., Otto T.D., Wastling J., Pain A.;
RT "Comprehensive Evaluation of Toxoplasma gondii VEG and Neospora caninum LIV
RT Genomes with Tachyzoite Stage Transcriptome and Proteome Defines Novel
RT Transcript Features.";
RL PLoS ONE 10:e0124473-e0124473(2015).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LN714493; CEL72563.1; -; Genomic_DNA.
DR EMBL; AAYL02000286; ESS29696.1; -; Genomic_DNA.
DR AlphaFoldDB; B9QI17; -.
DR STRING; 432359.B9QI17; -.
DR PaxDb; 5811-TGME49_118260; -.
DR EnsemblProtists; ESS29696; ESS29696; TGVEG_318260.
DR VEuPathDB; ToxoDB:TGVEG_318260; -.
DR eggNOG; KOG0263; Eukaryota.
DR OMA; ESACCAV; -.
DR Proteomes; UP000002226; Partially assembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003743; F:translation initiation factor activity; IEA:UniProtKB-KW.
DR Gene3D; 1.25.40.500; TFIID subunit TAF5, NTD2 domain; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR007582; TFIID_NTD2.
DR InterPro; IPR037264; TFIID_NTD2_sf.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR19879:SF1; CANNONBALL-RELATED; 1.
DR PANTHER; PTHR19879; TRANSCRIPTION INITIATION FACTOR TFIID; 1.
DR Pfam; PF04494; TFIID_NTD2; 1.
DR Pfam; PF00400; WD40; 4.
DR SMART; SM00320; WD40; 6.
DR SUPFAM; SSF160897; Taf5 N-terminal domain-like; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 4.
PE 4: Predicted;
KW Initiation factor {ECO:0000313|EMBL:CEL72563.1};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Protein biosynthesis {ECO:0000313|EMBL:CEL72563.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000002226};
KW WD repeat {ECO:0000256|PROSITE-ProRule:PRU00221}.
FT DOMAIN 200..312
FT /note="TFIID subunit TAF5 NTD2"
FT /evidence="ECO:0000259|Pfam:PF04494"
FT REPEAT 652..684
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 758..789
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 809..843
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 1001..1026
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REGION 1..86
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 140..160
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 394..431
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 574..629
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 701..748
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 856..943
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 7..82
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 140..154
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 406..425
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 576..590
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 856..910
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 921..942
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1095 AA; 118143 MW; 6E4CEA6942863BF4 CRC64;
MTEERPVSVS FPLTASQQTR ETVRPMGEPP PSDSSNSSVG DASKLLSSPS SSSASSSASL
SSSSSSSSLS SSSSSSSVSP VDPRGLGASA AAVVPSTSSA SEKLSEIFHL LQQKYHVDSR
MLDSLQEAIS SFKSGPSLSS SLASSSSPQP EQTQAAAPAR AAPSAEEAAV VALGTLGGRL
SGQGASPLAL PGSNALLSFD VYERIYNRFC IWILSLFEDC REELMDVAFA VLLHMFRKLL
TVDAYQAHQL LRRFSPLHAG KHGSLLKQLE DAEPVHPLQL CQIPYFASEE RNPLYISERA
YFLLRTWLVD TRCLLLEVMI QAASRLLPPP AHAPHARGLV YRGLLGPSLS QSLLSPPSSH
PLSSLPAGLR RHEETPAPTT QEKTVAQRAG DLAFFDAPPP FRSQPGDPEK GEKSDEREED
RSALDNSGEL PPVVWGLPRQ FFREETTVIG PSGERTKRVR LLGGENLRET ELAEPNSLLP
LPEPPKDSFL LYRRLIKQQT ERRAPLSAAA GQWPSIACMT VLNSSSESAC CAVSPSTCRL
VAVGGEGEIR LWDLQQYQVT KARRERQRRR WLLRTQQKAQ EGQPLSSFLS ASKPAADDED
GEASGSEKDD APEEKKNSSG KSLHRRKNAA SAAPLALDWG EDAEGEAGVS CLVGTDGRVL
SLAFGEMDDR ILLSGGTDGV VRLWPSYPSA AWASTLLDED EGEALEEKGE ERQLKKAPGT
GESGAQESAE GEGEHAPRSV SRGLPGSRSS VVSPLCVYRG ALAAVWALDV GPYGHYFASG
SSDNCARLWC TSRSFPLRLL QHPAAATDVF HVAFHPNSSL LLTAASDNCV RLFDLRSAQL
ARAWAPLLLP VVDGEARGRA TEEPEEGDGD QRDRQERRER RGTDRKVEKH DSSPFMRRKR
VYEDVKKPGA KHLRLGKKRG SPAFRASDDE EEEKRQAERR RSGRVTALAM SPNGRLAAVG
DSAGGICVFD IPSGRPLAIG SSPSMQKREE RFSPLHFPPS IASLSFCHGS SFLASAAVDG
TVALWDTSGG ALQIPEADGT FRQKSGLFAG RPVTALSLAE TYGASHVAFR SCLFSPENLL
FCLGFSTLCS DEDFL
//