ID V5F963_BYSSN Unreviewed; 1096 AA.
AC V5F963;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE RecName: Full=Transcription initiation factor TFIID subunit 5 {ECO:0000256|ARBA:ARBA00044130};
GN ORFNames=PVAR5_1182 {ECO:0000313|EMBL:GAD92589.1};
OS Byssochlamys spectabilis (strain No. 5 / NBRC 109023) (Paecilomyces
OS variotii).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Eurotiales; Thermoascaceae; Paecilomyces.
OX NCBI_TaxID=1356009 {ECO:0000313|EMBL:GAD92589.1, ECO:0000313|Proteomes:UP000018001};
RN [1] {ECO:0000313|Proteomes:UP000018001}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=No. 5 / NBRC 109023 {ECO:0000313|Proteomes:UP000018001};
RX PubMed=24407650; DOI=10.1128/genomeA.01162-13;
RA Oka T., Ekino K., Fukuda K., Nomura Y.;
RT "Draft genome sequence of the formaldehyde-resistant fungus Byssochlamys
RT spectabilis No. 5 (anamorph Paecilomyces variotii No. 5) (NBRC109023).";
RL Genome Announc. 2:E0116213-E0116213(2014).
CC -!- SIMILARITY: Belongs to the WD repeat TAF5 family.
CC {ECO:0000256|ARBA:ARBA00009435}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GAD92589.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BAUL01000031; GAD92589.1; -; Genomic_DNA.
DR AlphaFoldDB; V5F963; -.
DR eggNOG; KOG0263; Eukaryota.
DR HOGENOM; CLU_005884_0_2_1; -.
DR InParanoid; V5F963; -.
DR OrthoDB; 3138699at2759; -.
DR Proteomes; UP000018001; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003743; F:translation initiation factor activity; IEA:UniProtKB-KW.
DR CDD; cd08044; TAF5_NTD2; 1.
DR CDD; cd00200; WD40; 1.
DR Gene3D; 1.25.40.500; TFIID subunit TAF5, NTD2 domain; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR020472; G-protein_beta_WD-40_rep.
DR InterPro; IPR006594; LisH.
DR InterPro; IPR007582; TFIID_NTD2.
DR InterPro; IPR037264; TFIID_NTD2_sf.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR19879:SF1; CANNONBALL-RELATED; 1.
DR PANTHER; PTHR19879; TRANSCRIPTION INITIATION FACTOR TFIID; 1.
DR Pfam; PF04494; TFIID_NTD2; 1.
DR Pfam; PF00400; WD40; 6.
DR PRINTS; PR00320; GPROTEINBRPT.
DR SMART; SM00320; WD40; 6.
DR SUPFAM; SSF160897; Taf5 N-terminal domain-like; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS50896; LISH; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 6.
DR PROSITE; PS50294; WD_REPEATS_REGION; 4.
PE 3: Inferred from homology;
KW Initiation factor {ECO:0000313|EMBL:GAD92589.1};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Protein biosynthesis {ECO:0000313|EMBL:GAD92589.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000018001};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT DOMAIN 449..579
FT /note="TFIID subunit TAF5 NTD2"
FT /evidence="ECO:0000259|Pfam:PF04494"
FT REPEAT 738..779
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 794..840
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 848..879
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 890..931
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 932..973
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 982..1008
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REGION 1..151
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 170..406
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 607..626
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 646..668
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1014..1059
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 23..37
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 38..75
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 76..92
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 123..151
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 177..205
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 249..313
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 381..406
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 646..662
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1096 AA; 117998 MW; 261490D22095E7BC CRC64;
MSAPGGAPSP APRSASIGPG GGMPIPPQQP MPSPQVGSGT PGPATGSTGV MSQQNLNQIT
RYLAPITSKM TRKSGRASKS GSKARSRRKP KHKDHQVSPD AQANVQPAPG VAAAEDPGDV
AVTKASSVTS DESTIKADSR RQSTSTLQDF EIHESAESLE HFSEMAAASQ EDFLAGPSRT
DRFRYRRLDQ TKDSAQDEIR ALTEQRQARE SVALAIQKDN SPAKASEPSA SLAPPDGFPV
APDGDSSVVT PSKKKSASGS GKKSTNETCS ANPRSQGSIF PASQSVSSAS APPQRAPRQA
ASSQSTRALV QEYFASQVSD PGYDPAPGEQ PKQRASGRRR PKSSTTDPTV GLSVYPEDDP
LLSGFPPSCI QRPIRPRPDK QTNVPPTTDE PAILSPQGNS PVQGTDSQLN SVIDYLAKKG
YSRTEAMLRM ESANQEIDGR PLPPLGEDAR PKYRAGFELL KSWVEDNLDL YKPELRRVLW
PLFVYSFLSL VTSFYPQDAK QFFASNKNLF LPEHNEDIRA LEPISLPEHA QDNSVAKIYR
GNKYRLILSN PAFSNLMQFL ENKQKEGGSV MSAILSSYCT VITKERTADD RFSFAAMLGK
SSDAQTFPLE DEGIPGHHPG SAYTGDNPAM TGTLPRLKLG KLAPDSQLEE DVRGELADED
AKNPPAAGRN TLVQEYDQMI KKEEEDDAPT RAEIPYPPST ARDVAMEVQK VKENRDRFKI
EGRTGGVGPA VSVCMFTFHN TYDGITCLDF SDDNMLVAAG MQESYIRVWS LDGQKIKPTH
EGIDDKPLAN SHRLIGHSGP IYAVAFAPSA TPADGSLAPT NVRFLLSSSA DKTIRLWSLD
LWQCMVVYKG HDHPVWDLSW GPYGYYFVSG GHDKTARLWA TDRIRQQRIF AGHDQDVDCV
CFHPNSAYIF TGSCDRTVRM WAVTTGNAVR MFTGHTGNIT ALACSKNGKI LASADDQGSI
ILWDLAPGRQ LKRMRGHGKG GIWSLSWSAE STVLVSGGAD GTVRVWDVAG PAQDPAATQG
RVVGEGGAGT KIDAGNASAA GAQPSASVGP GANKKKGKDV VVTPDQISAF PTKKSPVYKV
KFTNMNLIVA GGAYLP
//