ID A0A0V0XUD9_TRIPS Unreviewed; 1932 AA.
AC A0A0V0XUD9;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 22-FEB-2023, entry version 23.
DE SubName: Full=Transcription initiation factor TFIID subunit 1 {ECO:0000313|EMBL:KRX91385.1};
GN Name=TAF1 {ECO:0000313|EMBL:KRX91385.1};
GN ORFNames=T4E_7018 {ECO:0000313|EMBL:KRX91385.1};
OS Trichinella pseudospiralis (Parasitic roundworm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=6337 {ECO:0000313|EMBL:KRX91385.1, ECO:0000313|Proteomes:UP000054815};
RN [1] {ECO:0000313|EMBL:KRX91385.1, ECO:0000313|Proteomes:UP000054815}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS141 {ECO:0000313|EMBL:KRX91385.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRX91385.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDU01000138; KRX91385.1; -; Genomic_DNA.
DR STRING; 6337.A0A0V0XUD9; -.
DR Proteomes; UP000054815; Unassembled WGS sequence.
DR GO; GO:0005669; C:transcription factor TFIID complex; IEA:InterPro.
DR GO; GO:0004402; F:histone acetyltransferase activity; IEA:InterPro.
DR GO; GO:0001091; F:RNA polymerase II general transcription initiation factor binding; IEA:InterPro.
DR GO; GO:0017025; F:TBP-class protein binding; IEA:InterPro.
DR GO; GO:0003743; F:translation initiation factor activity; IEA:UniProtKB-KW.
DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:InterPro.
DR CDD; cd05511; Bromo_TFIID; 1.
DR Gene3D; 1.20.920.10; Bromodomain-like; 2.
DR InterPro; IPR001487; Bromodomain.
DR InterPro; IPR036427; Bromodomain-like_sf.
DR InterPro; IPR040240; TAF1.
DR InterPro; IPR022591; TAF1_HAT_dom.
DR InterPro; IPR041670; Znf-CCHC_6.
DR PANTHER; PTHR13900; TRANSCRIPTION INITIATION FACTOR TFIID; 1.
DR PANTHER; PTHR13900:SF0; TRANSCRIPTION INITIATION FACTOR TFIID SUBUNIT 1; 1.
DR Pfam; PF00439; Bromodomain; 2.
DR Pfam; PF12157; DUF3591; 1.
DR Pfam; PF15288; zf-CCHC_6; 1.
DR PRINTS; PR00503; BROMODOMAIN.
DR SMART; SM00297; BROMO; 2.
DR SUPFAM; SSF47370; Bromodomain; 2.
DR PROSITE; PS50014; BROMODOMAIN_2; 2.
PE 4: Predicted;
KW Bromodomain {ECO:0000256|PROSITE-ProRule:PRU00035};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Initiation factor {ECO:0000313|EMBL:KRX91385.1};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Protein biosynthesis {ECO:0000313|EMBL:KRX91385.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000054815};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 1534..1604
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT DOMAIN 1656..1726
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT REGION 505..534
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1213..1262
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1288..1318
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1375..1407
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1907..1932
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1740..1767
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 518..534
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1244..1259
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1381..1407
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1917..1932
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1932 AA; 219682 MW; 99C6616E32DB694E CRC64;
MGGHFEEKIL TSVLFGNLTE DEDSNDSFFD SETLSQLGNL TSVLNLSGLV TADDVEEVSL
PPVKPFFNDS EDFSNIDELA PDGTLEDFIK PTCSRFLKQP LKENNKGIAE ELIDSENESD
DEAPLANMLP EEYRDYDVST LFPEFRKNQV LRFSRLFGPG KPSSMPMIWR SARNWRRPES
GVDSDLLDFN GLEAAIDHKW EPNLIKTAVL TQQDQIESGD HLRILSLPLE DEFEADDETA
LHKLSKSPKQ VTNEKILPPE PKKIAPWRYG VAMKWYDKLN IPPSGEGYGR KKNEELDEEK
IEEKKVNSNI RLSNSSPIGP CCMYESSEGK HCRPLTIREK RFKEMAKNRL AENSDGFEFK
SDSLFPYHVL EWEKSVVLSD EQARNEFLQL IKAKRFPMCG WIASLKIRDP TIFREMYAKY
GWACLLRDSV NLPPETSDEY LERLNTVDNL YDFFGFPPCD IDFKDWEESV ILNLSDINRI
PGPNLVEMTP TECLSLFGVP EDVPKEGSTI QASSEGDDFE NKLPERRQEH QATKKSQLIL
HRVVQRKNEE EEESMSQMQE KDPLNLSNDE YYLPKGALSL NAFVCLGIQH SIPSQNLMQP
YFPTHFSLIK LRQFHRPQLR RFPKTAHQAP ILQNVMSLEK HVEQKKIERE QERLASGGGE
MFFMRKPADL TGKDGDLILL EYCEEHPPLL SQPGMGSKIK NYFKRLPGKD DHEPQFEYGE
TSYSHSTPFL GTLSPGETLQ AIENNLFRAP IFRHRVPNTD FLIMRTRSGL YIRKVQALFC
VGQELPLIEV PSPNSKRACN FVRDFLLAFI YRLFWNSKED PRRLRMDDLR RAFPHHAESS
IRKRLRICSD FKRLGSGIDS NFWVLKKDFR LPNTDELWSM ITPETCCAYY SMLAAEQRLK
DAGYGEKYFF TPEEDEDEDD QVKMADEIKC APWNTTRAYI SAVNGKCLLD ITGVADPTGC
GEGFSFVKLS SKPPKEEVPP PVKKNVTGTD ADLRKLSLKD AKQLLREFGV QEEEINMLSR
WEIIDVIRTL STQQAKSGGT GITKFARGNM RYSMAEVQER YKQDCQRIFQ LQNEVLSCLE
TVTTDDEQSD GNDSDIEEMG KNIENMLTVK KTGPVDPEEH ERLELQKLIS GEPAVEEASV
AEDKPVQQQV DELAGKRLKI IRYYRDSDGN EHVGVEYVTK PQLIEAYMKI RSTKDEAFIK
HFAQMDEQFR EERRRQKRRL QDQLRRIRRK EIKAKLGMTT PRPKPEKPSK PPPPPKPSLL
KMRCSACGAK GHMKTNKNCP LYNRNERVQQ SSSGQNGQNT ATSVPSTSTS SSLSLSSSNT
KTVSVAASSV SAVASAVASI APSSSSSSSA AAFASATTNA SGNELETAPA VLKEETVQHL
SMKTETTVTD SSTVETSTET APSAETSENR CIHLEGMKLK LSWAVIEQAE TATSKKSRFL
KAAAVGSGSD LEYEISESDD DSADAKRSVY SFEATDDEND RHLDDDDEEW QTVDGRLIVH
PKYLEPPKQG VYRRRTDPRV TMSVVLESIL NEIKALPEAE PFLVPVRKRS VPDYHKVVSR
PMSIQTIRMN IAKNQYVTRE EFLKDIRQIL DNSRLYNGDQ SEITISAQHI FTVASRLVAE
KESRLMKLEK EINPLLDDND QVAFSFIIGN IIDECKKIPK SFAFHSPVDV RKVRSYYDKI
KNPMDLGTME MKAKRHQYHS LIDFFNDIHK IRDNSMLFNG PASPFTLKAS EIVSLARKLV
IENKAQLIEL EENLHKLREQ ALEVAKRKGQ TTADCENLRT HPENMKLLEE ELSKSSSNNN
EFVFENALPI QGYDRTTGGK ISRMLEEIYE ETSEEITEPT VEGISDDNFV IEDLEDDLVD
DEISENITVE TKENYCTKEE EMEIVSASSS ETDLSVDDGQ LISPNDLLMD LAMSSDSEDE
PNSKRSRTLS ED
//