ID A0A6A1QBT1_BALPH Unreviewed; 517 AA.
AC A0A6A1QBT1;
DT 17-JUN-2020, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 1.
DT 22-FEB-2023, entry version 7.
DE RecName: Full=TATA box binding protein associated factor (TAF) histone-like fold domain-containing protein {ECO:0000259|SMART:SM00803};
GN ORFNames=E2I00_019043 {ECO:0000313|EMBL:KAB0404694.1};
OS Balaenoptera physalus (Fin whale) (Balaena physalus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Mysticeti;
OC Balaenopteridae; Balaenoptera.
OX NCBI_TaxID=9770 {ECO:0000313|EMBL:KAB0404694.1, ECO:0000313|Proteomes:UP000437017};
RN [1] {ECO:0000313|EMBL:KAB0404694.1, ECO:0000313|Proteomes:UP000437017}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=FinWhale-01 {ECO:0000313|EMBL:KAB0404694.1};
RX PubMed=31553763;
RA Westbury M.V., Petersen B., Lorenzen E.D.;
RT "Genomic analyses reveal an absence of contemporary introgressive admixture
RT between fin whales and blue whales, despite known hybrids.";
RL PLoS ONE 14:0-e0222004(2019).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the TAF6 family.
CC {ECO:0000256|ARBA:ARBA00007688}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAB0404694.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; SGJD01000548; KAB0404694.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A6A1QBT1; -.
DR Proteomes; UP000437017; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0000124; C:SAGA complex; IEA:InterPro.
DR GO; GO:0046695; C:SLIK (SAGA-like) complex; IEA:InterPro.
DR GO; GO:0005669; C:transcription factor TFIID complex; IEA:InterPro.
DR GO; GO:0046982; F:protein heterodimerization activity; IEA:InterPro.
DR GO; GO:0016251; F:RNA polymerase II general transcription initiation factor activity; IEA:InterPro.
DR GO; GO:0006367; P:transcription initiation at RNA polymerase II promoter; IEA:InterPro.
DR CDD; cd08050; TAF6C; 1.
DR Gene3D; 1.10.20.10; Histone, subunit A; 1.
DR Gene3D; 1.25.40.770; TAF6, C-terminal HEAT repeat domain; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR009072; Histone-fold.
DR InterPro; IPR037796; TAF6.
DR InterPro; IPR011442; TAF6_C.
DR InterPro; IPR046344; TAF6_C_sf.
DR InterPro; IPR004823; TAF_TATA-bd_Histone-like_dom.
DR PANTHER; PTHR10221:SF22; TAF6-LIKE RNA POLYMERASE II P300/CBP-ASSOCIATED FACTOR-ASSOCIATED FACTOR 65 KDA SUBUNIT 6L; 1.
DR PANTHER; PTHR10221; TRANSCRIPTION INITIATION FACTOR TFIID SUBUNIT 6; 1.
DR Pfam; PF02969; TAF; 1.
DR Pfam; PF07571; TAF6_C; 1.
DR SMART; SM00803; TAF; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR SUPFAM; SSF47113; Histone-fold; 1.
PE 3: Inferred from homology;
KW Membrane {ECO:0000256|SAM:Phobius};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000437017};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 363..385
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 397..422
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 10..73
FT /note="TATA box binding protein associated factor (TAF)
FT histone-like fold"
FT /evidence="ECO:0000259|SMART:SM00803"
SQ SEQUENCE 517 AA; 56839 MW; A2DBD3359BFBEEB1 CRC64;
MSEREERRFV EIPRESVRLM AESTGLELSD EVAALLAEDV CYRLREATQN SSQFMKHTKR
RKLTVEDFNR ALRWSSVEAV CGYGSQEALP LRPAREVHVS YLDGKGNLAP QGSVPSAVSS
LTDDLLKYYQ QVTRAVLGDD PQLMKVALQD LQTNSKIAAL LPYFVYVVSG VKSVSHDLEQ
LHRLLQVARS LVRNPHLCLG PYVRSLVGSV LYCVLEPLAA SINPLNDHWT LRDGAALLLS
HIFWTHGDLV SGLYQQILLS LQKVLADPVR PLCSHYGAVV GLHALGWKAV ERVLYPHLST
YWTNLQAVLD DYSVSNAQVK ADGHKVYGAI LGSFSGSCPL YGVAALNGSS LALSRPSAPS
LCYFVAGASG LLALYCLLLL LFWVYSSCIE DCHRGPIGLR IALAISAIAI FLVLVSACIL
RFGTSSLCKS IISLNITSCS DAQKTPWIPP GTTLQFYSNL HNAETSSWVN LVLWCVVLVL
QVVQWKSEAT PYRPLERGDP VWSSETDALV GSRLSHS
//