ID W5JP73_ANODA Unreviewed; 1194 AA.
AC W5JP73;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 50.
DE RecName: Full=TAFH domain-containing protein {ECO:0000259|PROSITE:PS51119};
GN ORFNames=AND_002025 {ECO:0000313|EMBL:ETN66192.1};
OS Anopheles darlingi (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=43151 {ECO:0000313|EMBL:ETN66192.1};
RN [1] {ECO:0000313|EMBL:ETN66192.1, ECO:0000313|Proteomes:UP000000673}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=20920257; DOI=10.1186/1471-2164-11-529;
RA Mendes N.D., Freitas A.T., Vasconcelos A.T., Sagot M.F.;
RT "Combination of measures distinguishes pre-miRNAs from other stem-loops in
RT the genome of the newly sequenced Anopheles darlingi.";
RL BMC Genomics 11:529-529(2010).
RN [2] {ECO:0000313|EMBL:ETN66192.1}
RP NUCLEOTIDE SEQUENCE.
RA Almeida L.G., Nicolas M.F., Souza R.C., Vasconcelos A.T.R.;
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:ETN66192.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=23761445;
RA Marinotti O., Cerqueira G.C., de Almeida L.G., Ferro M.I., Loreto E.L.,
RA Zaha A., Teixeira S.M., Wespiser A.R., Almeida E Silva A.,
RA Schlindwein A.D., Pacheco A.C., Silva A.L., Graveley B.R., Walenz B.P.,
RA Lima Bde A., Ribeiro C.A., Nunes-Silva C.G., de Carvalho C.R., Soares C.M.,
RA de Menezes C.B., Matiolli C., Caffrey D., Araujo D.A., de Oliveira D.M.,
RA Golenbock D., Grisard E.C., Fantinatti-Garboggini F., de Carvalho F.M.,
RA Barcellos F.G., Prosdocimi F., May G., Azevedo Junior G.M., Guimaraes G.M.,
RA Goldman G.H., Padilha I.Q., Batista Jda S., Ferro J.A., Ribeiro J.M.,
RA Fietto J.L., Dabbas K.M., Cerdeira L., Agnez-Lima L.F., Brocchi M.,
RA de Carvalho M.O., Teixeira Mde M., Diniz Maia Mde M., Goldman M.H.,
RA Cruz Schneider M.P., Felipe M.S., Hungria M., Nicolas M.F., Pereira M.,
RA Montes M.A., Cantao M.E., Vincentz M., Rafael M.S., Silverman N.,
RA Stoco P.H., Souza R.C., Vicentini R., Gazzinelli R.T., Neves Rde O.,
RA Silva R., Astolfi-Filho S., Maciel T.E., Urmenyi T.P., Tadei W.P.,
RA Camargo E.P., de Vasconcelos A.T.;
RT "The genome of Anopheles darlingi, the main neotropical malaria vector.";
RL Nucleic Acids Res. 41:7387-7400(2013).
RN [4] {ECO:0000313|EnsemblMetazoa:ADAC002025-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the TAF4 family.
CC {ECO:0000256|ARBA:ARBA00006178}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADMH02000498; ETN66192.1; -; Genomic_DNA.
DR AlphaFoldDB; W5JP73; -.
DR STRING; 43151.W5JP73; -.
DR EnsemblMetazoa; ADAC002025-RA; ADAC002025-PA; ADAC002025.
DR VEuPathDB; VectorBase:ADAC002025; -.
DR VEuPathDB; VectorBase:ADAR2_009831; -.
DR eggNOG; KOG2341; Eukaryota.
DR HOGENOM; CLU_012027_0_0_1; -.
DR OMA; QFYHHHA; -.
DR OrthoDB; 2910924at2759; -.
DR Proteomes; UP000000673; Unassembled WGS sequence.
DR GO; GO:0005669; C:transcription factor TFIID complex; IEA:InterPro.
DR GO; GO:0046982; F:protein heterodimerization activity; IEA:InterPro.
DR GO; GO:0006352; P:DNA-templated transcription initiation; IEA:InterPro.
DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:UniProt.
DR CDD; cd08045; TAF4; 1.
DR Gene3D; 1.10.20.10; Histone, subunit A; 1.
DR Gene3D; 1.20.120.1110; TAFH/NHR1 domain; 1.
DR InterPro; IPR009072; Histone-fold.
DR InterPro; IPR045144; TAF4.
DR InterPro; IPR007900; TAF4_C.
DR InterPro; IPR037249; TAFH/NHR1_dom_sf.
DR InterPro; IPR003894; TAFH_NHR1.
DR PANTHER; PTHR15138; TRANSCRIPTION INITIATION FACTOR TFIID SUBUNIT 4; 1.
DR PANTHER; PTHR15138:SF14; TRANSCRIPTION INITIATION FACTOR TFIID SUBUNIT 4; 1.
DR Pfam; PF05236; TAF4; 1.
DR Pfam; PF07531; TAFH; 1.
DR SMART; SM00549; TAFH; 1.
DR SUPFAM; SSF47113; Histone-fold; 1.
DR SUPFAM; SSF158553; TAFH domain-like; 1.
DR PROSITE; PS51119; TAFH; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000000673};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 467..563
FT /note="TAFH"
FT /evidence="ECO:0000259|PROSITE:PS51119"
FT REGION 46..77
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 411..469
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1067..1105
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1122..1163
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 46..69
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1194 AA; 124309 MW; AE1FBE7291AC6609 CRC64;
MASANNFLEE ALKSAVDESA VNAIVGTLEN QLDVNTNLVQ QVGSVGKSET GAGTGESGVK
IQSSAKHETA AASPKRDLVG GLANGEKIVV SNNNTITANN NTASKSVLPK PPPRNNSGSV
VIAGGGARGG GNINIISQQI VVPPPPFNVP PTSQKQLQTS SNMPKNEPVK LVYPANMNNN
RVTLTPGLSN GTITVSQPQP QSGNGGVSGP PTPTLIIKNQ QTNAVMNPGT PGIVTVSKPM
NNQATPNIVG LPGVQIVNVR PGAQPTQAQQ KTVAAVSPRV VIGSQPIVST RPPNASAITL
SALQGQQGST LLLKNEQGQF QLLRIGPAPT GTQITPASLT PSSTNQTIRL QTVPATHASG
TGTIIVSSHS GTTTQTNYIS TQPTPVASVA PVPALAAQQN ITITHTSASP ATVLSTQQQP
QQQPQQQQQQ HSSTGSVGGQ QTSQQQQQQP TVVVTTTPAV TQQRNSLDNT KEKCSKFLTN
LIELSKREPT KVEQNVRTLI QELVDANVDP AEFCERLERL LNASPQPCLI GFLKKSLPLL
RQSLVTKEIT IEGINPPSAA VAFAGTALSS IPAQIRPVAP TIVSQGSMIG QTQIRMLTTQ
SGVTTLPRIG QTTIRPAAPI RIQTPLQQQS TVASGGTTIV GQPRTTTLTA QQIRPNVTTI
GHTTIVQTAG QQLPQTISTT PPALLPVSGN AKFNASAQQQ IRTTTPITSR TLTSIGGTTV
TTTGKQILQS QSVNQIRGQT PVASVAAVAA AAAAAASGSS ATQVKQVTAI TGGGNVVVSL
NQNPPPMQPV SGLGGGSSQG GSLMVSGTQA MPALTLTSAA SAAGFGVSGG VVSGVKTASA
VSSAIVLNSS STVSTTAASA SIGSGTAVSA GLPTASGTTL ATPGLTTIKT ITAKSQNLSG
AAAASAKKKA SASAASSAGT EQDASKRAGA SAQSQFYHHH AAMYGEDDIN DVAAMGGVNL
AEETQRILGS TEFVGTQIRS CKDEVFLHLP ALQSRIRNII ARHGLEEPSN EVAVLISHAC
QERLKNVVEK LAIIAEHRID IIKVDPRYEV TKDVRGQIKF LEELDKAEQK RHEEQEREML
MRAAKSRSKT EDPEQAKLKA KAKEMQRAEM EELRQRDANL TALQAIGPRK KPKLEEGSST
SATPGASGIG SLSGKAPTPL RPRIKRVNLR DMLFYMEQER ESCRSQMLYK AYLK
//