ID A0A2T0FNB5_9ASCO Unreviewed; 1153 AA.
AC A0A2T0FNB5;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 03-MAY-2023, entry version 16.
DE RecName: Full=Transcription initiation factor TFIID subunit 2 {ECO:0000256|ARBA:ARBA00017363};
GN ORFNames=B9G98_04085 {ECO:0000313|EMBL:PRT56465.1};
OS Wickerhamiella sorbophila.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Trichomonascaceae; Wickerhamiella.
OX NCBI_TaxID=45607 {ECO:0000313|EMBL:PRT56465.1, ECO:0000313|Proteomes:UP000238350};
RN [1] {ECO:0000313|EMBL:PRT56465.1, ECO:0000313|Proteomes:UP000238350}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DS02 {ECO:0000313|EMBL:PRT56465.1,
RC ECO:0000313|Proteomes:UP000238350};
RA Ahn J.O.;
RT "Genome sequencing of [Candida] sorbophila.";
RL Submitted (APR-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the TAF2 family.
CC {ECO:0000256|ARBA:ARBA00010937}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PRT56465.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NDIQ01000022; PRT56465.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2T0FNB5; -.
DR STRING; 45607.A0A2T0FNB5; -.
DR OrthoDB; 1342632at2759; -.
DR Proteomes; UP000238350; Unassembled WGS sequence.
DR GO; GO:0005669; C:transcription factor TFIID complex; IEA:InterPro.
DR GO; GO:0008237; F:metallopeptidase activity; IEA:InterPro.
DR GO; GO:0003743; F:translation initiation factor activity; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd09839; M1_like_TAF2; 1.
DR Gene3D; 1.10.390.10; Neutral Protease Domain 2; 1.
DR Gene3D; 2.60.40.1730; tricorn interacting facor f3 domain; 1.
DR InterPro; IPR042097; Aminopeptidase_N-like_N_sf.
DR InterPro; IPR014782; Peptidase_M1_dom.
DR InterPro; IPR027268; Peptidase_M4/M1_CTD_sf.
DR InterPro; IPR037813; TAF2.
DR PANTHER; PTHR15137; TRANSCRIPTION INITIATION FACTOR TFIID; 1.
DR PANTHER; PTHR15137:SF9; TRANSCRIPTION INITIATION FACTOR TFIID SUBUNIT 2; 1.
DR Pfam; PF01433; Peptidase_M1; 1.
DR SUPFAM; SSF63737; Leukotriene A4 hydrolase N-terminal domain; 1.
DR SUPFAM; SSF55486; Metalloproteases ('zincins'), catalytic domain; 1.
PE 3: Inferred from homology;
KW Initiation factor {ECO:0000313|EMBL:PRT56465.1};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Protein biosynthesis {ECO:0000313|EMBL:PRT56465.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000238350};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 320..429
FT /note="Peptidase M1 membrane alanine aminopeptidase"
FT /evidence="ECO:0000259|Pfam:PF01433"
SQ SEQUENCE 1153 AA; 130490 MW; A5A5D834C1A10440 CRC64;
MTEPSRGFRV AHQKVSVDVD LFRQRLIGHT ELTIVPTTRE LHQVRLDARQ LVVQNVLING
KHATFAHNDL VASNPERLNS WIPEQHRMFY EHRKQLFQAT VPGELLIDVP EDVSILDQDT
SSTYIVSHAS DEQNYQPLTV RVAFEVSGSN SGFNFVGGQN SSLPRSKWHA YTTSALLGAS
TSCWLPCVDG LWELSTWEIE VSVPKTLKDV DRRGLMTDNT KDEDNEEDEE GEHEILAVCN
NNSPSQVVDP TASHKKIVSF ELFNPVSAHH LGFAVGPFVQ TPMTQSTDDL DESGTSSVPF
SVFALRESVD MVKHCCGIFV KAMDFFNRDF GSFPYSSYSL CFVSDMPETT ADAAGLTILS
DGFLFPSNVI EPVFQNVEPL ICALAAQWCG VSIVPKTWND IWVTQGVAMY MTHLFVRKLM
GNNEYRFRMR KYVDEITRQD INMPPLAGPD FQFPITNPDL GFIQLKAPLV LYILDRRMTK
TDRSLGLSRV IPKLFLQSMS GDLHSTISTA HFIKLCERVA HHRLTKFFQE WVYGSGYPIF
RVTQRFNKKR MFIEMGIGQV QSRELPPEAL QNDTFLAQAL RDLDKAPEGA KYKPSDVFTG
PMTIRIHEAD GTPYEHVVDL KEGFTKLDIQ YNTKYKRLKR TQRYRNGNAH RDEFEDDGSG
LIHCLGDVLM GDRETEEWKL SDWGQDDEEY MFNEAFEWIR VDSDFEWICK MSVGQPDYMY
ASQLQQDRDV VAQYEAVTFF SAQPPNAIHS SILLRTLMDR RYFYGIRILA AFGVATCAVS
QLKYIGKYHL MRAFQVMYCF PGSLVPKAND FQDFANYFVQ RSIPVALSTI QEDGRAPKDV
AEFLLDLVRY NENSGNPYSD SYYVSTLITS IVNSLKPPKD TVVDYDDAAL QDYVSRVTSE
ISKCQRLDRW LPSFKHVITT TALTQTEALI RDGYGKPKFS KLLSLSAPEN PPEVRLAAFS
SLLNLGGYRT PEILSYALVA AVKDPSMVVR TGTLRAVATA IGQVAVYGEY FSADDPAKPD
SNDPLVDRQH RIARRAVHTA IPLLQETIAK HPHVAKSIWS TLRTPGLGVF EQKILFEVLR
VAVEAKNSMV VTLKTPKLYK LAAKRKRDFL VVVKYRSVLR REKKTVVPRL TLPVIPSEPA
KPPALKLKLG FSL
//