GenomeNet

Database: UniProt
Entry: A0A2T0FKW5_9ASCO
LinkDB: A0A2T0FKW5_9ASCO
Original site: A0A2T0FKW5_9ASCO 
ID   A0A2T0FKW5_9ASCO        Unreviewed;       581 AA.
AC   A0A2T0FKW5;
DT   18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT   18-JUL-2018, sequence version 1.
DT   22-FEB-2023, entry version 16.
DE   SubName: Full=Protein SGM1 {ECO:0000313|EMBL:PRT55612.1};
GN   ORFNames=B9G98_03232 {ECO:0000313|EMBL:PRT55612.1};
OS   Wickerhamiella sorbophila.
OC   Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC   Saccharomycetales; Trichomonascaceae; Wickerhamiella.
OX   NCBI_TaxID=45607 {ECO:0000313|EMBL:PRT55612.1, ECO:0000313|Proteomes:UP000238350};
RN   [1] {ECO:0000313|EMBL:PRT55612.1, ECO:0000313|Proteomes:UP000238350}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=DS02 {ECO:0000313|EMBL:PRT55612.1,
RC   ECO:0000313|Proteomes:UP000238350};
RA   Ahn J.O.;
RT   "Genome sequencing of [Candida] sorbophila.";
RL   Submitted (APR-2017) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:PRT55612.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; NDIQ01000021; PRT55612.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A2T0FKW5; -.
DR   STRING; 45607.A0A2T0FKW5; -.
DR   OrthoDB; 2054925at2759; -.
DR   Proteomes; UP000238350; Unassembled WGS sequence.
DR   Gene3D; 1.10.287.1490; -; 1.
DR   InterPro; IPR022091; TMF_TATA-bd.
DR   PANTHER; PTHR46515:SF1; TATA ELEMENT MODULATORY FACTOR; 1.
DR   PANTHER; PTHR46515; TATA ELEMENT MODULATORY FACTOR TMF1; 1.
DR   Pfam; PF12325; TMF_TATA_bd; 1.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Reference proteome {ECO:0000313|Proteomes:UP000238350}.
FT   DOMAIN          471..579
FT                   /note="TATA element modulatory factor 1 TATA binding"
FT                   /evidence="ECO:0000259|Pfam:PF12325"
FT   REGION          20..76
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          367..391
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          169..244
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          485..578
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        368..387
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   581 AA;  63991 MW;  D4D5D64CF1447280 CRC64;
     MASEWSNYLK RAVATVERRL DEALEMPPSS GNTENPEKAD ETHIESTGEA GSGSSDEAEP
     GPPALEPGPP LTFPDLSGAV SAVQKLALDI SEAGDENLKD TSLNALDLCE AAGREYDAFI
     EDLNAKISAL LSKIKYLSHS RVQKLSSSKE AAAARESQIA SLIEEGQRLA QQELKLNMVI
     KKLRSAEKHT PQVVVKPQME VNTAELEKAL SDLEIERTRA TSLQESFEKE LAGEKSNVDA
     LRKQIFDQSA LIEHYRSLAE ASSQDQHVRD LQRRLDTAQA HHAAATDNWN NIEAGLLSRI
     AELEAKASDD EKQRTGTQQK LQATQKSLFE ARDAASASAA AAEAAQTEAK RAFRQVEELK
     EQIEKLESVT KSQSAELNAA RQEAEHTSES FSQARVEWEQ READLVAAAT PTPLTGRNDS
     FFGSEVSLPS LHPLDACTPT PRSPIESRRD SLASDFDHSM FMSNASSINM RSESSHMLGK
     LNMTVRRLET ELTSTKQALE LANAEKTDAY NELLGAINAS EELEEMRKES ESHKTKIAEL
     EAKIDSLLVR LGEKSERVSE LEADVQDLKE AYREQIEALL R
//
DBGET integrated database retrieval system