ID A0A1Z5TPI2_HORWE Unreviewed; 334 AA.
AC A0A1Z5TPI2;
DT 27-SEP-2017, integrated into UniProtKB/TrEMBL.
DT 27-SEP-2017, sequence version 1.
DT 28-JUN-2023, entry version 18.
DE RecName: Full=Transcriptional activator HAP2 {ECO:0000256|RuleBase:RU367155};
GN ORFNames=BTJ68_02437 {ECO:0000313|EMBL:OTA37914.1};
OS Hortaea werneckii EXF-2000.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Dothideomycetes;
OC Dothideomycetidae; Mycosphaerellales; Teratosphaeriaceae; Hortaea.
OX NCBI_TaxID=1157616 {ECO:0000313|EMBL:OTA37914.1, ECO:0000313|Proteomes:UP000194280};
RN [1] {ECO:0000313|EMBL:OTA37914.1, ECO:0000313|Proteomes:UP000194280}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=EXF-2000 {ECO:0000313|EMBL:OTA37914.1,
RC ECO:0000313|Proteomes:UP000194280};
RA Sinha S., Flibotte S., Neira M., Lenassi M., Gostincar C., Stajich J.E.,
RA Nislow C.E.;
RT "The recent genome duplication of the halophilic yeast Hortaea werneckii:
RT insights from long-read sequencing.";
RL Submitted (JAN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Component of the sequence-specific heterotrimeric
CC transcription factor (NF-Y) which specifically recognizes a 5'-CCAAT-3'
CC box motif found in the promoters of its target genes.
CC {ECO:0000256|RuleBase:RU367155}.
CC -!- SUBUNIT: Heterotrimer. {ECO:0000256|RuleBase:RU367155}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|RuleBase:RU367155}.
CC -!- SIMILARITY: Belongs to the NFYA/HAP2 subunit family.
CC {ECO:0000256|RuleBase:RU367155}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OTA37914.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MUNK01000015; OTA37914.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Z5TPI2; -.
DR STRING; 1157616.A0A1Z5TPI2; -.
DR VEuPathDB; FungiDB:BTJ68_02437; -.
DR InParanoid; A0A1Z5TPI2; -.
DR OrthoDB; 5490901at2759; -.
DR Proteomes; UP000194280; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:UniProtKB-UniRule.
DR Gene3D; 6.10.250.2430; -; 1.
DR InterPro; IPR001289; NFYA.
DR PANTHER; PTHR12632:SF6; NUCLEAR TRANSCRIPTION FACTOR Y SUBUNIT ALPHA; 1.
DR PANTHER; PTHR12632; TRANSCRIPTION FACTOR NF-Y ALPHA-RELATED; 1.
DR Pfam; PF02045; CBFB_NFYA; 1.
DR PRINTS; PR00616; CCAATSUBUNTB.
DR SMART; SM00521; CBF; 1.
DR PROSITE; PS51152; NFYA_HAP2_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|RuleBase:RU367155};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|RuleBase:RU367155};
KW Reference proteome {ECO:0000313|Proteomes:UP000194280};
KW Transcription {ECO:0000256|ARBA:ARBA00023163,
KW ECO:0000256|RuleBase:RU367155};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015,
KW ECO:0000256|RuleBase:RU367155}.
FT REGION 1..70
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 102..219
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 239..334
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 20..47
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 56..70
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 102..166
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 167..202
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 286..326
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 334 AA; 36389 MW; B64FF0AAC1594DA2 CRC64;
MDYAQHQYQP PPSYPHTHPQ PQQSPVMQTQ PPPNYGQHQQ QPQMAGQHMM MQPGYNPAYS
MSQPYGVSPS QAAAMATAAA SGYPAYSMGD SLQSLPQTSP RIQQVKQDGP GMQRASPQSP
RQNQMSLSST MAGQMPMPTA QQMTPQQMQQ AQRRMSHSVA QSPAGVPQQQ APPPMQAVPP
RQSVPPAPQQ QPPQPSQGPQ GSPDAAPPVA GTAEESPLYV NAKQFHRILK RRMARQKLEE
QLRLTSKGRK PYLHESRHNH AMRRPRGPGG RFLTADEVAA MEARGGDGGG EDEKDVKENG
LTNGSTKRKA GKDSDTPSKR SKSSPEDDDE DDDG
//