ID W9YFW8_9EURO Unreviewed; 322 AA.
AC W9YFW8;
DT 14-MAY-2014, integrated into UniProtKB/TrEMBL.
DT 14-MAY-2014, sequence version 1.
DT 22-FEB-2023, entry version 25.
DE RecName: Full=Transcriptional activator HAP2 {ECO:0000256|RuleBase:RU367155};
GN ORFNames=A1O1_05495 {ECO:0000313|EMBL:EXJ88565.1};
OS Capronia coronata CBS 617.96.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Chaetothyriomycetidae; Chaetothyriales; Herpotrichiellaceae; Capronia.
OX NCBI_TaxID=1182541 {ECO:0000313|EMBL:EXJ88565.1, ECO:0000313|Proteomes:UP000019484};
RN [1] {ECO:0000313|EMBL:EXJ88565.1, ECO:0000313|Proteomes:UP000019484}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 617.96 {ECO:0000313|EMBL:EXJ88565.1,
RC ECO:0000313|Proteomes:UP000019484};
RG The Broad Institute Genomics Platform;
RA Cuomo C., de Hoog S., Gorbushina A., Walker B., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Capronia coronata CBS 617.96.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Component of the sequence-specific heterotrimeric
CC transcription factor (NF-Y) which specifically recognizes a 5'-CCAAT-3'
CC box motif found in the promoters of its target genes.
CC {ECO:0000256|RuleBase:RU367155}.
CC -!- SUBUNIT: Heterotrimer. {ECO:0000256|RuleBase:RU367155}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|RuleBase:RU367155}.
CC -!- SIMILARITY: Belongs to the NFYA/HAP2 subunit family.
CC {ECO:0000256|RuleBase:RU367155}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EXJ88565.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMWN01000004; EXJ88565.1; -; Genomic_DNA.
DR RefSeq; XP_007724571.1; XM_007726381.1.
DR AlphaFoldDB; W9YFW8; -.
DR STRING; 1182541.W9YFW8; -.
DR GeneID; 19160370; -.
DR eggNOG; KOG1561; Eukaryota.
DR HOGENOM; CLU_045001_0_0_1; -.
DR OrthoDB; 5490901at2759; -.
DR Proteomes; UP000019484; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:UniProtKB-UniRule.
DR Gene3D; 6.10.250.2430; -; 1.
DR InterPro; IPR001289; NFYA.
DR PANTHER; PTHR12632:SF6; NUCLEAR TRANSCRIPTION FACTOR Y SUBUNIT ALPHA; 1.
DR PANTHER; PTHR12632; TRANSCRIPTION FACTOR NF-Y ALPHA-RELATED; 1.
DR Pfam; PF02045; CBFB_NFYA; 1.
DR PRINTS; PR00616; CCAATSUBUNTB.
DR SMART; SM00521; CBF; 1.
DR PROSITE; PS51152; NFYA_HAP2_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|RuleBase:RU367155};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|RuleBase:RU367155};
KW Reference proteome {ECO:0000313|Proteomes:UP000019484};
KW Transcription {ECO:0000256|ARBA:ARBA00023163,
KW ECO:0000256|RuleBase:RU367155};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015,
KW ECO:0000256|RuleBase:RU367155}.
FT REGION 1..73
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 221..322
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..67
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 282..308
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 322 AA; 34666 MW; 9F7BD6A4EE93F6A2 CRC64;
MEYPPQYQQH NQHPHLQGAY QTSPQSAGPG SLQSPSGPQS HMQQHNPNQA SPILPSQTQN
HYQPQPAGGV HSSMGYPQYG VGAGMAPGYG ISPTQAAAMA TAAASGESIY SMDSRHELMD
PTGQMPPLPS GVGMPQANQM GQQRRMSQHL TSPHGQVSQP ALNHSMPRVS VPPAMPPQQP
VQAPQEELVA GGAEESPLYV NAKQFHRILK RRVARQKLEE QLRLTSKGRK PYLHESRHNH
AMRRPRGPGG RFLTADEVAA LEKGENPLGE NGTPATKKTE NTSSGTKRKA TDADNDTPAK
KSKVASESAE EEDEDDGEAD LG
//