ID A0A369H4R2_9HYPO Unreviewed; 564 AA.
AC A0A369H4R2;
DT 07-NOV-2018, integrated into UniProtKB/TrEMBL.
DT 07-NOV-2018, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=HSF-type DNA-binding domain-containing protein {ECO:0000259|SMART:SM00415};
GN ORFNames=CP533_2321 {ECO:0000313|EMBL:RDA91679.1};
OS Ophiocordyceps camponoti-saundersi (nom. inval.).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Ophiocordycipitaceae; Ophiocordyceps.
OX NCBI_TaxID=2039874 {ECO:0000313|EMBL:RDA91679.1, ECO:0000313|Proteomes:UP000253071};
RN [1] {ECO:0000313|EMBL:RDA91679.1, ECO:0000313|Proteomes:UP000253071}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BCC 79314 {ECO:0000313|EMBL:RDA91679.1,
RC ECO:0000313|Proteomes:UP000253071};
RA Kobmoo N., Wichadakul D., Arnamnart N., Rodriguez De La Vega R.C.,
RA Luangsa-Ard J.-J., Giraud T.;
RT "A genome scan of diversifying selection in the zombie-ant fungus
RT Ophiocordyceps unilateralis complex supports a role of enterotoxins in
RT coevolution and host-specificity.";
RL Submitted (OCT-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the HSF family. {ECO:0000256|RuleBase:RU004020}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RDA91679.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PDHQ01000065; RDA91679.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A369H4R2; -.
DR STRING; 2039874.A0A369H4R2; -.
DR Proteomes; UP000253071; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR000232; HSF_DNA-bd.
DR InterPro; IPR027725; HSF_fam.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR PANTHER; PTHR10015:SF396; FLOCCULATION SUPPRESSION PROTEIN; 1.
DR PANTHER; PTHR10015; HEAT SHOCK TRANSCRIPTION FACTOR; 1.
DR Pfam; PF00447; HSF_DNA-bind; 1.
DR PRINTS; PR00056; HSFDOMAIN.
DR SMART; SM00415; HSF; 1.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Reference proteome {ECO:0000313|Proteomes:UP000253071}.
FT DOMAIN 87..190
FT /note="HSF-type DNA-binding"
FT /evidence="ECO:0000259|SMART:SM00415"
FT REGION 38..72
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 195..215
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 344..564
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 40..72
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 201..215
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 375..391
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 392..406
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 453..470
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 486..518
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 541..564
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 564 AA; 61164 MW; 0B69582BC41C190B CRC64;
MGAPVAVPAA VPVVDMSVPG PGFDAMDVAF HRSHDLEVNG KSTSARSNRS SPPDQTPSSA
NVNMPAPPSS SSSAAAAAAA AVQPKIVQTA FIHKLYNMLE DSSIQHLISW SANAESFVMS
PSPDFSKVLS QYFKHTNISS FVRQLNMYGF HKERDVFHTG NPDTTLWEFK HGNGNFKRGD
LTGLREIKRR ASRHALVHRE TSYTKQAQAQ QGATADPAQM AAETWEARLA NMENSMYDMS
ARLQRGEETA YLRNQAMTDV INRLLHFNQE LSRALLSIAP ADNPAHRDVA TLQTEIQRQV
DALRLFDEPH ESAAASARQH FLGAVENAPV SPRQLAQDDI RRSAGLTVPS GRGQPLLYRP
GPPTSSGMLA SARRPYGSIS SSSTTQSSPL RNANPPAPPP HPLAHVEPPS ASLARRHTAA
DIRACGWQPG PSTFSTTSNP PTALWPPSPE DQRLRDSLST YTLQSASHPR SRPATPPPPP
SLAPSSHGVN GGNNGCDTFG SWSWNSSATT SNREVKSLAL RDSSAPPTRR GSMAHILNPS
DTAERSDEDE EARGDDDRKR KRMQ
//