ID A0A452E0J9_CAPHI Unreviewed; 524 AA.
AC A0A452E0J9;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 24-JAN-2024, entry version 21.
DE SubName: Full=Heat shock transcription factor 1 {ECO:0000313|Ensembl:ENSCHIP00000005441.1};
GN Name=HSF1 {ECO:0000313|Ensembl:ENSCHIP00000005441.1};
OS Capra hircus (Goat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Capra.
OX NCBI_TaxID=9925 {ECO:0000313|Ensembl:ENSCHIP00000005441.1, ECO:0000313|Proteomes:UP000291000};
RN [1] {ECO:0000313|Ensembl:ENSCHIP00000005441.1, ECO:0000313|Proteomes:UP000291000}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Bickhart D.M., Koren S., Rosen B., Hastie A., Liachko I., Sullivan S.T.,
RA Burton J., Sayre B.L., Huson H.J., Lee J., Lam E., Kelley C.M.,
RA Hutchison J.L., Zhou Y., Sun J., Crisa A., Schwartz J.C., Hammond J.A.,
RA Schroeder S.G., Liu G.E., Dunham M., Shendure J., Sonstegard T.S.,
RA Phillippy A.M., Van Tassell C.P., Smith T.P.;
RT "Polished mammalian reference genomes with single-molecule sequencing and
RT chromosome conformation capture applied to the Capra hircus genome.";
RL Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCHIP00000005441.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the HSF family. {ECO:0000256|ARBA:ARBA00006403,
CC ECO:0000256|RuleBase:RU004020}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LWLT01000011; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A452E0J9; -.
DR Ensembl; ENSCHIT00000013127.1; ENSCHIP00000005441.1; ENSCHIG00000009445.1.
DR GeneTree; ENSGT00940000158421; -.
DR OMA; MPIFFEL; -.
DR Proteomes; UP000291000; Chromosome 14.
DR Bgee; ENSCHIG00000009445; Expressed in longissimus thoracis muscle and 16 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR000232; HSF_DNA-bd.
DR InterPro; IPR027725; HSF_fam.
DR InterPro; IPR010542; Vert_HSTF_C.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR PANTHER; PTHR10015:SF274; HEAT SHOCK FACTOR PROTEIN 1; 1.
DR PANTHER; PTHR10015; HEAT SHOCK TRANSCRIPTION FACTOR; 1.
DR Pfam; PF00447; HSF_DNA-bind; 1.
DR Pfam; PF06546; Vert_HS_TF; 1.
DR PRINTS; PR00056; HSFDOMAIN.
DR SMART; SM00415; HSF; 1.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
DR PROSITE; PS00434; HSF_DOMAIN; 1.
PE 3: Inferred from homology;
KW Activator {ECO:0000256|ARBA:ARBA00023159};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000291000}.
FT DOMAIN 57..81
FT /note="HSF-type DNA-binding"
FT /evidence="ECO:0000259|PROSITE:PS00434"
FT REGION 273..326
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 339..364
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 409..461
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 493..524
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 312..326
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 339..353
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 438..452
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 524 AA; 56589 MW; A1933E3F228D9537 CRC64;
MDLPVGPGAA GPSNVPAFLT KLWTLVSDPD TDALICWSPS GNSFHVLDQG QFAKEVLPKY
FKHSNMASFV RQLNMYGFRK VVHIEQGGLV KPERDDTEFQ HPCFLRGQEQ LLENIKRKVT
SVSTLRSEDI KIRQDSVTKL LTDVQLMKGK QESMDSKLLA MKHENEALWR EVASLRQKHA
QQQKVVNKLI QFLISLVQSN RILGVKRKIP LMLNDSSPAH PMPKYGRQYS LEHIHGPGSY
PAASPAYSGS SLYSPDAVTS SGPIISDITE LAPGSPVASA GRSVDERPLS SSPLVRVKEE
PPSPPQSPRA EGASPSRPSS MVETPLSPTT LIDSILRESE PTPAASTTPL ADTGGRPASP
LPASAPEKCL SVACLDKTEL SDHLDAMDSN LDNLQTMLTT HGFSVDTSTL LDVSPSPARP
TPSPPRSQAV QPLGDGSRHE PEPPRPLEAE KSSPDSGKQL VHYTAQPLLL LDPGSVDVGS
SDLPVLFELG EGSYFSEGDD YSDDPTISLL TGSEPPKAKD PTVS
//