ID S8CMM3_9LAMI Unreviewed; 95 AA.
AC S8CMM3;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 22-FEB-2023, entry version 40.
DE RecName: Full=Homeobox-leucine zipper protein {ECO:0000256|RuleBase:RU369038};
DE AltName: Full=HD-ZIP protein {ECO:0000256|RuleBase:RU369038};
DE AltName: Full=Homeodomain transcription factor {ECO:0000256|RuleBase:RU369038};
DE Flags: Fragment;
GN ORFNames=M569_06366 {ECO:0000313|EMBL:EPS68404.1};
OS Genlisea aurea.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Lamiales; Lentibulariaceae; Genlisea.
OX NCBI_TaxID=192259 {ECO:0000313|EMBL:EPS68404.1, ECO:0000313|Proteomes:UP000015453};
RN [1] {ECO:0000313|EMBL:EPS68404.1, ECO:0000313|Proteomes:UP000015453}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23855885;
RA Leushkin E.V., Sutormin R.A., Nabieva E.R., Penin A.A., Kondrashov A.S.,
RA Logacheva M.D.;
RT "The miniature genome of a carnivorous plant Genlisea aurea contains a low
RT number of genes and short non-coding sequences.";
RL BMC Genomics 14:476-476(2013).
CC -!- FUNCTION: Transcription factor. {ECO:0000256|RuleBase:RU369038}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the HD-ZIP homeobox family. Class I subfamily.
CC {ECO:0000256|ARBA:ARBA00025748, ECO:0000256|RuleBase:RU369038}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EPS68404.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AUSU01002631; EPS68404.1; -; Genomic_DNA.
DR AlphaFoldDB; S8CMM3; -.
DR OrthoDB; 679932at2759; -.
DR Proteomes; UP000015453; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:UniProtKB-UniRule.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR045224; HDZip_class_I_plant.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR000047; HTH_motif.
DR InterPro; IPR003106; Leu_zip_homeo.
DR PANTHER; PTHR24326; HOMEOBOX-LEUCINE ZIPPER PROTEIN; 1.
DR PANTHER; PTHR24326:SF225; HOMEOBOX-LEUCINE ZIPPER PROTEIN HOX23; 1.
DR Pfam; PF02183; HALZ; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00031; HTHREPRESSR.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000015453};
KW Transcription {ECO:0000256|RuleBase:RU369038};
KW Transcription regulation {ECO:0000256|RuleBase:RU369038}.
FT DOMAIN 1..61
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 3..62
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..27
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 53..90
FT /evidence="ECO:0000256|SAM:Coils"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EPS68404.1"
FT NON_TER 95
FT /evidence="ECO:0000313|EMBL:EPS68404.1"
SQ SEQUENCE 95 AA; 11184 MW; 1C12CA5AFDC40869 CRC64;
GGGGEKKRRL SPDQVRTLER SFESGDRLEP ERRMELARGL GLEPRQVSVW FQNRRARWKA
KQLEKDYEAL RRELQEIRAL NDALKTHNNK LVSQV
//