ID A0A1Q3CBL0_CEPFO Unreviewed; 357 AA.
AC A0A1Q3CBL0;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 24-JAN-2024, entry version 27.
DE SubName: Full=Homeobox domain-containing protein/HALZ domain-containing protein {ECO:0000313|EMBL:GAV77620.1};
GN ORFNames=CFOL_v3_21091 {ECO:0000313|EMBL:GAV77620.1};
OS Cephalotus follicularis (Albany pitcher plant).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Oxalidales; Cephalotaceae; Cephalotus.
OX NCBI_TaxID=3775 {ECO:0000313|EMBL:GAV77620.1, ECO:0000313|Proteomes:UP000187406};
RN [1] {ECO:0000313|Proteomes:UP000187406}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. St1 {ECO:0000313|Proteomes:UP000187406};
RA Fukushima K., Hasebe M., Fang X.;
RT "Cephalotus genome sequencing.";
RL Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the HD-ZIP homeobox family. Class II subfamily.
CC {ECO:0000256|ARBA:ARBA00006074}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GAV77620.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BDDD01001651; GAV77620.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Q3CBL0; -.
DR InParanoid; A0A1Q3CBL0; -.
DR OrthoDB; 419995at2759; -.
DR Proteomes; UP000187406; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR006712; HD-ZIP_N.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003106; Leu_zip_homeo.
DR PANTHER; PTHR45714; HOMEOBOX-LEUCINE ZIPPER PROTEIN HAT14; 1.
DR PANTHER; PTHR45714:SF39; HOMEOBOX-LEUCINE ZIPPER PROTEIN HAT14; 1.
DR Pfam; PF02183; HALZ; 1.
DR Pfam; PF04618; HD-ZIP_N; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00340; HALZ; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000187406}.
FT DOMAIN 181..241
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 183..242
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 47..132
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 167..190
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 302..357
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 48..66
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 69..99
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 111..132
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 170..190
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 302..346
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 357 AA; 39532 MW; 7141D117E093AC9C CRC64;
MELGLSLGDA SKPFGIMDKS REVVTTTKGV GFCMALSIGS NLIDHQQTTE VKKTKPSSDE
DRHQISALND PPIQLNLLPN TPVLWPSSDN GSSEGGSSGQ MGRGFDVNRF PNSVEDTQEA
VSSSPPNSAA SSFHQMDFCM YTSTGSLGRH MRDLVADGGV GNNEVVDAER TSSRASDDDE
NGSSRKKLRL SKEQSAFLEE SFKEHSTLNP KQKLALAKQL NLRPRQVEVW FQNRRARTKL
KQTEVDCDHL KRCCETLTEE NRRLHKELQE LRALKASNPF YMQLPATTLT MCPSCERVAT
TTTTTTTTIS NTDSTDPTSK ANGFSLTRPR FYPFSQAQNH TQASTTSRDH VVKHVEK
//