ID A0A2G5F3C1_AQUCA Unreviewed; 315 AA.
AC A0A2G5F3C1;
DT 31-JAN-2018, integrated into UniProtKB/TrEMBL.
DT 31-JAN-2018, sequence version 1.
DT 24-JAN-2024, entry version 24.
DE RecName: Full=Homeobox-leucine zipper protein {ECO:0000256|RuleBase:RU369038};
DE AltName: Full=HD-ZIP protein {ECO:0000256|RuleBase:RU369038};
DE AltName: Full=Homeodomain transcription factor {ECO:0000256|RuleBase:RU369038};
GN ORFNames=AQUCO_00200482v1 {ECO:0000313|EMBL:PIA62491.1};
OS Aquilegia coerulea (Rocky mountain columbine).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Ranunculales; Ranunculaceae; Thalictroideae;
OC Aquilegia.
OX NCBI_TaxID=218851 {ECO:0000313|EMBL:PIA62491.1, ECO:0000313|Proteomes:UP000230069};
RN [1] {ECO:0000313|EMBL:PIA62491.1, ECO:0000313|Proteomes:UP000230069}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Goldsmith {ECO:0000313|Proteomes:UP000230069};
RA Hodges S., Kramer E., Nordborg M., Tomkins J., Borevitz J., Derieg N.,
RA Yan J., Mihaltcheva S., Hayes R.D., Rokhsar D.;
RT "WGS assembly of Aquilegia coerulea Goldsmith.";
RL Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Transcription factor. {ECO:0000256|RuleBase:RU369038}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the HD-ZIP homeobox family. Class I subfamily.
CC {ECO:0000256|ARBA:ARBA00025748, ECO:0000256|RuleBase:RU369038}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KZ305019; PIA62491.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2G5F3C1; -.
DR InParanoid; A0A2G5F3C1; -.
DR OrthoDB; 466194at2759; -.
DR Proteomes; UP000230069; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:UniProtKB-UniRule.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.20.5.400; -; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR045224; HDZip_class_I_plant.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR000047; HTH_motif.
DR InterPro; IPR003106; Leu_zip_homeo.
DR PANTHER; PTHR24326; HOMEOBOX-LEUCINE ZIPPER PROTEIN; 1.
DR PANTHER; PTHR24326:SF606; HOMEOBOX-LEUCINE ZIPPER PROTEIN ATHB-54; 1.
DR Pfam; PF02183; HALZ; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00031; HTHREPRESSR.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000230069};
KW Transcription {ECO:0000256|RuleBase:RU369038};
KW Transcription regulation {ECO:0000256|RuleBase:RU369038}.
FT DOMAIN 80..140
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 82..141
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 175..214
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 175..191
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 192..214
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 315 AA; 36076 MW; C9CEA628CEF1D0F1 CRC64;
MASRKVYSAS NVPVLLQNDG ISCSNGAFEA LLMPNSTSCF NGKRSMVNFE DACRGNTQGM
SFYHTRDQEE NGDDDLDDCF RQPEKKRRLS VDQVQFLEKS FEVENKLEPE RKVQLAKDLG
LQPRQVAIWF QNRRARWKTK QLEKDYDALK ASYDSLKSNY ENLLKEKEQL KAEVSSLTDK
QFHKENVSEN SETSKPMEPS QPLAQPDSVP ETKTSTIICK QEDLSSVNSD VFDSDSPHYA
DGVHSSFLET GDSSHVFEPD QSDLSQDEED NLSKNLRHVV YNFPKLEDPV YPDQSADCGN
YEFPVEDQAL WFWSY
//