ID A0A103XQR4_CYNCS Unreviewed; 397 AA.
AC A0A103XQR4;
DT 13-APR-2016, integrated into UniProtKB/TrEMBL.
DT 13-APR-2016, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Basic-leucine zipper domain-containing protein {ECO:0000313|EMBL:KVH95158.1};
GN ORFNames=Ccrd_002764 {ECO:0000313|EMBL:KVH95158.1};
OS Cynara cardunculus var. scolymus (Globe artichoke) (Cynara scolymus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Carduoideae; Cardueae;
OC Carduinae; Cynara.
OX NCBI_TaxID=59895 {ECO:0000313|EMBL:KVH95158.1, ECO:0000313|Proteomes:UP000243975};
RN [1] {ECO:0000313|EMBL:KVH95158.1, ECO:0000313|Proteomes:UP000243975}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2C {ECO:0000313|EMBL:KVH95158.1};
RX PubMed=26786968; DOI=10.1038/srep19427;
RA Scaglione D., Reyes-Chin-Wo S., Acquadro A., Froenicke L., Portis E.,
RA Beitel C., Tirone M., Mauro R., Lo Monaco A., Mauromicale G., Faccioli P.,
RA Cattivelli L., Rieseberg L., Michelmore R., Lanteri S.;
RT "The genome sequence of the outbreeding globe artichoke constructed de novo
RT incorporating a phase-aware low-pass sequencing strategy of F1 progeny.";
RL Sci. Rep. 6:19427-19427(2016).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KVH95158.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LEKV01004398; KVH95158.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A103XQR4; -.
DR STRING; 59895.A0A103XQR4; -.
DR EnsemblPlants; KVH95158; KVH95158; Ccrd_002764.
DR Gramene; KVH95158; KVH95158; Ccrd_002764.
DR OMA; MAIMPMS; -.
DR Proteomes; UP000243975; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR CDD; cd14702; bZIP_plant_GBF1; 1.
DR Gene3D; 1.20.5.170; -; 1.
DR InterPro; IPR004827; bZIP.
DR InterPro; IPR045314; bZIP_plant_GBF1.
DR InterPro; IPR046347; bZIP_sf.
DR InterPro; IPR044827; GBF-like.
DR InterPro; IPR012900; MFMR.
DR PANTHER; PTHR45967:SF2; BZIP TRANSCRIPTION FACTOR 68; 1.
DR PANTHER; PTHR45967; G-BOX-BINDING FACTOR 3-RELATED; 1.
DR Pfam; PF00170; bZIP_1; 1.
DR Pfam; PF07777; MFMR; 1.
DR Pfam; PF16596; MFMR_assoc; 1.
DR SMART; SM00338; BRLZ; 1.
DR SUPFAM; SSF57959; Leucine zipper domain; 1.
DR PROSITE; PS50217; BZIP; 1.
DR PROSITE; PS00036; BZIP_BASIC; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000243975}.
FT DOMAIN 311..374
FT /note="BZIP"
FT /evidence="ECO:0000259|PROSITE:PS50217"
FT REGION 1..40
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 116..194
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 209..232
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 310..332
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 373..397
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..19
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 146..194
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 211..232
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 376..390
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 397 AA; 42397 MW; 2EEE1B6E0329285A CRC64;
MGSRDMDKPA KEAKDSAKET PTSQEQSSGN AMGAVPDWTG FQAYSPMPPH GYLASSPQPH
PYMWGVQHLM PPYGTAPHPY VAMYPHGGIY AHPSMPPGSY PFSPFAMPSP NGVAEVSGNT
PGSMEVNGKS PEGKEKLPIK RSKGSLGSLN MITGKNSEPS KAGASANGSY PKSAESGSEG
SSEGSDANSE NVEFPNEIRI QARFYGRFGS GEASQNGNTV HGSQNGGPNT PHSVVNPTMG
IVPISAGATL GSVAGPTTNL NIGMDYWSGA NTSNIPAMRG HVTSAPVAGG MVAAGSRESM
QPQLWIQDER ELKRQRRKQS NRESARRSRL RKQAECDELA QRAETLKEEN ASLRAEVSRI
RSDYEQLLAE NASLKERVGE SQEEARENHQ TEVVQSG
//