ID A0A3B6GNY6_WHEAT Unreviewed; 1146 AA.
AC A0A3B6GNY6;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=Homeobox domain-containing protein {ECO:0000259|PROSITE:PS50071};
GN ORFNames=CFC21_046288 {ECO:0000313|EMBL:KAF7035404.1};
OS Triticum aestivum (Wheat).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Pooideae; Triticodae; Triticeae; Triticinae; Triticum.
OX NCBI_TaxID=4565 {ECO:0000313|EnsemblPlants:TraesCS3D02G114300.2};
RN [1] {ECO:0000313|EMBL:KAF7035404.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Leaf {ECO:0000313|EMBL:KAF7035404.1};
RX PubMed=29069494;
RA Zimin A.V., Puiu D., Hall R., Kingan S., Clavijo B.J., Salzberg S.L.;
RT "The first near-complete assembly of the hexaploid bread wheat genome,
RT Triticum aestivum.";
RL Gigascience 6:1-7(2017).
RN [2] {ECO:0000313|EnsemblPlants:TraesCS3D02G114300.2}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Chinese Spring
RC {ECO:0000313|EnsemblPlants:TraesCS3D02G114300.2};
RX PubMed=30115783; DOI=10.1126/science.aar7191;
RG International wheat genome sequencing consortium (IWGSC);
RT "Shifting the limits in wheat research and breeding using a fully annotated
RT reference genome.";
RL Science 361:EAAR7191-EAAR7191(2018).
RN [3] {ECO:0000313|EnsemblPlants:TraesCS3D02G114300.2}
RP IDENTIFICATION.
RG EnsemblPlants;
RL Submitted (OCT-2018) to UniProtKB.
RN [4] {ECO:0000313|EMBL:KAF7035404.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Leaf {ECO:0000313|EMBL:KAF7035404.1};
RA Zimin A.V., Puiu D., Shumante A., Alonge M., Salzberg S.L.;
RT "The second near-complete assembly of the hexaploid bread wheat (Triticum
RT aestivum) genome.";
RL Submitted (MAR-2020) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM022219; KAF7035404.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3B6GNY6; -.
DR SMR; A0A3B6GNY6; -.
DR STRING; 4565.A0A3B6GNY6; -.
DR EnsemblPlants; TraesCS3D02G114300.2; TraesCS3D02G114300.2; TraesCS3D02G114300.
DR Gramene; TraesCS3D02G114300.2; TraesCS3D02G114300.2; TraesCS3D02G114300.
DR Gramene; TraesCS3D03G0237900.3; TraesCS3D03G0237900.3.CDS; TraesCS3D03G0237900.
DR Proteomes; UP000019116; Chromosome 3D.
DR Proteomes; UP000815260; Chromosome 3D.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0005667; C:transcription regulator complex; IBA:GO_Central.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR018501; DDT_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR044977; RLT1-3.
DR InterPro; IPR028942; WHIM1_dom.
DR InterPro; IPR028941; WHIM2_dom.
DR PANTHER; PTHR36968; HOMEOBOX-DDT DOMAIN PROTEIN RLT2; 1.
DR PANTHER; PTHR36968:SF8; HOMEOBOX-DDT DOMAIN PROTEIN RLT3 ISOFORM X1; 1.
DR Pfam; PF02791; DDT; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF15612; WHIM1; 1.
DR Pfam; PF15613; WSD; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000019116}.
FT DOMAIN 8..68
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 10..69
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 69..109
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 256..284
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 581..626
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 869..893
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 78..105
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 262..277
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 594..608
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1146 AA; 128358 MW; 39CD30D490646AB2 CRC64;
MMAKGFLAKS DNAGTKKSPL QIQMLESFYS EVQYPKPEDL TEYAASVGLT YNQVRIWFKE
RRRKERRHME AAEVHVETQA SARSNWPRCS SSRSSNSSQS PMQGIAGLQP EDDITLGRSM
SLVGEKHTLR SQVLFPKDYI LRKVFRKDGP SLGGDPDLLP ERAHVHVRVA DTTGHHSYQD
QSVLKKRKIM SPTAQRSTLP FENNDPVRKH GKGKGLMTVW HAMYSQTAEI QDCSSFIDES
GCLRSLRPFE DFGGKLAQKQ TVPRKKVNKK SRPPPSKRKV PCGRVTDLKE HPPVECHLSV
DESESSELRT EQATLVDDEE LELSELQAGP NPLRCSAHIS STGRHGCPLC KDLLARFPPP
SVRMKQPFPT KPWESSPEMV VRFVYTHFGS MDVHPFTFDE FAQAFHDKDS SLLGKVHVSL
LKLLMLNTER GSGSVFVPRS SKDSRFSSFL NFVREQEFDV NFWIKSLNSL TWVEILRQVL
VASGFGSDHH MLNRNFFNKE KNQMVKYGLR PRTLKGELFT LLSKKGSGGL KVAELAKSPQ
IIGLNLSGAS EVEQLIFSTL SSDITLFEKI APSAYRLRVD PRIKGKEDPR SDTEDSGTVD
DDGDASSSGD ESDGPQESYP EHESRIVRWR QKNVHKNMNK CSEIDESYSG ERWLLGLMEG
EYSDLSIDEK LDCLVALIDV VSGAGSVPRL EEPQSVLSNI QRAQSHASGG KIKKCTRTIY
QSSDEYLNRP GSSHSFDSSM QGQSGTLRSQ DYIADSGANE SPTGFAHQPQ IVLLGSDRRY
NNYWLFLGPC RADDPGHRRV YFESSEDGHW EVIDSPQDLL SLLSVLDIRG TREAYLLASM
KKRQSCLFEG MKKHLEDGCV VALTASSDSS RSETSSGNRY SPKPSSGDGA SPLSDIDSAS
VPTYLAGNLQ NASSAIGIEV GRRSDEKMLK WERLQALDKW IWTSFYSSLT AVKCGKRSFK
ESLVHCESCH DLYWRDEKHC RICHSTFEVG FDLEERYAIH VATCREPEDL YDVPNHKVLP
SQLQALKAAI HAIEARMPTA AFAGLWMKSS HNLWVKRLRR TSSLPELLQV LVDFVGAIDE
DWLYQSSSAV SFSSYLDDIT VYFQTMPQTT SAVALWVVKL DALIAPDLAQ ADSCRGLGKG
SIQTRA
//