ID D7KI49_ARALL Unreviewed; 531 AA.
AC D7KI49;
DT 10-AUG-2010, integrated into UniProtKB/TrEMBL.
DT 10-AUG-2010, sequence version 1.
DT 27-MAR-2024, entry version 71.
DE RecName: Full=Homeobox domain-containing protein {ECO:0000259|PROSITE:PS50071};
GN ORFNames=ARALYDRAFT_472196 {ECO:0000313|EMBL:EFH69321.1};
OS Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=81972 {ECO:0000313|Proteomes:UP000008694};
RN [1] {ECO:0000313|Proteomes:UP000008694}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. MN47 {ECO:0000313|Proteomes:UP000008694};
RX PubMed=21478890; DOI=10.1038/ng.807;
RA Hu T.T., Pattyn P., Bakker E.G., Cao J., Cheng J.-F., Clark R.M.,
RA Fahlgren N., Fawcett J.A., Grimwood J., Gundlach H., Haberer G.,
RA Hollister J.D., Ossowski S., Ottilar R.P., Salamov A.A., Schneeberger K.,
RA Spannagl M., Wang X., Yang L., Nasrallah M.E., Bergelson J.,
RA Carrington J.C., Gaut B.S., Schmutz J., Mayer K.F.X., Van de Peer Y.,
RA Grigoriev I.V., Nordborg M., Weigel D., Guo Y.-L.;
RT "The Arabidopsis lyrata genome sequence and the basis of rapid genome size
RT change.";
RL Nat. Genet. 43:476-481(2011).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/BELL homeobox family.
CC {ECO:0000256|ARBA:ARBA00006454}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL348713; EFH69321.1; -; Genomic_DNA.
DR RefSeq; XP_002893062.1; XM_002893016.1.
DR AlphaFoldDB; D7KI49; -.
DR STRING; 81972.D7KI49; -.
DR EnsemblPlants; fgenesh2_kg.1__2150__AT1G19700.1; fgenesh2_kg.1__2150__AT1G19700.1; fgenesh2_kg.1__2150__AT1G19700.1.
DR Gramene; fgenesh2_kg.1__2150__AT1G19700.1; fgenesh2_kg.1__2150__AT1G19700.1; fgenesh2_kg.1__2150__AT1G19700.1.
DR eggNOG; KOG0773; Eukaryota.
DR HOGENOM; CLU_011058_6_1_1; -.
DR OrthoDB; 681284at2759; -.
DR Proteomes; UP000008694; Unassembled WGS sequence.
DR ExpressionAtlas; D7KI49; baseline.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR006563; POX_dom.
DR PANTHER; PTHR11850:SF394; BEL1-LIKE HOMEODOMAIN PROTEIN 10; 1.
DR PANTHER; PTHR11850; HOMEOBOX PROTEIN TRANSCRIPTION FACTORS; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR Pfam; PF07526; POX; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00574; POX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000008694}.
FT DOMAIN 344..407
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 346..408
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 193..216
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 429..493
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 429..469
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 477..493
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 531 AA; 60045 MW; 705F5FFC41B9EE17 CRC64;
MAVYYPSNVG CYQQEPIFLN HQQQQASSSS AAASFTVTGS DNVRNEMVFI PPTTTGDVVT
GNGAVSSSDL SFHDGQGLSL SLGTQISVAP FHFHQYQLGF TQNPSTSVKE TSPFNVDEMS
VKSKEMMLLS QSDPSSGYAG SGFYNNYRYN ETSGGFMSSV LRSRYLKPAQ NLLDEVVSVK
KELNQMGKKK MKVNDFNNGS KEIEGGGSGE LSNDLNGKSM ELSTVEREEL QNKKNKLLTM
VDEVDKRYNQ YYHQMEALAS SFEIVAGLGS AKAYTSVALN RISRHFRALR DAIKEQIQII
REKLGEKGGE SLDEQQGERI PRLRYLDQRL RQQRALHQQL GMVRPAWRPQ RGLPENSVSV
LRAWLFEHFL HPYPKESEKI MLAKQTGLSK NQVANWFINA RVRLWKPMIE EMYKEEFGDE
SELLISKSSQ EPNSTNQEDS SSQQQQQQEN NNNLTYSSAD TTNIVFSSET KPDRVLGNDN
EPQQPQINRS SDYDTLMNYH GFGVDDYRYI SGSNQQESRF SNSHHLHDFV V
//