ID A0A1J6HYF9_NICAT Unreviewed; 747 AA.
AC A0A1J6HYF9;
DT 15-FEB-2017, integrated into UniProtKB/TrEMBL.
DT 15-FEB-2017, sequence version 1.
DT 24-JAN-2024, entry version 36.
DE SubName: Full=Homeobox protein hat3.1 {ECO:0000313|EMBL:OIS97315.1};
GN Name=HAT3.1 {ECO:0000313|EMBL:OIS97315.1};
GN ORFNames=A4A49_18300 {ECO:0000313|EMBL:OIS97315.1};
OS Nicotiana attenuata (Coyote tobacco).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae;
OC Nicotiana.
OX NCBI_TaxID=49451 {ECO:0000313|EMBL:OIS97315.1, ECO:0000313|Proteomes:UP000187609};
RN [1] {ECO:0000313|EMBL:OIS97315.1, ECO:0000313|Proteomes:UP000187609}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. UT {ECO:0000313|Proteomes:UP000187609};
RC TISSUE=Leaves {ECO:0000313|EMBL:OIS97315.1};
RA Xu S., Brockmoeller T., Gaquerel E., Navarro A., Kuhl H., Gase K., Ling Z.,
RA Zhou W., Kreitzer C., Stanke M., Tang H., Lyons E., Pandey P., Pandey S.P.,
RA Timmermann B., Baldwin I.T.;
RT "The genome of Nicotiana attenuata.";
RL Submitted (NOV-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the PHD-associated homeobox family.
CC {ECO:0000256|ARBA:ARBA00007427}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OIS97315.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MJEQ01037193; OIS97315.1; -; Genomic_DNA.
DR RefSeq; XP_019256173.1; XM_019400628.1.
DR AlphaFoldDB; A0A1J6HYF9; -.
DR STRING; 49451.A0A1J6HYF9; -.
DR EnsemblPlants; OIS97315; OIS97315; A4A49_18300.
DR GeneID; 109234581; -.
DR Gramene; OIS97315; OIS97315; A4A49_18300.
DR KEGG; nau:109234581; -.
DR OMA; KHSKHIA; -.
DR OrthoDB; 473783at2759; -.
DR Proteomes; UP000187609; Chromosome 11.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:EnsemblPlants.
DR GO; GO:0045893; P:positive regulation of DNA-templated transcription; IEA:EnsemblPlants.
DR CDD; cd00086; homeodomain; 1.
DR CDD; cd15504; PHD_PRHA_like; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR045876; PRHA-like_PHD-finger.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR12628:SF13; HOMEOBOX PROTEIN HAT3.1; 1.
DR PANTHER; PTHR12628; POLYCOMB-LIKE TRANSCRIPTION FACTOR; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF00628; PHD; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00249; PHD; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Reference proteome {ECO:0000313|Proteomes:UP000187609};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00146}.
FT DOMAIN 222..279
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 562..622
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 564..623
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..127
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 310..572
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 630..650
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 690..747
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..41
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 54..68
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 75..89
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 350..368
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 376..390
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 439..458
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 473..494
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 519..534
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 535..552
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 690..718
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 719..747
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 747 AA; 82741 MW; 829320407288B6DC CRC64;
MRTQLEDQTE MSTLGNTAVS PGKVARTTAR SHNTASAGKM SENPGVEQLG DACGNAGQNL
NLSECQEKTP GQPRKRKSTS GTPISSTRLL RSKSKEKSGA SEANNTVVTH EANEEKKRKR
RKKKHSKHIA VNEFTSIRGH LRYLLQRIKY EQTLIEAYSG EGWKGQSLEK IKLEKELERA
KAHIFRYKLK IRDLFQRVDT LLTQGRLPES LFDNEGEIDS EDIFCAKCGA KDLPADNDII
LCDGACERGF HQLCLEPPLL KEDIPPDDEG WLCPGCDCKV DCIDLLNDLQ GTNLSITDSW
EKVYPKEAAA AASGEKLDDI SGLPSDDSED DDYNPENPDV EKNDSGDESS SDESDFFSAS
EDLEEVPPKD DELLGLPSED SEDDDYTPDD PDKDEPVKTE SSSSDFTSDS EDLGLIVDTN
RLPGDELGVS SSVDNSKHSS ASQEEKPKGG RAKRNSLNDE LSDLMQSHSP LVSCKRHIER
LDYKKLHDET YGNESSDSSD EDFEGDPLPK VREIRSAKAA MTSPNSTPAD TKYQSGKKKV
SRHTDRGLCK KLKIGGMDTS EPHSSGKKKT YGEGAIKRLY ESFKENQYPD RDAKEKLGKE
LGLTAHQVSK WFENARHCHR HSSRWDTIMS QKVSKESPSS PNIMGEPLGT ESISTINNVL
CNGVGKMEPP KQCLNGEKCH AIDKSEGDLL IQEASGKKSR KPKAKNDTTD RGLDDTPTNK
TSKKQNTQIN SPNSQNVRRS SRLQKQG
//