ID A0A1U8N1W6_GOSHI Unreviewed; 1970 AA.
AC A0A1U8N1W6;
DT 10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT 10-MAY-2017, sequence version 1.
DT 27-MAR-2024, entry version 33.
DE SubName: Full=Histone-lysine N-methyltransferase ASHH2-like {ECO:0000313|RefSeq:XP_016733040.1, ECO:0000313|RefSeq:XP_016733041.1};
GN Name=LOC107943759 {ECO:0000313|RefSeq:XP_016733040.1,
GN ECO:0000313|RefSeq:XP_016733041.1};
OS Gossypium hirsutum (Upland cotton) (Gossypium mexicanum).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX NCBI_TaxID=3635 {ECO:0000313|Proteomes:UP000189702, ECO:0000313|RefSeq:XP_016733040.1};
RN [1] {ECO:0000313|Proteomes:UP000189702}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TM-1 {ECO:0000313|Proteomes:UP000189702};
RX PubMed=25893780; DOI=10.1038/nbt.3208;
RA Li F., Fan G., Lu C., Xiao G., Zou C., Kohel R.J., Ma Z., Shang H., Ma X.,
RA Wu J., Liang X., Huang G., Percy R.G., Liu K., Yang W., Chen W., Du X.,
RA Shi C., Yuan Y., Ye W., Liu X., Zhang X., Liu W., Wei H., Wei S., Huang G.,
RA Zhang X., Zhu S., Zhang H., Sun F., Wang X., Liang J., Wang J., He Q.,
RA Huang L., Wang J., Cui J., Song G., Wang K., Xu X., Yu J.Z., Zhu Y., Yu S.;
RT "Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1)
RT provides insights into genome evolution.";
RL Nat. Biotechnol. 33:524-530(2015).
RN [2] {ECO:0000313|RefSeq:XP_016733040.1, ECO:0000313|RefSeq:XP_016733041.1}
RP IDENTIFICATION.
RC TISSUE=Leaf {ECO:0000313|RefSeq:XP_016733040.1,
RC ECO:0000313|RefSeq:XP_016733041.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016733040.1; XM_016877551.1.
DR RefSeq; XP_016733041.1; XM_016877552.1.
DR STRING; 3635.A0A1U8N1W6; -.
DR PaxDb; 3635-A0A1U8N1W6; -.
DR GeneID; 107943759; -.
DR KEGG; ghi:107943759; -.
DR OrthoDB; 54704at2759; -.
DR Proteomes; UP000189702; Unplaced.
DR GO; GO:0000785; C:chromatin; IBA:GO_Central.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0046975; F:histone H3K36 methyltransferase activity; IBA:GO_Central.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IBA:GO_Central.
DR CDD; cd19172; SET_SETD2; 1.
DR Gene3D; 3.30.40.100; -; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR006560; AWS_dom.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR044437; SETD2/Set2_SET.
DR InterPro; IPR011124; Znf_CW.
DR PANTHER; PTHR22884:SF413; HISTONE-LYSINE N-METHYLTRANSFERASE CG1716-RELATED; 1.
DR PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1.
DR Pfam; PF17907; AWS; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF07496; zf-CW; 1.
DR SMART; SM00570; AWS; 1.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS51215; AWS; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS51050; ZF_CW; 1.
PE 4: Predicted;
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000189702};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 920..973
FT /note="CW-type"
FT /evidence="ECO:0000259|PROSITE:PS51050"
FT DOMAIN 1034..1084
FT /note="AWS"
FT /evidence="ECO:0000259|PROSITE:PS51215"
FT DOMAIN 1078..1203
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1211..1227
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 422..483
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 498..540
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 820..849
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 868..887
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1389..1420
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1680..1710
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1910..1970
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 504..540
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1686..1709
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1917..1942
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1943..1970
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1970 AA; 217208 MW; C613E319AD98DB33 CRC64;
MCLCESTALV NEPLSVVASA EQHSCSESME NLVPEPRDCI IRDSSGDSTG NRYDDTVVYL
EENRGESNGD SADYSYENHC ENVDCSGLKE FLGARIDDHV ACLNVSPGKI DVHNSENDQL
CLENRLFSGK YVPTAINGSS GLSQDEYSAC LSSGTEIDTE IYNRIQSVKD SNLTLESIAI
AGCRSGCAQQ NGQNDNNIVR GPLLDGKNCA SSMIKSVTSS EISAICCAQT LSSLQGSGDS
VSSCDWLNQK DDMSSRDLSL ESFNKAVEAK SIGDTYSKLL ASKCCISSFE TLHRAESLCT
KQNAQIDNKN FIVLSGDSVA KVSEERTDIA AGAKVETSSE IMNAGDSFEL SENSLCDKLV
PLSCHPFDIV ENGLSGRLDP PDCLTNGAYA ALNSSSSIDF CGQRQNEGKV VIKADCVSEI
KHHPTESSSS RRGGRKGKSS QKTNAKTRNY RNKLQQPPES IELHFRASRR KRSCSSRPGR
SSIQGLFSNI TQFLEPCDDP EFNEVQNQKP SNGRDGQGSR KSCKDQSGQS IKGSGGLSKS
STSCLRFRIK VGKGVGPSNL NSVVAEVVNL PVSVDTSFSI YGKGTGLQFP KLANVAEDKV
GELGIERQFL NKEDQEKVKT CLDASFMDLK LTNNVSGSAE YLKKYAEDAL GDYLVSKPNA
LAESSGRAID NKYSGSGTSP DSVVINSIPD AQVGLIHQEE LHDPVLNNSG FLASPGGVKS
SMVSKKGKKD NHRSPRTVCL RKAKSSNNCR GRTKTRDNEF ISNKAISSSA GANSSRGNGL
GVSEEAMKMD INMDAKACCS HHVPETKKFK NLSSTKYTLN QLSKSSKSQG VRKERSKVSD
SAGSRKGNAC KQWGDELKSV SKIKVKEKGS NQEIVTRGGK HPLTGNHISD DFENSDAGNS
SASAYMTNID SVSDVIKQHR QPDNAWVCCD DCHKWRRIPV ILLNSIDEAC RWICGDNMDK
TFADCSIPQE KSNADINAEL GVSDAEEDGC DGLNYKEFDK GFNNKRVTVP PPSHFWRIDS
NKFLHRGCKT QTIDEIMICH CKRPPDGNLG CGDECLNRML NIECVQDTCP CGELCSNQQF
QKRKYAKMMW DRFGRKGFGL RMLESISAGQ FLIEYVGEVL DMQAYEARQK EYASRGQRHF
YFMTLNGSEV IDAYVKGNLG RFINHSCDPN CRTEKWMVNG EICIGLFALR DIKKGEEITF
DYNYVRVFGA AAKKCHCGSS HCRGYIGGDS LSEGVIVYDD SDVESPEPMM LEDGETWNGY
ANVISRSSPF VGAEMQPVER VITDGVRKLE KMPEAEGSVY HSASASSKLD ISAEIEDLQG
NFQLPIEPEE VSPLTAPYEP VQQDDTIQQK AMKKTSRLIH ILDTFLNMSD NKLPSVFIDA
NKESKFNTAE DKRVPPKSHP LMKASCLSSS HKKGKLSSNS LNGTKVRMIS DKSQVPSFKL
KKFSETSSSC RFEAVEEKLN ELLDSEGGIT KRKDASKGYL KLLLLTATSG DSCNGEAIQS
TRELSMILDA LLKTKSRLVL TDIIDKNGLQ MLHNIMKKYR RDFNKIPVLR KLLKVLEYLA
RRKILTVERI NGGPPCAGRE SFLESILSFT EHYDKTVHEI ARNFRDTWIP KPLRKHSYRD
KVERRMEFCR YLDCNRVSAS HNHSREQAIR STEAITVVEK TTLDTSHEIC SSSPTGVCQT
NGTKIRKRKS RWDQPAETEK IDSRSPKKHE YSQLTILGKP TSNHMNKLSR WDKECHDILC
KGEAVNVVNG KHRFQGDAPP GFSSPCSASL VSSTAALTAT SFPQPKTCQL KCPEMTIAHP
QTRLISRLPV SYGIPLPIVQ RFGAPKDESV ESWVIAPGMP FHPYPPLPPS PCPHGRKDTP
PVCAANSIGN NEDAKDEQQD CCRPATSYPD NSIRSTAHCN EPNSEIPCAN IQRTSKRTRE
SSNDLGKYFR QQKRKGPLWH KSESTGSKHN NIGGTSFLDV GNVKNDVRNS
//