ID C5YRA3_SORBI Unreviewed; 1033 AA.
AC C5YRA3;
DT 01-SEP-2009, integrated into UniProtKB/TrEMBL.
DT 01-SEP-2009, sequence version 1.
DT 06-MAR-2013, entry version 19.
DE SubName: Full=Putative uncharacterized protein Sb08g002530;
GN Name=Sb08g002530; ORFNames=SORBIDRAFT_08g002530;
OS Sorghum bicolor (Sorghum) (Sorghum vulgare).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae;
OC PACMAD clade; Panicoideae; Andropogoneae; Sorghum.
OX NCBI_TaxID=4558;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. BTx623;
RX PubMed=19189423; DOI=10.1038/nature07723;
RA Paterson A.H., Bowers J.E., Bruggmann R., Dubchak I., Grimwood J.,
RA Gundlach H., Haberer G., Hellsten U., Mitros T., Poliakov A.,
RA Schmutz J., Spannagl M., Tang H., Wang X., Wicker T., Bharti A.K.,
RA Chapman J., Feltus F.A., Gowik U., Grigoriev I.V., Lyons E.,
RA Maher C.A., Martis M., Narechania A., Otillar R.P., Penning B.W.,
RA Salamov A.A., Wang Y., Zhang L., Carpita N.C., Freeling M.,
RA Gingle A.R., Hash C.T., Keller B., Klein P., Kresovich S.,
RA McCann M.C., Ming R., Peterson D.G., Mehboob-ur-Rahman M., Ware D.,
RA Westhoff P., Mayer K.F.X., Messing J., Rokhsar D.S.;
RT "The Sorghum bicolor genome and the diversification of grasses.";
RL Nature 457:551-556(2009).
CC -!- SIMILARITY: Contains 1 SET domain.
CC -----------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution-NoDerivs License
CC -----------------------------------------------------------------------
DR EMBL; CM000767; EES16607.1; -; Genomic_DNA.
DR RefSeq; XP_002442769.1; XM_002442724.1.
DR EnsemblPlants; Sb08g002530.1; Sb08g002530.1; Sb08g002530.
DR GeneID; 8074853; -.
DR KEGG; sbi:SORBI_08g002530; -.
DR Gramene; C5YRA3; -.
DR eggNOG; COG2940; -.
DR HOGENOM; HOG000085131; -.
DR OMA; CARASIN; -.
DR ProtClustDB; CLSN2698998; -.
DR InterPro; IPR001214; SET_dom.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00317; SET; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Complete proteome; Reference proteome.
SQ SEQUENCE 1033 AA; 114820 MW; 0D5C1D6495542281 CRC64;
MVSGIGCSYP VTGRKRVKLL ATELSDSEPL VCCVPICHDS LGDLLDRCSE RHHVASGSGD
PTENTGVFPA MQEYACSTVN NGVVYPQSAM GYSSGQIGAQ GAYMQHQHFE GCMYMSGNGQ
MCGPYPPEQL YEGLSTGFLP QSLAVYAVFG GKTADPVQLI FLKQFLSQWN VGAMASTPNA
STETAKVASH AKMIYHVHGK FGPFTLVSLM GMWSGEHTER SEATDNDSAS FNGLVGDIVD
DVSHQLHAGI MKSARRVLID EIFSSILPDL IVSKKTEKQL AAKLKNQVTK PDSVSNKKDS
TVKATPVIQP QWIVRRQFNL QQCMRNCLLN LPSTSISVTP DDKIKAQDSD ELSPKDLDAT
ECDMDFPPGF GPCWKSPESS LSPSLLEVNG SAKTDGKSES GTTLFSGPLA VVQRMLANDL
YISSKQSLFH YFEEVIAEEI TNCLCFGLES SIDQEQIGTP IHAPESPVSA ETSMHETLNP
IEMTGGDELN IVEMAMSTRT TPIEMAVDEE LNTVEATRTS LTEPTSAETS TAAEMATDKM
PTSHVEEHLS MSYARIFEKM DICKTAELDE KFDEVPPGME TGLVPVPLMD KNIYQPSKSM
NSIPLISRYI TLALCRQKLH ENVVREWTSL FSDTISKCLG SWYTRRNAVP KSADGSSKPK
EKTYYRKRKF EKTCQAKSSK KPVEISMDEQ LSKPLCQLVD RKIYVKNIQE SNKALTSKKV
SFVDKPSKKG AKPVANDAHD LNIQQDLTLL SSEVPKRARS SHPTKKHMVA NRTPTVNDNV
ANNSMLTKHV KKKKGRDISS ETSQKVKPMI SCPESDGCAR ASINGWEWRN WARNATPSER
ARVRGYRVRT ILSASNNNVW KNSQAKVSSA RTNRVKLRNL LAAAEGAELL KITQMKARKK
RLRFQRSKIH EWGLVALELI EAEDFVIEYV GQLIHRRVSD IRESQYEKSG IGSSYLFRLD
DDFVVDATKR GGLARFINHS CEPNCYTKVI TVDGQKKIFI YAKRRIYAGE EITYNYKFPL
EEEKIPCHCG SRR
//