ID A0A921Z477_MANSE Unreviewed; 795 AA.
AC A0A921Z477;
DT 22-FEB-2023, integrated into UniProtKB/TrEMBL.
DT 22-FEB-2023, sequence version 1.
DT 28-JAN-2026, entry version 15.
DE RecName: Full=Myb protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=O3G_MSEX006889 {ECO:0000313|EMBL:KAG6451034.1};
OS Manduca sexta (Tobacco hawkmoth) (Tobacco hornworm).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Bombycoidea;
OC Sphingidae; Sphinginae; Sphingini; Manduca.
OX NCBI_TaxID=7130 {ECO:0000313|EMBL:KAG6451034.1, ECO:0000313|Proteomes:UP000791440};
RN [1] {ECO:0000313|EMBL:KAG6451034.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=27522922;
RA Kanost M.R., Arrese E.L., Cao X., Chen Y.R., Chellapilla S.,
RA Goldsmith M.R., Grosse-Wilde E., Heckel D.G., Herndon N., Jiang H.,
RA Papanicolaou A., Qu J., Soulages J.L., Vogel H., Walters J.,
RA Waterhouse R.M., Ahn S.J., Almeida F.C., An C., Aqrawi P.,
RA Bretschneider A., Bryant W.B., Bucks S., Chao H., Chevignon G.,
RA Christen J.M., Clarke D.F., Dittmer N.T., Ferguson L.C.F., Garavelou S.,
RA Gordon K.H.J., Gunaratna R.T., Han Y., Hauser F., He Y., Heidel-Fischer H.,
RA Hirsh A., Hu Y., Jiang H., Kalra D., Klinner C., Konig C., Kovar C.,
RA Kroll A.R., Kuwar S.S., Lee S.L., Lehman R., Li K., Li Z., Liang H.,
RA Lovelace S., Lu Z., Mansfield J.H., McCulloch K.J., Mathew T., Morton B.,
RA Muzny D.M., Neunemann D., Ongeri F., Pauchet Y., Pu L.L., Pyrousis I.,
RA Rao X.J., Redding A., Roesel C., Sanchez-Gracia A., Schaack S., Shukla A.,
RA Tetreau G., Wang Y., Xiong G.H., Traut W., Walsh T.K., Worley K.C., Wu D.,
RA Wu W., Wu Y.Q., Zhang X., Zou Z., Zucker H., Briscoe A.D., Burmester T.,
RA Clem R.J., Feyereisen R., Grimmelikhuijzen C.J.P., Hamodrakas S.J.,
RA Hansson B.S., Huguet E., Jermiin L.S., Lan Q., Lehman H.K., Lorenzen M.,
RA Merzendorfer H., Michalopoulos I., Morton D.B., Muthukrishnan S.,
RA Oakeshott J.G., Palmer W., Park Y., Passarelli A.L., Rozas J.,
RA Schwartz L.M., Smith W., Southgate A., Vilcinskas A., Vogt R., Wang P.,
RA Werren J., Yu X.Q., Zhou J.J., Brown S.J., Scherer S.E., Richards S.,
RA Blissard G.W.;
RT "Multifaceted biological insights from a draft genome sequence of the
RT tobacco hornworm moth, Manduca sexta.";
RL Insect Biochem. Mol. Biol. 76:118-147(2016).
RN [2] {ECO:0000313|EMBL:KAG6451034.1}
RP NUCLEOTIDE SEQUENCE.
RA Kanost M.;
RL Submitted (DEC-2020) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAG6451034.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH668398; KAG6451034.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A921Z477; -.
DR Proteomes; UP000791440; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:TreeGrafter.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IEA:TreeGrafter.
DR CDD; cd00167; SANT; 3.
DR FunFam; 1.10.10.60:FF:000010; Transcriptional activator Myb isoform A; 1.
DR FunFam; 1.10.10.60:FF:000016; Transcriptional activator Myb isoform A; 1.
DR InterPro; IPR015395; C-myb_C.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR050560; MYB_TF.
DR InterPro; IPR001005; SANT/Myb.
DR PANTHER; PTHR45614:SF25; MYB PROTEIN; 1.
DR PANTHER; PTHR45614; MYB PROTEIN-RELATED; 1.
DR Pfam; PF09316; Cmyb_C; 1.
DR Pfam; PF13921; Myb_DNA-bind_6; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM00717; SANT; 3.
DR PROSITE; PS51294; HTH_MYB; 3.
DR PROSITE; PS50090; MYB_LIKE; 3.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000791440};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 65..115
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 65..115
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 116..171
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 116..167
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 168..218
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 172..222
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT REGION 20..72
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 237..258
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 350..380
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 438..463
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 519..564
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 41..51
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 247..256
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 519..529
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 548..561
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 795 AA; 89039 MW; 65DA080F07EE2AF2 CRC64;
MQYHNEDDDF HYRIFGEAFN LPGENSLNSG RRKKKSGYDS ETSEYSEDST AYEDAPVATK
SSGQRKNINR GRWTKEEDKR LKMYVKAYKE NWERISAEFP DRSDVQCQQR WTKVVNPELV
KGPWTKEEDE KVVELVAKYG PKKWTLIARQ LKGRIGKQCR ERWHNHLNPS IKKTAWTEHE
DRVIYQAHKQ LGNQWAKIAK LLPGRTDNAI KNHWNSTMRR KYEPDLLDSF ESLRRKRSWA
EPPSEPAPSN SSSQPSRHVL HCRRVLQSQL HSLQPHSLPP RRAGLVAADV QLVDSPFKFV
NIESLPSNSP IKNYLSQASA TNECNQDTVT YSVQADELKE IVVPTVYTSP KKKQMETNSP
PPILRRKKKT QVPTPTPNPW ADTISKALEC RESGVTPIKA LPFSPSQFLN SPAGVSFSTV
ESTPLRHAAH TKEWSASPLL HTPTPVNITP GPNHLKNNTP KTPTPFKIAM AEIGKKSGLK
YEPSSPGLLV EDITEMIKRE ENSDSTVQLN DSMLSSTADL ENAQVNDSGI GSLKRRGSDS
HGKENVPGAG GGGGGGGGGG AAHKRARKAL AARWGATSTP HAHHHTLVPD VAFILETPSK
TLEGDSSVMF SPPSIVKNSL LEESTSLHSL VSVENTPDPK FEDIDIEAVI TEKTKVQHRS
NVLTNSSPLD PNQYPFKEIT NENKSPNPDP LIYQYPFRDI TNEKNFVRLN ADRLRTINSA
NDIACDNLIA PKDVLTNALA DHIYKVTNQK PSTSKIEEVL TNKMKESQSV GQWWGRDTGQ
DDVFSNDYYM FSSNL
//