ID A0A2P6VKW7_9CHLO Unreviewed; 1147 AA.
AC A0A2P6VKW7;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE SubName: Full=Myb A {ECO:0000313|EMBL:PSC74728.1};
GN ORFNames=C2E20_2403 {ECO:0000313|EMBL:PSC74728.1};
OS Micractinium conductrix.
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Chlorellales; Chlorellaceae; Chlorella clade; Micractinium.
OX NCBI_TaxID=554055 {ECO:0000313|EMBL:PSC74728.1, ECO:0000313|Proteomes:UP000239649};
RN [1] {ECO:0000313|EMBL:PSC74728.1, ECO:0000313|Proteomes:UP000239649}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SAG 241.80 {ECO:0000313|EMBL:PSC74728.1,
RC ECO:0000313|Proteomes:UP000239649};
RX PubMed=29178410; DOI=10.1111/tpj.13789;
RA Arriola M.B., Velmurugan N., Zhang Y., Plunkett M.H., Hondzo H.,
RA Barney B.M.;
RT "Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium
RT conductrix SAG 241.80: implications to maltose excretion by a green alga.";
RL Plant J. 93:566-586(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PSC74728.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LHPF02000004; PSC74728.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2P6VKW7; -.
DR STRING; 554055.A0A2P6VKW7; -.
DR Proteomes; UP000239649; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR CDD; cd00167; SANT; 3.
DR Gene3D; 1.10.10.60; Homeodomain-like; 3.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR001005; SANT/Myb.
DR PANTHER; PTHR46621; SNRNA-ACTIVATING PROTEIN COMPLEX SUBUNIT 4; 1.
DR PANTHER; PTHR46621:SF1; SNRNA-ACTIVATING PROTEIN COMPLEX SUBUNIT 4; 1.
DR Pfam; PF13921; Myb_DNA-bind_6; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM00717; SANT; 3.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS51294; HTH_MYB; 3.
DR PROSITE; PS50090; MYB_LIKE; 3.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000239649}.
FT DOMAIN 5..55
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 5..52
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 56..110
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 56..106
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 107..157
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 111..161
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT REGION 158..210
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 251..309
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 411..439
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 453..472
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 492..511
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 545..585
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 972..1023
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 158..184
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 192..210
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 570..584
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 972..1005
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1147 AA; 119220 MW; 7B2156CB492D45E7 CRC64;
MDSDKKAHKP GVWTEEEDTL LALWQGRVGN KWSEVAKHIP GKTGQQCAQR WRHRVNPNIS
REKWTADEDV TLAELVNKYG NSWAEISRRL PGRTDQQCMG RWRRHLDPSI KREGWEPAED
EQLRELYAEY GSSWSCISKS MTGRTAQQCR ARWCQLTNAE RPSRSAASRS KPASKQSQAQ
TAAAPTAAAD PANPRRQSTA NTSGAQQTPG GAVASAADLF AAAAAGGAVA VAATTAPASA
RRPAARGAAG AARRLSSVSE VGGNDLSWSH GSEEDEEQEE DSSPEFTVRR TTARTAGRGV
PRTSSGAGSA GLDVAAAAMA AVGLPLPPAV AVLQQQQLRH QQQQQQLQQQ LQQPVSSWQQ
QQQQQQHNHL QPPASVAAAA AVHAVPTPAP VLPQALLPLN LDLPATALQQ RGAADGLPPP
DEEEDSPALT QNPGSAATAA AADLLSPLRA PLPAAPPLFT PSSSGRKRTA ASRMAAFGPA
LPPLVIPEGA ATAAKGTPVR PTPGSGSGPM SLELRSPNVL TLLQSPPPMM DDLFRSPAFG
SMLTPPWAKS GSAGSGGGCK TAGGHERPPS AAAVEEERRQ SVARRLDTAP SFSHLAAAQA
HQVAQQQQQQ QQAAQQAAQQ AAAASSLHHA ASFDFGVILS PPKKSKMNSP ASSSLTMATA
ALAAGGGTPC WLPGAASAGL MLPAESGGGT SLAMAVQQHL LLASTAKARS WPRRPGAAAR
PTPSAATLLL VLACLTLWPG HAQGLSGYGW KQSMPCVGFW IRLPEENDES FKVTITELRS
ELARLARSST DEDPSAVDIV EIALVAPGED AGLDVWTVVR FQGGRPVAAL AEHLRQRGTA
ALASKFPGAT ASWEAAEEEA FQAVVVLTTH MAADAIPDVT IYNASESNST AAAATAAAAA
ATARRAGGAA KAARHHAAAQ LGADVRLRYG REELAEAKRT AQALVTGKLP LLRMVFPGAT
VFNVRLNGKP VHPPSPPLPP PPPAKLPATS LDSPPPPPPL ARAKPRTPPR KPRGPEPPAR
LPANSLHVAA PCILPASGVG TTTATLMVVA AQHAAHITLN CTVAAAGLGA IRKAGDRRAP
LQGLRLATKV ARLPAGAAAR ARQGAAFPLR GLTPASTYLC TATAFTDAGQ ASKPSPVARF
TTRAGRL
//