ID K0R2Q2_THAOC Unreviewed; 548 AA.
AC K0R2Q2;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 27-MAR-2024, entry version 50.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EJK46988.1};
DE Flags: Fragment;
GN ORFNames=THAOC_34321 {ECO:0000313|EMBL:EJK46988.1};
OS Thalassiosira oceanica (Marine diatom).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK46988.1, ECO:0000313|Proteomes:UP000266841};
RN [1] {ECO:0000313|EMBL:EJK46988.1, ECO:0000313|Proteomes:UP000266841}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK46988.1,
RC ECO:0000313|Proteomes:UP000266841};
RX PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA Rosenstiel P., Hippler M., Laroche J.;
RT "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT limitation.";
RL Genome Biol. 13:R66-R66(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJK46988.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGNL01047425; EJK46988.1; -; Genomic_DNA.
DR AlphaFoldDB; K0R2Q2; -.
DR EnsemblProtists; EJK46988; EJK46988; THAOC_34321.
DR eggNOG; KOG0048; Eukaryota.
DR eggNOG; KOG0724; Eukaryota.
DR Proteomes; UP000266841; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR CDD; cd00167; SANT; 3.
DR Gene3D; 1.10.10.60; Homeodomain-like; 4.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR006447; Myb_dom_plants.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR NCBIfam; TIGR01557; myb_SHAQKYF; 1.
DR PANTHER; PTHR45614; MYB PROTEIN-RELATED; 1.
DR PANTHER; PTHR45614:SF307; TRANSCRIPTION FACTOR MYB3R-2; 1.
DR Pfam; PF13921; Myb_DNA-bind_6; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM00717; SANT; 3.
DR SUPFAM; SSF46689; Homeodomain-like; 3.
DR PROSITE; PS51294; HTH_MYB; 3.
DR PROSITE; PS50090; MYB_LIKE; 3.
DR PROSITE; PS51293; SANT; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000266841}.
FT DOMAIN 53..107
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 53..103
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 61..107
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT DOMAIN 304..357
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 312..361
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 358..410
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 363..414
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT REGION 1..58
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 122..143
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 155..174
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 262..309
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 405..505
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 42..58
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 405..445
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 548
FT /evidence="ECO:0000313|EMBL:EJK46988.1"
SQ SEQUENCE 548 AA; 59780 MW; 9FA7D35C11B05E88 CRC64;
MPTASKLRIP TANPEPSTVD GGAVDPDNAD NARPDDAPSG EAGEATTSSA RQAKNSNIGL
WTAEEHRLFV EGLECHGKNW AEVATHVGSR TVDQIRSHAR QYFEKLANGS PAQWNFAEVA
KQKDANPPSG EVRRSSGRTP KPVVNFGKEV FASAPNASNS TRKSFGRREK KGLRADSRWK
AGGKITHQLV VHEKPRGTSS SEDIAAKARR LANESARAKS GVVAGVPTRA ALNQILQECC
DVADRKGLSE LKRMLEQFLS EAKASGKRSS RASKLEPPSK KSKLAAMTPL PASGSGDKKP
AVTTKRKEPN FWTEEEDLRL KELVRGFGSG PVKWTRLATE MPGREGKCCQ ARWSCRLDPS
ISRSPFTAEE VRAIVRFQAD EEKAGKWAEL AKVLPGRTRE QIRSQWNSMT SHSNSLTSNS
KAVSEQRQST PSTNAILSPK KSNAETTAEM VVGVGGKKPE PISSQANDLS LGRGKEKTLH
SAAPKLEPPS NGTHSARRRK ETAGLSDEAK AYLTKWFYDQ KAYPYPTRQK KVELCNVLGI
SDLNQLDR
//