ID K0TQ33_THAOC Unreviewed; 1611 AA.
AC K0TQ33;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 14-DEC-2022, entry version 30.
DE RecName: Full=MBD domain-containing protein {ECO:0000259|PROSITE:PS50982};
GN ORFNames=THAOC_02930 {ECO:0000313|EMBL:EJK75347.1};
OS Thalassiosira oceanica (Marine diatom).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK75347.1, ECO:0000313|Proteomes:UP000266841};
RN [1] {ECO:0000313|EMBL:EJK75347.1, ECO:0000313|Proteomes:UP000266841}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK75347.1,
RC ECO:0000313|Proteomes:UP000266841};
RX PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA Rosenstiel P., Hippler M., Laroche J.;
RT "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT limitation.";
RL Genome Biol. 13:R66-R66(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJK75347.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGNL01002985; EJK75347.1; -; Genomic_DNA.
DR EnsemblProtists; EJK75347; EJK75347; THAOC_02930.
DR eggNOG; ENOG502QX0I; Eukaryota.
DR OMA; DHERENA; -.
DR Proteomes; UP000266841; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR Pfam; PF01429; MBD; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR PROSITE; PS50982; MBD; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000266841}.
FT DOMAIN 933..1000
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT REGION 118..287
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 344..435
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 611..650
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 986..1016
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1062..1092
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1301..1323
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 118..154
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 161..182
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 193..208
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 209..238
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 273..287
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 374..392
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 409..429
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1611 AA; 180603 MW; A753607A9E698930 CRC64;
MSDPTTTAVA TSTVDGSAAA TVDIVLSMRN PRYKEIMWTR FNLLGRQRDH SREDAIITEV
FQLFKSQRAK FYKQTKQGDR VPVNEEEALA KITYDIKRRL KSVHNWLQVN ETMREYHRGV
SDAENSSDST SESHDERQAP IGGCDEKPHH MIDVDNTFES GGAPLSRSES VSVQSSQIGS
PKSAKPFEEC PARSTRKTNS GSNHLDTSMD TDGGHHLDDE IVEGFLGEDP RAKSDRASLK
RSPPPAPRAG AGTRKPRRNS RIPQRDIPLR RSRTTVELSN SVDDNPTEND IVLSLLDDRY
REEMHKHYNL LGPKRPSDRE REEAMSATIF RVFKRRLGRR MENPSWLAVP QLPQQPEAGP
DSTRRKQRRR KNPPISNESG APSSNESIGR WRDQLTPSPV DHKMDELSES MATPSSNYHA
LPPINSRSRR RGQSLRKEEL TKLLVASHLG QRLAPLKEIP HIESIETPNQ LVHYLLLATQ
QLNELNGTHI KHCRVLRLYN LAELNEDRSA ILEAPSLARL AKNMSTTSEQ LHSLFCVYLD
ALASGFDFYT HGDDLFQDHS GIFNMKRDPE VHQWTADGER ALNTFCHSVL EEMIRQEHSD
NGAHQQVANL FGEGTSTDRR RLPPRQVRDG KSSSLTRIPD WLKDDIDPPK SKRQKVTFDA
ILDAGQPNNR SETCHVSKSN ENVFSTHLRQ NQARLCALSE RPDVDQLLVA SISLDTIVKR
LKCGSKTRTL CMISGCKKHA QTRCDGLCNA HYNTISTGKS KMPISSGANS ATGENAPWSI
RTAGSEERAR LATLLGPEEK KDAALLGPEE KKESFNSAAN LEPPRRLSDV DIRRLIPDAR
IYVKWSGDGR IYQATVKKLM LQRDEPVVKI HYDGKKKHIF NNIPVSMVHS FIGEDKVPVE
GTEHQEESSP EQEGNLDFRQ LNHLYPCVLK DDEDICETSC PRLGPAWLVR VVRRKNASGN
KADRSFISPS GAIFRSIPEL ERHFENDPRE FGGSLREPAA APQVKSPGSR STDDLTAKAP
DMKEAAKLPV EARAQSIGHA CHESNVAEST DECDEIAAIN VDVNTPSPPA KDREENSLSR
TRQSSSFSPL KVQNGGFIDR PITYSPITIH SGQLVQHALS ETSQSGIVSD HTKSTTCQLP
SKAVENSAFS RAPRTLVRPT LTHRLCGDPL SLFCSIRGVG SSKPLASLKR DMAVESGFNA
KRPRRKVATA MQGTPATPAL PCAEKPDTAL SALCHCPACD KANLSIQGLY AHWGKVHEGK
VSWKAVTFSC PFCPPKSTAR SKLYKSFRDL DSHVRQAHPK CMVLGPKSPR GPRGLAGGHN
SSGGIEDRTA AKVSIGGHFH GSNSASSGYE PVPQTPHKWQ DLEFTRLVSD IKTDDGDQLF
QMIDGQCLKQ EALVEEARHQ RRKQCKAEAE SEMKTFDEER LAYQRGIRDR RQLADQETLE
KQRFVEFWLY RQHRGQKRSR DNIEIEQVCT RPIKFSRSNQ SRTSQGKKLC HEPDCMFCQK
DSSYLRHVLL DDELECFEVG ESTFRAPAIQ KGATILSPYF HYVSDQFLDQ AAAEETEATT
RQSRRALSTA RRVKAEEDKL WSLTKTKHSL AFIKKYNKGL IRNAWQGVNK L
//