ID K0SYJ3_THAOC Unreviewed; 1566 AA.
AC K0SYJ3;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 24-JAN-2024, entry version 57.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EJK66066.1};
GN ORFNames=THAOC_13031 {ECO:0000313|EMBL:EJK66066.1};
OS Thalassiosira oceanica (Marine diatom).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK66066.1, ECO:0000313|Proteomes:UP000266841};
RN [1] {ECO:0000313|EMBL:EJK66066.1, ECO:0000313|Proteomes:UP000266841}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK66066.1,
RC ECO:0000313|Proteomes:UP000266841};
RX PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA Rosenstiel P., Hippler M., Laroche J.;
RT "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT limitation.";
RL Genome Biol. 13:R66-R66(2012).
CC -!- SIMILARITY: Belongs to the SNF2/RAD54 helicase family. ISWI subfamily.
CC {ECO:0000256|ARBA:ARBA00009687}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJK66066.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGNL01015292; EJK66066.1; -; Genomic_DNA.
DR EnsemblProtists; EJK66066; EJK66066; THAOC_13031.
DR eggNOG; KOG0385; Eukaryota.
DR OMA; TEARINM; -.
DR Proteomes; UP000266841; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0140658; F:ATP-dependent chromatin remodeler activity; IEA:InterPro.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0016787; F:hydrolase activity; IEA:UniProtKB-KW.
DR GO; GO:0031491; F:nucleosome binding; IEA:InterPro.
DR CDD; cd17997; DEXHc_SMARCA1_SMARCA5; 1.
DR CDD; cd00167; SANT; 1.
DR CDD; cd18793; SF2_C_SNF; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2.
DR Gene3D; 3.40.50.10810; Tandem AAA-ATPase domain; 1.
DR InterPro; IPR014001; Helicase_ATP-bd.
DR InterPro; IPR001650; Helicase_C.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR044754; Isw1/2_DEXHc.
DR InterPro; IPR036306; ISWI_HAND-dom_sf.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR015195; SLIDE.
DR InterPro; IPR038718; SNF2-like_sf.
DR InterPro; IPR000330; SNF2_N.
DR PANTHER; PTHR45623; CHROMODOMAIN-HELICASE-DNA-BINDING PROTEIN 3-RELATED-RELATED; 1.
DR PANTHER; PTHR45623:SF49; ISW-1; 1.
DR Pfam; PF00271; Helicase_C; 1.
DR Pfam; PF09011; HMG_box_2; 1.
DR Pfam; PF09111; SLIDE; 1.
DR Pfam; PF00176; SNF2-rel_dom; 1.
DR SMART; SM00487; DEXDc; 1.
DR SMART; SM00490; HELICc; 1.
DR SUPFAM; SSF101224; HAND domain of the nucleosome remodeling ATPase ISWI; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 2.
DR PROSITE; PS51192; HELICASE_ATP_BIND_1; 1.
DR PROSITE; PS51194; HELICASE_CTER; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000266841}.
FT DOMAIN 3..101
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DOMAIN 306..501
FT /note="Helicase ATP-binding"
FT /evidence="ECO:0000259|PROSITE:PS51192"
FT DOMAIN 634..788
FT /note="Helicase C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51194"
FT DNA_BIND 3..101
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 109..202
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 223..281
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 396..419
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1302..1414
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1530..1566
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 134..202
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1304..1324
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1334..1349
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1379..1411
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1541..1566
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1566 AA; 178553 MW; BAEDE981BA5388AD CRC64;
MVVTKAKTAF GFWQVSNRSS SMFDQFGPTR SRIQISTLNL LQPQADKLAG IMRRDDVDGM
GKAMKILSAE WQTLSDLDRK PYLDREEEDR ARYQRECRDA DEAAYLAHQE RKKKNQMPAE
DENLVSSSRG ARAQQDAERA KRDAKAQARR EAADARLTEE EREERRAAKA AKRAEVLERQ
RKKDAEEKAV ADRHKKLDKE ATKKTADRLK YLLAQSDIFS RLKDGKSRTT EDASRADAGG
GYKSKHSPAK KKGRPKKDEE AAPDGEGLDE DDEEEGESHT FLTKQPSCIK FGTLKPYQLE
GLNWMIHLAE KGLNGILADE MGLGKTLQSI SILAYHYEYL KIQGPHLICV PKSTLSNWMN
ELNRWCPSLR VIRFHGMKDE REELVEDYFT NEAAAHDGRR PTRQIRNEET GEMEDDNSDN
PRAWDVCVTT YEMANMEKRT LGRFAWKYLI IDEAHRLKNE ASMFSTTVRE FNTANRLLLT
GTPLQNNLHE LWALLNFLLP DIFSSSQQFD EWFNLEIDDA DAKKKMISQL HGVLRPFMIR
RLKADVAKGL PPKTETLVMV GMSKMQKQLY KRLLLRDIKA ITGKNTNSGK TAVLNIVMQL
RKCCNHPYLF EGIEDRTLDP LGEHLVDNCG KLNMVDKLLK KLKERGSRVL IFTQMTRILD
IMEDFMHMRG YKYCRLDGQT DYETRERSIR EFNAEGSEKF CFILSTRAGN PQADLQAQDR
CHRLGQKKPV SIFRLVTENT VEQKIIERAQ QKLKLDAMVV QQGRLKEKDK VTKEEVMAAV
RFGADAVFRS EESTISNDDI DVILERGKAK TKELADKITK AEKGDLLDFR LDGGVSSQTF
EGIDYSDAQL RNQLRLLAAD SMGKRERKPP PVTYTEIIQP KKSMVIKNQK IRLPRPLRLP
SMEDHQFYNR ARLLEISDKE FQLFATLKET GKVPSYEYME QKRTILPPNL AKEKWELLAE
GFSEINRSQF FHFIKACTKY GRADFDNISE AMGLPTDLIR VYSTAFWEYG PTELKSDEWE
RVKGQIEKGE RRLEKQREQE KMLGKFVATF DNPETDITFA NKGNAHFSLE QDRAILCAVA
KHGYGNWEKV REEIRNDSNL LFHHTVQGMN IDAISKRCDY RLRQMEKELE TREKKINVKP
PAVVEAESTL EAIRQMEEYE IEKQDMEMRG QMVPPIQAYV SNGLEVFEDY LRDQQECIDG
LREIETQIRG CNVLAEETKE CINRGDQYVN YSHIVLRGGG SRGGLNDDGR GWGVSLEERI
NASVLSIPEC GACQFCVDPS TNKICIQRRY VRDVLVQEEM AKVPTTSPVP SSDSKSSKAK
SLGPSASSKV RKKPGPKPGS KKSGPKPKKA RTTSPKPDAQ RSESQLAGDP MEVEHLDPAS
QPPQKEKKKP GPKPKKAEPK SESEKRKSGG FKMSVPENLL PEFAKLIGVD GTNLRNDIIN
DFTATHPNTS ARQVTIKFSE MATKNRPETA KEAPKQGGRG RAVWYYLRPM FYHLLEESER
PSGWEEAAKV DNALYEKEIA EKERATAEKK LKASMKAKDA NDGMTTMTSS AANTDDEVGS
TVSSKF
//