ID A0A059F2H7_9MICR Unreviewed; 1244 AA.
AC A0A059F2H7;
DT 09-JUL-2014, integrated into UniProtKB/TrEMBL.
DT 09-JUL-2014, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KCZ81473.1};
GN ORFNames=H312_01050 {ECO:0000313|EMBL:KCZ81473.1};
OS Anncaliia algerae PRA339.
OC Eukaryota; Fungi; Fungi incertae sedis; Microsporidia; Tubulinosematoidea;
OC Tubulinosematidae; Anncaliia.
OX NCBI_TaxID=1288291 {ECO:0000313|EMBL:KCZ81473.1, ECO:0000313|Proteomes:UP000030655};
RN [1] {ECO:0000313|Proteomes:UP000030655}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PRA339 {ECO:0000313|Proteomes:UP000030655};
RG The Broad Institute Genome Sequencing Platform;
RA Cuomo C., Becnel J., Sanscrainte N., Walker B., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Dewar J., Goldberg J., Griggs A.,
RA Gujja S., Hansen M., Howarth C., Imamovic A., Larimer J., McCowan C.,
RA Murphy C., Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T.,
RA Sisk P., Sykes S., Wortman J., Nusbaum C., Birren B.;
RL Submitted (FEB-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:KCZ81473.1, ECO:0000313|Proteomes:UP000030655}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PRA339 {ECO:0000313|EMBL:KCZ81473.1,
RC ECO:0000313|Proteomes:UP000030655};
RG The Broad Institute Genome Sequencing Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Cuomo C., Becnel J., Sanscrainte N., Walker B., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Dewar J., Goldberg J., Griggs A.,
RA Gujja S., Hansen M., Howarth C., Imamovic A., Larimer J., McCowan C.,
RA Murphy C., Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T.,
RA Sisk P., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Anncaliia algerae insect isolate PRA339.";
RL Submitted (MAR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KK365141; KCZ81473.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A059F2H7; -.
DR STRING; 1288291.A0A059F2H7; -.
DR VEuPathDB; MicrosporidiaDB:H312_01050; -.
DR HOGENOM; CLU_000315_8_5_1; -.
DR Proteomes; UP000030655; Unassembled WGS sequence.
DR GO; GO:0005524; F:ATP binding; IEA:InterPro.
DR GO; GO:0140658; F:ATP-dependent chromatin remodeler activity; IEA:InterPro.
DR CDD; cd18659; CD2_tandem; 1.
DR CDD; cd18793; SF2_C_SNF; 1.
DR Gene3D; 2.40.50.40; -; 2.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 3.40.50.10810; Tandem AAA-ATPase domain; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR023780; Chromo_domain.
DR InterPro; IPR014001; Helicase_ATP-bd.
DR InterPro; IPR001650; Helicase_C.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR038718; SNF2-like_sf.
DR InterPro; IPR049730; SNF2/RAD54-like_C.
DR InterPro; IPR000330; SNF2_N.
DR PANTHER; PTHR45623; CHROMODOMAIN-HELICASE-DNA-BINDING PROTEIN 3-RELATED-RELATED; 1.
DR PANTHER; PTHR45623:SF11; KISMET, ISOFORM C; 1.
DR Pfam; PF00385; Chromo; 1.
DR Pfam; PF00271; Helicase_C; 1.
DR Pfam; PF00176; SNF2-rel_dom; 1.
DR SMART; SM00298; CHROMO; 2.
DR SMART; SM00487; DEXDc; 1.
DR SMART; SM00490; HELICc; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 2.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 2.
DR PROSITE; PS50013; CHROMO_2; 1.
DR PROSITE; PS51192; HELICASE_ATP_BIND_1; 1.
DR PROSITE; PS51194; HELICASE_CTER; 1.
PE 4: Predicted;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Helicase {ECO:0000256|ARBA:ARBA00022806};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Reference proteome {ECO:0000313|Proteomes:UP000030655};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 206..264
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT DOMAIN 301..468
FT /note="Helicase ATP-binding"
FT /evidence="ECO:0000259|PROSITE:PS51192"
FT DOMAIN 629..782
FT /note="Helicase C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51194"
FT REGION 1..43
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 111..137
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 908..944
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 582..614
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 18..34
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 123..137
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 908..923
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1244 AA; 145995 MW; 95DBF364F677BAB3 CRC64;
MKEDTDTTSF DTSSDVPEVQ VVEKRRRGRP RKNEYQQPQH NPNYMPMGYQ YVVPRQFPIT
MGYNNSLYKQ PSFKYYTENP LKGGNNVYLK GAYIYSYNQA PYAPITIPAP SPPVQQNVSP
APPIKKEKKK KRYEEEEDDE FQIEEELDQY EKLLAQDDDR FLVKFRNKSY LHCDWVDQSE
LVTSRAAAMK IKRFKPKEVP FDPEYLKVDR IITEENDNYF VKWKSLPYES ATWESKDDLA
KVENFAEEKE QFYDRRNVAR TSLPMDWRPN KEYFLKFTES PTFKNNNTLR SYQLEGLNWL
LNRWFYKQSC IMADEMGLGK TVQSVSFVNT LYTKYNYTMP VLIVSPLSTI IHWEREFKNW
TDLRVIVFHG TAAGRQIISD YELYLKGRRL FDVMITTYET VMSSLRQFQE INFSIGIFDE
AHRLKNSNSK AVQCLKAVYF NHKILLSGTP LQNNLSELWS LLNFISPQQF SCINSFLEEF
KLEKSEDVER LQQVLKPIML RRMKEDVEKS IPLKEETIIE VELTMIQKRY YRAILEKNLE
FLRKNNSENA PNLLNAMMEI RKCCIHPYLI KGAEESIVHD YIEQKKKRKE EASIELLENN
KEDINLLYKE LQVDEHYKVL INSSGKLVLL DKLLQKLQGS HKVLIFSQMT KCLDLLADYL
NYRQYKYERI DGGIRGDARQ AAIDRFSTTD VFVFLLCTRA GGVGINLTAA DTVIIFDSDW
NPQNDLQAQA RCHRIGQTQE VKIYRLVTRN TYEREMFDKA GLKLGLDRAV LQKMSFDKET
VLKKKDAVEI LLKKGAYGVL METDEASRKF CEEDIDQILE RRTKIIKHQD GGNVFSKASF
QVDEEIDDPD FWENLLSKKE KQASEGRIKR IIRKLAREGN LSNEEKESID KELKTLLDNY
DDLYGKDKAS VKDDHKPKKI KTNIQNTKDN NSISESEEED GLQPKDDSFL KERIVFLMLL
RKGIKSLATL KFDYSEIVQL IYSYCLSKIL VKKHKEDLMM FLDIKEEKMV DFTFYDDWCI
KLLLRVQVPI ILQALLKQTN LTGEKSKGFS LEDDKRLVNY TLSHGYDNYP TSTGNEKGFF
KGKNTEDLNQ RVRRIILNLN LKIESIDGDS TDPSLVATIL QFGKCTERNK DVLLDLLGEE
KLEEYNSVVD QILEKKKKGK RRDAEEQLLF DRICLFSAFF NMEEIPIIKK RLNKKWNNET
DYKLKDRLEC FGLTEEIVKE FNVTEDAILL RIRELIDKSY ELSE
//