ID W2THV8_NECAM Unreviewed; 1040 AA.
AC W2THV8;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 55.
DE SubName: Full=Protein, SNF2 family {ECO:0000313|EMBL:ETN81393.1};
GN ORFNames=NECAME_08522 {ECO:0000313|EMBL:ETN81393.1};
OS Necator americanus (Human hookworm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae; Bunostominae;
OC Necator.
OX NCBI_TaxID=51031 {ECO:0000313|EMBL:ETN81393.1, ECO:0000313|Proteomes:UP000053676};
RN [1] {ECO:0000313|Proteomes:UP000053676}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24441737; DOI=10.1038/ng.2875;
RA Tang Y.T., Gao X., Rosa B.A., Abubucker S., Hallsworth-Pepin K., Martin J.,
RA Tyagi R., Heizer E., Zhang X., Bhonagiri-Palsikar V., Minx P., Warren W.C.,
RA Wang Q., Zhan B., Hotez P.J., Sternberg P.W., Dougall A., Gaze S.T.,
RA Mulvenna J., Sotillo J., Ranganathan S., Rabelo E.M., Wilson R.K.,
RA Felgner P.L., Bethony J., Hawdon J.M., Gasser R.B., Loukas A., Mitreva M.;
RT "Genome of the human hookworm Necator americanus.";
RL Nat. Genet. 46:261-269(2014).
CC -!- SIMILARITY: Belongs to the SNF2/RAD54 helicase family. ISWI subfamily.
CC {ECO:0000256|ARBA:ARBA00009687}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI658737; ETN81393.1; -; Genomic_DNA.
DR RefSeq; XP_013303620.1; XM_013448166.1.
DR AlphaFoldDB; W2THV8; -.
DR STRING; 51031.W2THV8; -.
DR EnsemblMetazoa; NECAME_08522; NECAME_08522; NECAME_08522.
DR GeneID; 25348551; -.
DR KEGG; nai:NECAME_08522; -.
DR CTD; 25348551; -.
DR OMA; TAFYRKE; -.
DR OrthoDB; 5482994at2759; -.
DR Proteomes; UP000053676; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0140658; F:ATP-dependent chromatin remodeler activity; IEA:InterPro.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0016787; F:hydrolase activity; IEA:UniProtKB-KW.
DR GO; GO:0031491; F:nucleosome binding; IEA:InterPro.
DR CDD; cd17997; DEXHc_SMARCA1_SMARCA5; 1.
DR CDD; cd00167; SANT; 1.
DR CDD; cd18793; SF2_C_SNF; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR Gene3D; 1.20.5.1190; iswi atpase; 1.
DR Gene3D; 1.10.1040.30; ISWI, HAND domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 3.40.50.10810; Tandem AAA-ATPase domain; 1.
DR InterPro; IPR014001; Helicase_ATP-bd.
DR InterPro; IPR001650; Helicase_C.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR044754; Isw1/2_DEXHc.
DR InterPro; IPR015194; ISWI_HAND-dom.
DR InterPro; IPR036306; ISWI_HAND-dom_sf.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR InterPro; IPR015195; SLIDE.
DR InterPro; IPR038718; SNF2-like_sf.
DR InterPro; IPR049730; SNF2/RAD54-like_C.
DR InterPro; IPR000330; SNF2_N.
DR PANTHER; PTHR45623; CHROMODOMAIN-HELICASE-DNA-BINDING PROTEIN 3-RELATED-RELATED; 1.
DR PANTHER; PTHR45623:SF49; SWI_SNF RELATED, MATRIX ASSOCIATED, ACTIN DEPENDENT REGULATOR OF CHROMATIN, SUBFAMILY A, MEMBER 1; 1.
DR Pfam; PF09110; HAND; 1.
DR Pfam; PF00271; Helicase_C; 1.
DR Pfam; PF09111; SLIDE; 1.
DR Pfam; PF00176; SNF2-rel_dom; 1.
DR SMART; SM00487; DEXDc; 1.
DR SMART; SM00490; HELICc; 1.
DR SMART; SM00717; SANT; 2.
DR SUPFAM; SSF101224; HAND domain of the nucleosome remodeling ATPase ISWI; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 2.
DR PROSITE; PS51192; HELICASE_ATP_BIND_1; 1.
DR PROSITE; PS51194; HELICASE_CTER; 1.
DR PROSITE; PS51293; SANT; 1.
PE 3: Inferred from homology;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Reference proteome {ECO:0000313|Proteomes:UP000053676}.
FT DOMAIN 172..337
FT /note="Helicase ATP-binding"
FT /evidence="ECO:0000259|PROSITE:PS51192"
FT DOMAIN 467..618
FT /note="Helicase C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51194"
FT DOMAIN 823..876
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT REGION 27..53
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 90..118
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 996..1040
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 32..52
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 95..115
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 996..1014
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1016..1040
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1040 AA; 120681 MW; B3B07721AE62D810 CRC64;
MSAELAANGE EANFTLLLLC QAKPEPVEDV KMENDENYDD EMEAEEGEDS LESKFEADSF
KRFELLLKKT ENFSHCLSAG DVAAYKGSPV KKKRGRTASG VDGDHRHRKT EQEEDEEMVE
AAEREDSITI FDKSPFYIQN GELRLVSSTL FFIFIHALRF LDYQIRGLNW LISLYHNGIN
GILADEMGLG KTLQTISLLG YMKHYKNMAS PHLIIVPKST LKNWMNEFAK WCPSLTTCCI
IGDEQERNQV IRDVILPQKF DVCCTTYEMV LKVKTQLKKL VWKYIIIDEA HRIKNEKSKL
SEVVREIKSK NRLLITGTPL QNNLHELWAL LNFLLPDMFS SSEDFDSWFT DGSMQGNTDI
ISRLHKVLQP FLLRRIKSDV EKTLLPKKEV KIYVGLSKMQ REWYTKVLMK DIDVINGAGK
VEKARLMNIL MHLRKAANHP YLFDGAEPGP PYTTDQHLVD NSGKMVVLDK LLQKLKEQGS
RVLIFSQFSR ILDLLEDYCW WRQYQYCRLD GNTAHVDRQE AIDAYNAPDS EKFIFMLTTR
AGGLGINLAT ADVVVIFDSD WNPQSDLQAM DRAHRIGQKK QVRVFRLITE NTVDERIIER
AEVKLRLDSI VIQQGRVAEA QKTLGKDDMI NMIRHGAELV FASKDSTITD EDIDSILQRA
EVKTAELNAK MEEMGESNLR NLTFDNSKSV YNFEGENWKG KQNDGMGHFW IEPPKRERKA
NYQVDAYFRE AMRQGQPVEK QSRAPRPKQP AVFDFQFYPP RLMELLDRET YHYRKTIGYK
AEKPKECGPK EAEKRQKEEQ RLIDTAQPLT EEEQQEKNEL LTQGLANWSK RDFTAFVRAN
EKYGRHDIEN IANEMIETKS RDEVEYYAKI FWERFEELQD HEKILAQIEK GEARIQRRQS
VKRALDAKIA KYKAPFHQLR IAYGTNKGKT YTEEEDRFLV CELHRLGFDK ETVYEELRQS
VRMAPQFRFD WFIKSRTAME LQRRCNTLIT LIEKEMGETE VKHRTKKDNK EKASAPSEAG
SATPSRTPSG KKLGRPKTSK
//