ID A0A182P5N4_9DIPT Unreviewed; 2033 AA.
AC A0A182P5N4;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 38.
DE RecName: Full=TATA-binding protein-associated factor 172 {ECO:0008006|Google:ProtNLM};
OS Anopheles epiroticus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=199890 {ECO:0000313|EnsemblMetazoa:AEPI002224-PA, ECO:0000313|Proteomes:UP000075885};
RN [1] {ECO:0000313|Proteomes:UP000075885}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Epiroticus2 {ECO:0000313|Proteomes:UP000075885};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles epiroticus epiroticus2.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AEPI002224-PA}
RP IDENTIFICATION.
RC STRAIN=Epiroticus2 {ECO:0000313|EnsemblMetazoa:AEPI002224-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 199890.A0A182P5N4; -.
DR EnsemblMetazoa; AEPI002224-RA; AEPI002224-PA; AEPI002224.
DR VEuPathDB; VectorBase:AEPI002224; -.
DR OrthoDB; 180798at2759; -.
DR Proteomes; UP000075885; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0016887; F:ATP hydrolysis activity; IEA:InterPro.
DR GO; GO:0140658; F:ATP-dependent chromatin remodeler activity; IEA:InterPro.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0017025; F:TBP-class protein binding; IEA:InterPro.
DR CDD; cd17999; DEXHc_Mot1; 1.
DR CDD; cd18793; SF2_C_SNF; 1.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 3.40.50.10810; Tandem AAA-ATPase domain; 1.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR014001; Helicase_ATP-bd.
DR InterPro; IPR001650; Helicase_C.
DR InterPro; IPR044972; Mot1.
DR InterPro; IPR044078; Mot1_ATP-bd.
DR InterPro; IPR022707; Mot1_central_dom.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR038718; SNF2-like_sf.
DR InterPro; IPR049730; SNF2/RAD54-like_C.
DR InterPro; IPR000330; SNF2_N.
DR PANTHER; PTHR36498; TATA-BINDING PROTEIN-ASSOCIATED FACTOR 172; 1.
DR PANTHER; PTHR36498:SF1; TATA-BINDING PROTEIN-ASSOCIATED FACTOR 172; 1.
DR Pfam; PF12054; DUF3535; 1.
DR Pfam; PF00271; Helicase_C; 1.
DR Pfam; PF00176; SNF2-rel_dom; 1.
DR SMART; SM00487; DEXDc; 1.
DR SMART; SM00490; HELICc; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 2.
DR PROSITE; PS51192; HELICASE_ATP_BIND_1; 1.
DR PROSITE; PS51194; HELICASE_CTER; 1.
PE 4: Predicted;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Helicase {ECO:0000256|ARBA:ARBA00022806};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1446..1615
FT /note="Helicase ATP-binding"
FT /evidence="ECO:0000259|PROSITE:PS51192"
FT DOMAIN 1778..1932
FT /note="Helicase C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51194"
FT REGION 88..123
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 285..324
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 617..640
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 742..781
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1108..1142
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 106..123
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 310..324
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 617..634
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1108..1127
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2033 AA; 223702 MW; 2D073810BEF0C025 CRC64;
MTSRLDRLFI LLESGSAAVT RKAAAKQIGE VQKLHPHELH NLLSRLLTYL HSNSWDTRIA
ASQAVQAILE NVPQWEPKPL PNLLKMETKE EPSEEGEEED KSCDSSSSTR GSSNRNGGPN
SHRLSFDSFD LNAVLFKGAR LMGSEGTEFD PLDENEVIDL REKLARQRAL LNEKLGLSSG
LNIEELVTLD DVRNNGSGSS GGRDHQTGAS GERLVPVQEI LKLNRVDSVE GLSCREKNRA
RRKARQQQQA FQNPIAAVTG GGGGGGGVAA VSAAGNGANG LGSTGALGPN EIGAGEPDRK
RIKTEKGGEP LTPTTPSTGS SILTGTWTGE AVPDLTGAWV DAVDWPLDSF CSKLFLDLFS
PRWETRHGSA TALRELLKSH ADGGGKSVYM TKAEMERQHQ LWLEDATLRL LCVLALDRFG
DFVSDQVVAP VRETCAQVLG TVLRQLPLPK VHQTVTILLT FVKQKEWEVR HGGLLGIKYM
LVVREDLIQT FLPVIINDVL TGLFDAVDDV GAVAASTLIP IATWLPKLLS RAQVSHIVKL
LWDLLLDQDE LASACNSFMG LLASILSLPS ASSWIQMEPM SMLVPRLWPF LSHCSSSVRR
STLQTLKTLT SVCTEKVTNG GASDGSTTPP ASNGKSDEPV GRAVLTMPNA EESNLGLNFG
VKDWPPPLLQ EALRHIFQRV LVEHVEDIQS LAEDVWNNLV VNAELSALLH ASCPYVSSWL
CLAMQPVRLA FDPGSLIYAK PNQHHHQQGS AAMRERRPRQ FDSFDTGGGA SGGHLSSSSS
SGSLHQKLFL GGAETVPLDA REKNVVRARC KAARMIGLLS RYLVLPAPGV TYTPETESPI
DCYTKVLIGY LQSRSALQRL ISSLVIAYWC SFDSTIQPGP PVLQDRLRAC LNEYVYYDEV
GILFTRLLQE CRDYMATLKQ HKVQLAEYEQ LKVLTLDQIY QIATAIGWSV DEMRLKYNLK
TKVADLLEER RRSLLGSHAT TALEQTTLHI STQSSVSGAV VSLRCLPDRL NPVVKPLMES
IKREECELLQ RLSAKYLSDL LDQVTARTPC PNSKIVTNLC TLLKSDAEFT PRVLCPDQEL
QHFDPANTED SNPYHGILTL AKQNQRCKES TSGAASSGGS VGGSSGGGGS SRGPGRPAAS
STVAAALDLS ASSSSGATSS STTTNAALDE LLGSSESEET QRKHARTQRL GATAAITTIC
AQFGAQLPQK LPILWQLLLD KIQSRVDEPF VDRLALDVIA QDETNDFMTS LQLLEVAAPF
LHDSLHKELF ELLPKLCLLL RHPLKGVRHM VGRCLATLAA VDAATVMTKV INEVVPLLSC
IESVIKRQGA AEAVTCIVNR LQFEIVPYVV LLVVPLLGRM SDPDQSVRLV STHCFATLIQ
LMPLDGLAAD SGSTRNLSED LRQRKMKDRR FLEYLFSPKT IPDFQMPVKI NAELRSYQQS
GVNWLWFLNR YKLHGILCDD MGLGKTLQAI CILAADHHQR SLDRSCAQLP SIVICPPTLT
GHWVYEVEKF LPSRFLRPLH YVGLPGNREQ LRQKLGTYNL VVASYDIVRK DIEFFGSVNW
NYCILDEGHI IKNGRTKSSK AIKQLVANHR LILSGTPIQN NVLELWSLFD FLMPGFLGTE
KQFSTRFSRP ILASRDPKSS PKEQEAGALA MESLHRQVLP FLLRRVKEDV LTDLPPKITQ
DLLCELSPLQ ERLYEDFSRM HLHSSDIREC LENIDGQMAG PANKKTHVFQ ALRYLQNVCN
HPKLVLSPSH PEYQMIVSEF TRNGSSMDDI EHSAKLPALK QLLLDCGIGT NEDVSVNQHR
ALIFCQLKAM LDIVENDLLK KHLPAVSYLR LDGSVPPSTR HHIVTKFNGD PSIDVGGLGL
NLTGADTVIF VEHDWNPMKD LQAMDRAHRI GQKKVVNVYR LITRKSLEEK IMGLQKFKLL
TANTVVSDEN ASMDTMGTDQ LLDLFTLADD SGKQRAAAAA ASTERRGSLR SASSNAGAVT
ATTADLEANG NSGATTAIKT VLENLPELWD DSQYHEEYDL SQFLEGLKKN RQS
//