ID A0A087W279_ECHMU Unreviewed; 550 AA.
AC A0A087W279;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE SubName: Full=Protein iws1 {ECO:0000313|EMBL:CDI98604.1};
GN ORFNames=EmuJ_000246900 {ECO:0000313|EMBL:CDI98604.1};
OS Echinococcus multilocularis (Fox tapeworm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC Eucestoda; Cyclophyllidea; Taeniidae; Echinococcus.
OX NCBI_TaxID=6211 {ECO:0000313|EMBL:CDI98604.1};
RN [1] {ECO:0000313|EMBL:CDI98604.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23485966; DOI=10.1038/nature12031;
RA Tsai I.J., Zarowiecki M., Holroyd N., Garciarrubio A., Sanchez-Flores A.,
RA Brooks K.L., Tracey A., Bobes R.J., Fragoso G., Sciutto E., Aslett M.,
RA Beasley H., Bennett H.M., Cai J., Camicia F., Clark R., Cucher M.,
RA De Silva N., Day T.A., Deplazes P., Estrada K., Fernandez C., Holland P.W.,
RA Hou J., Hu S., Huckvale T., Hung S.S., Kamenetzky L., Keane J.A., Kiss F.,
RA Koziol U., Lambert O., Liu K., Luo X., Luo Y., Macchiaroli N., Nichol S.,
RA Paps J., Parkinson J., Pouchkina-Stantcheva N., Riddiford N., Rosenzvit M.,
RA Salinas G., Wasmuth J.D., Zamanian M., Zheng Y., Cai X., Soberon X.,
RA Olson P.D., Laclette J.P., Brehm K., Berriman M., Garciarrubio A.,
RA Bobes R.J., Fragoso G., Sanchez-Flores A., Estrada K., Cevallos M.A.,
RA Morett E., Gonzalez V., Portillo T., Ochoa-Leyva A., Jose M.V., Sciutto E.,
RA Landa A., Jimenez L., Valdes V., Carrero J.C., Larralde C.,
RA Morales-Montor J., Limon-Lason J., Soberon X., Laclette J.P.;
RT "The genomes of four tapeworm species reveal adaptations to parasitism.";
RL Nature 496:57-63(2013).
RN [2] {ECO:0000313|EMBL:CDI98604.1}
RP NUCLEOTIDE SEQUENCE.
RA Zhang Y., Guo Z.;
RL Submitted (NOV-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00649}.
CC -!- SIMILARITY: Belongs to the IWS1 family.
CC {ECO:0000256|ARBA:ARBA00037992}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LN902844; CDI98604.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A087W279; -.
DR STRING; 6211.A0A087W279; -.
DR eggNOG; KOG1793; Eukaryota.
DR OMA; RNNVQHA; -.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR InterPro; IPR017923; TFIIS_N.
DR PANTHER; PTHR46010; PROTEIN IWS1 HOMOLOG; 1.
DR PANTHER; PTHR46010:SF1; PROTEIN IWS1 HOMOLOG; 1.
DR Pfam; PF08711; Med26; 1.
DR PROSITE; PS51319; TFIIS_N; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00649}.
FT DOMAIN 314..392
FT /note="TFIIS N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51319"
FT REGION 32..152
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 420..452
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 464..515
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 50..79
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 105..152
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 485..500
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 550 AA; 62410 MW; 4079F35A8B97AD4F CRC64;
MAVNRKTYEP EEGKLPIEID DSSFDLITVA KTPPGSRFDA AGSDNEYRVE PFMSPTTPTY
TPSRLAMSNS DSETSLSSSE CQTHEEKRMR VDSQAFAKGS ALSPEIFERN KERVFSSGDE
EERNERSDLE ENEKAKEEKA DADFEGFDGD EEGAKGMIAD IFGESDAEEE GDFEGFAENE
VEKEAAVSSA ASIAAVEVIK DEMHNHHQPR EQATSDNDDE VGEGFISDFD RFMSRRREET
RRRRRLGRDE EFLNDNDEII RETVSKMKSA ADEDRRLLSK NRPATKKLSM LNVVSNLLIR
AGMKPALIDN GILGAITEWL SPVSGHVLPS VTIRETLLRH LSEFHIQDPD LLRDSGIGKA
VMYLYKHPRE TRANKMLAGH LINEWSRPIF NLTSDYGSLT REERKQLDLE HLPKRRNLRK
EGVDNIDVNQ KSSDNQIPLR PGDPGWVNRA RVPQPSNKDY VIRPKWNVPT NSSTFEEGDA
DDAYDNEEGR ERDWARSARK RRLQQAQPPG GTSSRIERHI RTLAKNAAVR RKTRFVRAVP
MSVEGRNMSL
//