ID A0A1B0AT06_9MUSC Unreviewed; 914 AA.
AC A0A1B0AT06;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 24-JAN-2024, entry version 28.
DE RecName: Full=TFIIS N-terminal domain-containing protein {ECO:0000259|PROSITE:PS51319};
OS Glossina palpalis gambiensis.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Hippoboscoidea;
OC Glossinidae; Glossina.
OX NCBI_TaxID=67801 {ECO:0000313|EnsemblMetazoa:GPPI007538-PA, ECO:0000313|Proteomes:UP000092460};
RN [1] {ECO:0000313|Proteomes:UP000092460}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IAEA {ECO:0000313|Proteomes:UP000092460};
RA Aksoy S., Warren W., Wilson R.K.;
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:GPPI007538-PA}
RP IDENTIFICATION.
RC STRAIN=IAEA {ECO:0000313|EnsemblMetazoa:GPPI007538-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00649}.
CC -!- SIMILARITY: Belongs to the IWS1 family.
CC {ECO:0000256|ARBA:ARBA00037992}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JXJN01003090; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A1B0AT06; -.
DR STRING; 67801.A0A1B0AT06; -.
DR EnsemblMetazoa; GPPI007538-RA; GPPI007538-PA; GPPI007538.
DR VEuPathDB; VectorBase:GPPI007538; -.
DR Proteomes; UP000092460; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR InterPro; IPR017923; TFIIS_N.
DR PANTHER; PTHR46010; PROTEIN IWS1 HOMOLOG; 1.
DR PANTHER; PTHR46010:SF1; PROTEIN IWS1 HOMOLOG; 1.
DR Pfam; PF08711; Med26; 1.
DR PROSITE; PS51319; TFIIS_N; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00649}.
FT DOMAIN 708..786
FT /note="TFIIS N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51319"
FT REGION 1..617
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 795..851
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 71..108
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 109..126
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 164..203
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 204..220
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 233..255
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 256..285
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 297..313
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 314..348
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 390..475
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 506..521
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 547..578
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 795..822
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 914 AA; 102816 MW; F0F96C2D52A46BC1 CRC64;
MPAVENGAVG YEDELRKDKS QNQEPLIPSL EEKTLKSRHR SRSVSPGNVS AELRHSKSRS
RSGSRSSEGE PPSPRAEKEK QRFRSRSRSV SLDSDGRIKE RNESPRSDSR SPSRSSNISS
EKPGLSKETK LPSRSRSRSS SRSSSESSRG QRELSKSRSR SRSHSSNARS ISPNVLTNQQ
EKTNSRSLND EPYTPILSTK EKSDSVNISH DEDAGLKARR RSRSPSQCSL GFNKDDEPKR
KSRSRSKEHT RGTPRSRSCS KSPSGIPSPS GKTPNDSVAR SRSGSRRSRS GSKRNNSRSR
SRSISPLSRP ASRHRSRSYS KSASRSRSRS SRNRSRSGSK HSRSASKLSR SSRRSYSGSK
HSRSGSERSR SGSRRSYSGS SDSRRSRSGS RRSRSGSRRS RSGSRRSRSG SRRSRSGSRR
SRSGSRRSRS GSRRSRSGSR RSRSGSRRSR SGLRRSRSGS RRSKSGSRHS RSGSRRSRAS
GSASRSRSRN SSRKSRSRSV RSHSKSADSR SGSLARSVSR SSRSHSRSRS KSHHSSISPT
SKGQKGHKRN ILSDSEGEDG PSSAQKKRPK ITDTDNEDEA DKSEHEIVEE GGAQAASKNI
GDSSDDENLP IDNDREPDNF ISDFDAMLLR KKEEKRIRRR KRDIDLINDN DDLIDQLIMN
MKNASDEDRQ LNVEGKPATK KISMLKQVMS QLIKKDLQLA FLEHNILNVL TDWLAPLPNK
SLPCLQIRES ILKLLSDFPT IEKSYLKQSG IGKAVMYLYK HPKETKQNRD RAGRLISEWA
RPIFNLSCDF KALTKEERQE RDMRQVPKSR RKSPEPEPST SKKKRGLNSA FGNTEEKPLR
PGDPGWVARA RVPRPSNKDY LIRPKSKIDG EVSTTNKRKP NRYEKHMKKF LDAKRLKSSR
RAVEISIEGR KMAL
//