ID A0A0T6B0K5_9SCAR Unreviewed; 835 AA.
AC A0A0T6B0K5;
DT 17-FEB-2016, integrated into UniProtKB/TrEMBL.
DT 17-FEB-2016, sequence version 1.
DT 24-JAN-2024, entry version 21.
DE RecName: Full=TFIIS N-terminal domain-containing protein {ECO:0000259|PROSITE:PS51319};
DE Flags: Fragment;
GN ORFNames=AMK59_5434 {ECO:0000313|EMBL:KRT80857.1};
OS Oryctes borbonicus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Scarabaeiformia;
OC Scarabaeidae; Dynastinae; Oryctes.
OX NCBI_TaxID=1629725 {ECO:0000313|EMBL:KRT80857.1, ECO:0000313|Proteomes:UP000051574};
RN [1] {ECO:0000313|EMBL:KRT80857.1, ECO:0000313|Proteomes:UP000051574}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=OB123 {ECO:0000313|EMBL:KRT80857.1};
RC TISSUE=Whole animal {ECO:0000313|EMBL:KRT80857.1};
RA Meyer J.M., Markov G.V., Baskaran P., Herrmann M., Sommer R.J.,
RA Roedelsperger C.;
RT "Draft genome of the scarab beetle Oryctes borbonicus.";
RL Submitted (SEP-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00649}.
CC -!- SIMILARITY: Belongs to the IWS1 family.
CC {ECO:0000256|ARBA:ARBA00037992}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRT80857.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LJIG01016353; KRT80857.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0T6B0K5; -.
DR OrthoDB; 12240at2759; -.
DR Proteomes; UP000051574; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR InterPro; IPR017923; TFIIS_N.
DR PANTHER; PTHR46010; PROTEIN IWS1 HOMOLOG; 1.
DR PANTHER; PTHR46010:SF1; PROTEIN IWS1 HOMOLOG; 1.
DR Pfam; PF08711; Med26; 1.
DR SUPFAM; SSF47676; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR PROSITE; PS51319; TFIIS_N; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00649}; Reference proteome {ECO:0000313|Proteomes:UP000051574}.
FT DOMAIN 647..725
FT /note="TFIIS N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51319"
FT REGION 1..580
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 806..835
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..121
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 137..160
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 270..302
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 303..354
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 355..385
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 386..462
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 463..552
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 564..580
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 835
FT /evidence="ECO:0000313|EMBL:KRT80857.1"
SQ SEQUENCE 835 AA; 93221 MW; 2F97E85531343A68 CRC64;
MDVITENSKA GVNTASSPIE NRKSRSSSIE SNHSRKSNSL SPTQKSNDRS SPISDSGSMS
IDVSPVKSGD QNGSTSSYEP IKSPREKSSS PNNARISRPR SKSNDSMKSR SGSNSRSRSR
SVSNKSRSRS RSKSPHTPES RSGSGKGFRS KSNSRSRSRS ISGSRKSRSR SGSRSRSGSR
GSRSDSGHSR SASRRSIRNS RSRSGSNDGS RRTSRSRSKS NSPSHRSRSV SSLRRSRSGS
KHSRSRSGSL TSKISVSERA RKSRSGSRRS KSGSIHSGSG SRRSRSGSRQ SRSPSQRSKS
GSRRSRSGSR RSRSGSQRSR SGSRRSRSGS IRSRSGSRRS RSGSRSRSGS RRSRSGSHGS
RSGSQRSRSG SRNSRSGSAR SRSGSRRSRS GSRRSRSRSR SRSRSAKSRS KSRSRSRSRS
RSGSRSKSRS RSRSKNRSRS ASRSRSRSRS KSRSRSKNRS RSGSRRSRSD SGRSKSRSRS
RSLPKSEETA NRKRTISESE DEGTASVSKA RKPIESDDEE AGPSEVKIPE GEENEPAKEA
ASDSDDGVDP NLENRGSDGL SDFEIMLQRK KDEKSGKRKR KDIDIINDND DIIAQLLADM
RIAAEEDRKL NQASQPATKK IAMLPKVMSQ LKKHDLQLAF IEHNVLSVLT DWLAPMPDRS
MPSLQVRESI LRLLREFPRI DQQTLKQSGI GKAVMYLYKH PKETKENKER AGKLISEWAR
PIFNLSADFK AMSREERLQR DIEQMPKKRR ESDAASYLEK KDISKALNEE GKPLRPGDKG
WVYRARVPVP SNKDYVVRPK WSTEVDISRS SKKQMNRFEK HYKNFLDSKR NKQTR
//