ID G7PLG6_MACFA Unreviewed; 819 AA.
AC G7PLG6;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 24-JAN-2024, entry version 30.
DE RecName: Full=TFIIS N-terminal domain-containing protein {ECO:0000259|PROSITE:PS51319};
GN ORFNames=EGM_05219 {ECO:0000313|EMBL:EHH55917.1};
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541;
RN [1] {ECO:0000313|EMBL:EHH55917.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CE-4 {ECO:0000313|EMBL:EHH55917.1};
RX PubMed=22002653; DOI=10.1038/nbt.1992;
RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., Li Q.,
RA Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., Huang Z.,
RA Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., Huang Y.,
RA Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., Li B., Liu X.,
RA Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., Li Y., Wang W.,
RA Katze M.G., Su B., Nielsen R., Yang H., Wang J., Wang X., Wang J.;
RT "Genome sequencing and comparison of two nonhuman primate animal models,
RT the cynomolgus and Chinese rhesus macaques.";
RL Nat. Biotechnol. 29:1019-1023(2011).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00649}.
CC -!- SIMILARITY: Belongs to the IWS1 family.
CC {ECO:0000256|ARBA:ARBA00037992}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001288; EHH55917.1; -; Genomic_DNA.
DR AlphaFoldDB; G7PLG6; -.
DR eggNOG; KOG1793; Eukaryota.
DR Proteomes; UP000009130; Chromosome 13.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR InterPro; IPR017923; TFIIS_N.
DR PANTHER; PTHR46010; PROTEIN IWS1 HOMOLOG; 1.
DR PANTHER; PTHR46010:SF1; PROTEIN IWS1 HOMOLOG; 1.
DR Pfam; PF08711; Med26; 1.
DR SUPFAM; SSF47676; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR PROSITE; PS51319; TFIIS_N; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00649}.
FT DOMAIN 614..692
FT /note="TFIIS N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51319"
FT REGION 1..520
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 696..728
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 30..71
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 82..463
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 475..493
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 494..520
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 698..714
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 819 AA; 91925 MW; 525ACCC4C608D1B6 CRC64;
MDSEYYSGDQ SDDGGATPVQ DERDSGSDGE DDVNEQHSGS DTGSVERHSE NEPSDREDGL
TKGHHVIDSE NDEPINLNAS DSESEDLHRQ KDSDSESEER AEPPASDSEN EDVNQHGSDS
ESEETRKLPG SDSENEELLN GHASDSENED VGKHPASDSE IEELQKSPAS DSETEDALKP
QISDSESEEP PRHQASDSEN EEPPKPRMSD SESEELPKPQ VSDSESEEPP RHQASDSENE
ELPKPRISDS ESEDPPRHQA TDSENEELPK PRISDSESED PPRNQASDSE NEELPKPRVS
DSESEGPRKG PASDSETEDA SRHKQKPESD DDSDRENKGE DTEMQNDSFH SDSHMDRKKF
HSSDSEEEEP KKQKLDSDED EKEGEEEKVA KRKAAVLSDS EDGEKASAKK SRVVSDADDS
DSDAVSDKSG KREKTIASDS EEEAGKELSD KKNEEKDLFG SDSESGNEEE NLIADIFGES
GDEEEEEFTG FNQEDLEEEK SETQVKEAED SDSDDNIKRG KHMDFLSDFE MMLQRKKSMS
GKRRRNRDGG TFISDADDVV SAMIVKMNEA AEEDRQLNNQ KKPALKKLTL LPTVVMHLKK
QDLKETFIDS GVMSAIKEWL SPLPDRSLPA LKIREELLKI LQELPSVSQE TLKHSGIGRA
VMYLYKHPKE SRSNKDMAGK LINEWSRPIF GLTSNYKGMT REEREQRDLE QMPQRRRMNS
TGGQTPRRDL EKVLTGEEKA LRPGDPGFCA RARVPMPSNK DYVVRPKWNV EMESSRFQAT
SKKGISRLDK QMRKFTDIRK KSRSAHAVKI SIEGNKMPL
//