ID G3VL45_SARHA Unreviewed; 714 AA.
AC G3VL45;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 38.
DE RecName: Full=TFIIS central domain-containing protein {ECO:0000259|PROSITE:PS51321};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000003900.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000003900.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000003900.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3VL45; -.
DR Ensembl; ENSSHAT00000003939.2; ENSSHAP00000003900.2; ENSSHAG00000003434.2.
DR GeneTree; ENSGT00940000162194; -.
DR HOGENOM; CLU_029996_0_0_1; -.
DR InParanoid; G3VL45; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro.
DR CDD; cd21540; SPOC_SPOCD1; 1.
DR Gene3D; 1.10.472.30; Transcription elongation factor S-II, central domain; 1.
DR InterPro; IPR012921; SPOC_C.
DR InterPro; IPR003618; TFIIS_cen_dom.
DR InterPro; IPR036575; TFIIS_cen_dom_sf.
DR PANTHER; PTHR11477:SF18; SPOC DOMAIN-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR11477; TRANSCRIPTION FACTOR S-II ZINC FINGER DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF07744; SPOC; 1.
DR Pfam; PF07500; TFIIS_M; 1.
DR SMART; SM00510; TFS2M; 1.
DR SUPFAM; SSF46942; Elongation factor TFIIS domain 2; 1.
DR PROSITE; PS51321; TFIIS_CENTRAL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007648}.
FT DOMAIN 1..112
FT /note="TFIIS central"
FT /evidence="ECO:0000259|PROSITE:PS51321"
FT REGION 146..172
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 190..239
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 660..714
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 198..216
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 696..714
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 714 AA; 76974 MW; 0F076FEA6308D180 CRC64;
MASASPRLQE PANLTVGEEA VRGIAANIEA AIFDLMQCTD YRYKTKYRSL VFNLRDPRNK
DLFLQVIRGD ITPQGLVRMS ATELASQELA QWRDREVKHG LEIIEKQQRE APSCRVTKLT
HKGEIEIHPD VDQTLTLEDL TEPASHMDLN LRHQPGTAES QSGRDTTEQH ESHFLDPDCR
VCMGWEAPRG GRGFNAPRST PLSRSKITST PQKLPGPTSL PRVDTPLPGM SKSRSGPWTQ
LQDRPELAFC PRQALKPKAL QGRPLWEGAL EMFSIKRFDT KAFLVHGYSS QLIQTLPKVI
RSAGCVLPEA VWDYLDSIWS TEAKDISVVR LCPLRAHDAQ NYNMLYSYLN NKQRYGMAAS
KHLDMFLVPL PAFQPVPPKL RPLGGPGLEV THCSLVLGLI LPKAPPGDLR LNGTSPLQEK
KRKTVTFKET VETKCYSPGF RRPAGLSKPG PSWAPGASLE GPSAWDSISA DLLEEVAGHV
EQEQSVGGRG PYSLPFCCQQ EVAGPCPDGP CPLGSLAPQE PLPGWTQLSA GGRIPSQRAQ
PFSDGVGPGL VPLGAPLHPE CGGLAPELGA LSLLQLAGLF HFGSPPAVAL APAPSLLSHP
PAADCFHFPE HAATNCPHAA AGAACPPPHP SLLPTGEALT LIQHLEALVK MNSQLQASLQ
MAGPDPSLPG PGAGGAPGPV GMERPGREAE SQECPPPPLF LTPFCGSEQP PPGY
//