ID A0A0R3W3Q0_TAEAS Unreviewed; 860 AA.
AC A0A0R3W3Q0;
DT 20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 1.
DT 03-MAY-2023, entry version 31.
DE SubName: Full=ETS domain-containing protein {ECO:0000313|WBParaSite:TASK_0000457001-mRNA-1};
GN ORFNames=TASK_LOCUS4571 {ECO:0000313|EMBL:VDK33649.1};
OS Taenia asiatica (Asian tapeworm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC Eucestoda; Cyclophyllidea; Taeniidae; Taenia.
OX NCBI_TaxID=60517 {ECO:0000313|Proteomes:UP000046400, ECO:0000313|WBParaSite:TASK_0000457001-mRNA-1};
RN [1] {ECO:0000313|WBParaSite:TASK_0000457001-mRNA-1}
RP IDENTIFICATION.
RG WormBaseParasite;
RL Submitted (FEB-2017) to UniProtKB.
RN [2] {ECO:0000313|EMBL:VDK33649.1, ECO:0000313|Proteomes:UP000282613}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Pathogen Informatics;
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|RuleBase:RU004019}.
CC -!- SIMILARITY: Belongs to the ETS family. {ECO:0000256|ARBA:ARBA00005562,
CC ECO:0000256|RuleBase:RU004019}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; UYRS01018357; VDK33649.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0R3W3Q0; -.
DR STRING; 60517.A0A0R3W3Q0; -.
DR WBParaSite; TASK_0000457001-mRNA-1; TASK_0000457001-mRNA-1; TASK_0000457001.
DR Proteomes; UP000046400; Unplaced.
DR Proteomes; UP000282613; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:InterPro.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR000418; Ets_dom.
DR InterPro; IPR046328; ETS_fam.
DR InterPro; IPR024668; GABP_asu_N.
DR InterPro; IPR013761; SAM/pointed_sf.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR PANTHER; PTHR11849; ETS; 1.
DR PANTHER; PTHR11849:SF195; GA-BINDING PROTEIN ALPHA CHAIN; 1.
DR Pfam; PF00178; Ets; 1.
DR Pfam; PF11620; GABP-alpha; 1.
DR PRINTS; PR00454; ETSDOMAIN.
DR SMART; SM00413; ETS; 1.
DR SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
DR PROSITE; PS00345; ETS_DOMAIN_1; 1.
DR PROSITE; PS00346; ETS_DOMAIN_2; 1.
DR PROSITE; PS50061; ETS_DOMAIN_3; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|RuleBase:RU004019};
KW Nucleus {ECO:0000256|RuleBase:RU004019};
KW Reference proteome {ECO:0000313|Proteomes:UP000282613}.
FT DOMAIN 639..719
FT /note="ETS"
FT /evidence="ECO:0000259|PROSITE:PS50061"
FT REGION 129..153
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 397..443
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 860 AA; 93496 MW; FFFD7B2A484E54A3 CRC64;
MSRIFKSRNT SDSPLVNSNR LSHSGSRFAK NTCSSHKRIS LIVSLDRTLH EIAGILQRRF
GLDLKNCQYY LNDRIRLHGN WPLSAHCIQP SGMVQLLLEI KSMPTHISSY RFTRLNVVDI
LSPDTEGDYT ESSALNSKTG MSEGGVGQSH RSTSMASLNG ISSALSISNV RSDGPLNSEG
SSADQESRLI SSSAVFEAFT VANRAIENIE RLSTATTSNN VASQSGVALP TGFYSTPRVG
TSHYGPNYAI IMERLAGGGN DVGGRWTIDE HYRRLMRMNS IPSDPGRWNA TQVVMWINWA
SKQFKLVNNG SSNSHYCNNG GGDGGKLAKA FEGLLGEDLL GLSLSDLSAR FNFSGSTATP
NLAFFTHFEL LKNCREVCVP YKPQVQPYTV SQTSQRVYRQ MGRSRVRSSR NGGGEYTSNG
PSSSTYSTNN GNNVITRFPG GTGTNHPISP AQSCANAATY QALGYLLANG VTGLNGAVFS
PSLPLTSNLR LNGGGFQPRI SQPSASSVQF SYGRLASSSP ASTPSYFNSV PASTQPAFTA
HGGSFYGAGD GNSSVPVTST VSPGASATVT AFRTDERIYS PFSSLSVDGS GTTSSVRKYR
VVSALSPPHH HDSGTITMSS IALGGGGTQS PAAGSGCQVH LWQFLLDLLT DWRHQDAIHW
VNRDGEFMLA NPERVAAMWG QRKNKPAMNY EKLSRALRYY YDGDMIAKVQ SKRFCYKFIC
DLKTLMGYSA GELHELVCFC AEKHGVNFRN GDLEIGRKRT MVDGEAWFGC PSPMRLRTMP
SSMPTILDSM ESDSTSAEVD VSPHFSRPEE EGEVIEEVDV DEDDDSFEYC GVDDIDKVTA
VLRMEADECC KIAEQKSRRK
//