ID A0A158R6S0_TAEAS Unreviewed; 816 AA.
AC A0A158R6S0;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 24-JAN-2024, entry version 28.
DE SubName: Full=ARID domain-containing protein {ECO:0000313|WBParaSite:TASK_0000076401-mRNA-1};
GN ORFNames=TASK_LOCUS765 {ECO:0000313|EMBL:VDK21755.1};
OS Taenia asiatica (Asian tapeworm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC Eucestoda; Cyclophyllidea; Taeniidae; Taenia.
OX NCBI_TaxID=60517 {ECO:0000313|Proteomes:UP000046400, ECO:0000313|WBParaSite:TASK_0000076401-mRNA-1};
RN [1] {ECO:0000313|WBParaSite:TASK_0000076401-mRNA-1}
RP IDENTIFICATION.
RG WormBaseParasite;
RL Submitted (APR-2016) to UniProtKB.
RN [2] {ECO:0000313|EMBL:VDK21755.1, ECO:0000313|Proteomes:UP000282613}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Pathogen Informatics;
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; UYRS01000118; VDK21755.1; -; Genomic_DNA.
DR STRING; 60517.A0A158R6S0; -.
DR WBParaSite; TASK_0000076401-mRNA-1; TASK_0000076401-mRNA-1; TASK_0000076401.
DR Proteomes; UP000046400; Unplaced.
DR Proteomes; UP000282613; Unassembled WGS sequence.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:InterPro.
DR CDD; cd16881; ARID_Dri-like; 1.
DR Gene3D; 1.10.150.60; ARID DNA-binding domain; 1.
DR InterPro; IPR045147; ARI3A/B/C.
DR InterPro; IPR001606; ARID_dom.
DR InterPro; IPR036431; ARID_dom_sf.
DR InterPro; IPR023334; REKLES_domain.
DR PANTHER; PTHR15348; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN ARID DOMAIN- CONTAINING PROTEIN DEAD RINGER PROTEIN B-CELL REGULATOR OF IGH TRANSCRIPTION BRIGHT; 1.
DR PANTHER; PTHR15348:SF0; PROTEIN DEAD RINGER; 1.
DR Pfam; PF01388; ARID; 1.
DR SMART; SM01014; ARID; 1.
DR SMART; SM00501; BRIGHT; 1.
DR SUPFAM; SSF46774; ARID-like; 1.
DR PROSITE; PS51011; ARID; 1.
DR PROSITE; PS51486; REKLES; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Reference proteome {ECO:0000313|Proteomes:UP000282613}.
FT DOMAIN 335..427
FT /note="ARID"
FT /evidence="ECO:0000259|PROSITE:PS51011"
FT DOMAIN 725..815
FT /note="REKLES"
FT /evidence="ECO:0000259|PROSITE:PS51486"
FT REGION 1..164
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 264..314
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 462..511
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 585..632
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 648..678
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 19..45
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 73..115
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 134..151
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 481..511
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 587..627
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 816 AA; 87252 MW; 74522FF55D2E748F CRC64;
MAGSPIRAPL PNLLFPQSGE KSPLITGSGV RNSSPLLSDI VTGSPTYSRF PEGPFCPNNV
GSASPQPPAK KRCLEPQQQS TSGEGEGLAA PSQPTHECSS DSPHTSASSF SPKGEQQPQP
PQPLPHLVQN CAEESHPTKP QQPQQQPQES ESLKDLPHHP PLPLWGDILP DSTYSRISMF
PKNFPPMFAS GCTAALTPGI PAITTTTNGD PVPLGASALA AAAAMAMGPL FQQPPGSSLV
PGSTAPPWNP MLAAAAAAFA SGRFPPPGLP SRLQGDAEGP EEKDGDSSSR SGSPIPLSGA
GSSVEGMEVM PDANQSGHHW TFQEQFKQLY ELSSDPKRKE FLDDLFNFMQ KRGSPVNRIP
IMAKQVLDLY ELYRLVVTRG GLVEVINKKL WREITKGLQL PSSITSAAFT LRTQYMKYLY
PYECEKLALS TPSELQAAID GNRREARRSS YSFEYPMLMG PSSSAGAPSG ATGQPLPNAP
LGPTFSHPPP PHHPLLAPPP PPPPSVSASC GPSLLPPGLL LPPGFPNSNS GRNGSIFPDA
PFFGFPSLVV PTSTAVVPPP TPTPPSAFDA KSYLDDDQLI KQKRTKCPQR CLSSASTTMV
TVAASPKPRP NSKPNYSSPA TLESDPKSEA NRQLKPVKFS TLSKCASAFD LDQRHSPRES
PSPRPSGNAE VPDEGQQSAA TRFSEFLKQR LPPNDFETSI GCNFSKGDPM QARFPICRHN
NAYQQFMGNT VQKLGVAEKK DSMNGCAEDR RQTKPTAIPR ENCDFPASSI QLVPNLRIST
QAGNQLGLPE NTLVVCMEVT GVVYQGVLFG RIKPPS
//