ID A0A094ZCS8_SCHHA Unreviewed; 1470 AA.
AC A0A094ZCS8;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 24-JAN-2024, entry version 35.
DE RecName: Full=ARID domain-containing protein {ECO:0000259|PROSITE:PS51011};
GN ORFNames=MS3_00209 {ECO:0000313|EMBL:KGB32070.1};
OS Schistosoma haematobium (Blood fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma.
OX NCBI_TaxID=6185 {ECO:0000313|EMBL:KGB32070.1};
RN [1] {ECO:0000313|EMBL:KGB32070.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=22246508; DOI=10.1038/ng.1065;
RA Young N.D., Jex A.R., Li B., Liu S., Yang L., Xiong Z., Li Y.,
RA Cantacessi C., Hall R.S., Xu X., Chen F., Wu X., Zerlotini A., Oliveira G.,
RA Hofmann A., Zhang G., Fang X., Kang Y., Campbell B.E., Loukas A.,
RA Ranganathan S., Rollinson D., Rinaldi G., Brindley P.J., Yang H., Wang J.,
RA Wang J., Gasser R.B.;
RT "Whole-genome sequence of Schistosoma haematobium.";
RL Nat. Genet. 44:221-225(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL250491; KGB32070.1; -; Genomic_DNA.
DR RefSeq; XP_012791861.1; XM_012936407.1.
DR STRING; 6185.A0A094ZCS8; -.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR CDD; cd16100; ARID; 1.
DR Gene3D; 1.10.150.60; ARID DNA-binding domain; 1.
DR InterPro; IPR001606; ARID_dom.
DR InterPro; IPR036431; ARID_dom_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR22970; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2; 1.
DR PANTHER; PTHR22970:SF14; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2; 1.
DR Pfam; PF01388; ARID; 1.
DR SMART; SM01014; ARID; 1.
DR SMART; SM00501; BRIGHT; 1.
DR SUPFAM; SSF46774; ARID-like; 1.
DR PROSITE; PS51011; ARID; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
PE 4: Predicted;
FT DOMAIN 1..101
FT /note="ARID"
FT /evidence="ECO:0000259|PROSITE:PS51011"
FT REGION 544..577
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 930..963
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1039..1148
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1083..1117
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1118..1136
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1470 AA; 163038 MW; 8D9C7788BB91E906 CRC64;
MLDIEASRQF LGIFVYHFGD TLSMWSRQYH RRLPRLLGKP IKLSDLFVAV VSRGGYKRVC
DHRCWLEVAR ELKLPSECTN ASVGLRRIYY QFLSHFEFKE YPSLSEQFLV RTLVEPDVTP
EDVTLPLSSV ERNLGIGTSC NHAHMTSVQS ADYNYSNKEI LKTGQDNDIF DASPNADEFP
DVFEPSQLRL VESALCSGLP NEIEVALNSL LVLSITPSSG SSTSVRLAHC TNLLSLLVAS
VGIYGEEYTI DEIGTNEEKV MHELFTPYSN QYFGTHWPME GFEGSRVLLI ATILVNLVTP
PPVAYASVVE GDDDELDFGP IIPCNLFPNG LPLVTWRENA RVIAGSPNAL RFIFLCAYAH
HSGLRSLGLQ LLSSIRYPLD PPGLSLPLDC PEYWTPLIDN GCRLGQLTLA FLTRCILESN
DRCDLIAGLM FLANLTKVRE KMNLSSLLVG LPQTVWPRLA QLLCLPDLAV VCATLEALRC
LTNLDATMCL TCWESCCLNV NSFDQENIPF ILLQPLLALL TLEGQAMGSQ SLHRIRLLPR
TPQAQNVQPQ LPHGTGLRIP ALSRPSPNVN ELNRSRDAPA YTNGQVFKSY LRPKLPVVHT
VPSDSFDNNN IHNSSTSSFR NLKFESSTHL KQSPISPTST CISHPVYRPQ SGSTACSGLI
NLLSSAPPAS SSPNINMPVI NSTSPNSPKV VTPSLSELTD RLQMPPPSLP PPSAMRRSIK
RTHSRTSLIS SPTQAADAVK VNISSPCRTN HSSGNVTVTD RISPTSLLLA NSCNLRPQPQ
GDSVPLGVSI NPKEIQQTEK HEVCSENFPD ESLNQINNSR NLLPHLRPIV NYDTCNKKTS
NNSNNKDELD DSEIKPIVNG EKTIDPIMDG ELKSPVNTFH TNGLRTGILQ EAVLALQDRK
LKQTSEQCKS TIKCVNGVLE TKRELILSTK DNSNTDTNSN NGKSTPFDVC ENSNKKSSKT
SLSNGYLTDK IHENEKDYNS LCENNPLYIA SSGNGNVNND DVNNNKKTDE LEKNSKFRFR
GVHSSHRFLK RRRKWGSKNR LFTPKRRRDL TSKHIDSSPT LHVPLTFNPV PGSVLDPTWK
QPTDPVSVES PPDTSTSSSS SKAIVEPPSS NDNKNSSISK DVMDKSSKHD CDVSVTSESS
ERNSESGNTL YNASLTCSED PSKTVNIEKP VLYLCEWECC TATFELKHQV AFHVYSTHLP
SKESWKVDGS HHKCSVQLVR RCCRWRDCQS ASLARAPYAL MTHVLDMHCS PRELEARRSN
HAKSKTEYRT RNEHFSDVNS SIGDTHPEYP LVSDRTGWGI IRSVEAKNMQ TELLAAQHHF
LLTNPNCHSN ISINPLVHSG MNNPPREGPV TKHLRVTSAL ILKNLAVHVD QARRWLLKES
QVLSEIAFGC TPSETGLKTN NASHIIAQCL SICTLKQNST NCVLSHPLER SLNIPLLSTD
ACSSLKIQPN ESTLYQLKES TVDLDSDTCN
//