ID G7Y8A6_CLOSI Unreviewed; 1040 AA.
AC G7Y8A6;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 24-JAN-2024, entry version 37.
DE RecName: Full=C2H2-type domain-containing protein {ECO:0000259|PROSITE:PS50157};
GN ORFNames=CLF_102651 {ECO:0000313|EMBL:GAA49191.1};
OS Clonorchis sinensis (Chinese liver fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA49191.1, ECO:0000313|Proteomes:UP000008909};
RN [1] {ECO:0000313|EMBL:GAA49191.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Henan {ECO:0000313|EMBL:GAA49191.1};
RX PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA Yu X.;
RT "The draft genome of the carcinogenic human liver fluke Clonorchis
RT sinensis.";
RL Genome Biol. 12:R107-R107(2011).
RN [2]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Henan;
RA Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W.,
RA Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L.,
RA Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., Xu J.,
RA Wu Z., Yu X.;
RT "The genome and transcriptome sequence of Clonorchis sinensis provide
RT insights into the carcinogenic liver fluke.";
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DF142938; GAA49191.1; -; Genomic_DNA.
DR AlphaFoldDB; G7Y8A6; -.
DR Proteomes; UP000008909; Unassembled WGS sequence.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR003150; DNA-bd_RFX.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR22970; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2; 1.
DR PANTHER; PTHR22970:SF14; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2; 1.
DR Pfam; PF02257; RFX_DNA_binding; 1.
DR SMART; SM00355; ZnF_C2H2; 2.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Reference proteome {ECO:0000313|Proteomes:UP000008909};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 785..810
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT REGION 246..275
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 554..574
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 877..926
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 877..918
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1040 AA; 112863 MW; 9AE6D1DE76628F6E CRC64;
MAYGDDEGFR VQLVATVLRN LAIEGGLSAV EIGRDPQALR FTFLCIYAKH SSLRQLGLEI
LSSLHFPVVS CLRPILSRLL RILLTSCDRN DQIRGLQLVR RLCESPTYLS ALDRIRAASA
IQTQSATRDR AMLTTDRNSE FLSMLPSVVF RCIISRLCLR DLHLVVLALD TIYSLSSLGS
VLCERMLKSH VDLEQPHEGV YRCSQVAGLT NILEMLVALL RLEAQSMGSE SLVRVRVMQA
LGAGTQKTSS VPTGSQPSTT SHGSDFIRPS TQPSSFTRIP AFRSVTDDVQ AAPVCSVQPQ
LSLSINTSLC SPMSTVVPRI TIVAAPVPVS EPQQLSCVQS YAVSSVNSLP TSTQTVSKVP
STEVLILSDI KPFRLPHQQV VDVRTTQTCA TPPRPSPPIR SALPALKAST ARPLRASSEA
PTTVSHATRR AYMFEWLRKN YVVHSHSTVP RIQIYTDYQQ AHQRRFGFNN GGAISPIEFH
AELKSIFPGI EQIKVQTPGG HVEIHYHHLR HLNSPEPKST QGSSGLFNIS CFLPRCERYN
GSCALSQAKH HSRTAVDSQN HSAAVSLDSD NGPPAKTVPK PIDTPLNGHT EYLEAVNNRT
ASPMNGLNKD VARVITNPHN DKIAILPNGF QPILSSVASS APQTGATATL SNCPKLNELV
SKETGLVLLP VFTTLNFHPT VSNIKGQHPV IHNPGNAQKQ MGMTVCISPV QQRVGAPSNH
PTNCSSAPTT TTTTVKLNLS DNHSATCLLL PHGGIDDIGK PTETNLQRKP ELSPNGQSAV
PSTTFVCLWD HCGRTFDSPE CLKEHVFESH FVNSTTEIFC RWDGCSRRML QTSRDVFVTH
LMDDHLASVL STTPASVCTT KTAGKPAITS ISALLQADSP NPENSNSPIV IPSSPENSNS
VEFSLQSSGS SNHTNEPGAP IHNLPHVLPQ PSIPEPSVQL TEEESIAGPT RSISPNAMSD
EQPVPIQPFL EPLSVTQLQH ALWSPPPVIS AECRKALAGG IPNPPFTPHP QREGPVTKHI
RLTAALALKN FLQYSATARR
//