ID G7YTU1_CLOSI Unreviewed; 992 AA.
AC G7YTU1;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 27-MAR-2024, entry version 38.
DE SubName: Full=Kanadaptin {ECO:0000313|EMBL:GAA56371.1};
GN ORFNames=CLF_110728 {ECO:0000313|EMBL:GAA56371.1};
OS Clonorchis sinensis (Chinese liver fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA56371.1, ECO:0000313|Proteomes:UP000008909};
RN [1] {ECO:0000313|EMBL:GAA56371.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Henan {ECO:0000313|EMBL:GAA56371.1};
RX PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA Yu X.;
RT "The draft genome of the carcinogenic human liver fluke Clonorchis
RT sinensis.";
RL Genome Biol. 12:R107-R107(2011).
RN [2]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Henan;
RA Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W.,
RA Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L.,
RA Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., Xu J.,
RA Wu Z., Yu X.;
RT "The genome and transcriptome sequence of Clonorchis sinensis provide
RT insights into the carcinogenic liver fluke.";
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DF144235; GAA56371.1; -; Genomic_DNA.
DR AlphaFoldDB; G7YTU1; -.
DR Proteomes; UP000008909; Unassembled WGS sequence.
DR GO; GO:0016891; F:RNA endonuclease activity, producing 5'-phosphomonoesters; IEA:InterPro.
DR CDD; cd19856; DSRM_Kanadaptin; 1.
DR Gene3D; 2.60.200.20; -; 1.
DR InterPro; IPR005034; Dicer_dimerisation_dom.
DR InterPro; IPR000253; FHA_dom.
DR InterPro; IPR008984; SMAD_FHA_dom_sf.
DR PANTHER; PTHR23308:SF53; KANADAPTIN-RELATED; 1.
DR PANTHER; PTHR23308; NUCLEAR INHIBITOR OF PROTEIN PHOSPHATASE-1; 1.
DR Pfam; PF03368; Dicer_dimer; 1.
DR Pfam; PF00498; FHA; 1.
DR SMART; SM00240; FHA; 1.
DR SUPFAM; SSF49879; SMAD/FHA domain; 1.
DR PROSITE; PS50006; FHA_DOMAIN; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000008909}.
FT DOMAIN 116..191
FT /note="FHA"
FT /evidence="ECO:0000259|PROSITE:PS50006"
FT REGION 1..37
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 816..924
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 939..979
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 455..482
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 899..914
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 962..979
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 992 AA; 111126 MW; F0D455EB741B0B98 CRC64;
MSSSPTPIES NPCTEATPNE ESETPNTQNQ LSSPISECTG KDEVSDYLCE DKTQVFVQPK
LQPAGNTYKP PVWAMPCPAD LGYRFEVIKN GTPLSECTVT LSEAGDSGDA TELSFCLFGR
QPQPFYAPYN RLHGQCVALA HPSISRLHAV LQYGRPPPSI AKTSLAQPEA AGWYIQDLES
THGTFVNKRR LPSGRFVRIH VGHVVRFGGS TRLNVLQGPE DDTERESTLS WTELKQVHFA
KKAVAKERTV DKTPETTVDF GCDWGLAAED AAADVPSFLR DINGAACLSH ENLYQDDPKR
ALRTYFEREG IDPAPEFEFV EASFGKQHCK IDLPLSSGTI TAEAIVAGKR KEAIAQCALE
ACRLLDRLGE FDPNKDQSTA AKRIRTKAYW EEHDYYSSDE DTFTDRTGHV ERKRLSRIRQ
LGVEGREAEE AERRAAEHAT ATCPYSAQRL DNATLLTVLA ELEKVGEEIV SLEEKLEKIN
KEFTPQGQNP SELDELEAYM KALKSGQARV ILVMLIKPTV TSKIAAESFP GSLEHRIPGR
TVAVLKCRRN VPAGPEHHLT RRVVRYQMKV SVRADREVWW TRKVKEMEEA QRASNARRLF
QLIRVTGHRK PHRSIVPKSI FKPRRPVCLA AFNIGTLKPT GSLEHRIPGR TVAVLKCRRN
VPAGPEHHLT RRVVRYQMKV SVRADREVWW TRKVKEMEEA QRASNARRLF QLIRVTGHRK
PHTSARVRHL WRQSGISLDL KGGAPSRKER LKLRSQLFTL RQREMRLFQQ AGLPQPRQRV
KLITTGGEEG EERVPVVEKT VRADAAAAVR AAKRKLQEDT GAADSHEVAP EQGPKNARLL
RNEIKRSLAG HRTVQQSDRT RHVSEDQPFE VEEDDDEEVA APTEQSTLRD RPDPNSPAVP
SETDLTTSNK DELESITGPK MPTEHRIDGV VVIEHKPAVE TASKPMGDNK GDLKSGHDSP
DEHITASSPT QLSYSFCTNG RSNIRTSSII SL
//