ID H2KNG5_CLOSI Unreviewed; 967 AA.
AC H2KNG5;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2012, sequence version 1.
DT 24-JAN-2024, entry version 42.
DE SubName: Full=Ubiquitin carboxyl-terminal hydrolase 20/33 {ECO:0000313|EMBL:GAA30975.2};
GN ORFNames=CLF_100105 {ECO:0000313|EMBL:GAA30975.2};
OS Clonorchis sinensis (Chinese liver fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA30975.2, ECO:0000313|Proteomes:UP000008909};
RN [1] {ECO:0000313|EMBL:GAA30975.2}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Henan {ECO:0000313|EMBL:GAA30975.2};
RX PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA Yu X.;
RT "The draft genome of the carcinogenic human liver fluke Clonorchis
RT sinensis.";
RL Genome Biol. 12:R107-R107(2011).
RN [2]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Henan;
RA Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W.,
RA Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L.,
RA Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., Xu J.,
RA Wu Z., Yu X.;
RT "The genome and transcriptome sequence of Clonorchis sinensis provide
RT insights into the carcinogenic liver fluke.";
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DF142831; GAA30975.2; -; Genomic_DNA.
DR AlphaFoldDB; H2KNG5; -.
DR Proteomes; UP000008909; Unassembled WGS sequence.
DR GO; GO:0004843; F:cysteine-type deubiquitinase activity; IEA:UniProtKB-EC.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR Gene3D; 3.30.2230.10; DUSP-like; 2.
DR InterPro; IPR035927; DUSP-like_sf.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR006615; Pept_C19_DUSP.
DR InterPro; IPR001394; Peptidase_C19_UCH.
DR InterPro; IPR018200; USP_CS.
DR InterPro; IPR028889; USP_dom.
DR PANTHER; PTHR21646; UBIQUITIN CARBOXYL-TERMINAL HYDROLASE; 1.
DR PANTHER; PTHR21646:SF86; UBIQUITIN CARBOXYL-TERMINAL HYDROLASE; 1.
DR Pfam; PF06337; DUSP; 2.
DR Pfam; PF00443; UCH; 1.
DR SMART; SM00695; DUSP; 2.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR SUPFAM; SSF143791; DUSP-like; 2.
DR PROSITE; PS51283; DUSP; 2.
DR PROSITE; PS00973; USP_2; 1.
DR PROSITE; PS50235; USP_3; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000313|EMBL:GAA30975.2};
KW Reference proteome {ECO:0000313|Proteomes:UP000008909}.
FT DOMAIN 1..492
FT /note="USP"
FT /evidence="ECO:0000259|PROSITE:PS50235"
FT DOMAIN 495..589
FT /note="DUSP"
FT /evidence="ECO:0000259|PROSITE:PS51283"
FT DOMAIN 595..714
FT /note="DUSP"
FT /evidence="ECO:0000259|PROSITE:PS51283"
FT REGION 91..173
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 236..257
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 748..791
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 95..117
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 118..132
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 133..168
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 240..254
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 967 AA; 108105 MW; 2DAD6642A8292A66 CRC64;
MIFSPQVTEH FLQCSGIDFD CRFHLASYYL DFLANVWLNS RSMIVSPSSI LREIKYWYPM
FSGYAQHDSQ EFLRVFLNRL HDELKRADLP SLTASSKAPP ETKSTICDCQ SKPSTLTHKG
KTNKRHRGKL KQSQSKLPSC PVCSSSLSAD PSTGGDSGNT TASSKPKPVG SDGHSIVTDV
FQGKLISAVR CLSCKNVSCR EEVFLDLSLS LRRPFPTEHP STKGFRRGSN IRPFDCLQPQ
ADADSNQSSA RSYSGLRPAR GEINDGHTIS AMFLNNSISD TTNIGSTSRF STNPLQQLIK
FRDSMPFIRR MLAPLFGFIA LSISWIFARL SDVQDWMTRP NLSLDDCLSD FFSQAELNGE
NKYYCERCKK LCNGLNQTAL VSLPEVLCIH LKRFRSHCMD SSKINSPVTF PLEGLELGPY
LHTDCTDKVT TFDLISVICH RGGYGGGHYV TYSLNAYTQS WYEFDDDHVT QVPAFHVEQL
TSDAYILFYR KRDTALVPLR REAALMLQKE VVHESSRIVF LSRAWFVRFS TWAEPGPIDN
TEFLCEHGQF KPTQWNHRDS LLVPIPHELW TKLLSAFGGG PVLRNPSPCS VCHEAIIMRQ
RNELQTFEKL FKQCEPVMES CGIFAVSNAW FDLWKDFVEG KSLIAPGPID NNDIVEYRSS
RSYFNNSLFS SRQTLSSQPR LKESVSYSEL FEECWVMLRD IYGGGPAVCV RAASGCISIT
PTDEDEPNGS LAVNQSVLTV SPSNVSLLSI SDRGDSPDSS RTQPPLTSDS SSHSWVHQGA
GDSAPSHTYT SPLLASKTDT KVLSINGDTE SGIYSNDRVS YTTLSIKEDY ADGDRSSFSD
SLSYLDVSYR SHSVENLLFI TEKTDWKQPL SPSTSNSING FSNSLHTHSM EHLLLSNQQQ
RVSSSVFGVA GDGPNKLSRS THIKGQRTNS NNNHLVTDVC NNTDTVRSNV SCHNNVYHKR
PRLSVRP
//