ID A0A1A6GCR4_NEOLE Unreviewed; 882 AA.
AC A0A1A6GCR4;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE RecName: Full=ubiquitinyl hydrolase 1 {ECO:0000256|ARBA:ARBA00012759};
DE EC=3.4.19.12 {ECO:0000256|ARBA:ARBA00012759};
DE Flags: Fragment;
GN ORFNames=A6R68_07435 {ECO:0000313|EMBL:OBS64031.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS64031.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS64031.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS64031.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS64031.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Thiol-dependent hydrolysis of ester, thioester, amide, peptide
CC and isopeptide bonds formed by the C-terminal Gly of ubiquitin (a 76-
CC residue protein attached to proteins as an intracellular targeting
CC signal).; EC=3.4.19.12; Evidence={ECO:0000256|ARBA:ARBA00000707};
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS64031.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01097466; OBS64031.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1A6GCR4; -.
DR STRING; 56216.A0A1A6GCR4; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0004843; F:cysteine-type deubiquitinase activity; IEA:UniProtKB-EC.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02659; peptidase_C19C; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR001394; Peptidase_C19_UCH.
DR InterPro; IPR024729; USP7_ICP0-binding_dom.
DR InterPro; IPR029346; USP_C.
DR InterPro; IPR018200; USP_CS.
DR InterPro; IPR028889; USP_dom.
DR PANTHER; PTHR24006; UBIQUITIN CARBOXYL-TERMINAL HYDROLASE; 1.
DR PANTHER; PTHR24006:SF644; UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 7; 1.
DR Pfam; PF00443; UCH; 1.
DR Pfam; PF14533; USP7_C2; 2.
DR Pfam; PF12436; USP7_ICP0_bdg; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00973; USP_2; 1.
DR PROSITE; PS50235; USP_3; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW Ubl conjugation pathway {ECO:0000256|ARBA:ARBA00022786}.
FT DOMAIN 1..281
FT /note="USP"
FT /evidence="ECO:0000259|PROSITE:PS50235"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:OBS64031.1"
SQ SEQUENCE 882 AA; 102715 MW; 50E0E63A4A3FEEF6 CRC64;
AVYMMPTEGD DSSKSVPLAL QRVFYELQHS DKPVGTKKLT KSFGWETLDS FMQHDVQELC
RVLLDNVENK MKGTCVEGTI PKLFRGKMVS YIQCKEVDYR SDRREDYYDI QLSIKGKKNI
FESFVDYVAV EQLDGDNKYD AGEHGLQEAE KGVKFLTLPP VLHLQLMRFM YDPQTDQNIK
INDRFEFPEQ LPLDEFLQKT DPKDPANYIL HAVLVHSGDN HGGHYVVYLN PKGDGKWCKF
DDDVVSRCTK EEAIEHNYGG HDDDLSVRHC TNAYMLVYIR ESKLSEVLQA VTDHDIPQQL
VERLQEEKRI EAQKRKERQE AHLYMQVQIV AEDQFCGHQG NDMYDEEKVK YTVFKVLKNS
SLAEFVQSLS QTMGFPQDQI RLWPMQARSN GTKRPAMLDN EADGNKTMIE LSDNENPWTI
FLETVDPELA ASGATLPKFD KDHDVMLFLK MYDPKTRSLN YCGHIYTPIS CKIRDLLPVM
CDRAGFIQDT SLILYEEVKP NLTERIQDYD VSLDKALDEL MDGDIIVFQK DDPENDNSEL
PTAKEYFRDL YHRVDVIFCD KTIPNDPGFV VTLSNRMNYF QVAKTVAQRL NTDPMLLQFF
KSQGYRDGPG NPLRHNYEGT LRDLLQFFKP RQPKKLYYQQ LKMKITDFEN RRSFKCIWLN
SQFREEEITL YPDKHGCVRD LLEECKKAVE LEIVSYKIIG VHQEDELLEC LSPATSRTFR
IEEIPLDQVD IDKENEMLIT VAHFHKEVFG TFGIPFLLRI HQNSVPLKDL LISAAGAGNP
GFDFYHTALH ARVGEHFREV MKRIQSLLDI QEKEFEKFKF AIVMMGRHQY INEDEYEVNL
KDFEPQPGNM SHPRPWLGLD HFNKAPKRSR YTYLEKAIKI HN
//