ID F6TVI6_HORSE Unreviewed; 3503 AA.
AC F6TVI6;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 27-MAR-2024, entry version 71.
DE SubName: Full=Ubiquitin specific peptidase 34 {ECO:0000313|Ensembl:ENSECAP00000020288.3};
GN Name=USP34 {ECO:0000313|Ensembl:ENSECAP00000020288.3,
GN ECO:0000313|VGNC:VGNC:51423};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000020288.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000020288.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000020288.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000020288.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000020288.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9796.ENSECAP00000020288; -.
DR PaxDb; 9796-ENSECAP00000020288; -.
DR Ensembl; ENSECAT00000024422.3; ENSECAP00000020288.3; ENSECAG00000022272.4.
DR VGNC; VGNC:51423; USP34.
DR GeneTree; ENSGT00940000158659; -.
DR InParanoid; F6TVI6; -.
DR OMA; KLMYSLY; -.
DR TreeFam; TF323966; -.
DR Proteomes; UP000002281; Chromosome 15.
DR Bgee; ENSECAG00000022272; Expressed in brainstem and 23 other cell types or tissues.
DR GO; GO:0005829; C:cytosol; IBA:GO_Central.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0004843; F:cysteine-type deubiquitinase activity; IBA:GO_Central.
DR CDD; cd02659; peptidase_C19C; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR021905; DUF3517.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR001394; Peptidase_C19_UCH.
DR InterPro; IPR018200; USP_CS.
DR InterPro; IPR028889; USP_dom.
DR PANTHER; PTHR24006; UBIQUITIN CARBOXYL-TERMINAL HYDROLASE; 1.
DR PANTHER; PTHR24006:SF948; UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 34; 1.
DR Pfam; PF12030; DUF3517; 1.
DR Pfam; PF00443; UCH; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00972; USP_1; 1.
DR PROSITE; PS00973; USP_2; 1.
DR PROSITE; PS50235; USP_3; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002281}.
FT DOMAIN 1855..2200
FT /note="USP"
FT /evidence="ECO:0000259|PROSITE:PS50235"
FT REGION 464..496
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 511..640
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1420..1439
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3293..3401
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 477..496
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 517..569
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 621..640
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3295..3334
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3335..3370
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3503 AA; 399033 MW; 928A9B74065C9A2A CRC64;
MHSNWQCLCC FKEYKHLEIF NQVVCALINL VIAQVQVLRD QLCKHCTTVN IDSTWQDENN
QVEEPLNIER QCNEGSTERQ KSIEKKSNST RICNLTEEES SKSSDPFSLW STDEKEKLLL
CVAKIFQIQF PLYTAYKHNT HPTIEDISTQ ESNILGAFCD MNDVEVPLHL LRYVCLFCGK
NGLSLMKDCF EYGTPETLPF LIAHAFITVV SNIRIWLHIP AVMQHIIPFR TYVIRYLCKL
SDQELRQSAA RNMADLMWST VKEPLDTTLC FDKESLDLAF KYFMSPTLTM RLAGLSQITN
QLHTFNDVCN NESLVSDTET SIAKELADWL ISNNVVEHIF GPNLHIEIIK QCQVILNFLA
AEGRLSTQHI DCIWAAAQLK HCSRYIHDLF PSLIKNLDPV PLRHLLNLVS ALEPSVHTEQ
TLYLASMLIK ALWNNALAAK AQLSKQSSFA SLLNTNLPIG NKKEEEELRR AAPSPWSPAA
SPQSSDNSDT HQSGGSDIEM DEQLINRTKH VQQRLSDTEE SMQGSSDETA NSAEDGSSGP
GSSSGHSDGS SNEVNSSHAS QSAGSPGSEV QSEDIADIEA LKEEDEDDDH GHNPPKSSCG
TDLRNRKLES QAGICLGDSQ GPSERSGTSN GTGKDLVFNT ESLPSVDNRI RMLDACSHSE
DPENDISGEM NAAHIAQASQ ESCITRTGDF LGETIGNELF NCRQFIGPQH HHHHHHHHHH
HDGHMVDDML SADDVSCSSS QVSAKSEKNM ADFDGEESGC EEELVQINSH AELTSHLQQH
LPNLASIYHE HLSQGPAVHK HQFNSNAVTD INLDNVCKKG NTLLWDIVQD DDAVNLSEGL
INEAEKLLCS LVCWFTDRQI RMRFIEGCLE NLGNNRSVVI SLRLLPKLFG TFQQFGSSYD
THWITMWAEK ELNMMKLFFD NLVYYIQAVR EGRQKHALYS HSAEVQVRLQ FLTCVFSTLG
SPDHFRLSLE QVDILWHCLV EDSECYDDAL HWFLNQVRSK DQHAMGMETY KHLFLEKMPQ
LKPETISMTG LNLFQHLCNL ARLATSAYDG GSNSELCGMD QFWGIALRAQ SGDVSRAAIQ
YINSYYINGK TGLEKEQEFI SKCMESLMIA SSSLEQESHS SLTVIERGLL MLKTHLEAFR
RRFAYHLRQW QIEGTGISSH LKALSDKQSL PLRVVCQPAG LPDKMTIEMY PSDQVADLRA
EVTHWYENLQ KEQINQQAQL QEFGQSSRKG EFPGGLMGPV RMISSGHELT TDYDEKALHE
LGFKDMQMVF VSLGAPRRER KGEGVQLPAS CLPPPQKDNI PMLLLLQEPH LTTLFDLLEM
LASFKPPSGK VAVEDSESLR CEELHLHAEN LSRRVWELLM LLPTCPNMLM AFQNISEEQS
NDGLNWKELL KIKSAHKLLY ALEIIEALGK PNRRIRREST GSYSDLYPDS DDSSEDQVEN
SKNSWSCKFV AAGGLQQLLE IFNSGILEPK EQESWTVWQL DCLACLLKLI CQFAVDPSDL
DLAYHDVFAW SGIAESHRKR TWPGKSRKAA GDHAKGLHIP RLTEVFLVLV QGTSLIQRLM
SVAYTYDNLA PRVLKAQSDH RSRHEVSHYS MWLLVSWAHC CSLVKSSLAD SDHLQDWLRK
LTLLIPETAV RHESCNGLYK LSLSGLDGGD SINRSFLLLA ASTLLKFLPD AQALKPIRID
DYEEEPMLKP GCKEYFWLLC KLVDNIHIKD ASQTTLLDLD ALARHLADCI RSREILDHQD
GNIEDDGLTG LLRLATSVIK HKPPFKFSRE GQEFLRDIFN LLFLLPSLKD RQQPKCKSHS
SRAAAYDLLV EMVKGSVENY RLIHNWVMAQ HMQSHAPYKW DYWPHEDVRA ECRFVGLTNL
GATCYLASTI QQLYMIPEAR QAVFTAKYSE DMKHKTTLLE LQKMFTYLME SECKAYNPRP
FCKTYTMDKQ PLNTGEQKDM TEFFTDLITK IEEMSPELKN TVKSLFGGVI TNNVVSLDCE
HVSQTAEEFY TVRCQVADMK NIYESLDEVT IKDTLEGDNM YTCSHCGKKV RAEKRACFKK
LPRILSFNTM RYTFNMVTMM KEKVNTHFSF PLRLDMTPYT EDFLMGKSDR KEGFKEVSDH
SKDTESYEYD LIGVTVHTGT ADGGHYYSFI RDIVNPLAYK NNKWYLFNDA EVKPFDSAQL
ASECFGGEMT TKTYDSVTDK FMDFSFEKTH SAYMLFYKRM EPEEENGKDY KFDVSSELLE
WIWHDNMQFL QDKNIFEHTY FGFMWQLCSC IPSTLPDPKA VSLMTAKLST SFVLETFIHS
KEKPTMLQWI ELLTKQFNNS QAACEWFLDR MADDDWWPMQ ILIKCPNQIV RQMFQRLCIH
VIQRLRPVHA HLYLQPGMED GSDDMDASVE DIGGRSCVTR FVRTLLLIME HGVKPHSKHL
TEYFAFLYEF AKMGEEESQF LLSLQAISTM VHFYMGTKGP ENPQVEVLSE EEGEEEEEEE
DILSLAEEKY RPAALEKMIA LVALLVEQSR SERHLTLSQT DMAALTGGKG FPFLFQHIRD
GINIRQTCNL IFSLCRYNNR LAEHIVSMLF TSIAKLTPEA ANPFFKLLTM LMEFAGGPPG
MPPFASYILQ RIWEVIEYNP SQCLDWLAVQ TPRNKLAHSW VLQNMENWVE RFLLAHNYPR
VRTSAAYLLV SLIPSNSFRQ MFRSTRSLHI PTRDLPLSPD TTVVLHQVYN VLLGLLSRAK
LYVDAAVHGT TKLVPYFSFM TYCLISKTEK LMFSTYFMDL WNLFQPKLSE PAIATNHNKQ
ALLSFWYNVC ADCPENIRLI VQNPVVTKNI AFNYILADHD DQDVVLFNRG MLPAYYGILR
LCCEQSPAFT RQLASHQNIQ WAFKNLTPHA SQYPGAVEEL FNLMQLFIAQ RPDMREEELE
DIKQFKKTTI SCYLRCLDGR SCWTTLISAF RILLESDEDR LLVVFNRGLI LMTESFNTLH
MMYHEATACH VTGDLVELLS IFLSVLKSTR PYLQRKDVKQ ALIQWQERIE FAHKLLTLLN
SYSPPELRNA CIDVLKELVL LSPHDFLHTL VPFLQHNHCT YHHSNIPMSL GPYFPCRENI
KLIGGKSNIR PPRPELNMCL LPTMVETSKG KDDVYDRMLL DYFFSYHQFI HLLCRVAINC
EKFTETLVKL SVLVAYEGLP LHLALFPKLW TELCQTQSAM SKNCIKLLCE DPVFAEYIKC
ILMDERTFLN NNIVYTFMTH FLLKVQGQVF SEANCANLIS TLITNLINQY QNLQSDFTNR
IEISKASASL NGDLRALALL LSVHTPKQLN PALIPTLQEL LSKCRTCLQQ RNSLQEQEAK
ERKTKDDEGA TPVKRRRVSS DEEHTVDSCI SDLKTETREA LTPTSTSDNE TRDSSIIDPG
TEQDLPSPEN SSVKEYRMEV PSSFSEDIRS QHAEQSNNGR FEDCKEFKDL PCSKDPNLAE
EESEFPSTSI SAVLSDLADL RSCDGPALPS QDPEAALSLS CGHSRGLFSH MQQHDILDTL
CRTIESTIHV VTRISGKGNQ AAS
//