ID A0A4U5QSK2_POPAL Unreviewed; 2116 AA.
AC A0A4U5QSK2;
DT 31-JUL-2019, integrated into UniProtKB/TrEMBL.
DT 31-JUL-2019, sequence version 1.
DT 08-NOV-2023, entry version 17.
DE RecName: Full=USP domain-containing protein {ECO:0000259|PROSITE:PS50235};
GN ORFNames=D5086_0000063660 {ECO:0000313|EMBL:TKS12437.1};
OS Populus alba (White poplar).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Malpighiales; Salicaceae; Saliceae; Populus.
OX NCBI_TaxID=43335 {ECO:0000313|EMBL:TKS12437.1, ECO:0000313|Proteomes:UP000309997};
RN [1] {ECO:0000313|Proteomes:UP000309997}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. PAL-ZL1 {ECO:0000313|Proteomes:UP000309997};
RX PubMed=30661181; DOI=10.1007/s11427-018-9455-2;
RA Liu Y.-J., Wang X.-R., Zeng Q.-Y.;
RT "De novo assembly of white poplar genome and genetic diversity of white
RT poplar population in Irtysh River basin in China.";
RL Sci. China Life Sci. 62:609-618(2019).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:TKS12437.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; RCHU01000171; TKS12437.1; -; Genomic_DNA.
DR STRING; 43335.A0A4U5QSK2; -.
DR Proteomes; UP000309997; Unassembled WGS sequence.
DR GO; GO:0004843; F:cysteine-type deubiquitinase activity; IEA:InterPro.
DR CDD; cd02257; Peptidase_C19; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 3.
DR InterPro; IPR006866; DUF627_N.
DR InterPro; IPR006865; DUF629.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR001394; Peptidase_C19_UCH.
DR InterPro; IPR028889; USP_dom.
DR PANTHER; PTHR22975:SF9; ECHINUS SPLICE FORM 3; 1.
DR PANTHER; PTHR22975; UBIQUITIN SPECIFIC PROTEINASE; 1.
DR Pfam; PF04781; DUF627; 1.
DR Pfam; PF04780; DUF629; 1.
DR Pfam; PF00443; UCH; 3.
DR SUPFAM; SSF54001; Cysteine proteinases; 3.
DR PROSITE; PS50235; USP_3; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000309997}.
FT DOMAIN 945..1276
FT /note="USP"
FT /evidence="ECO:0000259|PROSITE:PS50235"
FT DOMAIN 1403..1744
FT /note="USP"
FT /evidence="ECO:0000259|PROSITE:PS50235"
FT DOMAIN 1784..2115
FT /note="USP"
FT /evidence="ECO:0000259|PROSITE:PS50235"
FT REGION 1..27
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 829..871
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 908..939
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1755..1774
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 11..27
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 908..932
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2116 AA; 241664 MW; BE6B86B5801768AF CRC64;
MGKTKKPSSA APRPKQAQTT ARPLQSTVDD FSSEIIRSAC EQALRDVSSN PQKSLKRIKD
LISKHPNSAA VHHTQSFVHF KIYSQTSSSF STLKQKYLNN AADSAKKSLS LFPNSITLNY
LNARIFIKMA KYSSDYQRII DHCWKALKAL PSGLGPGEDI IASQPEMGGK SQESRIQRLK
QVLLDVMDKA KIESLLIGKD VSEKMKESAC LRIKARFPKD EELPCKYERN EKRKFIKKEI
KFLRDCEKRV CAGFVVKDPQ SILNKVKVDT SNITIVSNYW KEMSCEETRG LLQVSIDEIT
GYYKKHDRLV ADYLLEAVDF ARKTNKWKCL KCFCCALFFF DWKELRSHVF LKHLGGLSEQ
QMELVPFGLE DSYVEEIENG VWKPVDVDNM AQELSTLRHC KNDVYQEKCK IYSDQKKWMF
CDDAQRQELL KKIHRLLKLF LKNQCLAPRI LSWMWDYTIQ ELEESMELGF KDLVPILEQT
QTPLCICFLW LEQLEVVFDF LEELSNDCDL EDNISDDGGD SKEEFCDYEP IHFNSDSSCL
ILDKKFIRSM LDAGEHSNIV ADEGTAVIPF VDDPEKDIQF DRYRFVNWLF VSDKIQELLN
SWINLRKLDK ELAGMVFQFV DTDLSLLKYF CERKCRLLEY QETFTDVENI CLEEYKRREE
ISEYKEQNFA SLLVERQDEL VDAQSDIIGD EHACILNVLR FAQYVGHKKF GLDETLISTY
TQFPDLECHE DKANRDILVD VCVKEAIKME KQNVVREFCE NDVLIMKNVA SIKRMEPKFV
LLSALDYQFI LFYMVKSFIR AHLEDLANKD AVKKANTAEM LLTLADLAPE PSKNMTKGSD
NSRKTRKKSK LKKHKNQGKA MDEGAGGSQE HLPLDKENAE KVSCHSIASD GGCSDFGIVV
STSTEDLKQM EDEHRSKIGS KVPQKSFLKE EDSGKSPSGS GIFGVVLTND IGENNCFLNV
IIQCLWNIQL VRNELCSITD SGHEHFGDPC IVCELAEIFG ELSEASTRTR REIVSTTSLR
LAISKCFAHR DSFQEGQMND ADEVLQNILV ILHQSFTSCP APDASSESEK SKRVDCQQWT
SHKCLAHRLF GMDIYGYCDS CGLEWRHQTF SDFSHYICSS QLREKKNKNQ ASSFDELMKL
MLMDDCLTCN RDAGGCGKPN RIQFILRTPP QVFVCVLSMQ TARERREDIR DTLTALDTEV
DIGDVFLGRG PGNRYCLASM VCYDELHYVC FCYSHEGKRW TMYDGAHVEV IGFWHNLLDK
CVDELLQPQI LFFEAGVIKI FICSIVQGPQ FDDLRKLSGS SSMLCNLGVG EVKQAEDTPQ
ESTLHNDWQT AKDEGGNLQS HVQNEMKFLH QDKLQDAFRV QKHVPLEMDQ KLLQNLFLKG
EETGKCPTDS KVDYMDGSEI LGSGLKNDIG KNYSSLNVII QSLWHIPQFR NELACKTAPE
HKHVGDPCIV CGLAEIFDKL SAAIINPSRE IVYPTSLSIA IDKLSPCGDL FQKEKMNNAF
EVLWIILDSL HHSLTSVEDF SLHESEKRHC VGSLECTTDT CLVHTLFGMT VYKSVNYDSC
GLESRQQKHT FFFHTISAFE LRKQAESSFL DKVSTLRQGT SSFDELLKPM LVDYHLTLEP
DADGCRENHF KYFLQTPSHV FTSVIEWTTI WVAREDIRET LAALATEIDV GILYQGLGKG
KKYRLVSVVC YRGLLYSCFI YSDECKRWMM YNDTHVEVIG SWDFLCKKCV EKHFQPRILF
FVECAPTEID QNMPPKSFFE EEESGKSQSG SKVNYEDSSG IFGAGLKNDI DENSCFLNTI
IQCLWNVQLV RNELCSITDS GHEHVGDPCI VCGLSEVFGE LSEASTRTRR EIVSTASLRL
AISKNSPCGD SFQEGQMNDV DEVLQNILVI LHQSFTSCPA PDASSESEKS KRVDCQWWTS
NKCLAHRLFG MDIYGYCDSC GLEWRHQTFS DFSHYIRSSQ LREKKANKNQ ASSFDELMKL
MLMDDCSTCN RDAGGCGKHN RIQFILRTPP LVFICVLVQT AHESREDTRK TLTALGTELD
IGVVYQGLGP GKKYCLVSVV CYHHQHNVCF SYSHEHKRWT MFNDANVEVV GCWDDLLSKC
SHEQFQPQIL FFEAVQ
//