ID A0A3B6HRP0_WHEAT Unreviewed; 1578 AA.
AC A0A3B6HRP0;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE RecName: Full=USP domain-containing protein {ECO:0000259|PROSITE:PS50235};
GN ORFNames=CFC21_051000 {ECO:0000313|EMBL:KAF7041176.1};
OS Triticum aestivum (Wheat).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Pooideae; Triticodae; Triticeae; Triticinae; Triticum.
OX NCBI_TaxID=4565 {ECO:0000313|EnsemblPlants:TraesCS4A02G022000.2};
RN [1] {ECO:0000313|EMBL:KAF7041176.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Leaf {ECO:0000313|EMBL:KAF7041176.1};
RX PubMed=29069494;
RA Zimin A.V., Puiu D., Hall R., Kingan S., Clavijo B.J., Salzberg S.L.;
RT "The first near-complete assembly of the hexaploid bread wheat genome,
RT Triticum aestivum.";
RL Gigascience 6:1-7(2017).
RN [2] {ECO:0000313|EnsemblPlants:TraesCS4A02G022000.2}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Chinese Spring
RC {ECO:0000313|EnsemblPlants:TraesCS4A02G022000.2};
RX PubMed=30115783; DOI=10.1126/science.aar7191;
RG International wheat genome sequencing consortium (IWGSC);
RT "Shifting the limits in wheat research and breeding using a fully annotated
RT reference genome.";
RL Science 361:EAAR7191-EAAR7191(2018).
RN [3] {ECO:0000313|EnsemblPlants:TraesCS4A02G022000.2}
RP IDENTIFICATION.
RG EnsemblPlants;
RL Submitted (OCT-2018) to UniProtKB.
RN [4] {ECO:0000313|EMBL:KAF7041176.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Leaf {ECO:0000313|EMBL:KAF7041176.1};
RA Zimin A.V., Puiu D., Shumante A., Alonge M., Salzberg S.L.;
RT "The second near-complete assembly of the hexaploid bread wheat (Triticum
RT aestivum) genome.";
RL Submitted (MAR-2020) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM022220; KAF7041176.1; -; Genomic_DNA.
DR STRING; 4565.A0A3B6HRP0; -.
DR PaxDb; 4565-Traes_4AS_4D53BE158-1; -.
DR EnsemblPlants; TraesCS4A02G022000.2; TraesCS4A02G022000.2; TraesCS4A02G022000.
DR Gramene; TraesCS4A02G022000.2; TraesCS4A02G022000.2; TraesCS4A02G022000.
DR Proteomes; UP000019116; Chromosome 4A.
DR Proteomes; UP000815260; Chromosome 4A.
DR GO; GO:0004843; F:cysteine-type deubiquitinase activity; IEA:InterPro.
DR CDD; cd02257; Peptidase_C19; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR006866; DUF627_N.
DR InterPro; IPR006865; DUF629.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR001394; Peptidase_C19_UCH.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR028889; USP_dom.
DR PANTHER; PTHR22975:SF9; ECHINUS SPLICE FORM 3; 1.
DR PANTHER; PTHR22975; UBIQUITIN SPECIFIC PROTEINASE; 1.
DR Pfam; PF04781; DUF627; 1.
DR Pfam; PF04780; DUF629; 1.
DR Pfam; PF00443; UCH; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR SUPFAM; SSF48452; TPR-like; 1.
DR PROSITE; PS50235; USP_3; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000019116}.
FT DOMAIN 1251..1577
FT /note="USP"
FT /evidence="ECO:0000259|PROSITE:PS50235"
FT REGION 1..41
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 264..314
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 443..481
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 890..914
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 997..1040
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1059..1107
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1120..1207
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 962..996
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 7..21
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 264..294
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 443..475
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1004..1040
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1120..1143
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1150..1189
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1578 AA; 175500 MW; 98845C10692A58A2 CRC64;
MGRKKRSPPP NPTPPPHPPA GALLRLADAP AGAAQAGPDP AAVRAECDKA LACLQRGNQP
KALRLMKEAL ARHGDGSPLL LRAHGTVHSR AAAVLTDPAA RARHHQAALL AARRAVDLAP
DSLELAHFRA LLLYEAASDN RAYEEVIAEC ERGLRIDDPS DPEPHSLRLP APDPDQLRAE
LRNLVQKANL ASISTWVKNL GGSDDKLGFF RLADDPLELQ LLPAAPAPRR PNEIKKATKT
VEERRKEIEV QVAALRLLEQ QQQQNNAAAA LSSSPPQSQQ GDEPPSSSSQ STAAGHRADR
RKGGSKKAAA SSASDQRNQV RAFWAALPVD QRLALLKTSI SELKAYYKTH KDKDVAAAAP
AVLDEVVEFA TSHGCWEFWL CGICEERYPD VAHSLREHVS ALPRQVQAML PQEIDADWAA
MLIGSSWRPV DVSAALKALE DEQADNIGQD RDKDSMSSDN WSIKDKSDTS ESSASPHNEE
CDGFGVVVRE GARKWPLSDD DERAKILERI HSLFQILVKH KNISMNHLSR VIHFAMDELR
GMPSGSLLLN HSIDKSPLCI CFLDASSLKK VLKFLQDLMQ SCGLSRSSEK DGELGDGDCF
PKNNTILEGV TLDSVSSSLI LDGRVFCGKS KSGPENVDTD EFLSWLYAGS PPIGEQLSEW
NCMLVDRTSQ GMQILDMIDK EASALKNFCE MKHEQLNTEE GVLAVNNIIQ EEQRLRDRGG
RYSYQGYEDL LRNRQEELLE TRFRSSEYDA ISNILKEVRT SHFGYDEGFS GMTSRQCDFD
GAAIDDWRLH DFMHPSDSIV PTIVLRMKEH VATELGKIDA RIMRSVALMQ QLDLKLEPAA
FVDYRSILLP LLKSFLRNHL EELADKDARE RSDAARDAFL AELALDAEKN ANKGGDKKPS
HEKSKDKKRM KDSRRYKDLK DLSWSDQYIV RQDSADEETS EQAQTLVDCD DFDGKLSLSD
EYSNEQEEEH RHRVQLEAEE RKLEETLEYQ RRIEEEAKQK HLAEQSRSTS SAPDNWTNGY
STDVNSNVHQ DNHQSAPNNF SPAYLEGIKF GDFRFPKVPS REKNSSSDFC GVDLPQKTEN
NRREKPNGLR SPGAHALSSS NMDFTKPALK MNGVGKYAQN TKLSTNPLIQ RPKSSTSQPH
KKYIQGAVHN GDDSASSRQN GTTAPRWSSS GKVADFSSNS YQDGKQNELP PVLSSDDPWN
ANKAEEADKG AISPAIVCIK DDSDKRFEED LRKAVHQSLA GASNGKEVYG AGLKNAAGEY
NCFLNVIIQS LWHLKRFRHE FLKTSSLHKH IEDPCAVCAL YNIFVDLSKA SEGQGEAVAP
TSLRIALSKS YPNNRFFQEG QMNDASEVLG VIFECLHKSY TSQADCHAKS HESNSIGSWD
CASDFCIAHC LFGMDVYERM NCHSCGLESR RLKYTSFFHN INASSLRTAK MMCPDPFDDL
LKTVIMNDQL ACDPDVGGCG KPNHIHHILS SSPHVFTVVL GWQNSKESVG DIAATLAGIS
TEIDISVFYR GLDQGSKHFL VSVVCYYGQH YHCFAFEDEH WVMYDDQTVK VIGSWADVVI
MCEKGHLQPQ VLFFEAAN
//