ID A0A061E598_THECC Unreviewed; 467 AA.
AC A0A061E598;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 08-NOV-2023, entry version 33.
DE SubName: Full=Ubiquitin family protein isoform 3 {ECO:0000313|EMBL:EOX97453.1};
GN ORFNames=TCM_006457 {ECO:0000313|EMBL:EOX97453.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOX97453.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOX97453.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001880; EOX97453.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061E598; -.
DR EnsemblPlants; EOX97453; EOX97453; TCM_006457.
DR Gramene; EOX97453; EOX97453; TCM_006457.
DR HOGENOM; CLU_024293_4_0_1; -.
DR Proteomes; UP000026915; Chromosome 2.
DR GO; GO:0005634; C:nucleus; IEA:UniProt.
DR GO; GO:0031593; F:polyubiquitin modification-dependent protein binding; IEA:UniProt.
DR CDD; cd14399; UBA_PLICs; 1.
DR CDD; cd16106; Ubl_Dsk2p_like; 1.
DR Gene3D; 1.10.260.100; -; 1.
DR Gene3D; 1.10.8.10; DNA helicase RuvA subunit, C-terminal domain; 1.
DR InterPro; IPR006636; STI1_HS-bd.
DR InterPro; IPR015940; UBA.
DR InterPro; IPR009060; UBA-like_sf.
DR InterPro; IPR015496; Ubiquilin.
DR InterPro; IPR000626; Ubiquitin-like_dom.
DR InterPro; IPR029071; Ubiquitin-like_domsf.
DR PANTHER; PTHR10677:SF3; FI07626P-RELATED; 1.
DR PANTHER; PTHR10677; UBIQUILIN; 1.
DR Pfam; PF00627; UBA; 1.
DR Pfam; PF00240; ubiquitin; 1.
DR SMART; SM00727; STI1; 3.
DR SMART; SM00165; UBA; 1.
DR SMART; SM00213; UBQ; 1.
DR SUPFAM; SSF46934; UBA-like; 1.
DR SUPFAM; SSF54236; Ubiquitin-like; 1.
DR PROSITE; PS50030; UBA; 1.
DR PROSITE; PS50053; UBIQUITIN_2; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT DOMAIN 23..92
FT /note="Ubiquitin-like"
FT /evidence="ECO:0000259|PROSITE:PS50053"
FT DOMAIN 420..464
FT /note="UBA"
FT /evidence="ECO:0000259|PROSITE:PS50030"
FT REGION 1..22
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 96..115
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 276..331
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 371..393
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 99..115
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 294..331
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 371..390
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 467 AA; 49830 MW; FA8F98670911137C CRC64;
MGAEGDSSES RLGGNGGEEE EGVMVNIRCS NGTKFTVRTS LDSTVASFKA VLAQNCDIPA
DQQRLIYKGR ILKDDQTLQS YGLQADHSVH MVRGFAPSSS TPPAATTNVG TPNTTTGVTR
GVGSNEGAGL GAPLFPGLNP LGGGGGGGGG LGLFGAGLPE FEQVQQQLTQ NPNMMREIMN
TPAIQGLMNN PELMRSLIMS NPQMREIIDR NPELGHILND PSILRQTLEA ARNPELMREM
MRNTDRAMSN IESSPEGFNM LRRMYENVQE PFLNATTMAG NSGNAPGSNP FAALLGNQGG
SQARDSPNNT STAGSDTTQG QTAPNTNPLP MFDLNPQLRE MMQNPDILRQ MFSPETMQQM
LALQQSLLSH QLNRQQSTQD SAQPGATPGA PNTASLELLM NMFGGLGAGS LSVPNQPDVP
PEELYATQLS QLQEMGFYDT QENIRALRAT AGNVHAAVER LLGNSGQ
//