ID A0A061GIH1_THECC Unreviewed; 492 AA.
AC A0A061GIH1;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 36.
DE SubName: Full=Smg-4/UPF3 family protein, putative isoform 5 {ECO:0000313|EMBL:EOY26874.1};
GN ORFNames=TCM_028841 {ECO:0000313|EMBL:EOY26874.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY26874.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY26874.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the RENT3 family.
CC {ECO:0000256|ARBA:ARBA00005991}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001884; EOY26874.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061GIH1; -.
DR EnsemblPlants; EOY26874; EOY26874; TCM_028841.
DR Gramene; EOY26874; EOY26874; TCM_028841.
DR Proteomes; UP000026915; Chromosome 6.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0000184; P:nuclear-transcribed mRNA catabolic process, nonsense-mediated decay; IEA:UniProtKB-KW.
DR CDD; cd12455; RRM_like_Smg4_UPF3; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR039722; Upf3.
DR InterPro; IPR005120; UPF3_dom.
DR PANTHER; PTHR13112:SF5; SMG4_UPF3 DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR13112; UPF3 REGULATOR OF NONSENSE TRANSCRIPTS-LIKE PROTEIN; 1.
DR Pfam; PF03467; Smg4_UPF3; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT DOMAIN 7..168
FT /note="UPF3"
FT /evidence="ECO:0000259|Pfam:PF03467"
FT REGION 167..257
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 276..492
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 198..227
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 283..309
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 322..349
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 362..383
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 393..417
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 452..468
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 492 AA; 54461 MW; 2DA54F99D22DEE44 CRC64;
MKEPLRRTKV VIRHLPPSVT QSFLFSQIDD RFSDRYNWFS FRLGKSSHKH QRYSRAYINF
KRPEDVFEFA EFFDGHVFVN EKGTQFKAIV EYAPSQRVPK PGTKKDGREG TIFKDPDYLE
FLKLIAKPVD NLPSAEIQLE RKEVELSGAP KETPVITPLM AFVRQKRAAE SGTQGPVTRR
KIGRKAGAAS TGKSGSSSKR GSEKKKYILK DSVKGTHHKD KSKFFVASKQ EDQPVPSVGK
EKRENGTVYG IDGPVTGITL TADSGKKKIL LLKPKDQEAP HVPQGASEQQ GSSSPVANSP
GSTAPKQSQR REAGGRLIRS ILLSNEASQN QPLAGVKPQQ KTQTMNLDNV KRPPRPANTR
LGSGSEKHEK RIRNKDRLDR GVWAPLRGSD VSQASEERFS PSMSQSAQAS SNSIEGEMKG
DIPNGRSGRN VPSENGSNRH FDRRSAAYNI KDDGSVISSE SKSSKRGATG SGAHERNKFG
FRSHLQVLRK ST
//