ID A0A061FG70_THECC Unreviewed; 1032 AA.
AC A0A061FG70;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 41.
DE SubName: Full=Pre-mRNA-processing protein 40A isoform 1 {ECO:0000313|EMBL:EOY15662.1};
GN ORFNames=TCM_034659 {ECO:0000313|EMBL:EOY15662.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY15662.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY15662.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001886; EOY15661.1; -; Genomic_DNA.
DR EMBL; CM001886; EOY15662.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061FG70; -.
DR EnsemblPlants; EOY15661; EOY15661; TCM_034659.
DR EnsemblPlants; EOY15662; EOY15662; TCM_034659.
DR Gramene; EOY15661; EOY15661; TCM_034659.
DR Gramene; EOY15662; EOY15662; TCM_034659.
DR HOGENOM; CLU_005825_2_0_1; -.
DR Proteomes; UP000026915; Chromosome 8.
DR GO; GO:0045292; P:mRNA cis splicing, via spliceosome; IEA:InterPro.
DR CDD; cd00201; WW; 2.
DR Gene3D; 2.20.70.10; -; 2.
DR Gene3D; 1.10.10.440; FF domain; 5.
DR InterPro; IPR002713; FF_domain.
DR InterPro; IPR036517; FF_domain_sf.
DR InterPro; IPR039726; Prp40-like.
DR InterPro; IPR001202; WW_dom.
DR InterPro; IPR036020; WW_dom_sf.
DR PANTHER; PTHR11864; PRE-MRNA-PROCESSING PROTEIN PRP40; 1.
DR PANTHER; PTHR11864:SF0; PRP40 PRE-MRNA PROCESSING FACTOR 40 HOMOLOG A (YEAST); 1.
DR Pfam; PF01846; FF; 5.
DR Pfam; PF00397; WW; 2.
DR SMART; SM00441; FF; 5.
DR SMART; SM00456; WW; 2.
DR SUPFAM; SSF81698; FF domain; 5.
DR SUPFAM; SSF51045; WW domain; 2.
DR PROSITE; PS51676; FF; 5.
DR PROSITE; PS01159; WW_DOMAIN_1; 1.
DR PROSITE; PS50020; WW_DOMAIN_2; 2.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT DOMAIN 220..253
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 261..294
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 472..526
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 539..594
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 607..661
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 679..742
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 814..869
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT REGION 1..34
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 71..172
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 201..230
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 874..1032
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 578..615
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 655..690
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 101..126
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 142..172
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 201..228
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 874..946
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 953..977
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 986..1002
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1032 AA; 116553 MW; 9B808CE34F4EF28A CRC64;
MANNSQPSSA QPHWPPAVGS LGPQSYGSPL SSQFRPVVPM QQGQHFVPAA SQQFRPVGQV
PSSNVGMPAV QNQQMQFSQP MQQFPPRPNQ PGLSAPSAQP MHVPFGQTNR PLTSGSPQSH
QTAPPLNSHM PGLGAPGMPP SSSYSYVPSS FGQPQNNVSA SSQFQPTSQV HASVAPVAGQ
PWLSSGNQSV SLAIPIQQTG QQPPLISSAD TAANAPIHTP PSASDWQEHT SADGRRYYYN
KKTRQSSWEK PLELMTPIER ADASTVWKEF TTPEGRKYYY NKVTKQSKWT IPEELKLARE
QAQVVASQGA PSDTGVASQA PVAGAVSSAE MPAAAIPVSS NTSQASSPVS VTPVAAVANP
SPTLVSGSTV VPVSQSAATN ASEVQSPAVA VTPLPAVSSG GSTTPVTSVN ANTTMIRSLE
STASQDSVHF TNGASAQDIE EAKKGMATAG KVNVTPVEEK VPDDEPLVYA NKQEAKNAFK
SLLESANVQS DWTWEQTMRE IINDKRYGAL KTLGERKQAF NEYLGQRKKL EAEERRMRQK
KAREEFTKML EESKELTSSM RWSKAQSLFE NDERFKAVER ARDREDLFEN YIVELERKER
ENAAEEKRRN IAEYRKFLES CDFIKANSQW RKVQDRLEDD ERCSRLEKID RLVMFQDYIH
DLEKEEEEKK KMQKEQLRRA ERKNRDAFRK LMDEHVVDGT LTAKTYWRDY CLKVKDLPPY
LAVASNTSGS TPKDLFEDVV EELEKQYQQD KTHIKDAMKS GKISMVSTWT VEDFKAAISE
DVGSLPISDI NLKLVYEELL KSAKEKEEKE AKKRQRLADD FTKLLHTYKE ITASSDWEDS
RPLFEESQEY RSIAEESLRR EIFEEYIAYL QEKAKEKERK REEEKAKKEK EREEKEKRKE
KERKEKERER EREKGKERTK KDETDSENLD ISDSHGHKED KKKEKEKDRK HRKRHQSGGD
DGSSDKDDRE ESKKSRRHGS DRKKSRKHAH SPESDNESRH KKHKRDHRDG SRRNSGYEEL
EDGEVGEDGE IQ
//