GenomeNet

Database: UniProt
Entry: A0A061G7E9_THECC
LinkDB: A0A061G7E9_THECC
Original site: A0A061G7E9_THECC 
ID   A0A061G7E9_THECC        Unreviewed;       332 AA.
AC   A0A061G7E9;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   27-MAR-2024, entry version 38.
DE   SubName: Full=Uncharacterized protein isoform 2 {ECO:0000313|EMBL:EOY25333.1};
GN   ORFNames=TCM_016681 {ECO:0000313|EMBL:EOY25333.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY25333.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOY25333.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001881; EOY25333.1; -; Genomic_DNA.
DR   RefSeq; XP_007040832.1; XM_007040770.2.
DR   AlphaFoldDB; A0A061G7E9; -.
DR   EnsemblPlants; EOY25333; EOY25333; TCM_016681.
DR   EnsemblPlants; Tc03v2_t026620.4; Tc03v2_p026620.4; Tc03v2_g026620.
DR   GeneID; 18606896; -.
DR   Gramene; EOY25333; EOY25333; TCM_016681.
DR   Gramene; Tc03v2_t026620.4; Tc03v2_p026620.4; Tc03v2_g026620.
DR   KEGG; tcc:18606896; -.
DR   HOGENOM; CLU_036960_0_0_1; -.
DR   OrthoDB; 1211328at2759; -.
DR   Proteomes; UP000026915; Chromosome 3.
DR   InterPro; IPR032698; SirB1_N.
DR   PANTHER; PTHR31350; SI:DKEY-261L7.2; 1.
DR   PANTHER; PTHR31350:SF30; TRANSGLUTAMINASE FAMILY PROTEIN; 1.
DR   Pfam; PF13369; Transglut_core2; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT   DOMAIN          43..195
FT                   /note="Protein SirB1 N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF13369"
FT   REGION          1..29
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..28
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   332 AA;  37293 MW;  02ED16C1A44CD773 CRC64;
     MAFNQEMDTR SLLNERRNVS SPSDTKEWDS VEQMPLGGKT ISEWLSELDA IAKEVEAELV
     SRDIGCHLVE VLEAVNLVLF ELRGFKRSPV LVDSKHSYLH SILSSGCGSA ILLSIIYIEV
     CRRLGLTIVG SRVGGDFLIW PQTGYPEELF KVTSGHSLFA IVNGRCVEDP RSMASDLTGT
     SLLGLEIATN RDIIGIALAN LIRLHWKRAS RSNHGLMLTS PLRHVHNADE KPNKIDKSNV
     PLLRPQDLRL AIMASERLLI LQPHNWALRR DHGMMLYYNR EYGKAVQELS ICMAFAPEEE
     AEILEPFVEK LHLMRLELSW KSLGHAGRLA VP
//
DBGET integrated database retrieval system