ID A0A061F0B5_THECC Unreviewed; 198 AA.
AC A0A061F0B5;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 36.
DE RecName: Full=Germin-like protein {ECO:0000256|RuleBase:RU366015};
GN ORFNames=TCM_026069 {ECO:0000313|EMBL:EOY10770.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY10770.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY10770.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, apoplast
CC {ECO:0000256|ARBA:ARBA00004271, ECO:0000256|RuleBase:RU366015}.
CC -!- SIMILARITY: Belongs to the germin family.
CC {ECO:0000256|ARBA:ARBA00007456, ECO:0000256|RuleBase:RU366015}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001883; EOY10770.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061F0B5; -.
DR STRING; 3641.A0A061F0B5; -.
DR EnsemblPlants; EOY10770; EOY10770; TCM_026069.
DR Gramene; EOY10770; EOY10770; TCM_026069.
DR HOGENOM; CLU_015790_0_0_1; -.
DR InParanoid; A0A061F0B5; -.
DR OMA; INDGRTT; -.
DR Proteomes; UP000026915; Chromosome 5.
DR GO; GO:0048046; C:apoplast; IEA:UniProtKB-SubCell.
DR GO; GO:0030145; F:manganese ion binding; IEA:UniProtKB-UniRule.
DR CDD; cd02241; cupin_OxOx; 1.
DR Gene3D; 2.60.120.10; Jelly Rolls; 2.
DR InterPro; IPR006045; Cupin_1.
DR InterPro; IPR001929; Germin.
DR InterPro; IPR014710; RmlC-like_jellyroll.
DR InterPro; IPR011051; RmlC_Cupin_sf.
DR PANTHER; PTHR31238:SF137; GERMIN-LIKE PROTEIN; 1.
DR PANTHER; PTHR31238; GERMIN-LIKE PROTEIN SUBFAMILY 3 MEMBER 3; 1.
DR Pfam; PF00190; Cupin_1; 1.
DR PRINTS; PR00325; GERMIN.
DR SUPFAM; SSF51182; RmlC-like cupins; 1.
PE 3: Inferred from homology;
KW Apoplast {ECO:0000256|ARBA:ARBA00022523, ECO:0000256|RuleBase:RU366015};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Manganese {ECO:0000256|ARBA:ARBA00023211, ECO:0000256|PIRSR:PIRSR601929-1};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723,
KW ECO:0000256|PIRSR:PIRSR601929-1};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Secreted {ECO:0000256|ARBA:ARBA00022525, ECO:0000256|RuleBase:RU366015};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|RuleBase:RU366015}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|RuleBase:RU366015"
FT CHAIN 23..198
FT /note="Germin-like protein"
FT /evidence="ECO:0000256|RuleBase:RU366015"
FT /id="PRO_5019617805"
FT DOMAIN 130..198
FT /note="Cupin type-1"
FT /evidence="ECO:0000259|Pfam:PF00190"
FT BINDING 101
FT /ligand="oxalate"
FT /ligand_id="ChEBI:CHEBI:30623"
FT /evidence="ECO:0000256|PIRSR:PIRSR601929-1"
FT BINDING 151
FT /ligand="Mn(2+)"
FT /ligand_id="ChEBI:CHEBI:29035"
FT /evidence="ECO:0000256|PIRSR:PIRSR601929-2"
SQ SEQUENCE 198 AA; 21348 MW; 0C915EA88C1F60F5 CRC64;
MKGAHLLLAY SLLALSSSFA YASGPSPLQD FCVANGDVKD VLFGSDQMPK QGVFEYVMKH
AKRPNTWVAE DLINPKVFVN GKFCEDPKLA KAEDFTVSGL NVPRNTSNPV GSTVTPVNVA
QIPGLDTLDN RLITQILYLG DLFVFPVGLI HFQFNVGKTN VVAFAALRSQ NPGVITIANA
VFAANPPINP DVLVKAFQ
//