ID A0A1R3J929_9ROSI Unreviewed; 687 AA.
AC A0A1R3J929;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 08-NOV-2023, entry version 25.
DE RecName: Full=Cupin type-1 domain-containing protein {ECO:0000259|SMART:SM00835};
GN ORFNames=COLO4_18435 {ECO:0000313|EMBL:OMO91342.1};
OS Corchorus olitorius.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus.
OX NCBI_TaxID=93759 {ECO:0000313|EMBL:OMO91342.1, ECO:0000313|Proteomes:UP000187203};
RN [1] {ECO:0000313|Proteomes:UP000187203}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. O-4 {ECO:0000313|Proteomes:UP000187203};
RA Alam M., Haque M.S., Islam M.S., Emdad E.M., Islam M.M., Ahmed B.,
RA Halim A., Hossen Q.M.M., Hossain M.Z., Ahmed R., Khan M.M., Islam R.,
RA Rashid M.M., Khan S.A., Rahman M.S., Alam M., Yahiya A.S., Khan M.S.,
RA Azam M.S., Haque T., Lashkar M.Z.H., Akhand A.I., Morshed G., Roy S.,
RA Uddin K.S., Rabeya T., Hossain A.S., Chowdhury A., Snigdha A.R.,
RA Mortoza M.S., Matin S.A., Hoque S.M.E., Islam M.K., Roy D.K., Haider R.,
RA Moosa M.M., Elias S.M., Hasan A.M., Jahan S., Shafiuddin M., Mahmood N.,
RA Shommy N.S.;
RT "Corchorus olitorius genome sequencing.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the 11S seed storage protein (globulins) family.
CC {ECO:0000256|ARBA:ARBA00007178}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OMO91342.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AWUE01016462; OMO91342.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1R3J929; -.
DR STRING; 93759.A0A1R3J929; -.
DR OrthoDB; 359714at2759; -.
DR Proteomes; UP000187203; Unassembled WGS sequence.
DR GO; GO:0045735; F:nutrient reservoir activity; IEA:UniProtKB-KW.
DR GO; GO:0010431; P:seed maturation; IEA:UniProt.
DR CDD; cd02243; cupin_11S_legumin_C; 1.
DR CDD; cd02242; cupin_11S_legumin_N; 1.
DR Gene3D; 2.60.120.10; Jelly Rolls; 2.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR006044; 11S_seedstore_pln.
DR InterPro; IPR006045; Cupin_1.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR014710; RmlC-like_jellyroll.
DR InterPro; IPR011051; RmlC_Cupin_sf.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 1.
DR PANTHER; PTHR31189:SF62; OS01G0976200 PROTEIN; 1.
DR PANTHER; PTHR31189; OS03G0336100 PROTEIN-RELATED; 1.
DR Pfam; PF00190; Cupin_1; 2.
DR Pfam; PF01535; PPR; 1.
DR PRINTS; PR00439; 11SGLOBULIN.
DR SMART; SM00835; Cupin_1; 2.
DR SUPFAM; SSF51182; RmlC-like cupins; 1.
DR PROSITE; PS51375; PPR; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000187203};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Seed storage protein {ECO:0000256|ARBA:ARBA00023129};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Storage protein {ECO:0000256|ARBA:ARBA00022761}.
FT DOMAIN 3..157
FT /note="Cupin type-1"
FT /evidence="ECO:0000259|SMART:SM00835"
FT DOMAIN 199..339
FT /note="Cupin type-1"
FT /evidence="ECO:0000259|SMART:SM00835"
FT REPEAT 384..418
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
SQ SEQUENCE 687 AA; 73147 MW; 2772704CAC0D738F CRC64;
MDLDLSPKLA KKLYGESGGS YHAWCPDELP MLRQGNIGAA KLALEKNGFA LPRYSDSAKV
AYVLQGSGIA GIVLPESEEK VISIKKGDAL ALPFGVITWW FNKEDTELVV LFLGDTSKGH
KSGQFTDFFL TGPNGIFTGF TTEFVKRAWD VDDATVKSLV GNQTGKGIVK LDASVKMPEP
KAEHRTGMVL NCEEAPLDVD IKDAGNVVVL NTKNLPLVGQ VGLGADLVRL EGNAMCSPGF
SCDSALQVTY IVRGSGRLQV VGVDGKRVLE TIVKAGNLLI VPRFFVVSKI ADPDGLSWFS
IITTPNPIFT HLAGSIGAWK ALSPEVLQAS FNVDAETEKE YYQMGFKHDY ASCSSLIYKL
AKSRNFEAVE TLLGYLEDRN IRCQETLFNA LFQHYGKAHL IEKAVELFHK MPSFNCVRTV
QSLNSILNAL VDDDKFVDAK GIFDKSAEMG FRPNSFGKID GACFVLEEMQ KRKMSLDLEA
WGALVRDACG GDDDAALTIS RAAAARILVE EEKPIPGQQP VAGAAPVIAA SAKSNGAIVA
TVDPHHPLTF FMHDILGGSN PSARAVTGIV SNPVASGQVP FAKPNGANLP INSGVSVNSN
NNGIVNNNNV PFLTGLGGMN NAAGQNTGNN PINGGLGVAV LNGGQLPTGS TIQKFMFGTL
TAIDDELTEG HELGSGLIGK AQGDICG
//