Improving nutritional quality through rice polyploidization

We told our PI Professor Yuchi He about our idea of improving the endogenous nutritional quality of rice, and she affirmed our idea. Based on her research on polyploid rice breeding over the years, she told us that, like other polyploid crops that have been widely used, such as triploid watermelon, polyploid rice has the characteristics of generally improved nutrients, which can meet our expectations, and its shape in biomass, stress resistance and other aspects is better than that of conventional ploidy, that is, diploid rice. She suggested that we first double the genome of diploid rice to tetraploid, and then targeted to improve the nutrients we urgently want to improve, namely lysine. Because as a project widely beneficial to people, it is hoped that its product is not only a tool for the production of target products, but also can help the development of food nutrition balance and seed industry.
In terms of how to realize the technology of targeted improvement of lysine content, our initial consideration was to choose the way of adjusting the lysine metabolism pathway of rice, which is also a common method to improve the lysine content of rice through genetic engineering and metabolic engineering, but its strategy and technical means are contrary to our project (add a hyperlink: see iHP and plant synthetic biology for details, you can click to jump to these two pages respectively). We hope to edit the endogenous genes of rice in the project, rather than the exogenous genes introduced from other species. We sought the help of our PI, who suggested that we might start with gluten, the protein with the highest content of rice endosperm protein, and recommend Dr. Lu Gan, who is engaged in the research of rice storage protein, to us as our second pi to provide us with specific guidance in experiments.

Choose to edit gluten to increase lysine content in Rice

Glutenin is the most abundant protein in rice endosperm, accounting for about 80% of the storage protein in rice endosperm. It is encoded by four gene families, GluA, GluB, GluC and GluD. First, it synthesizes protein precursors in the rough endoplasmic reticulum, transports them into the lumen of the endoplasmic reticulum, and then carries them through the processing of the Golgi apparatus to the protein storage vacuole (PSV). The processed mature storage proteins form protein bodies (PB-Ⅱ) in the vacuole and accumulate. On the other hand, in theory, polyploidization will increase the content of gluten, which will also be verified in our experimental results.

How can the increase of gluten be associated with the increase of lysine? Our PIs suggests that there is often a dose compensation effect at the expression level among the coding genes of proteins encoded by multiple families in rice, that is, if one of the genes encoding less lysine content of glutamine is selected for site directed mutagenesis, then other synthetic genes in this subfamily can be expressed compensably, and correspondingly, the overall increase in lysine content is finally achieved. By constructing the evolutionary tree, we found that the GluB subfamily is the largest, with a total of eight synthetic genes (GluB1 includes two coding genes, GluB1-a and GluB1-b), and whether GluB1 is in the GluB subfamily or in four subfamilies, it the number of lysine residues is the least. It is advantageous to select it as the target gene of site directed mutagenesis to improve the overall lysine content of gluten by using the dose compensation effect.

Knockout of globulin to reduce rice allergenicity

In addition, during the investigation, we found that for some susceptible people, allergenic proteins in high protein foods have a greater risk of sensitization, which is one of the sources of greater uncertainty in food safety. (add a hyperlink: see iHP for details) through consulting the literature, we found that α- globulin is the main allergen in rice, accounting for about 2-8%. It is a 26kDa globulin, and its precursor is 19kDa globulin, which is encoded by a single gene Glb located on chromosome 5. Therefore, we also selected OsGlb as the target gene for site directed mutagenesis.

Gene editing technology and vector construction

In terms of gene editing technology, we chose CRISPR/Cas9 technology with relatively mature research and high specificity to construct PYLCRISPR/Cas9Pubi-H multi-target binary vector.
1. first, pCAMBIA-1300 was selected as the vector backbone, and the Cas9 protein expression cassette was integrated into the binary vector. The multi cloning site Bsa I used to load multiple sgRNA expression cassettes was located close to the binary vector RB. The sgRNA expression cassette elements were set on the intermediate plasmid vector, assembled by enzymatic ligation and PCR methods, and then assembled on the binary vector by Golden Gate or Gibson Assembly cloning methods.
2. using self-made isothermal in vitro recombination reaction mixture, multiple fragments of plasmid vector can be cloned and deleted simultaneously.
4. on the basis of constructing the vector of the first group of target (OsGluB1) sgRNA expression cassettes, add the second group of other target (OsGlb) sgRNA expression cassettes, change the Mlu I site in the primer Pgs-GGR by 1 base, so that the Mlu I upstream of LacZ becomes the only site. Mlu I was used to cut the vector into linear shape, and Gibson Assembly was used to clone the second group of target sgRNA expression cassettes.
6. after connecting the linker, amplify the sgRNA expression cassette fragment and directly use Golden Gate position specific primers for a round of PCR amplification.
7. the target OsGluB1 sequence is GGTAAGTCGCAGGGCAACC, and the target OsGlb sequence is TCGTCGTCGGAGTACTACGG. The molar ratio of vector to each insert is 1:4-6. Enzyme digestion and ligation were carried out with variable temperature cycles for about 10-15 cycles, and the unused ligation products were frozen for further amplification if necessary.

back to top