Model | GeorgiaState-SWJTU - iGEM 2024

Model

Explain your model's assumptions, data, parameters, and results in a way that anyone could understand.

A.DESCRIPTION OF OUR PROJCT AND MODELS



Plastic pollution has become a global environmental problem. It is very important to find an environmentally friendly plastic to help solve the problem of plastic pollution. An ideal target plastic is poly-3-hydroxybutyrate (PHB). PHB is a biodegradable plastic whose biosynthesis pathway and decomposition pathway will not harm the environment. The purpose of our project design is to use the symbiotic relationship between bacteria and algae to promote algae to produce more PHB as a metabolite through genetic engineering and co-cultivation, so as to obtain PHB. In this project, we selected Escherichia coli and Clostridium reinhardtii as two co-cultivated organisms. For Escherichia coli, we used genetic engineering to edit the genetic material of Escherichia coli so that Escherichia coli metabolizes compounds that are beneficial to the growth of algae, and at the same time installed a biosafety plasmid switch. When Escherichia coli is out of the laboratory culture environment, Escherichia coli will die and cannot reproduce in the external environment, thereby ensuring the environmental safety of this project to the greatest extent. For algae, it uses carbon dioxide and ethanol produced by Escherichia coli as carbon sources to synthesize the target product. When the two are in a co-culture environment, the principle of the symbiotic relationship between algae and Escherichia coli is shown in the figure below. Algae use carbon dioxide from the outside world and produced by E. coli as a carbon source and fix carbon through photosynthesis. This metabolic process will produce biomolecules such as carbohydrates, proteins, and lipids. These biomolecules will become biofuels in further metabolic reactions, including bio-oils, BIO-diesel, bio-ethanol, bio-hydrogen, and biogas. Our target product PHB is one of these substances.



However, how to regulate our culture material ratio in specific details and how to adjust the co-culture environment to maximize the PHB output rate are still the focus of our attention. We have carried out mathematical and computer modeling for this problem and simulated the co-culture environment to seek an optimal experimental environment condition.< /h4>



B.MODEL



Description of the models we make



Since our goal is to use the co-culture method of E. coli and algae to achieve the production of PHB under the optimal state of the system, we have developed two main mathematical models. These models simulate two small aspects of the biochemical regulation process of the synthesis of our target product: the kinetic model of PHA synthesis by cells under specific conditions and the model of the manipulation of PHA synthesis by cells under alternating carbon sources.

We retrieved their parameter values ​​from literature data and established the models, and used our growth data to revise and adjust the established models.

These models are intended to help future iGEM teams studying the co-culture of algae and bacteria by simulating the kinetics of PHA synthesis and the output results of the manipulation of cell states by alternating carbon sources. These models also benefited our team a lot when doing specific experimental design, because they helped us understand the synthetic kinetics of experimental bacterial metabolism and provided a model of manipulation metabolism, which provided a reliable guiding template for actual manipulation metabolism experiments. These models have made a significant contribution to our project.





Model 1: Equilibrium Model of PHB Molecular Weight Distribution and Synthetic Structural Dynamics



1.description



PHB belongs to the general class of polyhydroxyalkanoates (PHAs), which have some common biochemical and structural properties. Polyhydroxyalkanoates are a class of degradable polyesters synthesized by bacteria under imbalanced growth conditions. They serve as carbon and energy storage materials in bacteria. When the external carbon source is exhausted, PHA can be used by cells to support growth. Different carbon source inputs and culture conditions can affect the monomer composition of PHA, resulting in polymers of different chain lengths. Modeling the molecular weight distribution and dynamics of PHA polymer chains in bacterial cell populations can help us find an optimal experimental condition to obtain a target product with a specific structure.



2.parameter related and meanings of each mathematical symbol



Kinetic parameters of polymer chains:
Chain extension rate: refers to the rate at which PHA synthase adds new monomers to the polymer chain.
Chain termination rate(Rel(x)): refers to the rate at which the polymer chain stops growing and becomes an inactive chain.
Initiation rate(Rterm(x)): the rate at which the polymerization reaction starts.
x: the molecular mass of the polymers
A(x,t): the number density of the active chain when the time is t and the molecular mass is x.



3.Assumptions:



To simplify the model and highlight the most important parameters’ influence, we made some assumptions:
1. Homogeneous cell population:
It is assumed that the entire cell population is homogeneous, i.e. all cells behave identically during polymer synthesis. This means that the metabolic state of each cell, polymer synthesis rate, chain elongation and termination rates do not differ within the population.
2. Constant volume:
It is assumed that the cell volume remains constant throughout the aggregation process. This means that the synthesis of the polymer does not significantly increase the volume of the cells, or that the volume change is negligible. This assumption helps to simplify the volume-related parameters in the model so that the molecular weight distribution only depends on time and the growth process of the polymer chain and does not involve changes in cell volume.
3. Constant monomer concentration:
It is assumed that the concentration of monomers (such as 3-hydroxybutyric acid and 3-hydroxyvaleric acid) within the cells remains constant during the polymerization process. This means that the concentration change of the monomer in the cell is negligible. This assumption simplifies the complexity of the model so that the elongation rate depends only on the molecular weight of the polymer chain without taking into account changes in monomer concentration.
4. Expression form of chain elongation and termination rates:
It is assumed that the chain elongation rate is independent of molecular weight, that is, the elongation rate is constant. This means that the chain elongation process proceeds at the same rate regardless of the molecular weight of the polymer chain. The chain termination rate is assumed to be related to the molecular weight x and assumes that it exhibits an increasing and then decreasing behavior. This hypothesis is based on experimental observations and is consistent with the termination characteristics of polymer chains at different stages during PHA synthesis.
5. The relationship between the initial rate and the monomer production rate under steady-state conditions:
Under steady-state conditions, the onset rate of polymerization is assumed to be proportional to the rate of monomer production. This assumption helps determine the relative proportions of active and inactive chains in the system at steady state.
6. Assumptions for steady-state and transient analysis:
In steady-state analysis, it is assumed that the system has reached an equilibrium state, that is, the molecular weight distribution of the active chain no longer changes with time. In transient analysis, it is assumed that the system is in dynamic change within a certain period of time, and how the molecular weight distribution changes with time is calculated by solving time-dependent partial differential equations. 7. Ignorance of carbon source switching:
It is assumed that cells are supplied with a single carbon source during synthesis.



4.Mathematical Equations and their physics/chemistry meanings:



a.population balance equation(PBE):
We use this partial differential to track the evolution of polymer chain lngth(molecular weight)as the polymerization process progresses.
The general form of thie equation is:

Where:
A(x,t) is the concentration of active polymer chains with molecular weight xxx at time ttt,
Rel(x) is the chain elongation rate,
Rterm(x) is the chain termination rate.
This equation describes the synthesis and termination behavior of PHA chains as a function of time and molecular weight during synthesis.
A similar equation is set for the inactive polymer:

Similarly, the equation describes the inactive PHA polymer’s number varies as the time changes.
b. Steady-State and Transient Analysis:
①. Steady-State Analysis:
In the Steady-state , the system had reached the equilibrium state, and the elongation rate, termination rate and the initial rate have reached the steady value. In this state, the time derivation can be regarded as 0, so the equation is simplified as:

Using the numerical integration method to solve the steady state equation,we obtain the molecular distribution in the steady state. To find the concrete value of those coefficient in the equation, use the experimental data to calculate and regress.

②. Transient Analysis
Transient Analysis describes the systems’ behavior when it’s not in the steady-state. It is mainly used in analyzing the dynamic change of the polymer chains at different time.
The equation we used is Finite Difference Method to conduct the solution. In this way we can find the distribution of the polymer chains at a precise time point.



5.How this model guided our project:


This model uses PBE as the guiding equation and substitutes it into the actual experimental data for solution and fitting. We obtain the distribution of PHB polymer chains in cells under different experimental conditions, and have an accurate grasp of the synthesis and distribution of PHB polymer chains, which provides strong guidance and support for setting the feed ratio for our actual co-culture.



6.Model reference:


A population balance model describing the dynamics of molecular weight distributions and the structure of PHA copolymer chains (2002)
Nikos V. Mantzarisa;∗, Aaron S. Kelleyb, Prodromos Daoutidisc, Friedrich Sriencc, Chemical Engineering Science 57 (2002) 4643 – 4663



Model 2: Optimal carbon source switching strategy for PHA copolymer production



1.description of this model(reasons of choosing this model, advantages and shortcomings)


Model 2 starts from the synthesis of polyhydroxyalkanoates (PHA) in cells and studies how to manipulate cells to produce diblock copolymers with specific structures (such as PHB-PHBV and PHBV-PHB) by switching different carbon sources (such as fructose and valeric acid). These PHA copolymers have important industrial application value as degradable materials. In order to optimize the production of copolymers so that we can obtain PHB molecules with a single structure, we used a mixed integer nonlinear programming model (MINLP). The purpose of this model is to find the optimal number of carbon source switches and the time of each switch that can maximize the concentration of diblock PHA copolymers. The model takes into account the number of carbon source switches (integer variables) and the time of each switch (continuous variables), and uses numerical optimization to determine the best strategy to increase the yield of copolymers under experimental and industrial conditions.



2.parameter related and meanings of each mathematical symbol





3.Asumptions(Model boundaries determined by biochemistry)


The model is based on a series of mathematical and biochemical assumptions. These assumptions simplify the complexity of actual biological reactions.
a. Mathematical assumptions:
(1) The number and time of carbon source switching are limited: The model assumes that the number of carbon source switching (N) is limited and is limited to a certain range (e.g., between 2 and 65 times) in practical applications. In addition, the time of each switching also has a lower limit to ensure that enough polymer can be produced in each switching stage.
(2) The optimization problem is a mixed integer nonlinear programming (MINLP) problem: The model assumes that the optimization problem can be solved by mixed integer nonlinear programming.
(3) Objective function maximization: Assuming that the objective function is achieved by maximizing the concentration of diblock copolymers, the model finds the optimal carbon source switching strategy by solving the objective function.
(4) Assumption of linear growth stage: The model assumes that under certain conditions, the synthesis rate of polymer increases linearly with time until it reaches a maximum value.
b. Biochemical assumptions:
(1) Single cell population assumption: The model assumes that the cell population is uniform throughout the PHA production process, that is, each cell behaves the same way. Therefore, the model averages the polymer synthesis of all cells at the population level, simplifying the differences between individual cells.
(2) Constant monomer concentration: The model assumes that the monomer concentration (such as 3-hydroxybutyrate and 3-hydroxyvalerate) in the cell remains constant in each stage.
(3) Different carbon sources produce different monomers: It is assumed that when fructose is used as the carbon source, the cell mainly synthesizes 3-hydroxybutyrate (HB) monomer, while when valerate is used as the carbon source, the cell synthesizes 3-hydroxyvalerate (HV) monomer. This assumption makes the switch of carbon source directly affect the type of PHA synthesized, that is, two different types of polymers, PHB and PHBV, can be produced by switching carbon sources.
(4) Relative independence of metabolism and growth: It is assumed that the synthesis of PHA is separated from the growth of cells. That is, cells preferentially synthesize PHA under growth-restricted conditions rather than for growth or other metabolic processes. This means that changes in carbon source mainly affect the synthesis of PHA without significantly affecting cell proliferation and metabolism.



4.Mathematical Equations and their physics/chemistry meanings)


  1. Objective Function:
    The objective function aims to maximize the concentration of the diblock copolymer, expressed as:

    Where

    Is the deblock copolymer concentration equation. It represents the concentration of the diblock copolymer produced between two switching phases. It depends on the timing of the carbon source switches and helps evaluate how different switching strategies impact copolymer production.
    This function represents the concentration of the diblock copolymer C2, which depends on the switching times between the previous phase τi−1​ and the current phase τi​. By optimizing this function, the best carbon source switching times and number of switches can be identified to maximize the production of the target copolymer.

  2. Constraint on Polymer Production:
    A constraint limits the total polymer production in the system:

    This constraint ensures that the total amount of polymer produced PPP does not exceed 70% of the cell dry weight CDWCDWCDW. It prevents excessive PHA synthesis, which could overburden the metabolic capacity of the cells, ensuring that the cells remain viable throughout the process.

  3. witching Time Constraint:

    The minimum time for each carbon source switch is constrained by:

    This ensures that each switching period τi lasts longer than a predefined minimum time τmin ​, so enough polymer is produced during each phase. This prevents overly frequent switching, which could lead to insufficient polymer yield.



    5.How this model guided our project



    The model aims to solve the objective function equation and substitutes actual experimental data for solution and fitting. Based on the two major boundary conditions, it accurately predicts the synthesis of PHB polymer chains under alternating carbon sources, providing strong guidance and support for the setting of feed ratios for our actual co-cultivation.



    7.Model reference (passages)



    Optimal Carbon Source Switching Strategy for the Production of PHA Copolymers
    Nikolaos V. Mantzaris, Aaron S. Kelley, and Friedrich Srienc





COMPUTER PROCESSING




1.description of this model
2.assumptions(Model boundaries determined by biochemistry) parameter related and meanings of each symbol
The model is modified by our team like:

We assume that a = 2; b = 1; c = 3; d = 1; Ka = 100; Kb = 80; time range is [0,8], A(0)=1, B(0)=1
a:the efficiency of transporting the substrate of PHB which will convert to A from B
b:the component which is from A that can produce the substrate for B itself
c:the efficiency of transporting the nutrient from A to B
d:the component which is from B that can produce the nutrient for A itself
Ka:a parameter which is relative to the rate of generating of algae
Kb:a parameter which is relative to the rate of generating of e.coli
A(t):The amount of the algae respect to time
B(t):The amount of e.coli respect to time
3.Computer model images
4.How the parameters is adjusted
The parameters are adjusted by the refer from the paper which guide relative ways to adjust those parameters,also, we adjust them by the expected result of our experiments. Matlab and Python are the tools that we use to adjust them
5.How this model guided our project
This model creates a model of judging the generating rate of PHA by representing the mass of two important product during the polymerization process which will ingeniously reveal the effect of changing the carbon source that will affect the synthesis of the PHA,for our experiment, we try to change the ethanol pathway of the e.coli in order to offer a secondary carbon source for the algae to produce PHB,which is our target. By the refer passage,we modify those parameters in order to match our thoughts in generating our PHB pathways during the symbiosis between them.Once we get a relative parameter,we can guide our experiments and predict a quite wonderful result.
6.Model reference
Optimal Carbon Source Switching Strategy for the Production of PHACopolymers
Nikolaos V. Mantzaris, Aaron S. Kelley, and Friedrich Srienc





BIOINFO ANALYSIS



Bioinformatic analysis of the biochemical properties of alcohol dehydrogenase



In order to verify the validity of the experimental design and ensure that the adhB enzyme can function reasonably in E. coli and co-culture system, its basic biochemical properties must be analyzed.

Physicochemical Property Analysis



Online analysis of the physicochemical properties of Zymomonas mobilis adhB protein using ExPASy’s ProtParam showed that the total sequence length of Z. mobilis adhB protein was 383 amino acids, and the relative molecular mass was approximately 40.17 kDa. The proportion of each amino acid in Z. mobilis adhB protein is shown in the figure. The three most common amino acids are alanine (A), leucine (L), and glycine (G), and the top four most common amino acids are all hydrophobic amino acids. This may be due to the widespread existence of hydrophobic effects in the process of life activities. Hydrophobic amino acid residues are widely involved in the binding between oligomeric protein subunits, enzyme catalysis and activity regulation, and other physiological processes within proteins. The number of negatively charged residues (Asp + Glu) in Z. mobilis adhB protein is 41, the number of positively charged residues (Arg + Lys) is 32, and the half-life is approximately 30 hours. The instability index is 23.70. Usually, a value less than 40 indicates a stable protein, and a value greater than 40 indicates an unstable protein. Therefore, Z. mobilis adhB protein is determined to be a stable protein. The total average hydrophobicity index (GRAVY) is 0.114. Generally, a negative value of the total average hydrophobicity index indicates that the protein is hydrophilic, and a positive value indicates that the protein is hydrophobic. Therefore, it is judged that Z. mobilis adhB protein is a hydrophobic protein. The theoretical isoelectric point of Z. mobilis adhB is 5.49, indicating that it is an acidic protein.

Table.The amino acid composition of Z. mobilis adhB

Conserved Motif AnalysisA total of 10 conserved motifs were found using the MEME online website (Figure 3-3). All 10 motifs were present in all 20 adhB sequences, and the order of all sequences was consistent, indicating that the adhB protein sequence is a highly conserved protein sequence.

Transmembrane Domain Analysis and Signal Peptide Analysis


Subcellular localization results show that adhB protein is localized in the cytoplasmic membrane.

Figure. Subcellular localization of adhB protein
The signal peptide is generally located at the N-terminus of the protein and is a short amino acid sequence that guides the newly synthesized protein to transfer across the membrane. From the Figure, it appears that the analyzed protein is predicted not to have a signal peptide because the likelihood for all signal peptide types (Sec/SPI, Lipoprotein signal peptide Sec/SPII, TAT signal peptide Tat/SPI, Lipoprotein signal peptide Tat/SPII, Pilin-like signal peptide Sec/SPIII) is 0. The probability graph across the sequence shows a flat line with very low probability scores, consistently below the threshold for predicting a signal peptide presence, which is typically indicated by a peak that crosses a predefined probability threshold.
Therefore, the analysis would suggest that this protein does not contain a signal peptide and is unlikely to be secreted via the classical Sec or Tat pathway. It might be a cytoplasmic protein, an intracellular membrane protein, or secreted through non-classical pathways that do not involve a signal peptide.

Figure The signal peptide analysis of adhB protein
As shown in Figure 3.2.3c, the TMHMM result shows the posterior probabilities for the protein being inside, outside, or within the membrane. Since the line representing “transmembrane” probability remains at the baseline with no peaks above the threshold, this supports the conclusion that there are no transmembrane regions within this protein.