This page details our dry lab work on the G-quad-haemin complex. To download our various resources, see the resources page.
Introduction
In our project, G-quadruplex DNAzyme (G-quad) is the last step in our detection mechanism, which produces a visible colour change when the target gene is detected. The colour change is due to the peroxidase-mimicking activity when the G-quadruplex DNAzyme is bound with haemin (G-quad-haemin), which catalyses the oxidation of TMB (colourless) to TMB⁺ (blue) in the presence of H₂O₂, as shown in Equation 1 [1]. The literature has shown that the binding of haemin to G-quad is crucial to the catalytic activity of G-quad [2], and G-quadruplex is more stable in a mixture of water and dimethyl sulphoxide (DMSO) [3].
Since two of the reagents in the reaction, haemin and TMB, also have better solubility in DMSO than in water [4], [5], it is postulated that adding more DMSO to the reaction mixture may help G-quad work faster. To investigate this, a molecular dynamics (MD) approach has been developed to calculate the binding energy of haemin to G-quad at different concentrations of DMSO in water.
![]()
Equation 1: Oxidation of TMB into TMB⁺ by G-quad-haemin complex
Methodology
To obtain the free energy of binding of haemin to G-quad, alchemical transformation of the haemin is performed, and the free energy of binding is calculated according to the thermodynamic cycle, as shown in Figure 1. The software being used is GROMACS [6].
![]()
![]()
Figure 1: Thermodynamic cycle for calculating the binding energy of haemin to G-quad
Topology Preparation
The topology is prepared using the AlphaFold 3 model
[7], with
one DNA input (5'-GGGTGGGTGGGTGGGTC-3'
) and
one ligand input (HEM - Haem). It is noted that haemin and
haem are different, with
haemin having an extra chloride ion coordinated with the
iron ion (Figure 2). Since haemin and haem have very similar structures, we believe that
haem is a sufficient approximation to haemin. The model
generated by AlphaFold 3 with the highest confidence is
taken as the topology for simulation.
![]()
![]()
Figure 2: Structure of haemin (left) and structure of haem (right)
Topology Verification
We have visually inspected the model generated to ensure that AlphaFold 3 provided a reasonable starting topology for simulation. It was observed that the generated model follows the end-stacking binding mode reported in the literature [8], as shown in Figure 3 and Figure 4.
Figure 3: Folded G-quadruplex DNAzyme generated by AlphaFold 3
Note that the G-quadruplex structure in the middle of the DNAzyme could be clearly seen.
Figure 4: Folded G-quadruplex DNAzyme bound with haem
Note that the haem stacks on top of the G-quadruplex, which is consistent with the end-stacking binding mode.
Choice of Force Field
The amber14sb_OL15 force field [9], [10] is chosen as the force field to be used in the simulation, as this force field has been found to better resemble experimental measurements than other force fields, which would provide a more realistic simulation [11].
Since DMSO and haem are not "native" molecular species to the force field, they have to be parametrised before the simulation could proceed.
DMSO Parametrisation
The DMSO molecule was parametrised by the
ACPYPE server [12],
[13]. The topology of DMSO was submitted to
the server as a .pdb
file, and the server produces the
.itp
and .gro
file required
for building the topology.
Haem Parametrisation
Since the center of haem contains an Fe³⁺ ion, haem
cannot be parametrised by the ACPYPE server as the
Generalised Amber Force Field (GAFF) does not contain the Fe³⁺
ion
[14]. To parametrise haem in a way that is
compatible with amber14sb_OL15 force field, the
Metal Center Parameter Builder (MCPB.py
)
is used for preparing the topology [15].
First, the haem molecule is passed to MCPB.py
to obtain
input files for GAMESS-US
[16]. Then, quantum calculations were done
using GAMESS-US in order to optimise the geometry and
compute the charge density. The wave function is
computed using B3LYP/6-31G level of theory*, which is
sufficient for haem [17].
Restrained Electrostatic Potential (RESP)
[18] was applied to compute the
partial charges on different atoms, and the
Seminario method [19] was
used to compute the bonded parameters. Lastly, the
.frcmod
file were converted to .itp
and
.gro
using the LEAP program and ACPYPE.
Topology Assembly
While DMSO and haem were parametrised in previous steps, they are not a part of the residue database. Therefore, they have to be handled separately from the G-quad.
First, the .pdb
file generated by AlphaFold 3 was
separated into two .pdb
files, one for G-quad
and one for haem. The G-quad is
converted into GROMACS topology using the
pdb2gmx
command. For the haem, the
editconf
command is used to convert .pdb
to
.gro
. The .gro
coordinates were manually copied to
the .gro
file of the G-quad .gro
file. The
.top
file was accordingly modified to
include the topology for haem and the restraints for it.
After that, if DMSO is used as a co-solvent, the
insert-molecules
command is invoked to
add DMSO molecules into the topology.
solvate
is used to add water to the system.
The SPC/E water model [20] is
used. To neutralise charges, genion
is used to
add ions to the topology.
K⁺ ions are added to balance negative charges, and
Cl⁻ ions are added to balance positive charges, as K⁺ ions
are present in the experimental setup and were also shown to have
the best effect on stabilising G-quad structures
[21].
System Equilibration
To arrive at a reasonable starting structure and to have correct sampling during the production MD, the system must be well-equilibrated before the production MD runs. Energy minimisation was performed on the solvated structure using the steepest gradient descent method [22]. NVT equilibration was performed on the energy-minimised structure. No additional thermostats were necessary since the Langevin dynamics integrator was used for the simulation, which could also regulate the system temperature. The system temperature was set at 298K. NPT equilibration was performed on the NVT ensemble, using the Parrinello-Rahman barostat [23] to regulate the system pressure. The pressure is set to be 1 bar. The NVT and NPT equilibration were run for 100ps. The potential, temperature, pressure, and density over time were plotted, and convergence was observed in the graphs.
Production MD
A total of 41 production MDs were run for each solvent configuration, in which the charge interaction of haem is gradually decoupled first, and then the Lennard-Jones potentials are also gradually decoupled. This allows us to obtain the ∂H/∂λ curve, which can then be used to obtain ΔG using the Bennett Acceptance Ratio (BAR) method [24].
The charge lambdas are first decoupled in steps of 0.05 per iteration, and
after the charge interactions are decoupled, the Lennard-Jones potentials
are decoupled in steps of 0.05 per iteration. To
ensure sufficient ensemble sampling at the specified lambda
values,
2.5ns of the simulation was run for each combination of lambda, which is around two times what is deemed adequate
[25]. The
free energy was obtained from ∂H/∂λ using
bar
.
Calculation of ΔG of G-quad-haem Bond
To calculate the ΔG of haem binding to G-quad, one may refer to the thermodynamic cycle outlined in Figure 1. Two separate simulations were set up, one with G-quad bounded with haem and another with haem only. Two systems were solvated in the same DMSO/water mixture concentration, and haem was alchemically transformed through the process outlined above.
Results
The simulation results are summarised below. ΔG are given in kJ/mol and the +/- values represent the standard deviation of ΔG.
DMSO concentration (v/v) % | ΔG in G-quad-haem | ΔG in haem only | ΔG binding |
---|---|---|---|
0 | 2598.74 +/- 15.45 | 2633.05 +/- 5.10 | 34.41 +/- 16.27 |
5 | 2577.23 +/- 3.00 | 2592.05 +/- 7.48 | 14.82 +/- 8.06 |
10 | 2577.19 +/- 11.07 | 2615.68 +/- 2.05 | 38.49 +/- 11.26 |
15 | 2562.85 +/- 4.34 | 2604.41 +/- 2.51 | 41.56 +/- 5.01 |
20 | 2510.37 +/- 17.08 | 2626.85 +/- 1.66 | 116.48 +/- 17.16 |
25 | 2543.50 +/- 11.67 | 2568.09 +/- 2.83 | 24.59 +/- 12.01 |
30 | 2482.45 +/- 9.99 | 2600.48 +/- 3.56 | 118.03 +/- 10.61 |
35 | 2492.22 +/- 13.88 | 2594.15 +/- 5.82 | 101.93 +/- 15.05 |
40 | 2564.71 +/- 22.22 | 2586.21 +/- 3.90 | 21.50 +/- 22.56 |
45 | 2479.50 +/- 18.14 | 2558.66 +/- 3.21 | 79.16 +/- 18.42 |
50 | 2509.47 +/- 15.19 | 2524.21 +/- 9.77 | 14.74 +/- 18.06 |
To visualise the results, the free energies are plotted against DMSO concentration (v/v) %, as shown in Figure 5 and Figure 6.
![]()
Figure 5: Change in free energy (ΔG) of decoupling of haem in G-quad-haem and haem only
![]()
Figure 6: Change in free energy (ΔG) of binding of haem in G-quad-haem
Discussions
From Figure 5, we could observe that the free energy change of decoupling of haem only in solvent decreases as DMSO concentration increases. This is in agreement with the literature, where the solubility of haemin increases as DMSO concentration increases. In Figure 6, we observed a general trend of increasing binding energy of haem to G-quad, indicating a prediction of decreasing binding affinity to G-quad as the DMSO concentration increases.
Some data points do not follow the trend: the binding energies at 25% and 40% DMSO concentration (v/v). We believe this is due to the change in G-quad conformation during the simulation, which skewed the free energy values. Through the inspection of RMSD value over time of simulations (see Supporting Information) and comparing with the non-perturbed simulation (λ = 0), we conclude that the G-quadruplex DNAzyme has changed conformation in some of the simulations, leading to inaccurate free energy values. Potential solutions are discussed below.
We have also noticed that the
standard deviations of the change in free energy for G-quad-haem decoupling
are large. Upon inspection of the gmx bar
output, we noticed that the
relative entropies between some of the lambdas were significant, indicating inadequate phase space sampling. This suggests
the need for more sampling in those intervals.
Limitations and Uncertainties
While the simulations have provided valuable insights in terms of the affinity of haem to G-quad, we are also aware of the following limitations and uncertainties.
Inability to Simulate Peroxidase Activity of G-quad-haem in Various Solvent Mixtures
While molecular dynamics simulations may provide information about the binding affinity between G-quad and haem, they cannot provide information on the peroxidase activity of G-quad-haemin due to their inability to simulate bond breaking and forming, which are the basis of chemical reactions. Such processes would lead to topology modifications, which is impossible in pure molecular mechanics simulations. Also, in molecular mechanics simulations, quantum effects on the electrons are neglected, and molecules are assumed to be at the ground state. In contrast, most reactions require electrons to be excited or exchange electrons between different species. Resolving such issues would demand the model to take quantum effects into account. We suggest that quantum mechanics/ molecular mechanics (QM/MM) hybrid simulations may be able to fill this gap.
Unrestrained G-quad Skews Free Energy Values
Upon inspection of the G-quad-haem structure generated by the AlphaFold 3 model, we noticed that there are hand-like structures formed by the last two nucleotides of the G-quad sequence (TC-3') that seem to hold onto the haem ligand and keep it in place. However, as the interaction between haem and the surroundings was decoupled, the interactions between haem and the hand region weakened, which may cause the hand region to wiggle freely. This increases the number of possible microstates in the system, increasing the entropy and leading to skewed free energy values. This may be solved by "confine and release" free energy simulations, which have been proposed in literature [26].
Binding Affinity may not be a Universal Indicator to G-quad-haemin Peroxidase Activity
While there was literature suggesting that the binding affinity of haemin to G-quad may be a predictor for the peroxidase activity of the G-quad-haemin DNAzyme [8], we have also found literature supporting the opposing view [27]. Hence, the hypothesis that the free energy of binding between G-quad and haemin is a predictor for peroxidase activity may not be universally true.
G-quad-haem Activity may not be the Most Crucial Factor Affecting Reaction Rate
Although we focused on evaluating the stability of the G-quad-haem complex in different solvent mixtures, one must remember that the ultimate goal for optimising solvent mixture is to maximise the reaction rates, such that the test kit develops a more evident colour change more rapidly. The stability of the G-quad-haem complex only evaluated the catalyst in the reaction, and the effects of solvents on the reactants' stability may need to be accounted for in the simulations. The solvent may increase the reaction rate by affecting the stability of intermediate and final products and vice versa [28]. It is suggested that the whole reaction, including its intermediary products, may be qualitatively evaluated in terms of their stability before confirming whether the increased or decreased stability of G-quad-haem matters for the reaction.
Conclusion
Despite the presence of some inaccuracies due to the factors outlined above, our preliminary modelling results indicate a promising potential that the binding energy of haem to G-quad is the lowest in fully aqueous solutions. This exciting discovery could pave the way for a new method to predict G-quad-haem complex catalytic capability in different solutions, a possibility that can be further explored in the lab.
To help future iGEM teams in modelling molecular systems involving the haem residue, we have uploaded the parametrised haem topology for other teams to use.
References
- Z. Zhang, W. Zhao, C. Hu, Y. Cao, Y. Liu, and Q. Liu, "A Convenient and Label-Free Colourimetric Detection for L-Histidine Based on Inhibition of Oxidation of 3,3′,5,5′-Tetramethylbenzidine-H₂O₂ System Triggered by Copper Ions", vol. 9, p. 773519, Nov. 2021, doi: 10.3389/fchem.2021.773519.
- Y. Cao et al., "Investigation and improvement of catalytic activity of G-quadruplex/haemin DNAzymes using designed terminal G-tetrads with deoxyadenosine caps", vol. 11, no. 26, pp. 6896–696, Jul. 2020, doi: 10.1039/d0sc01905d.
- N. Tariq, T. Kume, L. Luo, Z. Cai, S. Dong, and R. B. Macgregor, "Dimethyl sulphoxide (DMSO) is a stabilising co-solvent for G-quadruplex DNA", vol. 282, p. 106741, Mar. 2022, doi: 10.1016/j.bpc.2021.106741.
- R. Stiebler, A. N. Hoang, T. J. Egan, D. W. Wright, and M. F. Oliveira, "Increase on the Initial Soluble Haem Levels in Acidic Conditions Is an Important Mechanism for Spontaneous Haem Crystallisation In Vitro", vol. 5, no. 9, p. e12694, Sep. 2010, doi: 10.1371/journal.pone.0012694.
- A. Frey, B. Meckelein, D. Externest, and M. A. Schmidt, "A stable and highly sensitive 3,3′,5,5′-tetramethylbenzidine-based substrate reagent for enzyme-linked immunosorbent assays", vol. 233, no. 1, pp. 47–56, Jan. 2000, doi: 10.1016/S0022-1759(99)00166-0.
- M. Abraham et al., "GROMACS 2024.3 Source code". Zenodo, 29 Aug, 2024. doi: 10.5281/zenodo.13456374.
- J. Abramson et al., "Accurate structure prediction of biomolecular interactions with AlphaFold 3", vol. 630, no. 8016, pp. 493–500, Jun. 2024, doi: 10.1038/s41586-024-07487-w.
- N. Alizadeh, A. Salimi, and R. Hallaj, Haemin/G-Quadruplex Horseradish Peroxidase-Mimicking DNAzyme: Principle and Biosensing Application, vol. 170. Switzerland: Springer International Publishing AG, 2017, pp. 85–106.
- M. Zgarbová, J. Šponer, M. Otyepka, T. E. Cheatham, R. Galindo-Murillo, and P. Jurečka, "Refinement of the Sugar–Phosphate Backbone Torsion Beta for AMBER Force Fields Improves the Description of Z- and B‑DNA", vol. 11, no. 12, pp. 5723–5736, Dec. 2015, doi: 10.1021/acs.jctc.5b00716.
- M. Zgarbová et al., "Refinement of the Cornell et al. Nucleic Acids Force Field Based on Reference Quantum Chemical Calculations of Glycosidic Torsion Profiles", vol. 7, no. 9, pp. 2886–2902, Sep. 2011, doi: 10.1021/ct200162x.
- Li N., Gao Y., Qiu F., and Zhu T., "Benchmark Force Fields for the Molecular Dynamic Simulation of G-Quadruplexes", vol. 26, no. 17, pp. 5379-, Sep. 2021, doi: 10.3390/molecules26175379.
- Sousa da Silva A. W. and Vranken W. F., "ACPYPE - AnteChamber PYthon Parser interfacE", vol. 5, no. 1, pp. 367--367, Jul. 2012, doi: 10.1186/1756-0500-5-367.
- L. Kagami, A. Wilter, A. Diaz, and W. Vranken, "The ACPYPE web server for small-molecule MD topology generation", vol. 39, no. 6, Jun. 2023, doi: 10.1093/bioinformatics/btad350.
- J. Wang, R. M. Wolf, J. W. Caldwell, P. A. Kollman, and D. A. Case, "Development and testing of a general amber force field", vol. 25, no. 9, pp. 1157–1174, Jul. 2004, doi: 10.1002/jcc.20035.
- P. Li and K. M. Merz, "MCPB.py: A Python Based Metal Center Parameter Builder", vol. 56, no. 4, pp. 599–604, Apr. 2016, doi: 10.1021/acs.jcim.5b00674.
- Barca G. M. J. et al., "Recent developments in the general atomic and molecular electronic structure system", vol. 152, no. 15, p. 154102, Apr. 2020, doi: 10.1063/5.0005188.
- Autenrieth F., Tajkhorshid E., Baudry J., and Luthey‐Schulten Z., "Classical force field parameters for the haem prosthetic group of cytochrome c", vol. 25, no. 13, pp. 1613–1622, Oct. 2004, doi: 10.1002/jcc.20079.
- C. I. Bayly, P. Cieplak, W. Cornell, and P. A. Kollman, "A well-behaved electrostatic potential based method using charge restraints for deriving atomic charges: the RESP model", vol. 97, no. 40, pp. 10269–10280, Oct. 1993, doi: 10.1021/j100142a004.
- J. M. Seminario, "Calculation of intramolecular force fields from second-derivative tensors", vol. 60, no. 7, pp. 1271–1277, 1996, doi: 10.1002/(SICI)1097-461X(1996)60:7<1271::AID-QUA8>3.0.CO;2-W.
- H. J. C. Berendsen, J. R. Grigera, and T. P. Straatsma, "The missing term in effective pair potentials", vol. 91, no. 24, pp. 6269–6271, Nov. 1987, doi: 10.1021/j100308a038.
- S. Haider, "Computational Methods to Study G-Quadruplex–Ligand Complexes", vol. 98, no. 3, pp. 325–339, Sep. 2018, doi: 10.1007/s41745-018-0083-3.
- E. J. Haug, J. S. Arora, and K. Matsui, "A steepest-descent method for optimisation of mechanical systems", vol. 19, no. 3, pp. 401–424, Jul. 1976, doi: 10.1007/BF00941484.
- Parrinello M. and Rahman A., "Polymorphic transitions in single crystals: A new molecular dynamics method", vol. 52, no. 12, pp. 7182–7190, Dec. 1981, doi: 10.1063/1.328693.
- C. H. Bennett, "Efficient estimation of free energy differences from Monte Carlo data", vol. 22, no. 2, pp. 245–268, Oct. 1976, doi: 10.1016/0021-9991(76)90078-4.
- R. S. Rathore, P. Aparoy, P. Reddanna, A. K. Kondapi, and M. Rami Reddy, "Minimum MD simulation length required to achieve reliable results in free energy perturbation calculations: Case study of relative binding free energies of fructose-1,6-bisphosphatase inhibitors", vol. 32, no. 10, pp. 2097–2103, Jul. 2011, doi: 10.1002/jcc.21791.
- D. L. Mobley, J. D. Chodera, and K. A. Dill, "Confine-and-Release Method: Obtaining Correct Binding Free Energies in the Presence of Protein Conformational Change", vol. 3, no. 4, pp. 1231–1235, Jul. 2007, doi: 10.1021/ct700032n.
- J. Chen et al., "How Proximal Nucleobases Regulate the Catalytic Activity of G‑Quadruplex/Haemin DNAzymes", vol. 8, no. 12, pp. 11352–11361, Dec. 2018, doi: 10.1021/acscatal.8b03811.
- J. T. Hynes, "Chemical Reaction Dynamics in Solution", vol. 36, no. 1, pp. 573–597, Oct. 1985, doi: 10.1146/annurev.pc.36.100185.003041.