RESULTS
Vilnius-Lithuania iGEM 2024 team set out on a challenging journey to develop a recombinant manufacturing system in E. coli for the efficient and tunable production of the strongest naturally found glue - holdfast.
Caulobacter crescentus and Hirschia baltica naturally produce holdfast, a powerful biological adhesive. However, these organisms are notoriously difficult to work with due to their slow growth rates, making industrial-scale production impractical. Optimizing holdfast production in these native hosts is currently unfeasible, as the synthesis system and its regulation are still poorly understood. To address this challenge, we have developed the world's first recombinant system for holdfast production using the widely industrially-utilized E. Coli (see Engineering).
By transferring the complex, multi-gene pathway responsible for adhesive production into a new host, we have enabled bacterial holdfast to:
We have created a controllable, cost-effective, and scalable system for holdfast production. This system serves as a robust foundation for further development and scale-up by research teams worldwide (see Implementation).
Explore our detailed results below to learn more about this innovative system.
As a first step, we needed to conduct an analysis of the holdfast synthesis pathway found in Caulobacter crescentus and Hirschia baltica. This holdfast synthesis pathway consists of 12 proteins responsible for the polysaccharide assembly, polymerization, and extracellular export[1][2].
Goal: Clone synthesis pathway genes into plasmids suitable for expression in E. coli.
Strategy: The pathway of 12 genes is divided by function into two plasmids and subdivided between two T7 promoters based on their arrangement in the natural operons, cloned using Golden Gate assembly (Fig. 1).
Results: All target genes were successfully cloned into expression vectors and validated by doing MSA from the results obtained from Nanopore sequencing (done by SeqVision). This resulted in six fully assembled plasmids ready for the following experimental applications.
We designed a strategy to maximize the success of plasmid assembly by first assembling plasmids with 3 genes, and after verifying the sequences, integrating 3 left genes into that backbone (Fig. 2). In this way, we prevented Golden Gate assembly errors by trying to construct plasmids from 8 or more fragments.
Firstly, to amplify target genes from the genome, we first had to purify the genomic DNA. It took us 5 days to get visible colonies of C. crescentus and H. baltica on agar plates before we were able to inoculate liquid cultures for genomic DNA purification. Nonetheless, we successfully acquired it and verified its presence with agarose gel (Fig. 3.). The obtained gDNA was above the ladder, indicating the presence of very large DNA products, which are common for genomic fragments.
Genes of interest were then amplified via PCR directly from the purified genomes of the strains (C. crescentus CB2/CB2A/Hirschia baltica) while incorporating IIS restriction enzyme recognition sites and unique sticky ends produced during Golden Gate assembly (Fig. 4).
Protein name | Size, bp |
---|---|
hfsA | 1506 |
hfsB | 702 |
hfsC | 1269 |
hfsD | 741 |
hfsE | 1539 |
hfsF | 1443 |
hfsG | 930 |
hfsH | 774 |
hfsI | 1323 |
hfsJ | 951 |
hfsK | 1086 |
hfsL | 921 |
Protein name | Size, bp |
---|---|
hfsA | 1506 |
hfsB | 703 |
hfsC | 1269 |
hfsD | 741 |
hfsE | 1539 |
hfsF | 1443 |
hfsG | 930 |
hfsH | 774 |
hfsI | 1323 |
hfsJ | 951 |
hfsK | 1086 |
hfsL | 921 |
Protein name | Size, bp |
---|---|
hfsA | 1488 |
hfsB | 711 |
hfsC | 1362 |
hfsD | 717 |
hfsE | 1554 |
hfsF | 1446 |
hfsG | 993 |
hfsH | 768 |
hfsI | 1305 |
hfsJ | 780 |
hfsK | 1113 |
hfsL | 987 |
Since the initial Golden Gate assembly with PCR fragments from the genome failed (see Engineering), the procedure was repeated using PCR fragments amplified from purified PCR fragments. This minimized the non-specific product formation during PCR amplification (Fig. 5) and increased their purity.
Subsequently, Golden Gate assembly (GG) was performed in two stages for each plasmid, as described in Fig. 2. After the reaction, the GG mixture was transformed into Mach1 competent cells (Thermo Scientific), and resulting colonies were screened using colony PCR (Fig. 6.1-9).
Positive colonies were then used for plasmid purification and restriction digestion analysis (Fig. 7.1-11).
Selected positive clones were verified using NanoPore sequencing (done by SeqVision) and only then used for the second round of cloning (cPCR and restriction analysis are shown in figures 6.1-9 and 7.1-11 above). The procedure was successfully repeated for each plasmid resulting in full assembly of 36 genes from 3 organisms into 6 plasmids, all verified by NanoPore sequencing (see Parts).
The intricate holdfast synthesis pathway involves numerous proteins that must be efficiently co-expressed in Escherichia coli. After obtaining plasmids used for full holdfast synthesis pathway assembly, we had to optimize the expression of the whole system in E. coli. Previously, only three studies have tried to recombinantly express C. crescentus proteins in E. coli for unassociated studies with our project's goal [3][4][5]. Since the E. coli strains and protein expression conditions were unrelated to each other and before our project, no one in iGEM besides the 2009 iGEM ULB-Brussels team ever tried expressing more than two C. crescentus proteins in E. coli at the same time, we had no solid foundation for expression and chose to experiment with different E. coli strains and conditions. Therefore, it was essential to optimize the conditions for simultaneous protein expression.
Goal: Find the conditions most suited for co-expression of holdfast synthesis pathway proteins. Verify their expression levels in E. coli.
Strategy: Optimize holdfast synthesis conditions, by testing different media, temperatures, IPTG concentrations, and expression duration on multiple E. coli strains. Use SDS-PAGE analysis of cell lysates and HPLC-MC proteomics to verify the expression results.
Results: Optimal C. crescentus protein expression was achieved in the BL21 (DE3) strain cultivated in the LB medium. The most favorable conditions included an incubation temperature of 37°C, induction with 0.5 mM IPTG, and an expression duration of 3 hours. However, we could not find the optimal conditions for H. baltica proteins that would result in similar expression levels.
To determine the best conditions for the whole system expression, we first used E. coli KRX (DE3) strain. We tried expressing separate plasmids pRSF-(1)HfsE-hsfJ-hfsG-(2)hfsH-hfsK-hfsL and pACYC-(3)hfsA-hfsB-hfsD-(4)hfsF-hfsC-hfsI with different IPTG concentrations - 0.1, 0.25, 0.5, 0.75 and 1 mM - and 0.1% rhamnose with protein expression for 3h at 37°C after induction. As we saw, some bands, corresponding to our protein sizes, were appearing in pACYC-(3)hfsA-hfsB-hfsD-(4)hfsF-hfsC-hfsI expression (Fig. 8.1), but we were not sure if they were our system proteins, therefore for pRSF-(1)HfsE-hsfJ-hfsG-(2)hfsH-hfsK-hfsL we used negative control with empty pRSF vector and expressed the proteins in similar conditions. We saw that pRSF-(1)HfsE-hsfJ-hfsG-(2)hfsH-hfsK-hfsL operon proteins were also expressed (Fig. 8.2) with minimal IPTG concentration impact on protein amount.
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
We decided to use lower IPTG concentrations - 0.25, 0.5, and 0.75 mM - for gene expression induction of the full system, as it is more cost-effective for upscale in the future. But, unfortunately, full system expression at different temperatures and expression duration did not provide clear bands of proteins in SDS-PAGE gel analysis (Fig. 9. 1-5).
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Only some proteins, in the size range of 30-50 kDa, were appearing, but in general, the results of expression of the whole system were inconclusive, leading to the need to test another E. coli strain.
The next E. coli strain we tested was BL21 (DE3). Since the IPTG concentration appeared not to make that big of an impact on the expression, we settled on IPTG concentration of - 0.25, 0.5, 0.75 mM - in this way covering a wide range of them and accelerating the optimisation effort, if the system would be expressed. We also decided to yet again test different expression temperatures - 37°C, 30°C, 16°C - before and after gene expression induction.
Initially, we tested pACYC-(3)hfsA-hfsB-hfsD-(4)hfsF-hfsC-hfsI operon expression at 37°C for 3h, which did not give promising results (Fig. 10.2), as we could not see distinguishable differences before and after induction. Nevertheless, we proceeded with pRSF-(1)HfsE-hsfJ-hfsG-(2)hfsH-hfsK-hfsL operon expression by applying the same conditions. Since stark differences between empty E. coli and our operon lysates were obtained, it was proven that previously mentioned conditions are optimal for this operon expression. (Fig. 10.1). As with the expression in the KRX strain we could not see many differences between the IPTG concentrations.
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
We proceeded with the whole system expression. After analyzing expression at 37°C for 3h conditions, we saw that there were significant differences between the uninduced system and the system after 3 hours (Fig. 11.1). As with KRX(DE3) expression, we saw that IPTG concentration used for gene expression induction did not make a big impact for overall expression.
Since the CB2 system was expressing, we went on to try different temperatures with the aim of further optimizing protein expression. Remarkably, decreasing the expression temperature to 30°C and expression overnight did not make a significant difference as the proteins were still expressed in similar amounts to that of 37°C (Fig. 11.2). Expression of 16°C overnight produced some of the expected bands but not in the same yield as expression at higher temperatures (Fig.11.3).
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Proteomic analysis (performed by Thermo Fisher Scientific Baltics) of samples induced by 0.5 mM IPTG from separate parts and the whole system, revealed that proteins responsible for holdfast polymerization - hfsC and hfsI - were not expressed (Fig. 12 (a),(b)). In addition, protein levels during full system expression dropped notably compared to separate part expression. However, proteins were still expressed in slightly higher quantities than in control (Fig. 12. (c)).
Nevertheless, as later experiments showed, these proteins were probably substituted by paralogous proteins found in E. coli as the system without 2 parts was still producing a polysaccharide (see BBa_K5246003 and BBa_K5246009). Since we obtained these results a week before the freeze-day deadline, we were unable to reconsider our approach to the optimization in time. However, we reason that in the future, we should first test whether the separate proteins - hfsC and hfsI - are expressed and at what conditions before assembling new plasmids with different operon orders or additional promoters. T7/lac could serve as a good starting point, other considerations could involve separately inducible or constitutive promoters available in iGEM Parts Registry.
Once the system was successfully expressed in BL21(DE3) strain, we proceeded to optimize the expression further by testing another E. coli strain - C41(DE3). We decided to test separate system parts and the whole CB2 system with different IPTG concentrations - 0.25, 0.5, and 0.75 mM. Since we saw that the proteins were best expressed at 37°C in KRX(DE3) and BL21(DE3) strains, we settled on testing only this temperature.
SDS-PAGE analysis of cell lysates before and after gene expression induction revealed that proteins were expressed in separate parts of the system and the whole system (Fig. 13. 1-3). Regrettably, the quantity was visibly less than that seen in BL21(DE3) strain indicating that this strain is not suitable for efficient system expression.
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Even though to this point we found E. coli strain with, in our perspective, sufficient system expression, we were determined to improve the expression even more. A study was conducted, where C. crescentus hfsJ gene was expressed in E. coli to determine its interaction with hfiA (inhibitor of holdfast development), we investigated the strain they used for expression - Rosetta (DE3) pLysS - for our own purpose [4]. Since we decided to use this strain, we had to carefully reconsidering our CB2 systems’ design, as the pRARE plasmid, native to Rosetta, contains the same antibiotic resistance - chloramphenicol - and the same origin of replication - p15A - as one of our system’s plasmids - pACYC-(3)hfsA-hfsB-hfsD-(4)hfsF-hfsC-hfsI. For this we decided to change the backbone’s antibiotic resistance and origin of replication (ori) (Fig.14).
As a donor of new antibiotic resistance and origin of replication we choose pBAD-PhoCl2f plasmid, which was kindly gifted to us by VU LSC Institute of Biotechnology. This particular plasmid contains the ampicillin resistance gene and ColE1 origin of replication compatible with our pRSF operon and pRARE plasmid.
To save time we decided to utilize Golden Gate assembly for resistance/ori switching. We successfully acquired plasmids with new resistance/ori, which, after testing them with colony PCR, restriction analysis (Fig. 15), were sequenced by whole plasmid sequencing by SeqVision. Since after transformation into electrocompetent Rosetta cells they grew without any problems on LB agar plates with 3 different antibiotics, we came to a conclusion that the resistance/ori switch worked.
To test the expression we used different IPTG concentrations - 0.25, 0.5 and 0.75 mM - and our standard expression temperature of 37°C with checking the total expressed protein amount after 3 hours by SDS-PAGE analysis. Unfortunately, we saw very low or almost no protein expression in separate plasmids, leading to the same happening when we tried expressing the whole system (Fig. 16. 1-3). We reason that this might be due to increased metabolic strain on E. coli due to a whole additional plasmid introduced into the system during expression. This E. coli strain might be suitable for specific C. crescentus protein expression but, sadly, is not fitting for our needs.
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
Since one of the strains used for more efficient polysaccharide production (see down below) was E. coli HMS174(DE3), we also expressed CB2 and an empty system in previously optimized conditions. We also analyzed protein production after overnight incubation with 1% glucose. After performing SDS-PAGE gel analysis, we saw that this strain also expresses some of the proteins of the CB2 system (Fig. 17). Remarkably, after overnight incubation with 1% glucose, more protein bands appear, corresponding to, e.g., hfsH or hfsD, proteins, indicating that it is likely that prolonged incubation does not negatively impact protein production. This might explain why some of the strains, previously expressing few proteins after 3 hours, were still forming rings after overnight incubation, as this strain also formed rings.
Protein name | Size, kDa |
---|---|
hfsA | 55 |
hfsE | 54 |
hfsF | 50 |
hfsI | 48 |
hfsC | 46 |
hfsK | 43 |
hfsJ | 35 |
hfsG | 34 |
hfsL | 33 |
hfsH | 28 |
hfsD | 26 |
hfsB | 25 |
As with C. crescentus we tested different IPTG concentrations (0.25, 0.5 and 0.75 mM). Since the 37°C temperature appeared to be optimal we proceeded with this temperature as our starting point. For expressed protein analysis we used SDS-PAGE on cell lysates before and after induction.
Due to time constrains we decided to start with expressing the system first and then proceed with separate system parts expression (see parts BBa_K5246049 and BBa_K5246052) . After numerous attempts to grow cultures for protein system expression we were only able to grow one culture to appropriate optical density for gene expression induction. Since it took much more time than we expected to reach the required optical density, we decided to leave the culture overnight at 16°C and analyze the lysates from overnight expression.
Unfortunately, all of the cultures with the full H. baltica system either did not grow overnight or were of almost non existent optical density indicating that the culture died. Nonetheless, we prepared samples for SDS-PAGE gel analysis of before and after induction. Surprisingly, there were some bands corresponding to recombinant protein sizes appearing in the after induction lanes suggesting that parts of the system were expressing (Fig. 18).
Protein name | Size, kDa |
---|---|
hfsE | 58 |
hfsA | 55 |
hfsF | 52 |
hfsC | 50 |
hfsI | 49 |
hfsK | 41 |
hfsG | 37 |
hfsL | 36 |
hfsJ | 29 |
hfsH | 28 |
hfsD | 27 |
hfsB | 25 |
However, due to the unpredictable nature of the culture growth and weak expression of some proteins, we concluded that this was definitely not the right condition for this system’s expression in E. coli. Because of the natural high salinity environment of the system’s proteins, we theorize, it might be impossible to express the system in E. coli as it seems that the system is toxic to the cell. Although, it is plausible to try expressing this system in other host organisms used for expressing halophilic proteins [6][7].
Following the optimization of protein expression, we examined the conditions required for successful holdfast polysaccharide production. Previous experiments demonstrated that the presence of the proteins does not guarantee holdfast synthesis, prompting further investigation into the underlying factors.
Goal: Finding conditions optimal for holdfast production in E. coli. Purifying holdfast polysaccharides and testing bacterial glue physicochemical properties.
Strategy: Testing substrates for holdfast production, using dot blot assays and Wheat germ Agglutinin (WGA) for holdfast identification, purifying polysaccharides using methods proposed by earlier research [8].
Results: The addition of 1% glucose and incubation overnight at 30°C are required for polysaccharide synthesis. E. coli cultures produce ring-like structures after incubation. Holdfast is primarily produced in the ring cells, detectable by WGA lectin, and contains considerable amounts of N-acetyl-D-glucosamine. Cells producing the holdfast are morphologically different, forming biofilm-like structures. The best substrate for polysaccharide production is glucose, with mannose as a second choice.
Multiple experiments revealed that having the proteins alone is insufficient for holdfast synthesis. We attempted to purify the holdfast from the media (see Experiments), but the results were not as expected (Fig. 19). No holdfast was produced under the given conditions, despite the presence of all the proteins. No differences were observed between the control and target samples in the dot blot assay of the purification samples.
So we considered adding precursors for the biosynthesis pathway to facilitate its’ activation. We hypothesized that we should be able to adapt methods, suitable for polymer production in E. coli [9][10]. Holdfast synthesis begins from UDP-D-glucose transfer to an UndP lipid carrier, so having an excess of available glucose could solve the problem. We designed a protocol based on previous studies[11][12][13]. Adding 1% (w/v) glucose after target protein expression, as the addition of glucose lowers expression levels by targeting lac operator, and incubating overnight at 30°C with a shaking speed lowered to 170 rpm resulted in distinct rings forming around the edges of the shaking media (Fig.20) [25][27].
In addition, for further experiments, e.g. holdfast purification, we expressed and synthesized ring-like formations, later deemed to be the holdfast, in larger flasks. This increase in size - from 500 mL to 1 L flask did not diminish the ability to form the rings (Fig. 21). Not only that but after multiple day incubation and consistently lowering the shaking speed, we were able to get multiple rings to form around the 1 L flask walls. This information was later used to produce rings for other experimental applications where larger amounts of starting material were required.
Wheat Germ Agglutinin (WGA) specifically binds to N-acetyl-D-glucosamine - a characteristic component of the holdfast, found in abundance in the polysaccharide [14]. This allowed us to determine the localization of the holdfast. We tested various samples, including cells from the media, the supernatant, material from the ring, sonicated cells, and the supernatant of the sonicated cells. After performing the dot blot assay (see Experiments), we saw that the holdfast is produced (indicated by considerable presence of N-acetyl-D-glucosamine) but is localized exclusively within the ring material (Fig. 22).
We made the conclusion that holdfast was located in the ring, but we still did not know the nature of the material, so we investigated if there were any living bacteria in it. Following the expression, after the rings formed, we resuspended the samples in LB media and streaked them on LB agar plates under four different conditions: (1) with appropriate antibiotics, (2) antibiotics + 0.5mM IPTG, (3) antibiotics + 1% glucose, and (4) antibiotics + 0.5mM IPTG + 1% glucose. The plates were then incubated overnight at 30°C. Variety of plates was supposed to show if there are any visible morphological differences between living bacterial cells from the ring, bacterial cells that express proteins of interest, and bacterial cells that not only express proteins but also, in theory, produce holdfast.
It seems like ring formation consists of living bacteria that can grow on agar plates. Although they do not have any morphological differences (Fig. 23), they do appear to be more of a liquid-like appearance.
Following the discovery, we hypothesized that only part of the bacterial population is able to effectively express all 12 proteins, so we used plated bacteria from rings for standard protein expression and ran a SDS-PAGE analysis (Fig. 24). It is clearly noticeable that bacteria from the ring expresses proteins much better than the fresh ones.
Surprisingly, after incubating overnight, we did not notice any significant difference in the quantity of holdfast produced, based on ring appearance.
After understanding that the ring contains our target polysaccharides with bacteria, we further investigated the exact location of the holdfast. We expanded the dot blot assay method, including more samples from the ring: lysed ring cells, lysed ring cells’ supernatant, and lysed ring cells’ sediment. This lets us pinpoint the location of the polysaccharides and confirm where the holdfast is produced. Dot blot assays showed that holdfast is produced only in the ring cells mostly located in the soluble fraction of the lysate. These results were replicable for multiple biological repetitions (Fig.25.1-3).
The addition of glucose is required to activate the holdfast synthesis pathway. Even after the activation, only part of the bacterial population is capable of synthesizing polysaccharides, forming ring-like structures on surfaces. After the synthesis, most of the holdfast is attached to the cells.
After successful holdfast biosynthesis, we aimed to purify the polysaccharides for further research and analysis. We adapted the only available C. crescentus holdfast chemical purification method and verified the results using dot blot assays with WGA [8].
During the purification process, it was noticed that holdfast-producing cells stick together and can not be homogenized via pipetting or vortexing. Unfortunately, despite multiple attempts, we could not obtain pure holdfast material (Fig. 26.1-3.). None of the 3 attempts yielded significant differences between the CB2 system sample and the empty (control) sample. As the other experiments have showed, the E. coli with the CB2 system produces polysaccharides with distinct chemical features of holdfast attached to the cells, purification protocols specific for exopolysaccharide purification from the cell envelope could be tried upon the ring-formed cells [15][16][17].
Understanding that holdfast stays attached to the producing cells led us to the hypothesis that they ought to have some noticeable morphological differences. SEM analysis revealed that the polysaccharide-producing bacteria form tight conglomerates and differ morphologically compared to the control group. SEM pictures clearly show that holdfast-producing bacteria have altered cell wall morphology appearing bigger, and more “puffy” with uneven envelope topology .Moreover, the CB2 system containing bacteria seems to form large aggregates, sticking together into difficult-to-disrupt lumps. This characteristic was noticed beforehand in holdfast purification experiments, but only in SEM, it became clear that the underlying cause of the phenomenon is biofilm-like structures (Fig. 27. 1-3., Fig. 28. 1-3.) [18][19].
Similar results were seen in flow cell bright-field microscopy, where it is noticeable that the CB2 sample has a considerable amount of big bacterial aggregates (Fig.29). Biofilm-like clumps noticeable in the video can not be homogenized by pipetting or vortexing, suggesting strong associations between cells. This could explain why CB2 sample bacteria were not evenly coating the glass slide, as the aggregates prevented enough bacteria from covering the bottom of the flow cell.
SEM and bright field microscopy analysis revealed that the ring material of E. coli cells producing the holdfast are morphologically different and produce biofilm-like structures.
We decided to test E. coli strain testing, we used C41(DE3), Rosetta(DE3) pLysS, and HMS174(DE3), and repeated the expression conditions with different IPTG concentrations (0.25, 0.5, and 0.75 mM) with protein expression for 3h at 37°C. After the expression, we added 1% glucose and left the flasks to incubate overnight at 30°C, 170 rpm. All of the flasks, after incubation, had prominent rings in CB2 system flasks compared to the control (Fig. 30.1-3.). However, the formed rings were not as prominent as the ones formed in BL21(DE3).
This indicated that even though these strains were not producing visible amounts of CB2 system proteins, the low amount of the expressed system was still sufficient enough to promote ring formation. As we ran out of time, we did not perform dot blot assays of the formed rings, but it is likely that they would also contain holdfast with N-acetyl-D-glucosamine as they appear during the same conditions used for BL21(DE3) rings production.
We chose sugars that could be relevant to the polysaccharide composition: glucose, mannose, fructose, saccharose, N-acetyl-D-mannosamine, D-glucuronic acid, D-glucosamine, and xylose. After performing holdfast synthesis with standard conditions (in BL21(DE3)) optimized in earlier iterations, we observed the formation of rings with some substrates (Fig. 31.1-8). Notably, D-glucosamine substrate - a direct component of holdfast polysaccharide does not produce prominent rings, which indicates that it is incorporated into the holdfast as a N-acetyl-D-glucosamine that, after deacetylation with hfsH, only then becomes D-glucosamine (Fig. 31.7.) [1].
Then, we examined the amount of holdfast produced with each monosaccharide substrate by dot blot assay (Fig. 32). As with visual representation of glucose formed rings (Fig. 31.1.), it appears that glucose is still the best choice as a substrate for holdfast production, with mannose coming second and D-glucosamine - third.
Additionally, crystal violet staining (see Experiments) experiments yielded similar results (repeated in 3 biological replicates), with glucose and mannose samples producing the largest amount of “biofilm”. This data was used in the model construction to find and verify if glucose is indeed the most cost-effective and efficient holdfast production substrate (see Model).
We aimed to reduce the metabolic burden on the cell membrane for polysaccharide production. We decided to deactivate one of the metabolic pathways in E. coli that produces polysaccharides, allowing substrates to be redirected towards our desired polysaccharide synthesis.
Goal: Inactivate the ECA pathway in E. coli, which produces polysaccharides, and evaluate its impact on polysaccharide production.
Strategy 1: Multiple Stepwise Gene Knockout Using CRISPR/Cas9 and λ Red Machinery
Strategy 2: Homology recombineering and usingthe Red/ET recombination system
Strategy 3: Test protein expression and polysaccharide production in knockedout strains.
Results: One of the essential genes, wecA was knocked out from E.coli, and the ECA pathway was eliminated from these expression strains: BL21(DE3), Rosetta(DE3), and HMS174(DE3). The HMS174(DE3)ΔWecA strain, expressing the CB2 polysaccharide production system, exhibited slower growth than the wild-type strain, but it still produced a sufficient quantity of polysaccharides.
After multiple attempts to eliminate the ECA pathway from E. coli expression strains KRX and BL21(DE3) by using CRISPR/Cas9 and λ Red system, we could not achieve it. We could transform cells with the plasmids and dsDNA templates required for efficient editing, but the few colonies we obtained showed false positive results.
The ECA pathway was eliminated by deleting the wecA gene in these E. coli expression strains: BL21(DE3), Rosetta(DE3), and HMS174(DE3). We achieved it by changing the genome editing strategy and choosing a homologous Red/ET recombination system. Our edited colonies had kanamycin-resistance cassettes as markers in the edited region, and we tested them by colony PCR. We were able to distinguish edited and non-edited colonies by data: PCR’ed non-edited region 1544 bp and with edited region 1840 bp. From the results it could be seen that gene deletion happened in most of the cells in three different E. coli expression strains (Fig.33.a-c). Some colonies obtained 2 bands because there was a mix of edited and dead cells on the plate.
CB2 holdfast system proteins are expressed, and polysaccharides were produced in HMS174(DE3)ΔwecA. We discovered that wecA gene deletion did not interfere with CB2 system protein production. Additionally, we compared it to the not edited HMS174(DE3) strain (Fig 34.).
We analyzed polysaccharide and ring production after overnight incubation with 1% glucose in an ECA-deficient HMS174(DE3)ΔwecA strain (Fig. 35). Although cells with the CB2 system grew slower than an empty control, they still produced rings with the polysaccharides, with thicker bands of the ring visible in figure 35.
We wanted to inactivate WecB - the enzyme that catalyzes the initial reaction in the production of the substrate UDP-N-acetyl mannosaminuronic acid (UDP-ManNAc), which is thought to be in the composition of our polysaccharide.
Goal: Knock out and test inactivation of WecB - enzyme producing one of the substrates for our polysaccharide biosynthesis pathway.
Strategy: Homology recombineering and usingtheRed/ET recombination system.
Results: The wecB gene was knocked out from E. coli expression strainsBL21(DE3), Rosetta(DE3), and HMS174(DE3).
The UDP-N-acetyl mannosaminuronic acid (UDP-ManNAc) substrate was withdrawn by deleting the wecB gene in these E. coli expression strains: BL21(DE3), Rosetta(DE3), and HMS174(DE3). We achieved it by using a homologous Red/ET recombination system. Our edited colonies had kanamycin-resistance cassettes as markers in the edited region, and we tested them by colony PCR. We were able to distinguish edited and non-edited colonies by data: PCR’ed non-edited region 1577 bp and with edited region 1846 bp. From the results it could be seen that gene deletion happened in most of the cells in three different E. coli expression strains(Fig.36 a-c). Some colonies obtained 2 bands because there was a mix of edited and dead cells on the plate.
CB2 holdfast system proteins are expressed but polysaccharides are not produced in HMS174(DE3)ΔwecB. We discovered that wecB gene deletion did not interfere with CB2 system protein production (Fig. 37).
We analyzed polysaccharide and ring production after overnight incubation with 1% glucose in the HMS174(DE3)ΔwecB strain. Cells with the CB2 system grew faster than unedited HMS174(DE3) and did not produce rings with polysaccharides. The flask with the E.coli containing the CB2 system looked the same as the E. coli without (Fig 38).
Additionally, we comparedHMS174(DE3)ΔwecA with HMS174(DE3)ΔwecB containing CB2 systemto see the difference of polysaccharide and ring production (Fig. 39).
These experiments were done by our colleagues at the Vilnius University Center of Physical Sciences And Technology. Dr. Martynas Talaikis and Dr. Ilja Ignatjev. FTIR analysis reveals chemical properties of our holdfast.
A shows the ATR-FTIR spectra of intact cell samples between the control and CB2 system samples (Fig.40.1.A). Since they are very similar, a difference spectrum was constructed by subtracting the control spectrum from the sample spectrum (Fig. 40.1.B) to highlight the differences. In the difference spectrum, the vibrational modes directed upwards belong to the CB2 sample. Vibrational modes of the amide functional group appear at 1618 cm-1 (Amide-I, C=O stretching), 1517 cm-1 (Amide-II, N-H in-plane bending), and 1235 cm-1 (Amide-III, N-H in-plane bending coupled with C-N stretching, and C-C stretching) were related to the amide fragment of N-Acetyl glucosamine. Another important vibration mode observed at 1169 cm-1 belongs to the C-O stretching vibration mode in glycosidic linkage, COH stretching, and C-H in-plane bending. Also, glycosidic linkage was observed at 978 cm-1 (symmetric C-O stretching), 931 cm-1 (C‑O stretching), and 833 cm-1 (C-C stretching of α-glycosidic linkage). All assignments were done according to literature [20][21]. All vibrational modes individually and collectively indicate the presence of polysaccharides.
Figure 40.2 represents the control and sample spectra of cell debris with their difference spectra. The spectral assignment is similar to that presented in Figure 40.1. In both cases, the positive intensities in the difference spectra are related to sample-contained N-Acetyl glucosamine fragment and polysaccharides.
In hopes for further research, applications, and ideas for future iGEM teams, we investigated if individual proteins from the system can be expressed and purified.
Tetrad assembly proteins of the holdfast synthesis system (HfsE,HfsG, HfsH, HfsJ, HfsK, HfsL) build a short chain of sugar monomers in a specific sequence. They create a glycolipid — a phospholipid with a covalently attached carbohydrate molecule.
Usually, glycolipids are predominantly located on the extracellular surface of eukaryotic cell membranes and are responsible for various functions [20][21]. They can act as receptors for viruses and other pathogens, allowing them to enter a specific host cell, because every type of cell in the human body has distinct glycolipid markers [22][23][24]. This feature will let us, in the future, use said glycolipids as labels for a precise and targeted liposome distribution throughout the body, delivering anything from cancer drugs to gene editing systems directly to the target cells, serving as an alternative for holdfast system proteins usage or as an idea for future iGEM teams. We chose to exclude HfsE protein because of the previous existing research on its purification [5].
Goal: Expression and purification of CB2/CB2A/HB HfsG, HfsH, HfsJ, HfsK, HfsL proteins.
Strategy: Adding 6x histidine tags to one of the terminus of each protein, purifying using immobilized metal ion affinity chromatography, and analyzing expression and purification using SDS-PAGE gels.
Results: Successful expression of all 5 proteins from C. crescentus CB2 and CB2A. Effective expression of all except for HfsJ proteins from Hirschia baltica. Successful purification of CB2 and CB2A HfsG, HfsH, hfsJ, HfsK, HfsL. Only hfsH and HfsK of H. baltica proteins could be purified (see Parts).
We chose BL21(DE3) strain for adjustable and efficient expression of target proteins since systems’ proteins were the best expressed in this strain. Given the lack of time, we went with conditions optimized beforehand in earlier experiments for the whole pathway expression: temperature of 37°C, induction with 0.5 mM IPTG concentration, and expression for 3 hours.
After SDS-PAGE gel analysis, we concluded that we successfully expressed all 5 proteins from C. crescentus CB2 and CB2A strains (Fig. 41.1-2). We noticed, that HfsJ and HfsL glycosyltransferases are visible in lower quantities in comparison with the other proteins. Both of these protein expression conditions need to be further investigated and optimized.
Protein name | Size, kDa |
---|---|
hfsG | 34 |
hfsH | 27.9 |
hfsJ | 34.7 |
hfsK | 43.3 |
hfsL | 33.3 |
Protein name | Size, kDa |
---|---|
hfsG | 34 |
hfsH | 27.9 |
hfsJ | 34.7 |
hfsK | 43.3 |
hfsL | 33.3 |
Surprisingly, proteins of H. baltica wereexpressed in even higher quantities than C. crescentus in the same conditions (Fig. 51). Looking back at the holdfast synthesis system expressions we were unable to succeed in the expression optimization of the H. baltica pathway, however, proteins are clearly synthesized in the same conditions when expressed on their own. HfsJ protein is, similar to C. crescentus, not visible (Fig. 41-42).
Protein name | Size, kDa |
---|---|
hfsG | 37 |
hfsJ | 29 |
hfsK | 41 |
hfsH | 28 |
hfsL | 36 |
After successful expression, we proceeded to work on the purification of his-tagged proteins. We went with immobilized metal ion affinity chromatography (IMAC). Adapting protocols from the little research that was available, we used HisPur Ni-NTA Spin Columns (Thermo Scientific) [25][5][4]. Equilibration, wash, and elution buffers contained 10mM Tris pH 7.4, 150mM NaCl, and 10mM, 75mM and 500mM imidazole, respectively. All proteins were successfully purified, unfortunately, in different quantities. HfsG, HfsH, and HfsK are purifiable in substantial quantities in the conditions used that the purification conditions are optimal (Fig. 43.1, 2, 4). HfsL is also clearly visible, although in lesser amounts (Fig. 43.5). HfsJ eluted earlier than expected at 75mM imidazole, suggesting that either the position of the 6xHis-Tag is on the wrong terminus or the tag is flexible, making weak interactions between Ni-NTA resin (Fig. 43.3).
Similar results were expected from the purification of C. crescentus CB2A proteins. HfsG, HfsH, and HfsK were purified in significant quantities (Fig. 44.1, 2, 4). Surprisingly, in this case, HfsJ was better purified and eluted in the same fraction as the other proteins — at 500 mM imidazole (Fig. 44.3). Another unexpected result was observed during HfsL protein purification (Fig. 44.5), where it eluted earlier than in the case of CB2, at 75 mM imidazole. This may be because proteins are dynamic structures that can fold in multiple ways. Our His-tagged end might have folded into the globular region of the protein, making it inaccessible for purification and causing weak interactions and premature elution.
H. baltica proteins were much trickier to purify. Only two of them were present after analyzing SDS-PAGE gel after purification: HfsH and HfsK, despite most of them being well-expressed (Fig. 45.1, 2). HfsK protein eluded at lower imidazole concentration - 75 mM. These results are likely to be explained by the natural environment of the host organism - Hirschia baltica. Its’ natural marine environment is high in salt concentration, thus proteins must be adapted to that specific ionic strength. Our purification buffers do not contain large amounts of salt and it might cause misfolding and aggregation of said proteins into insoluble inclusion bodies making them difficult to purify. More optimisation is required in order to purify these proteins, focusing more on salt composition of the buffers.
In hopes for further research, applications, and ideas for future iGEM teams, we showed that individual proteins of the holdfast synthesis system can be successfully expressed and purified. More testing on their enzymatic activity is needed, but we give a strong foundation for their future application for liposome labeling or any other implementation of choice.
A lot of foundational work was done, but the scientific journey is never truly over. We optimized holdfast synthesis pathway protein expression in E. coli, found suitable and cost effective substrates for its production and verified polysaccharide adhesive synthesis. There are multiple paths that need to be explored further than we had the time to.
E. coli cells produced holdfast polysaccharides that were not excreted into the environment but stayed within the cells.There are multiple reasons why that could be the case and every one of them is a possibility that needs more investigation.
After successful protein expression and holdfast production another big riddle has to be solved - holdfast purification. There is only one established protocol for holdfast purification for C. crescentus polysaccharides that proved to not work for our application [8]. There is a need for further investigation in this area. Some techniques used for different polysaccharide purification can be useful in this case [43].
After purification from E. coli, bacterial adhesive physiochemical properties have to be studied. This could include atomic force microscopy measurements as it was done in previous C. crescentus holdfast properties research [1]. On top of that, HPLC-MC analysis would help us identify the exact composition of the polysaccharide produced.
It is also important to test the biocompatibility of the polysaccharides using, starters, ELISA assay with blood serum of people who are allergic to the materials in the chemical adhesives. This way we can start a long way of medical trials to get our product to the consumers.
After figuring out the problem with co-polymerases HfsC and HfsI and ensuring holdfast shedding and its biocompatibility it is necessary to make the synthesis stable and suitable for the industry. Antibiotic usage does not only add to the global antibiotics resistance problem but also is simply expensive and not profitable for the industry. To solve the issue we need to eliminate the usage of antibiotics in the process of holdfast biosynthesis. Not only that but IPTG - a standard inductor - is also not cost friendly, so avoiding the usage would also be beneficial. This could be achieved in multiple ways.
Depending on the outcome, if sufficient expression under constitutive promoters is not possible, similar procedures, just using T7 promoter are also viable. In this case we would still have the benefit of not using antibiotics and IPTG would let us have a tunable level of expression in E. coli.
After the clinical trials and industrial upscale optimization we will be able to help millions of medical patch users to have a better quality of life while reducing environmental concerns from the industry.