During the need-scoping process of our integrated human practices, we engaged with a variety of professionals and literature, to isolate 3 areas of focus we could develop a solution for
Build: We attempted to build a novel Fab using OptMAVEn-2.0 (2).
Test: When using OptMAVEn-2.0, we ran into an impassable error since we were trying to build the antibody off non-protein residues (the PFOA molecule)
Learn: We learned from further literature review that antibodies don’t typically bind to small molecules. We were not going to be able to use a traditional Fab. From our literature review, we learned that albumin is a natural protein that has the ability to bind to PFOA with a respectable binding affinity in the micromolar range. We then sought to complex the pocket of Albumin that binds to PFOA to a Fc domain region of antibody, since the Fc region of the antibody is where the J chain attaches and how pIgR mediates transcytosis.
Build: We designed a protein sequence for a pseudo-antibody that had a N→C continuous pocket of albumin that was found to bind to PFOA. The pocket cutoff parameters was simply that the secondary structures which contained the PFOA binding residues (which were identified in the paper which did crystallized the Albumin-PFOA complex) At the C terminus of this pocket we appended the hinge and Fc domain from a human IgA antibody. To build this model, we created the sequence and then predicted its structure with Alpahfold 2.
Test: Before we ordered the gene sequence, we discussed it with our scientific advisors, and realized it would be extremely difficult to express IgA proteins in E.coli due to E.coli’s lack of post-translational protein machinery, the protein’s large size which could stress E. coli, and abundance of disulfide bonds, which are very difficult to form in the reducing environment of the E.coli. cytoplasm.
Learn: We did further literature review. One option we considered was trying to express the chimeric protein in a new, more complex, expression system like yeast– but this would require considerable upfront costs. Fortunately, we found a paper describing an attempt to harness the transcytosis ability of pIgR through the attachment of a small peptide that could be attached to the C-terminus of an IgG antibody. We contacted the authors of the paper and set up a meeting with him. He told us how based on the results, we could get the transcytosis effect we desired with the minimal peptide sequence his lab used, as compared to the Fc domain.
Build: We removed the hinge and Fc domain from Design 2 and replaced it with the peptide QRNPKLKLIRRHPTLRIPPI. The original moniker for this sequence was QRN, but for our project we have called it PASR (Peptide thAt triggerS pIgR-mediated transcytosis). The PASR peptide has been shown to bind to human and mouse pIgR at comparable affinities in-vitro. The PASR sequence has been shown to bind to pIgR and facilitate transcytosis when complexed to c-terminus of IgG in mice. The peptide has been optimized to express well and not aggregate.
Test: We used the linker sequence used in the original paper, GGGGS, to link the PFOA-binding pocket to the PASR peptide. We created the sequence and appended it to both the N and C termini of the pocket and then folded the sequences with Alpha Fold 2. Learn: Likely due to its small size and abundance of charged residues, alpahfold2 predicted the PASR to have no secondary structure. For this reason, for further designs we decided to append the PASR to the N-terminus, as we did not want it to destabilize the C-terminus alpha helix (which has some residues in the binding pocket.) The N-terminus is further away from the binding pocket than the C-terminus.Build: We isolated the correct, high-affinity pocket of Albumin, which luckily enough was continuous from N→C. We could modify our design by simply replacing the old pocket with the new pocket. Like the first design, we defined our pocket to be the minimal number of residues needed to not disturb any secondary structures which the binding amino acids were contained in.
Test: We tested appending the PASR peptide to both the N and C terminus. By this time, Alpha Fold 3 had been released, so we used Alpha Fold 3 to fold designs with the PASR sequence appended to either the N or C terminus with the GGGGS linker.
Learn: In this pocket, both the N-terminus and C-terminus were predicted to be unstable when the PASR peptide was appended. This would be problematic since the N and C termini were both very close to the binding pocket, and an intrinsically disordered region (IDR) near this could cause the binding pocket to be unavailable as the IDR flailed around.
Build: We could either expand the N or C terminus to add more space for the PASR peptide to move around without blocking the binding pocket. We decided to expand the C terminus for multiple reasons:
Test: We tested multiple lengths of an expanded c-terminus based on the same principle of keeping secondary structures intact. We predicted stability of the proteins by predicting the structure and assessing them in Alpha Fold 3. We predicted the structure of both the binding pocket with the PASR and linker on the N and the newly expanded C terminus.
Learn: The pocket was most stable when we expanded the pocket structure by adding 2 alpha helices on the c-terminus side from the real albumin. Importantly, these residues have not been shown to bind to any other ligands, and resulted in a PASR peptide that was ~110% more further away from the PFOA ligand (~11 Angstrom vs ~23 Angstroms.)
However, an old issue came up in our new design: there were methionines, cysteines and a disulfide bond in the pocket, which as previously mentioned, can cause folding issues when E.coli is used as the expression system.
Build: We wanted to see if LigandMPNN could redesign the PFOA pocket and increase its binding affinity and also optimize for protein stability by removing cysteine and methionine. We input Design 5 into LigandMPNN with the default parameters, but restricted cysteines and methionines from being in our designed peptide sequences.
Test: Before we ordered the designs, we realized we would not have the wet lab resources needed to properly utilize LigandMPNN, which requires considerable wet-lab based optimization after an initial screening is done
Learn: Although we could not use LigandMPNN to increase binding affinity, we could still use it to redesign the non-binding residues for stability by removing the cysteines and methonines, which as previously mentioned cause protein folding issues in E.coli expression systems. The proof of concept for this approach has already been tested and requires only a fraction of the wet lab testing, which was a workload suitable for our lab.
Build: We put Design 5 through the LigandMPNN algorithm, but fixed all residues within 7 angstroms of the PFOA ligand (except two cysteines that formed a disulfide bridge). We once again restricted cysteine and methionine from being incorporated into the generated protein sequences and took the fixed residues as context to build the rest of the protein.
Test: LigandMPNN is a stochastic algorithm, meaning it does not converge on just one solution, but rather generates many sequences that could work. We folded 120 of these designs with Alpha Fold 3 and selected the top 4 designs with the lowest RMSD, highest pTm and highest plDDT scores to order for wet lab testing.
Final Design Scheme: Residues colored green represent amino acids that were kept fixed during the LigandMPNN procedure.
Build: Our initial plan involved using a sandwich ELISA to assess the binding affinity between our engineered binder, containing the FA4 pocket and PASR sequence, and PFOA/PFOSA. PFOA/PFOSA would be immobilized onto the ELISA plate (blue box) via physical absorption. The binder, which included a his-tag (black) from the plasmid for detection, would then be introduced. We intended to detect the binding event using an anti-his-tag antibody (yellow), followed by an anti-mouse IgG (brown) conjugated to an enzyme (red) that would catalyze a fluorescent reaction upon binding with substrate (purple). This fluorescence would provide a quantitative measurement of the binder's ability to interact with PFOA/PFOSA.
Test: We planned to perform the ELISA in a sandwich format, where PFOA/PFOSA would be immobilized on the plate, and the engineered binder, with its his-tag, would bind to these molecules. The detection mechanism would involve a secondary antibody system to amplify the signal, providing measurable fluorescence when successful binding occurred. A control group using unmodified albumin would establish a baseline for comparison.
Learn: However, through advice from a mentor, we quickly realized that the ELISA assay was not suitable for our purpose. The main issue was the size of the PFOA/PFOS molecules, which are significantly smaller than typical antigens used in ELISAs. For example, insulin (a small protein) is around 6 kDa, while PFOA is roughly 200 Da. ELISA is more appropriate for larger proteins, and the size of perfluoroalkyl compounds like PFOA makes it unlikely they could be effectively immobilized for a sandwich assay. Additionally, Biolayer Interferometry (BLI) faced similar challenges due to the small size of the molecules and the expense and availability of BLI tips. Based on this insight, we abandoned the ELISA and BLI approaches.
Instead, we shifted our focus to Differential Scanning Fluorimetry (DSF), which allows us to assess the thermal stability of the binder in the presence of PFOA. In DSF, we monitor tryptophan fluorescence as the protein is heated, tracking the transition from folded to unfolded states. A higher melting temperature (Tm) indicates greater stability, meaning better binding to PFOA. Our goal is to see a high Tm for PFOA, showing strong interaction with the binder, and a lower Tm for myristic acid, serving as a control to ensure specificity.