Overview
To screen patients with cardiovascular diseases for their risk of
cancer, our project aims to develop a LIRA-based system to target miRNA
that are both highly expressed in CVD patients and capable of promoting
cancer. LIRA is a designed RNA sequence with stem-loop structures, which
could be activated by its target RNA to initiate the expression of
reporter gene. To establish an ideal LIRA system, our modeling group
mainly need to revolve the following four questions.
Question 1:
Which miRNA to target?
Question 2:
How to design LIRA to target miRNA?
Question 3:
How to improve the design of miRNA?
Question 4:
How to tailor LIRA-mediated reaction for samples with miRNA at
different concentrations?
To answer
Question 1
, we applied differential analysis in GEO database to identify miRNA
that are highly expressed in patients post myocardial infarction (MI) or
with heart failure. After differential analysis of miRNA in cancer
database from TCGA, we identified miR-142-3p and miR-210-3p as ideal
targets, as they are both highly expressed in multiple types of cancer
and in post MI patients. Additionally, we also verified that these two
miRNAs could promote migration and invasion of cancer cells with wet lab
experiments.
To address
Question 2
and
Question 3
, we designed single-arm LIRAs to target miR-142-3p or miR-210-3p
initially. Next, we designed double-arm LIRAs to target miR-142-3p and
miR-210-3p, which could only be activated in the presence of both
miRNAs. To optimize the design of LIRA, we analyzed multiple indicators
of LIRA, and used Multinomial Logistic Regression to evaluate the
potential contribution of these indicators to the expected performance
to LIRA. To improve model performance, we introduced Random Forest to
identify key indicators that could influence the stability and
functionality of LIRA.
To answer
Question 4
, we conducted kinetic modeling to simulate the reaction of
LIRA-mediated expression of reporter gene to obtain predicted time and
concentration curves of the reaction. Furthermore, using 3D curves, we
simulated the reaction with miR-142-3p and miR-210-3p at different
concentrations, and estimated suitable concentration of LIRA for
detecting miRNAs at a variety of concentration.
In summary, our modeling work laid the ground for our wet lab
experiments that generates the LIRA to target miR-142-3p and miR-210-3p,
and paved the way for the future application of the LIRA in the
screening of cancer risk in post-MI patients.
ldentify target miRNA for cancerscreening in CVD patients
Epidemiology studies have revealed that patients with CVDs, such as
myocardial infarction (MI), heart failure (HF) and atherosclerosis, have
increased risk of getting cancer [1-4]. For instance, MI
patients exhibit increased prevalence of a variety of cancers and
elevated mortality that are caused by cancer [1]. HF patients
also show higher cancer risk compared to healthy individuals
[5]. Experiments using animal models reveal that MI enhances
tumor progression through circulating factors and immune reprogramming
[6,7]. Especially, MI leads to aberrant miRNA secretion from
the inflicted heart into blood, which could circulate to distant organs
and contribute to cancer progression at distant organs [8].
To find good targets for our cancer screening kit for CVD patients, we
analyzed database of CVD, including MI and HF, and databases of multiple
cancers, to identify miRNA that show increased expression in both CVD
and cancer, and then verified the function of the identified miRNA on
cancer progression.
Analysis of miRNAs in MI and cancer databases
The whole process of our analysis of MI and cancer database is shown in
Figure 1. First, we searched Gene Expression Omnibus (GEO) database
using myocardial infarction and miRNA as key words and collected 318
datasets. Since our aim is to identify highly expressed miRNA in the
blood of MI patients, we manually excluded 305 irrelevant datasets based
on the sample sources, species and disease descriptions. To the
remaining 13 datasets, we calculated the log2FC value of the
miRNAs based on the ratio of the mean values of the patient samples and
control and applied logarithmic transformation to the ratio
[9]. We used Student's t-test to obtain P-values for each
miRNA. Furthermore, to reduce the occurrence of false positives, we
adjusted the P-values by comparing these P-values with the threshold
selected for an acceptable false discovery rate (FDR) [10],
and used adj.P value to refer FDR. We used the log2FC > 1
and adj.P Value < 0.05 as our screening criteria [11], and
got 9 datasets that contained significantly upregulated miRNA.
Figure 1. Flowchart of miRNA analysis in MI and cancer database
Second, we summarized multiple epidemiology studies and identified 33
types of cancers showing faster progression or higher risk in CVD
[12-16]. We conducted differential analysis of miRNA
expression of these 33 types of cancer using the Cancer Genome Atlas
Program (TCGA) database and obtained 22 cancer datasets that show
significantly upregulated expression of miRNA using our screening
criteria (log2FC > 1 and FDR < 0.05). The names of
these 22 types of cancers are listed in Table 1.
Table 1. The 22 types of cancer that show significantly upregulated
expression of miRNA
Abbreviation | Disease Name | Abbreviation | Disease Name |
---|---|---|---|
BLCA | Bladder Urothelial Carcinoma | LUAD | Lung Adenocarcinoma |
BRCA | Breast Invasive Carcinoma | LUSC | Lung Squamous Cell Carcinoma |
CESC | Cervical Squamous Cell Carcinoma & Endocervical Adenocarcinoma | PAAD | Pancreatic Adenocarcinoma |
CHOL | Cholangiocarcinoma | PCPG | Pheochromocytoma & Paraganglioma |
COAD | Colon Adenocarcinoma | PRAD | Prostate Adenocarcinoma |
ESCA | Esophageal Carcinoma | READ | Rectum Adenocarcinoma |
HNSC | Head And Neck Squamous Cell Carcinoma | SKCM | Skin Cutaneous Melanoma |
KICH | Kidney Chromophobe | STAD | Stomach Adenocarcinoma |
KIRC | Kidney Renal Clear Cell Carcinoma | THCA | Thyroid Carcinoma |
KIRP | Kidney Renal Papillary Cell Carcinoma | THYM | Thymoma |
LIHC | Liver Hepatocellular Carcinoma | UCEC | Uterine Corpus Endometrial Carcinoma |
We compared the significantly upregulated miRNAs in MI and cancers to
identify miRNAs that are significantly upregulated in both diseases and
obtained 24 miRNAs. We think that two criteria are especially important
for the potential effects on cancer promotion of these 24 miRNAs during
CVD:
1. log2FC values in MI. A high log2FC value of the
miRNA in MI demonstrates a large increased amount of this miRNA in the
blood of MI patients, which indicates that a large amount of this miRNA
is likely to reach distant organs through circulation system and exerts
its potential cancer-promoting effects at distant organs.
2.Types of cancers that show significantly upregulated expression of
these miRNAs. miRNAs that are highly expressed in multiple types of
cancers could potentially influence the progression of multiple types of
cancers. We would like to design a screening kit to identify MI patients
with increased risk of getting of a variety of cancers, instead of a
certain type of cancer.
We summarized these two key information of the 24 miRNAs in Table 2.
Table 2. miRNAs significantly upregulated in both MI and cancers
miRNA ID | log2FC Value in MI | Types of Cancers | miRNA ID | log2FC Value in MI | Types of Cancers |
---|---|---|---|---|---|
miR-210-3p | 6.18 | 13 | miR-320b | 1.49 | 1 |
miR-142-3p | 3.51 | 10 | miR-199a-3p | 1.48 | 3 |
miR-455-5p | 3.09 | 1 | miR-362-3p | 1.39 | 1 |
miR-483-3p | 2.79 | 1 | miR-582-3p | 1.31 | 1 |
miR-199b-5p | 2.31 | 3 | miR-146b-5p | 1.26 | 9 |
miR-629-3p | 2.19 | 1 | miR-455-3p | 1.23 | 5 |
miR-330-5p | 2.14 | 4 | miR-486-5p | 1.17 | 2 |
miR-483-5p | 2.02 | 1 | miR-28-5p | 1.15 | 1 |
miR-576-5p | 1.95 | 4 | miR-671-5p | 1.06 | 3 |
miR-107 | 1.82 | 1 | miR-628-5p | 1.05 | 1 |
miR-450b-5p | 1.58 | 3 | miR-142-5p | 1.03 | 4 |
miR-199a-5p | 1.52 | 1 | miR-127-5p | 1.01 | 4 |
To further evaluate the log2FC in MI and types of cancers of
these 24 miRNAs, we applied a scatter dots graph to visualize these data
(Figure 2). MiR-210-3p and miR-142-3p distinguish from other miRNAs in
this graph, as they show both much higher log2FC values in MI
and much more types of cancers that showed significantly upregulated
expression of them than other miRNAs.
Figure 2. Scatter dots graph of the 24 miRNA on log2FC in MI
and Types of Cancers
As shown in Table 2, miR-210-3p ranks the highest in log2FC
value in MI (6.18), which demonstrate 73 folds of increased expression
in MI patients, and show significantly upregulated expression in 13
types of cancers, with log2FC values in cancer ranged from
1.02 to 5.04. The log2FC values of miR-210-3p in 13 types of
cancers are shown in Table 3.
Table 3. Types of cancers with significantly upregulated expression of
miR-210-3p
Types of Cancers | log2FC values in Cancer |
---|---|
Cervical squamous cell carcinoma and endocervical adenocarcinoma |
5.04 |
Bladder Urothelial Carcinoma | 4.80 |
Lung Adenocarcinoma | 4.54 |
Lung Squamous Cell Carcinoma | 4.51 |
Uterine Corpus Endometrial Carcinoma | 3.14 |
Breast Invasive Carcinoma | 3.14 |
Kidney Renal Clear Cell Carcinoma | 3.07 |
Head and Neck Squamous Cell Carcinoma | 2.52 |
Kidney Renal Papillary Cell Carcinoma | 1.82 |
Esophageal Carcinoma | 1.58 |
Stomach Adenocarcinoma | 1.47 |
Prostate Adenocarcinoma | 1.30 |
Liver Hepatocellular Carcinoma | 1.02 |
miR-142-3p ranks the second highest in log2FC value in MI
(3.51), which demonstrated 11 folds of increased expression in MI
patients, and show increased expression in 10 types of cancers, with a
log2FC values in cancers range from 1.04 to 6.61. The
log2FC values of miR-142-3p in 10 types of cancers are shown
in Table 4.
Table 4. Types of cancers with significantly upregulated expression of
miR-142-3p
Types of Cancers | log2FC values in Cancer |
---|---|
Rectum Adenocarcinoma | 6.61 |
Colon Adenocarcinoma | 6.55 |
Cervical squamous cell carcinoma and endocervical adenocarcinoma |
3.58 |
Lung Adenocarcinoma | 2.29 |
Kidney Renal Clear Cell Carcinoma | 2.03 |
Breast Invasive Cancer | 1.87 |
Uterine Corpus Endometrial Carcinoma | 1.27 |
Bladder Urothelial Carcinoma | 1.24 |
Lung Squamous Cell Carcinoma | 1.09 |
Head and Neck Squamous Cell Carcinoma | 1.04 |
To summarize, our analysis shows that miR-142-3p and miR-210-3p are the
best target miRNAs to identify MI patients with high risk of getting
cancers.
Analysis of miRNA in HF and cancer databases
HF is a condition resulting from the impairment of cardiac contractility
and/or diastolic function[17].Current study also reveals that
HF can promote cancer [6]. Hence we also analyzed miRNAs that
are highly expressed in HF patients in GEO database. The analysis
process is shown in the flowchart (Figure 3).
Figure 3. Flowchart of miRNA analysis in HF and cancer database
We collected 158 datasets from GEO database using heart failure and
miRNA as key words. After we manually excluded 151 irrelevant datasets
based on the sample sources, species and disease descriptions, we got 7
datasets to analyze miRNA expression. Using log2FC > 1 and
adj.P Value < 0.05 as screening criteria, we got 6 datasets that
contain significantly upregulated miRNAs in HF. We compared the
significantly upregulated miRNAs in HF with those in cancers, which was
described in the previous section, we got 35 miRNAs that show
significantly upregulated expression in both diseases. We summarized the
log2FC in HF and types of cancers that show significantly
upregulated expression of these miRNAs in Table 5.
Table 5. miRNAs significantly upregulated in both HF and cancers
miRNA ID | log2FC Value in HF | Types of Cancers | miRNA ID | log2FC Value in HF | Types of Cancers |
---|---|---|---|---|---|
miR-944 | 5.60 | 1 | miR-331-5p | 1.75 | 1 |
miR-223-3p | 4.45 | 5 | miR-203a-3p | 1.70 | 6 |
let-7f-5p | 4.26 | 2 | miR-628-5p | 1.51 | 2 |
miR-15a-5p | 3.54 | 4 | miR-143-5p | 1.47 | 1 |
miR-16-5p | 3.46 | 5 | miR-18a-5p | 1.45 | 5 |
let-7g-5p | 3.25 | 2 | miR-15b-5p | 1.45 | 3 |
miR-126-3p | 3.20 | 2 | miR-301a-3p | 1.41 | 7 |
miR-19b-3p | 2.60 | 4 | miR-106a-5p | 1.30 | 8 |
miR-451a | 2.56 | 4 | miR-20b-5p | 1.24 | 4 |
miR-26b-5p | 2.52 | 2 | miR-17-5p | 1.23 | 13 |
miR-330-5p | 2.50 | 4 | miR-205-5p | 1.19 | 6 |
miR-4746-5p | 2.32 | 2 | miR-484 | 1.16 | 1 |
miR-455-3p | 2.19 | 5 | miR-642a-5p | 1.13 | 1 |
miR-9-5p | 2.19 | 7 | miR-944 | 1.12 | 1 |
miR-142-3p | 2.13 | 10 | miR-7-1-3p | 1.10 | 4 |
miR-1976 | 2.11 | 1 | miR-425-5p | 1.08 | 7 |
miR-200c-3p | 2.01 | 10 | miR-4662a-5p | 1.06 | 1 |
miR-125a-3p | 1.94 | 1 |
To further evaluate the log2FC in HF and types of cancers of
these 35 miRNAs, we applied a scatter dots graph to visualize these data
(Figure 4). Unlike the results of analysis in MI, these miRNAs all
clustered in the lower-left part of the graph, demonstrating that no
miRNA showed superior performances on both log2FC values in
HF and types of cancers that showed significantly upregulated
expression.
Figure 4. Scatter dots graph of the 24 miRNA on log2FC in HF
and Types of Cancers
In summary, we did not identify ideal miRNAs as targets for cancer
screening from our analysis of HF database.
Selection of miR-210-3p and miR-142-3p as target miRNAs for cancer
screening in MI patients
From our analysis of MI database, we found miR-210-3p and miR-142-3p are
two outstanding candidates targets to identify MI patients with high
risk of cancer. We used an ROC curve to evaluate the diagnostic
performance of miR-142-3p and miR-210-3p on our collected raw dataset
samples from MI (Figure 5). We found that combined ROC curve of
miR-210-3p and miR-142-3p (AUC=0.78) showed better results than ROC of
miR-210-3p alone (AUC=0.66) and ROC of miR-142-3p alone (AUC=0.60),
which demonstrates that using miR-210-3p and miR-142-3p as target miRNA
simultaneously could decrease the probability of false discovery rate of
detecting MI than using these 2 miRNAs as target miRNA separately.
Figure 5. ROC Curves with miR-210-3p and miR-142-3p in MI
We also revisited our HF screening dataset to specifically analyze
miR-142-3p and miR-210-3p. MiR-142-3p was identified from our analysis
of HF dataset with log2FC value (2.13), which demonstrated a
5 folds increased expression (Figure 4). MiR-210-3p failed to be
selected from our analysis of HF dataset due to the log2FC
value (0.67) did not pass our screening criteria.
From our analysis, we suggest using both miR-210-3p and miR-142-3p
together as targets to screen MI patients to identify those with high
risks of getting cancer.
Verification of miR-210-3p and miR-142-3p in cancer cells
Through bioinformatic analysis, we identified miR-210-3p and miR-142-3p
as potential targets for screening cancer risk in MI patients. We did
literature research of these two miRNA, and found that they are involved
in the regulation of migration and invasion of cancer cells
[18-21] .To verify whether miR-210-3p and miR-142-3p can
promote migration and invasion of cancer cells, mimics and inhibitors of
miR-210-3p or miR-142-3p were transfected into human renal clear cell
carcinoma cell line 786-o cells, and the transfected cells were tested
by Wound Healing and Transwell assays.
Ⅰ. miR-210-3p
To verify the function of miR-210-3p in cancer cells, we did Transwell
and Wound Healing assays with 786-o cells transfected with mimics of
miR-210-3p (mimics) and inhibitor of miR-210-3p (inhibitor). In Wound
Healing assay, we evaluated the relative migration levels through
measuring the wound area of 786-o cells transfected with control, mimics
and inhibitor of miR-210-3p. Statistical analysis of the Wound Healing
assays shows that 786-o cells transfected with mimics of miR-210-3p
migrated faster than control cells and 786-o cells transfected with
inhibitor of miR-210-3p migrated slower than control cells (Figure 6).
Next, we evaluate the invasion levels through counting the amounts of
786-o cells that colonized in the lower well of transwell assays.
Statistical analysis demonstrated that 786-o cells transfected with
mimics of miR-210-3p showed higher invasion capability than control
cells and 786-o cells transfected with inhibitor of miR-210-3p showed
lower invasion ability than control cells (Figure 7). To summarize,
786-o cells with overexpression of miR-210-3p have increased migration
and invasion capabilities, demonstrating that miR-210-3p could promote
the migration and invasion of cancer cells.
Figure 6. Wound Healing assays of 786-o cells transfected with control,
mimics-210-3p and inhibitor-210-3p. (A) Representative images of Wound
Healing assays. (B) Statistical analysis of relative migration levels in
the wound healing assays (n = 6). Data are presented as means ± SEM. *
indicates p < 0.05, and ** indicates p < 0.01. Standard error and
statistics are shown in graph.
Figure 7. Transwell assays of 786-o cells transfected with control,
mimics-210-3p and inhibitor-210-3p. (A) Representative images of
Transwell assays. (B) Statistical analysis of relative invasion levels
in the Transwell assay (n=5). Data are presented as means ± SEM. ***
indicates p < 0.001. Standard error and statistics are shown in
graph.
Ⅱ. miR-142-3p
Similarly, to verify the function of miR-142-3p in cancer cells, we did
Transwell and Wound Healing assays with 786-o cells transfected with
mimics of miR-142-3p (mimics) and inhibitor of miR-142-3p (inhibitor).
In Wound Healing assay, we evaluated the relative migration levels
through measuring the wound area of 786-o cells transfected with
control, mimics and inhibitor of miR-142-3p. Statistical analysis of the
Wound Healing assays shows that 786-o cells transfected with mimics of
miR-142-3p migrated faster than control cells and 786-o cells
transfected with inhibitor of miR-142-3p migrated slower than control
cells (Figure 8). Next, we evaluate the invasion levels through counting
the amounts of 786-o cells that colonized in the lower well of Transwell
assays. Statistical analysis demonstrated that 786-o cells transfected
with mimics of miR-142-3p showed higher invasion capability than control
cells and 786-o cells transfected with inhibitor of miR-142-3p showed
lower invasion ability than control cells (Figure 9). In summary, 786-o
cells with overexpression of miR-210-3p have increased migration and
invasion capabilities, demonstrating that miR-210-3p could promote the
migration and invasion of cancer cells.
Figure 8. Wound Healing assays of 786-o cells transfected with control,
mimics-142-3p and inhibitor-142-3p. (A) Representative images of Wound
Healing assays.(B) Statistical analysis of relative migration levels in
the wound healing assays (n = 6). Data are presented as means ± SEM. *
indicates p < 0.05. Standard error and statistics are shown in graph.
Figure 9. Transwell assays of 786-o cells transfected with control,
mimics-142-3p and inhibitor-142-3p. (A) Representative images of
Transwell assays. (B) Statistical analysis of relative invasion levels
in the Transwell assay (n=5). Data are presented as means ± SEM. ***
indicates p < 0.001. Standard error and statistics are shown in
graph.
Together, the results of transwell and Wound Healing assays indicated
that increased expression of miR-210-3p and miR-142-3p in MI patients
might enhance the invasion and migration of cancer cells, which supports
our idea of developing a screening kit that targets miR-210-3p and
miR-142-3p to MI patients with high risk of cancer.
Design LIRA to target miR-210-3p and miR-142-3p
To design a screening kit that can target miR-210-3p and miR-142-3p, we
adapted from an innovative approach named Loop-Initiated RNA Activator
(LIRA) that was used to detect viral RNA[18]. As shown in
Figure 10, LIRA forms a stem-loop structure that mainly contains a
recognition region, a ribosome binding sequence (RBS) and a start codon
AUG. Without interaction with target RNA, the RBS and start codon in
LIRA are kept in a close status in the stem structure, making them
impossible to initiate translation of downstream reporter gene. The
recognition region is a 31 nucleotides (nt) sequence that is reverse
complimentary to the target RNA of LIRA. While 21 nt of the reverse
complimentary sequence is designed to locate in the loop structure of
LIRA, which is named as a*, 10 nt of the reverse complimentary sequence
is designed to locate in the stem structure of LIRA, which is named as
b*. As a result, the target RNA of LIRA can bind to both a* and b*, and
the binding of the target RNA to b* destabilize the stem structure that
locks RBS and start codon, which makes them to be exposed in an open
status for the translation initiation of downstream reporter gene. Due
to the complexity of secondary structure and complicated hybridization
process of RNA, we used NUPACK software to design and analyze LIRA
sequences in the subsequent study.
Figure 10. Illustration of LIRA system[22]
Design LIRAs to target miR-210-3p and miR-142-3p separately
To design LIRA sequences that target miR-210-3p and miR-142-3p
separately, we first determined the composition and structure of each
part of LIRA, and replaced the recognition region with reverse
complementary sequence of miR-210-3p and miR-142-3p. We used the design
function of NUPACK to design the rest of LIRA sequences. Next, we used
the analysis function of NUPACK to simulate the hybridization process of
LIRA, and the nucleotide bases in the mismatch region of the LIRA with
abnormal structure were modified to achieve our design expectation,
which keep RBS and AUG at a closed status in a stem region without
interactions with target miRNA and release RBS and AUG to an open status
in a single strand RNA with interactions with target miRNA. The
procedure of our LIRA designing is listed below.
During the process of designing, we found that our situation differed
from the case reported in the literature in that the target RNA used in
the literature was 31nt, while the target miRNAs used by our team were
22nt and 23nt respectively[18]. Due to the difference in the
length of the target RNA, we were unable to use the way of LIRA design
reported in the literature directly. To determine the distribution of
the recognition region on our LIRA, we defined a concept of stem-loop
ratio: the ratio of the number of nucleotides of recognition region in
the stem structure to the number of nucleotides of recognition region in
the loop structure. We tried to design LIRA structure using different
stem-loop ratios. We found that when the number of nucleotides of the
recognition region in the stem is too large, it will partially overlap
with the complementary sequence of RBS, which will lead to the exposure
of the RBS sequence. Therefore, we designed LIRA to miR-210-3p with the
stem-loop ratios from 0/22 to 13/9 and LIRA to miR-142-3p from 0/23 to
13/10. The hybridization of LIRA to their target miRNAs was also
analyzed with NUPACK. The pairing results and related data are shown in
Table 6 and Table 7.
Table 6. Hybridization results of 14 LIRA structures of miR-210-3p and
related data
Stem-loop ratio | Single-arm sequence | Conformation of LIRA alone | Conformation of LIRA hybridized with miR-210-3p |
---|---|---|---|
0/22 | CUUGCUGGUGCUCCCUUUCCUGGGGUUCGGUCAUCAGCCGCUGUCACACGCACAGGAUAGGAGGAGAGGAGAGGAGACAUGUGUGAG | ||
1/21 | GAGUCGAAUCCUAAUUUUUCUGAGUACGAGUAUCAGCCGCUGUCACACGCACAGAUACGCGUACAGAGGAGAGAGAGGAUGAGGUUC | ||
2/20 | GUCUAGGAUGUGGGGUCUCUUAGCCGCCGGCUCAGCCGCUGUCACACGCACAGAGAGUGGGUGGAGAGGAGAUUAAAUAUGAUAGAC | ||
3/19 | CUCUCAGAUUUGAGCUCUUUUAGGUGUAGGUCAGCCGCUGUCACACGCACAGAAUGGCGUACACAGAGGAGAGGGAAAAUGAGAGAG | ||
4/18 | GAGUCGAGUGCGACAUCUCCUGAUGUCCGUCAGCCGCUGUCACACGCACAGAAACUGAGGGGCAAGAGGAGAUGAAGCAUGAGGGAC | ||
5/17 | GUCCGGGGUCAUGUCUCUUCUAACUUGCUCAGCCGCUGUCACACGCACAGAAAUGGGGUGUAGGAGAGGAGAGGGUUGAUGACGGAC | ||
6/16 | CCUGCGGAUCGUUGCUCUCCUGAUGUUUCAGCCGCUGUCACACGCACAGAAUGAGGGGCGGGUAAGAGGAGAGUGGGGAUGGGGGGG | ||
7/15 | CGGUGGAAUUGAACGUCUUCUGAUGGUCAGCCGCUGUCACACGCACAGGAGCUCCGGCGGCGGAAGAGGAGAUGCGCGAUGACACUG | ||
8/14 | CCUCGAGAUCUUUCAUUUCCUAACGUCAGCCGCUGUCACACGCACAGAAAAUGAGUGGGUGGUGAGAGGAGAUGGGAGAUGGUGGGG | ||
9/13 | GUGUAAAGUUUAGCUUUUUCUAAUUCAGCCGCUGUCACACGCACAGAAUGGCUAGGCGACUGGAAGAGGAGAGGAGAGAUGAUGUAC | ||
10/12 | CUGUCGGAUAUCUCAUCUUUUACUCAGCCGCUGUCACACGCACAGAUAAAAAGUCAGUCGCUGAAGAGGAGAUGGUAUAUGAGAUAG | ||
11/11 | CUACCAGGUAACAAGUUUUCUGUCAGCCGCUGUCACACGCACAGGAUAACACACACAGGGGUUGAGAGGAGAUUAUUUAUGAGGUAG | ||
12/10 | GUAUGGGAUGAAACAUUUUCUUCAGCCGCUGUCACACGCACAGGAUUUAAAGAAGGCGACGGUUAGAGGAGAUGGAUUAUGGUAUAC | ||
13/9 | GGCGGGAAUACAUAUUUUUCUCAGCCGCUGUCACACGCACAGUAAAAAAUAAAUUGACGGGGGCAGAGGAGAGGGGGGAUGAUCGUU |
Table 7. Hybridization results of 14 LIRA structures of miR-142-3p and
related data
Stem-loop ratio | Single-arm sequence | Conformation of LIRA alone | Conformation of LIRA hybridized with miR-210-3p |
---|---|---|---|
0/23 | GCUGCUGAUGAAACAUCUCCUAACAGCGUCAGUCCAUAAAGUAGGAAACACUACAGAGUGGGGGAGAGGAGAUGGCAAAUGUGUAGC | ||
1/22 | CUAAGAAGUCUCUGUUCUUCUAAGCAGGACGUUCCAUAAAGUAGGAAACACUACAGCGCGGUGCAGAGGAGAACGGAGAUGACGGGG | ||
2/21 | CCUACGGAUUGGGCGUCUUCUAAUCUGUCGGUCCAUAAAGUAGGAAACACUACAGACCAGCGGGAGAGGAGACGGAAAAUGCGAAGG | ||
3/20 | GUGGCUAAUCUGACGUCUUCUGAGAUCGCGUCCAUAAAGUAGGAAACACUACACGGACACGGUGAGAGGAGAUGAAAGAUGGGAAAC | ||
4/19 | GUUCGUAAUUCUGACUCUCUUAGUUGCCAUCCAUAAAGUAGGAAACACUACACCUGGGUGGGGAAGAGGAGAGUGUGAAUGGGGGGC | ||
5/18 | GAUAAAGGUGGGGGUUCUUCUGGCCGUCUCCAUAAAGUAGGAAACACUACACGCGUGGCGGCGGAGAGGAGAAUGGGGAUGGUUAUC | ||
6/17 | GAAGGGAGUGGAAUUUUUCUUGGCAGGUCCAUAAAGUAGGAAACACUACACCGAGGGGGGGGUGAGAGGAGAGACGUUAUGAGGGGC | ||
7/16 | GCGCUGAAUUAGUGUUCUCUUGGUAUUCCAUAAAGUAGGAAACACUACAGGGGAGGAUAGAAUAAGAGGAGAGCUGUAAUGAAAUGC | ||
8/15 | GAAGCAAGUACUAGAUCUUCUAAGAUCCAUAAAGUAGGAAACACUACACGAAGCGGGAAGGGUUAGAGGAGAUUGGGUAUGAGGUUC | ||
9/14 | GUAGUGAGUUAUCCGUCUCCUGAAUCCAUAAAGUAGGAAACACUACACACCCCGGGGGCUGGAUAGAGGAGACGUUUAAUGAACUGC | ||
10/13 | GGUCGGGAUUUAGCUUCUUCUGAUCCAUAAAGUAGGAAACACUACAUAGGGACAACUUAGUGGAAGAGGAGAAGGGAAAUGGCGAUC | ||
11/12 | GUGUUAAAUGUAAUGUUUUCUGUCCAUAAAGUAGGAAACACUACAUACGACGAAUACUAGGGGGAGAGGAGAUGGAACAUGAGAUAC | ||
12/11 | CCACCGAAUGCCUAUUCUCCUUCCAUAAAGUAGGAAACACUACACUGGGCCACGCGGGGGUAGGAGAGGAGAAAAAGCAUGAGGUGG | ||
13/10 | GCGUGAGAUCACCGUUCUUCUCCAUAAAGUAGGAAACACUACAAGGAUGUGAUACGAAGUGUAUAGAGGAGAGUCGGGAUGAGGGGU |
We think that the greater the free energy of a LIRA, the easier it is
for LIRA structures to be paired with miRNAs for translation; the
smaller the free energy of a LIRA, the more stable the structure of LIRA
is. Therefore, we chose free energy as an important index to analyze the
hybridization of LIRAs to their target miRNAs. We calculated the changes
of free energies of the LIRA before and after hybridization and fitted
the changes of free energy to stem-loop ratio. From the fitting curve,
it seems that there is no regularity between the changes of free energy
and stem-loop ratio (Figure 11 and Figure 12).
Figure 11. Fitting curves of the free energy of LIRA to miR-210-3p
Figure 12. Fitting curves of the free energy of LIRA to miR-142-3p
Since the LIRA sequence designed by NUPACK algorithm in the unrestricted
region is random, which causes large variations among these designed
LIRA. Therefore, we designed 10 sequences for each stem-loop ratio and
averaged the free energy values of these LIRA. A new fitting was
performed with the averaged free energy and stem-loop ratio (Figure 13
and Figure 14). The new fitting curve demonstrates that there is no
linear relationship between LIRA free energy and stem-loop ratio,
indicating that LIRA would not become more or less stable as stem-loop
ratio increase or decrease.
Figure 13. New fitting curve of miR210-3p
Figure 14. New fitting curve of miR142-3p
Next, we calculated the changes of free energy as the differences of
free energy of LIRA before and after hybridization with its target
miRNA, and analyzed the relation between LIRA free energy and the
changes of free energy. We observed that the data had a clear linear
relationship, so we linearly fitted it. The fitting results are shown in
Figure 15 and Figure 16. The slope of the fitted curve is approximately
equal to -1, which indicates that the free energy of LIRA after
hybridization with its target miRNA is stable at a constant value. The
free energy of LIRA hybridized with miR-210-3p is around -54 kcal/mol,
and the free energy of LIRA hybridized with miR-142-3p is around -44
kcal/mol. These results show that free energy of LIRA before
hybridization might be a more important indicator to determine the
quality of a LIRA.
Figure 15. Linear fitting results of miR-210-3p
Figure 16. Linear fitting results of miR-142-3p
Furthermore, we averaged 10 LIRA free energies at the same stem-loop
ratio, which could represent the LIRA free energy and hybridization free
energy at certain stem-loop ratio. Subsequently, we performed a
clustering with these 28 points, and observed a linear relationship
between LIRA free energy and the averaged changes of free energy (Figure
17 and Figure 18).
Figure 17. miR210-3p clustering results
Figure 18. miR142-3p clustering results
From our analysis of LIRA to miR-210-3p and miR-142-3p, we concluded
that stem-loop ratio has little effect on the free energy of LIRA. For
our subsequent wet lab experiments, we chose 3 LIRA sequences for
miR-210-3p with stem-loop ratios of 2/20, 7/15, and 10/12, and 3 LIRA
sequences for miR-142-3p with stem-loop ratios of 3/20, 8/15, and 11/12.
We selected these 6 LIRA sequences to verify whether stem-loop ratio
indeed had no effect on the performance of LIRA with wet lab
experiments. Meanwhile, since the free energies of these 6 LIRAs are
very different, the influence of the LIRA free energy on the functions
of LIRA can also be investigated (Table 8). In this regard, we expect
that a stable LIRA structure is often accompanied by a lower free
energy.
Table 8. 6 LIRA data
miRNA | Stem-loop ratio |
Free energy of LIRA before hybridization(kcal/mol) |
---|---|---|
miR210-3p | 2/20 | -66.61 |
miR210-3p | 7/15 | -62.10 |
miR210-3p | 10/12 | -56.54 |
miR142-3p | 3/20 | -54.83 |
miR142-3p | 8/15 | -47.89 |
miR142-3p | 11/12 | -41.10 |
Design double-arm LIRA to target miR-210-3p and miR-142-3p
simultaneously
To target miR-210-3p and miR-142-3p simultaneously, we designed LIRAs
with double-arm structure that can target two target RNA[22].
As shown in Figure 19, double-arm LIRA has two recognition regions, A*
and B*. While A* is located on the short arm of the double-arm
structure, B* is located on the long arm of the double-arm structure,
which is connected to RBS and AUG. We think that putting one miRNA
reverse complimentary sequence in A* and the other miRNA reverse
complimentary sequence in B* would make our LIRA targeting two miRNA.
Our expectation is that only when the LIRA binds to the target miRNA A
and then bind to the target miRNA B would expose the RBS and AUG in an
open status to initiate the translation of the downstream reporter gene,
and otherwise the RBS and AUG would be locked to at a closed status in
stem region.
Figure 19. Double-arm LIRA structure
We identified the composition of each part of double-arm LIRA and
replaced the recognition region with sequences that were reverse
complimentary to miR-210-3p and miR-142-3p. Utilizing the design
function of NUPACK, we designed a primary double-arm LIRA structure and
obtained the sequence. The following is the design procedures of LIRA
sequences to detect miR210-3p and miR142-3p in NUPACK.
During the process of simulation, we utilized NUPACK to generate 100
double-arm LIRAs. First, we hybridized each double-arm LIRA to both
miR-142-3p and miR-210-3p, and found that 32 double-arm LIRAs could only
hybridize to a single miRNA. We analyzed the results of the remaining 68
LIRA that could hybridize to both miRNAs, and 21 LIRAs did not form a
correct base-pairing structure between RBS and AUG. We then modified the
bases between RBS and AUG in these 21 LIRAs according to the
hybridization results to ensure that RBS and AUG could be successfully
exposed after hybridization. Finally, we analyzed the sequences at other
regions of these 21 LIRAs, and found that 15 of them would destabilize
AUG and RBS when LIRA interacts with miR-210-3p alone. We finally get 6
ideal double-arm LIRA that meet the design expectation, e.g., the RBS
and AUG remain in a close status without interaction with miR-210-3p and
miR-142-3p, and keep in an open status with interactions with miR-210-3p
and miR-142-3p. The simulation results of these 6 LIRAs are shown in
Table 9.
Table 9. Hybridization results of 6 double-arm LIRA sequences to detect
2 miRNAs
Number | sequence | Conformation of LIRA alone |
Conformation of LIRA hybridized with miR-210-3p and miR-142-3p |
---|---|---|---|
1 | AGCCUCAGCGGUCGACCCGCGGUCCAUAAAGUAGGAAACACUACAUUUGUGGGGAGCGGGGUAUAUCAUCCUCUCCUAGUCAGCCGCUGUCACACGCACAGCGGCUGGAGAGGAGAAUAUGAUAUACGGGGCCGAGGGGGCU | ||
2 | GUCCACGUUGAGCGUCCAGUCCAUAAAGUAGGAAACACUACAUUUGUGGAAAGGAUGGGUGCCUGUCCGAUCUCAGCUUCAGCCGCUGUCACACGCACAGCGGCUGAAGAGGAGAUGAUAUGGGCGCCGACAACGCAGAU | ||
3 | CGAUUUUCUCUUGCAGGAAUCCAUAAAGUAGGAAACACUACAUUUAUGGACGCCUGCCGGUCCCGUACAUUCUCAACUUCAGCCGCUGUCACACGCACAGCGGCUGAAGAGGAGAAUACAUGGGGCUGUCGAGAACUUCG | ||
4 | GCACGACGCCUUUCGAGUUUCCAAAAAGUAGGAAACACUACAUUUGUGGACUUUCGACUUAGUCAUCAACUCUCAACUUCAGCCGCUGUCACACGCACAGUGGUAGAAGAGGAGAGAGAAUGACUAAGCUGGUGGCGUGC | ||
5 | CGGUUCCGCUAAGGCAUCAUCCAUAAAGUAGGAAACACUACAUUUAUGGAACAUGCCCAUGUUUAUCCGUUCUCGACUUCAGCCGCUGUCACACGCACAGUGGCUGAAGAGGAGAAUAUAUGGAUAUGAUGGUGGCCCCG | ||
6 | GUCCACGUUGAGCGUCCAGUCCAUAAAGUAGGAAACACUACAUUUGUGGAAAGGAUGGGUGCCUGUCCGAUCUCAGCUUCAGCCGCUGUCACACGCACAGCGGCUGAAGAGGAGAUAAAAUGGGCGCCGACAACGCAGAU |
Optimization of double-arm LIRA to miR-210-3p and miR-142-3p
Setting indicators by the characteristic of LIRA
To generate an ideal double-arm LIRA sequence to miR-210-3p and
miR-142-3p, we would like to optimize the LIRA sequences that we
designed in the previous section. To facilitate the optimization
process, we need to identify the indicators that could predict the
performances of LIRAs. For our analysis of the indicators, we focus on
the bases between RBS and AUG, as we think that the status of these
bases determines the status of LIRA mostly. In the case that these bases
are paired with other bases to form stem structure, LIRA should be in a
close status. On the contrary, in the case that these bases are
unpaired, LIRA should be in an open status to initiate transcription of
reporter genes. From NUPACK, we could get the "unpaired probability" for
any base, which calculates the probability that such base is unpaired at
equilibrium, and the "paired probability" for any base, which calculates
the probability that the base is paired with another base at
equilibrium. We define indicator "unlock" for each product for example
complexes of LIRA and its target miRNA, by averaging the "paired
probability" of bases between RBS and AUG in the product and indicator
"lock" for LIRA only by averaging the "unpaired probability" of bases
between RBS and AUG in LIRA.
Figure 20. Diagram of "lock" and "unlock" calculation process
1. Unlock
2. Lock
NUPACK could simulate the LIRA-mediated reaction and calculate the
amount of each product from such reaction. The reaction with LIRA,
miR-210-3p, miR-142-3p could generate multiple complexes, including
"LIRA/miR-210-3p/miR-142-3p/complex",
"LIRA/miR-142-3p/miR-210-3p/complex", "LIRA/miR-142-3p/complex", and
"LIRA/miR-210-3p/complex", which we label as product 1 to 4 accordingly
and calculated their "unlock" values. Among all these products,
LIRA/miR-210-3p/miR-142-3p/complex is by far the dominant one, so we
treat it as the main product.
Subsequently, we define indicator "unlock divide lock" and "unlock minus
lock" using the "unlock" value of the main product, i.e., the
LIRA/miR-210-3p/miR-142-3p/complex.
3. Unlock divide lock
4. Unlock minus lock
"unlock level average" is the average of "unlock" values of all
products. "unlock level weighted" is the weighted average of "unlock"
value of all products, weighted by their corresponding concentrations
given by NUPACK.
5. Unlock level average
6. Unlock level weighted
We define the following four metrics using subtraction and division,
mean and weighted mean respectively.
7. Unlock minus lock average
8. Unlock minus lock weighted
9. Unlock divide lock average
10. Unlock divide lock weighted
It should be noted that some indicators differ in how many products are
included. Take "unlock", "unlock average" and "unlock weighted" as an
example: "unlock" includes the main product only, "unlock average"
includes the average "unlock" value of all products, "unlock weighted"
count the weighted average "unlock" value of all products.
Select better performance LIRA by indicators
To explore the influence of the above 10 indicators on the simulation
results of LIRA from NUPACK, which shows that whether LIRA meets the
design expectation (See part 2, design of double-arm LIRA), we analyzed
a variety of double-arm LIRA. In addition to the 6 LIRAs designed in
previous section that meet the design expectation (Table 10), we also
included 6 LIRAs that did not meet the design expectation as well.
Sequences and conformation of the 6 LIRA which did not meet expectations
are shown in the table below.
Table 10. Sequences and conformation of 6 LIRA that did not meet the
design expectation
Number | sequence | Conformation of LIRA alone |
Conformation of LIRA hybridized with miR-210-3p and miR-142-3p |
---|---|---|---|
7 | AGGACACAAGUAGCGGCACUCCAUAAAGUAGGAAACACUACAUUUAUGGAAAGCCGCGAGCUCCGUCUCGUCUCGGCUUCAGCCGCUGUCACACGCACAGCGGCUGGAGAGGAGACGUCAUGGAGCUCCCCUUGUAACCU | ||
8 | GCAUUGAUUCCAGUACCCCUCCAUAAAGUAGGAAACACUACAUUUAUGGACAGGUACCAAUCGCAUAAUCUUUCGGCUUCAGCCGCUGUCACACGCACAGCGGCUGAAGAGGAGAGAACAUGCGAUUGACGAAUCUUUGC | ||
9 | UCGGAGCGUGAAGAGGGUAUCCAUAAAGUAGGAAACACUACAUUUGUGGAGCCCCUCGUCCCGCGUUGGGUCUCGACUUCAGCCGCUGUCACACGCACAGCGGCUGAAGAGGAGACCAUAUGCGGGACAACGCGUAGCGA | ||
10 | UGCGAGGCGGUUACAGAAAUCCAUAAAGUAGGAAACACUACAUUUGUGGACCUUUGUCGGUCGCAUAGCAUCUCGACUUCAGCCGCUGUCACACGCACAGUGGUUGAAGAGGAGAUGGGAUGUGACUGCUCCGUUGAGCG | ||
11 | CCGAAGACAAAACUCUCAAUCCAUAAAGUAGGAAACACUACAUUUAUGGAACGAGAGGGAGGUCAUUUCCUUUUGGCUUCAGCCGCUGUCACACGCACAGCGGCUGAAGAGGAGAGGCCAUGACCUCCGCUUGUCGACGG | ||
12 | CCCACCUUCCUCGUGCGUGUCCAUAAAGUAGGAAACACUACAUUUGUGGAGUCGCACCGCCGGUAUCAGAUCUCGGUUUCAGCCGCUGUCACACGCACAGCGGCUGAAGAGGAGAUCCAAUGCCGGUGCUGGGAGCCGGG |
We call results that meet expectations as 1, and fail to meet
expectations as 0. Multivariate regression is applied to analyze the
effects of these 10 indicators on the predicted performance of LIRA,
i.e., meeting or not meeting the design expectation. The results of
multiple regression are shown in Table 11.
Table 11. Multiple regression result
Chi-square | Pseudo R2 | significance | AIC | BIC |
---|---|---|---|---|
16.636 | 0.75 | 0.119 | 24.000 | 29.819 |
The multiple regression model has a high R2 value as 0.75,
which means that the model has a good fitting effect. However, the
P-value is 0.119, which means that the multiple regression model does
not pass the significance test. We think that the large p-value is most
likely caused by strong collinearity. To verify such collinearity, we
used the Pearson correlation coefficient matrix to analyze the
correlation among 10 indicators, which demonstrates that the
collinearity is indeed very large (Table 12).
Table 12. Pearson correlation coefficient matrix for 10 indicators
Note:* means p<0.05 and ** means p<0.01. We highlighted the
correlation coefficient value >0.7 with colors.
We find that the correlation between the group of indicators "unlock",
"unlock level average" and "unlock level weighted" is stronger than the
correlation between these 3 and other indicators. The same strong
correlation exists in the group of indicators "unlock minus lock",
"unlock minus lock average", and "unlock minus lock weighted", and in
the group of indicators "unlock divide lock", "unlock divide lock
average", and "unlock divide lock weighted".
We explored the reason for the strong correlation within different
groups. Take the group of indicators "unlock", "unlock level average"
and "unlock level weighted" as an example, these 3 indicators basically
measure the same thing "unlock", and only differ in the way on how to
include "unlock" values from different products. Since the main product,
i.e., the LIRA/miR-210-3p/miR-142-3p/complex, comprises the vast
majority proportion of all the products, which results in almost equal
values for the three indicators, it could explain why the independent
variants in model have strong collinearity.
To overcome the problem of collinearity, we use random forest which is
not sensitive to collinearity to train the 10 metrics and results
associated with unlock and lock. We believe that the indicators with
higher R2 value in testing datasets could better help us t
evaluate the performance of LIRA. The ratio of training set to test set
is 0.8:0.2. Results are shown in Table 13.
Table 13. Random Forest result
Dependent Variable | Dataset | R2 | MAE | MSE | RMSE | MAD | MAPE | EVS | MSLE |
---|---|---|---|---|---|---|---|---|---|
Unlock | training datasets | 0.804 | 0.067 | 0.006 | 0.078 | 0.075 | 0.014 | 0.819 | 0.003 |
testing datasets | 0.758 | 0.022 | 0.001 | 0.025 | 0.017 | 0.001 | 0.939 | 0 | |
Lock | training datasets | 0.814 | 0.045 | 0.003 | 0.058 | 0.051 | NULL | 0.825 | 0.002 |
testing datasets | -0.711 | 0.094 | 0.009 | 0.095 | 0.086 | 0.016 | -0.392 | 0.006 | |
Unlock level average | training datasets | 0.827 | 0.050 | 0.004 | 0.061 | 0.046 | 0.018 | 0.843 | 0.002 |
testing datasets | -0.983 | 0.028 | 0.001 | 0.037 | 0.022 | 0.002 | -0.792 | 0.001 | |
unlock level weighted | training datasets | 0.811 | 0.094 | 0.014 | 0.117 | 0.07 | 0.052 | 0.827 | 0.008 |
testing datasets | 0.231 | 0.227 | 0.068 | 0.26 | 0.308 | 0.021 | 0.232 | 0.03 | |
Unlock divide lock | training datasets | 0.804 | 1.964 | 10.705 | 3.272 | 0.876 | 0.13 | 0.804 | 0.31 |
testing datasets | -19.792 | 1.643 | 2.854 | 1.689 | 1.701 | 0.05 | -0.119 | 0.343 | |
unlock minus lock | training datasets | 0.817 | 0.103 | 0.016 | 0.128 | 0.097 | 0.136 | 0.835 | NULL |
testing datasets | -0.192 | 0.079 | 0.009 | 0.097 | 0.08 | 0.007 | 0.466 | 0.005 | |
unlock minus lock average | training datasets | 0.825 | 0.041 | 0.002 | 0.05 | 0.036 | 0.142 | 0.844 | NULL |
testing datasets | -0.219 | 0.034 | 0.002 | 0.039 | 0.029 | 0.007 | 0.311 | 0.001 | |
unlock minus lock weighted | training datasets | 0.803 | 0.072 | 0.008 | 0.088 | 0.062 | 0.309 | 0.815 | NULL |
testing datasets | -0.146 | 0.12 | 0.032 | 0.179 | 0.041 | 0.011 | 0.111 | 0.019 | |
Unlock divide lock average | training datasets | 0.912 | 1.011 | 1.484 | 1.218 | 0.854 | 0.089 | 0.923 | 0.196 |
testing datasets | 0.911 | 1.028 | 1.452 | 1.205 | 1.028 | 0.006 | 0.935 | 0.034 | |
Unlock divide lock weighted | training datasets | 0.779 | 0.458 | 0.317 | 0.563 | 0.566 | NULL | 0.781 | 0.075 |
testing datasets | -68.13 | 1.56 | 2.61 | 1.616 | 1.361 | 0.299 | -3.65 | 0.649 |
As can be seen from the table, on the training set, the fitting effect
of the model is significant, and the R2 value is generally
high (>0.8), indicating that the model has achieved good training
effect on each index on the training set. For most indicators, MAE, MSE,
RMSE, MAD and MAPE values are low, so it can be considered that the
deviation between the predicted value and the actual value of the model
is small.
On the test set, indicator "unlock", "unlock level weighted" and "unlock
divide lock level weighted" have R2 above zero, which
indicates that these indicators have a higher accuracy in predicting the
result of open and close. Particularly, the R2
of "unlock divide lock weighted" reached 0.911, thus we can get the
conclusion that "unlock divide lock weighted" performs best among all
the indicators.
We get weights of indicators on the performance of LIRA on NUPACK from
random forest analysis, and list them in Table 14 below.
Table 14 The weight of the indicator to the result
Indicators | unlock | lock | Unlock divide lock | Unlock minus lock | Unlock level average |
---|---|---|---|---|---|
weights | 0.050 | 0.110 | 0.140 | 0.080 | 0.090 |
Indicators | Unlock level weighted | Unlock minus lock average | Unlock minus lock weighted | Unlock divide lock average | Unlock divide lock weighted |
---|---|---|---|---|---|
weights | 0.130 | 0.130 | 0.060 | 0.080 | 0.130 |
The weight of "unlock divide lock" is larger than that of other
indicators, which is followed closely by "unlock level weighted" and
"unlock divide lock weighted". To measure the expected performance of
LIRA quantitively, we calculated "expectation" for all LIRA according to
the formula shown as follows, and listed in Table 15.
Figure 21. Calculation method for indicators' weight for expectation
Table 15. The weight of the indicator to the result
numbers | 1 | 2 | 3 | 4 | 5 | 6 |
---|---|---|---|---|---|---|
expectation | 1.47 | 1.71 | 1.07 | 1.54 | 1.89 | 1.64 |
numbers | 7 | 8 | 9 | 10 | 11 | 12 |
---|---|---|---|---|---|---|
expectation | 0.58 | 0.44 | 0.13 | 0.23 | 0.59 | 0.47 |
Therefore, we sort all LIRAs according to "expectation" from high to
low, and find that LIRAs 2 and 5 have highest ranking score. Thus, we
conclude that LIRA sequences 2 and 5 might meet our design expectation
well, and should have high expression and low leakage in the wet lab
experiments.
Outlook
In the subsequent analysis, we attempt to explore the effects of some of
the LIRA's own parameters, such as base content, on indicators and
whether such parameters could be used to predict the performance of LIRA
on NUPACK, i.e., meeting or not meeting design expectation.
Next, we investigate the weights of parameters to indicators. We
collected the parameters of LIRA sequence itself from NUPACK software,
including "A","C","G","U" base content, and "A-U", "C-G", "G-U"
base-pair content. Then the random forest model is used for 10
indicators. The weights of different parameters on different indicators
are obtained as shown in Table 16 and Table 17 below.
Table 16. The weight of bases to the indicators
A | C | G | U | |
---|---|---|---|---|
unlock | 0.111 | 0.474 | 0.095 | 0.319 |
lock | 0.095 | 0.569 | 0.109 | 0.227 |
Unlock divide lock | 0.098 | 0.304 | 0.493 | 0.104 |
Unlock minus lock | 0.116 | 0.511 | 0.078 | 0.295 |
Unlock level average | 0.154 | 0.530 | 0.124 | 0.193 |
Unlock level weighted | 0.096 | 0.297 | 0.167 | 0.440 |
Unlock minus lock average | 0.088 | 0.569 | 0.096 | 0.247 |
Unlock minus lock weighted | 0.128 | 0.475 | 0.126 | 0.270 |
Unlock divide lock average | 0.116 | 0.287 | 0.473 | 0.124 |
Unlock divide lock weighted | 0.107 | 0.231 | 0.536 | 0.126 |
Table 17. The weight of the base pairs to the indicators
A-U | C-G | G-U | |
---|---|---|---|
unlock | 0.342 | 0.368 | 0.289 |
lock | 0.443 | 0.215 | 0.342 |
Unlock divide lock | 0.430 | 0.231 | 0.339 |
Unlock minus lock | 0.364 | 0.268 | 0.368 |
Unlock level average | 0.387 | 0.240 | 0.373 |
Unlock level weighted | 0.400 | 0.355 | 0.244 |
To judge the effect of different parameters on the result of open or
close accurately, we calculate by the following formula, multiply the
weights respectively, and obtain the weights of the parameters for the
result of unlock and lock.
Figure 22. sCalculation method for parameters' weight for indicators
Table 18. The weight of the parameter to the result of unlock and lock
Paraments | A% | C% | G% | U% |
---|---|---|---|---|
weights | 0.108 | 0.419 | 0.248 | 0.226 |
Table 19. The weight of the parameter to the result of unlock and lock
Paraments | A-U% | C-G% | G-U% |
---|---|---|---|
weights | 0.302 | 0.393 | 0.305 |
C% has weight of 0.1976, which is obviously higher than other
parameters, and in Table 19, C-G% has weight of 0.393, which also
confirmed that C% matters most in influencing the result.
Analysis of LlRA-mediated reaction
Since the core of our project is to use LIRA to detect miR-142-3p and
miR-210-3p, a comprehensive and detailed understanding of the
LIRA-mediated reaction is essential for the success of our project.
Through simulation of the process and analysis of the kinetics of the
LIRA-mediated reaction, we can get crucial information of this reaction,
such as the trend of changes of substrates and peak time of the
expression of the reporter gene, which could provide critical
information for designing and optimizing of our project.
We simulated reactions mediated by the single-arm and the double-arm
LIRA under both intracellular and extracellular conditions, and our
simulation includes the following steps:
a. Write down the chemical equations of LIRA system;
b. Transform chemical equations into ODEs;
c. Collect parameter values;
d. Solve the equations using MATLAB;
e. Analyze results;
Simulation and analysis of intracellular reactions mediated by LIRA
The interactions between LIRA and its target RNA could change the
conformation of LIRA, and initiate translation of downstream reporter
gene. Since the reaction between single-arm LIRA and miR-142-3p has the
same reaction process and parameters as the reaction between single-arm
LIRA and miR-210-3p, we think that these two reactions have the same
concentration-time curve and peak time of expression of reporter gene.
Without loss of generality, we take miR-210-3p as an example for kinetic
simulation below.
Schematic presentation of the intracellular reaction mediated by
single-arm LIRA is shown in Figure 23
Figure 23. Intracellular single-arm LIRA reaction process
Reaction process of single-arm LIRA is listed in Figure 24
Figure 24. Equations of intracellular single-arm LIRA reaction
Since the transition from 210-CSIL to OSIL is rapid, we treated the
product of the combination of CSIL210 and miR210 as OSIL210. Therefore,
we carried out a reasonable simplification of the simulation, and the
simplified reaction is shown in Figure 25.
Figure 25. Simplified equations of intracellular single-arm LIRA
reaction
Based on the chemical reaction described above, the equations of
ordinary differential equations with specific parameter values are
listed in Figure 26.
Figure 26. ODEs of intracellular single-arm LIRA reaction
In the simulation, we set the initial concentration of PLMg and PLM210
as 0.01mol/L, and other substances as 0. We use MATLAB and ode45
function to get the numerical solution. As shown in Figure 27, the
concentration of EGFP reaches the maximum value at 83 minutes and 1/2 of
the maximum value at 11.5 minutes. The concentration of OSIL210
increases rapidly and reaches the maximum value of 0.00054mol/L at 160s.
Due to the low transcription rate of PLMg and PLM210 plasmids, and the
high reaction rate of OSIL210 to EGFP, the concentration of OSIL210
gradually decreases after 160s.
Figure 27. simulation of intracellular single-arm LIRA reaction
Next, we simulated the intracellular reaction mediated by double-arm
LIRA, and the schematic presentation of this simulated reaction is shown
in Figure 28.
Figure 28. intracellular double-arm LIRA reaction process
For the reactions with double-arm LIRA, we assume that miR-210 cannot
bind to CTIL when miR-142 is not present. According to these reaction
processes, we can list the reaction equation of the two-arm LIRA, as
shown in Figure 29.
Figure 29. equations of intracellular double-arm LIRA reaction
According to the chemical reaction described above, the equations of
ordinary differential equations with specific parameter values are
listed below in Figure 30.
Figure 30. equations of intracellular double-arm LIRA reaction
In the simulation, we set the initial concentration of PLMag, PLM210 and
PLM142 as 0.01mol/L, and other substances as 0. Similarly, we use MATLAB
and ode45 function to get the numerical solution. As shown in Figure 31,
the concentration of EGFP reaches the maximum value of 0.00118mol/L at 3
minutes. The concentration of OTIL210 increases and reaches a maximum
value of 0.00064mol/L at 95s. Due the low transcription rate of PLMg and
PLM210 plasmids and the high reaction rate of OTIL210 to EGFP, the
concentration of OTIL210 gradually decreases after 95s.
Figure 31. Simulation of intracellular double-arm LIRA reaction
Simulation and analysis of extracellular reactions mediated by LIRA
Compared to the intracellular kinetic simulation, the extracellular
reaction process does not have the step of transcription of miR-142-3p
and miR-210-3p from plasmids. Since the intracellular and extracellular
conditions are different, the modeling parameters are altered
accordingly. The reaction between single-arm LIRA and miR-142-3p has the
same reaction process and parameters as the reaction between single-arm
LIRA and miR-210-3p, these two reactions have the same
concentration-time curve and the same peak time. Without loss of
generality, we take miR210-3p as an example for kinetic simulation
below.
Schematic presentation of extracellular reaction mediated by single-arm
LIRA is shown in Figure 32.
Figure 32. Extracellular single-arm LIRA reaction process
Reaction process of single-arm LIRA are listed in Figure 33
Figure 33. Equations of extracellular single-arm LIRA reaction
Since the transition from 210-CSIL to OSIL is rapid, we treat the
product of the combination of CSIL210 and miR210 as OSIL210. Therefore,
we carry out a reasonable simplification of this simulation, and the
simplified reaction is shown in Figure 34.
Figure 34. simplified equations of extracellular single-arm LIRA
reaction
According to the chemical reaction described above, the equations of
ordinary differential equations with specific parameter values are
listed below.
Figure 35. ODEs of extracellular single-arm LIRA reaction
In the simulation, we set the initial concentration of PLMg, PLM210 and
PLM142 as 0.01mol/L, and other substances as 0. Similarly, we use MATLAB
and ode45 function to get the numerical solution. As shown in Figure 36,
the concentration of fluorescent protein EGFP reaches the maximum value
of 0.0083mol/L at 30 minutes and 1/2 of the maximum value at 9 minutes.
The concentration of OSIL210 increases rapidly and reached a maximum
value of 0.00054mol/L at 160s. The concentration of OSIL210 gradually
decreases after 160s.
Figure 36. Simulation of extracellular single-arm LIRA reaction
Next, we simulated the extracellular reaction mediated by double-arm
LIRA, and the schematic presentation of this simulated reaction is shown
in Figure 37.
Figure 37. Extracellular double-arm LIRA reaction process
The process of reaction mediated by single-arm LIRA is listed in Figure
38
Figure 38. equations of extracellular double-arm LIRA reaction
According to the chemical reaction expression above, the equations of
ordinary differential equations with specific parameter values are
listed below.
Figure 39. ODEs of extracellular double-arm LIRA reaction
In the simulation, we set the initial concentration of PLMag, miR210-3p
and miR142-3p as 0.01mol/L, and other substances as 0. Similarly, we use
MATLAB and ode45 function to get the numerical solution. As shown in
Figure 40, the concentration of EGFP reaches the maximum value of
0.0083mol/L at 30 minutes and 1/2 of the maximum value at 9 minutes. The
concentration of OTIL210 increases rapidly and reaches a maximum value
of 0.00054mol/L at 160s. The concentration of OTIL210 gradually
decreases after 160s.
Figure 40. Simulation of extracellular double-arm LIRA reaction
Table 20. Explanation of abbreviation
Abbreviation | Description |
---|---|
PLMg | gate plasmid of CSIL |
PLMag | AND gate plasmid of CTIL |
PLM210 | input plasmid of miR-210 |
PLM142 | input plasmid of miR-142 |
CSIL | RBS closed single-input LIRA |
CTIL | RBS closed two-input LIRA |
OTIL | RBS open two-input LIRA |
210-CSIL | miR-210 & CSIL complex |
210-CTIL | miR-210 & CTIL complex |
142-CSIL | miR-142 & CSIL complex |
142-CTIL | miR-142 & CTIL complex |
OSIL210 | RBS open single-input LIRA formed by 210-CSIL |
OSIL142 | RBS open single-input LIRA formed by 142-CSIL |
GFP | green fluorescent protein |
EGFP | enhanced green fluorescent protein |
∅ | degradated substance |
Table 21. Parameters used in single-arm LIRA reaction process
Abbreviation | Description | Value(unit) | Reference |
---|---|---|---|
ktrg | gate plasmid transcription rate constant | 1.1×10-3 (s-1) | Baabu et al., 2021 |
ktr210 | miR-210 input plasmid transcription rate constant | 1.1×10-3 (s-1) | Baabu et al., 2021 |
ktr142 | miR-142 input plasmid transcription rate constant | 1.1×10-3 (s-1) | Baabu et al., 2021 |
kcb210 | combine rate constant of miR-210 and CSIL | 1×105 (M-1 ·s-1) | Baabu et al., 2021 |
kcb142 | combine rate constant of miR-210 and CSIL | 1×105 (M-1 ·s-1) | Baabu et al., 2021 |
ktl210 | OSIL210 translation rate constant | 0.3361 (s-1) | https://2023.igem.wiki/ugm-indonesia/ |
ktl142 | OSIL142 translation rate constant | 0.3361 (s-1) | https://2023.igem.wiki/ugm-indonesia/ |
kdeGFP | GFP degradation rate constant | 5×10-5 (s-1) | Baabu et al., 2021 |
kde210 | miR-210 degradation rate constant | 3×10-4 (s-1) | Baabu et al., 2021 |
kde142 | miR-142 degradation rate constant | 3×10-4 (s-1) | Baabu et al., 2021 |
kdeCSIL | CSIL degradation rate constant | 3×10-4 (s-1) | Baabu et al., 2021 |
kdeOSIL | OSIL degradation rate constant | 3×10-4 (s-1) | Baabu et al., 2021 |
Table 22. Parameters used in double-arm LIRA reaction process
Symbol | Description | Value(unit) intracellular | Value(unit) cell-free |
---|---|---|---|
ktrag | AND gate plasmid transcription rate constant | 1.1×10-3 (s-1) | 1.1×10-3 (s-1) |
ktlOTIL | OTIL translation rate constant | 0.03361 (s-1) | 0.0168 (s-1) |
kcb1 | combine rate constant of miR-210 and CTIL | 1×105 (m-1·s-1) | 1×105 (m-1·s-1) |
kcb2 | combine rate constant of miR-210 and 142-CTIL | 1×105 (m-1·s-1) | 1×105 (m-1·s-1) |
kdeCTIL | CTIL degradation rate constant | 3×10-4 (s-1) | 3×10-4 (s-1) |
kdeOTIL | OTIL degradation rate constant | 1.3×10-5 (s-1) | 3×10-4 (s-1) |
kde210CTIL | 210-CTIL degradation rate constant | 3×10-4 (s-1) | 3×10-4 (s-1) |
kde142CTIL | 142-CTIL degradation rate constant | 3×10-4 (s-1) | 3×10-4 (s-1) |
kdeEGFP | EGFP degradation rate constant | 0 (s-1)* | 0 (s-1)* |
3-Dimensional simulation graph
After conducting kinetic analysis of the reactions mediated by our
double-armed LIRA, we started to think how to apply LIRA as a testing
kit in practical situations. We encountered an issue which is that the
concentrations of the two highly expressed target miRNAs in patient
blood often vary, whereas in our kinetic simulation, we assumed the
concentrations for both target miRNAs are equal. Consequently, it is
questionable to simply take the kinetic simulation results as our
reference for LIRA's application in real-world situation. To address
this issue, we aim to introduce different concentrations of our target
miRNAs in our LIRA reaction system to observe the corresponding dynamic
changes in EGFP expression.
Firstly, we set the concentrations of miR-210-3p and miR-142-3p. We
selected concentrations of 0.0025, 0.005, 0.01, 0.02, and 0.04 mol/L for
our 2 target miRNAs, while keeping the LIRA concentration at 0.01 mol/L.
Subsequently, we simulate the reactions between 2 miRNA and LIRA based
on our extracellular kinetic model of reaction mediated by double-armed
LIRA. From our simulation of extracellular reaction in Figure 40, we can
see that the value of EGFP reaches its maximum in our reaction at
approximately 30 minutes. Hence, we collected the EGFP values at 40
minutes in our reaction after changing the concentrations of our target
miRNA and presented the results as scatter points in three-dimensional
space, as shown in Figure 41.
Figure 41. The influence of the concentrations of miR-210-3p and
miR-142-3p on EGFP expression in LIRA system
From Figure 41, we can see that the EGFP expression increases to high
values as the concentrations of miR-210-3p and miR-142-3p increase.
Additionally, the scatter points in the graph tend to cluster more
densely at lower concentrations of miR-210-3p and miR-142-3p. To better
understand the relationship between the concentrations of miR-210-3p and
miR-142-3p and their corresponding EGFP values, we attempted to fit
these scatter points.
A Quadratic Regression is commonly used for surface fitting models
[23]. Therefore, we employed the fitting function as Z = a +
bX + cY + dX² + eY² + fXY, where Z represents the dependent variable
(EGFP), and X and Y are the independent variables (X: [miR-210-3p], Y:
[miR-142-3p]). By substituting the data into this equation, we can
obtain the values of each parameter. The specific parameter
interpretations and results are presented in Table 23 below.
Table 23. Explanations and Values of Second-Order Polynomial Regression
Parameters
Parameter | Description | Value |
---|---|---|
a | constant term | -0.000118 |
b | coefficient of the linear term for X | 0.3505 |
c | coefficient of the linear term for Y | 0.3288 |
d | coefficient of the quadratic term for X | - 7.42 |
e | coefficient of the quadratic term for Y | - 6.84 |
f | Interaction coefficient between X and Y | 3.76 |
Based on the calculations above, we obtained the regression equation: Z
= -0.000118 + 0.3505X + 0.3288Y + 3.76XY - 7.42X² - 6.84Y². To
understand the performance of our fitted equation, we conducted an
Analysis of Variance (ANOVA) at a 95% confidence level (α = 0.05) for
the terms in our regression. This analysis could assess the significance
of our model terms and determine the terms that have statistically
significant impacts on EGFP value. Tables 25 and 26 present the results
of this analysis.
Table 24. ANOVA Results Summary for Regression Model Terms
Source | DF | Adj SS | Adj MS | F-Value | P-Value |
---|---|---|---|---|---|
Model | 5 | 0.000137 | 0.000027 | 12.78 | 0.000 |
Linear | 2 | 0.000094 | 0.000047 | 22.01 | 0.000 |
[miR-210-3p] | 1 | 0.000053 | 0.000053 | 24.64 | 0.000 |
[miR-142-3p] | 1 | 0.000056 | 0.000056 | 25.99 | 0.000 |
Square | 2 | 0.000047 | 0.000024 | 11.04 | 0.001 |
[miR-210-3p]*[ miR-210-3p] | 1 | 0.000026 | 0.000026 | 11.95 | 0.003 |
[miR-142-3p]*[ miR-142-3p] | 1 | 0.000022 | 0.000022 | 10.13 | 0.005 |
2-Way Interaction | 1 | 0.000013 | 0.000013 | 5.88 | 0.025 |
[miR-210-3p]*[ miR-142-3p] | 1 | 0.000013 | 0.000013 | 5.88 | 0.025 |
Error | 19 | 0.000041 | 0.000002 | ||
Total | 24 | 0.000177 |
Table 25. Regression model fit summary
Standard deviation | R2 | R2(adj) |
---|---|---|
0.0007532 | 96.44% | 91.99% |
Based on the results of the ANOVA, we can observe that the R² value is
high, indicating a good fit of the model. Additionally, the coefficients
for the constant term, the interaction term ([miR-210-3p] * [142-3p]),
the two linear terms ([miR-210-3p] and [miR-142-3p]), and the two
quadratic terms ([miR-210-3p] * [miR-210-3p] and [miR-142-3p] *
[miR-142-3p]) are all statistically significant. This suggests, in our
quadratic regression equation, the terms are not redundant or causing
interference in the regression results. Finally, we simulate the surface
fitting based on our equation and the result is visualized in Figure 42.
Figure 42. Surface Plot of EGFP expression
From Figure 42 we can see that, as the concentrations of the two miRNAs
continue to increase, the reaction rate accelerates, and the observed
EGFP levels also rise continuously. When the concentration of one miRNA
remains constant, as the concentration of the other miRNA increases, the
EGFP value first rise and then decline. To have a more detailed
observation of the changes of EGFP value in specific concentration
ranges, we have plotted a contour map as shown in Figure 43.
Figure 43. Contour Plot of EGFP expression
Figure 43 is a contour plot of EGFP levels at 30 minutes for the
reaction between LIRA and two target miRNAs. The color intensity in the
figure represents different numerical ranges of EGFP. Specifically, the
gradient from light green to dark green indicates an increasing trend of
the EGFP variable from low to high. It can be observed that there is a
distinct high-value region in the upper right corner of the plot. In
contrast, as we move towards the lower left corner, the EGFP levels
continuously decrease, and a low-value region appears. This may be
attributed to the incomplete reaction of LIRA due to lower
concentrations of miRNAs, resulting in lower EGFP value. Furthermore, we
can observe in the plot that the EGFP value lies within the range of
0.008-0.01 when miR-210-3p is within 0.025-0.04 mol/L and miR-142-3p is
within 0.0275-0.04 mol/L.
We can apply this conclusion into practical scenarios. Assuming the LIRA
testing kit we use has a LIRA concentration of 0.01 mol/L, and only as
the EGFP value reaches between 0.008 and 0.01 will it be considered as
positive and identified by colors. We can simulate two scenarios:
First, we can use LIRA to detect unknown concentrations of miRNA in
blood samples. When the concentrations of miR-210-3p and miR-142-3p in
the blood sample we are testing fall within the ranges of 0.025-0.04
mol/Land 0.0275-0.04 mol/L respectively, the LIRA testing kit will
successfully show colors and indicating after 30 minutes.
Second, we can use miRNA with known concentrations to validate the
quality of the LIRA testing kit. Suppose we have miR-210-3p and
miR-142-3p with concentrations as a mol/L and b mol/L, respectively, and
both a and b are within the concentration range of 0-0.04 mol/L. Given
the regression equation above, we can get Z = -0.000118 + 0.3505X +
0.3288Y + 3.76XY - 7.42X² - 6.84Y².When we add both a and b to the LIRA
testing kit and allow the reaction to proceed for 30 minutes, the EGFP
value that are detected should be close to the value of -0.000118 +
0.3505a + 0.3288b + 3.76ab - 7.42a² - 6.84b².With the premises of
disregarding error interference, If there is a significant deviation
between the two results, it indicates that there may be an quality issue
with the LIRA testing kit.
In summary, in the future application of LIRA detection, it is necessary
to take whether the concentration of LIRA and the concentrations of
miRNA are falling in matching ranges as considerations and adjusting the
LIRA concentration based on the concentration of miRNA in actual blood
samples. Due to the limitation of lacking the information on the
concentration of miRNA in blood samples, we can not adjust the
concentration of our LIRA in our test kit. However, we believe that
along with the advancement of miRNA detection technology and through
continuous adjustment and optimization of our LIRA, we can finally
obtain an ideal screening kit to detect expression of miRNA in real
samples.
Reference
Han H, Guo W, Shi W, et al. Hypertension and breast cancer risk: a
systematic review and meta analysis[J]. 2017, 7: 44877.
Rasmussen Torvik LJ, Shay CM, Abramson JG, et al. Ideal cardiovascular
health is inversely associated with incident cancer: the
Atherosclerosis Risk In Communities study [J]. Circulation, 2013, 127
(12): 1270 1275.
Solomon SD, Wang D, Finn P, et al. Effect of candesartan on cause
specific mortality in heart failure patients: the Candesartan in Heart
failure Assessment of Reduction in Mortality and morbidity (CHARM)
program [J]. Circulation, 2004, 110 (15): 2180 2183.
Malmborg M, Christiansen CB, Schmiegelow MD, et al. Incidence of new
onset cancer in patients with a myocardial infarction a nationwide
cohort study [J]. BMC Cardiovasc Disord, 2018, 18 (1): 198.
Hasin T, Iakobishvili Z, Weisz G. Associated risk of malignancy in
patients with cardiovascular disease: evidence and possible mechanism
[J]. Am J Med, 2017, 130 (7): 780 785.
Meijers WC, Maglione M, Bakker SJL, et al. Heart failure stimulates
tumor growth by circulating factors [J]. Circulation, 2018, 138 (7):
678 691.
Yang M, Li T, Guo S, et al. CVD phenotyping in oncologic disorders:
cardio-miRNAs as a potential target to improve individual outcomes in
revers cardio-oncology. J Transl Med. 2024;22(1):50. Published 2024
Jan 12.
Yuan Y, Mei Z, Qu Z, et al. Exosomes secreted from cardiomyocytes
suppress the sensitivity of tumor ferroptosis in ischemic heart
failure [J]. Signal Transduct Target Ther,2023,8(1):121.
Dembélé D, Kastner P. Fold change rank ordering statistics: a new
method for detecting differentially expressed genes. BMC
Bioinformatics. 2014;15:14. Published 2014 Jan 15.
Dai R, Zheng C. False discovery rate-controlled multiple testing for
union null hypotheses: a knockoff-based approach. Biometrics.
2023;79(4):3497-3509.
Yang G, Yang L, Wang W, Wang J, Wang J, Xu Z. Discovery and validation
of extracellular/circulating microRNAs during idiopathic pulmonary
fibrosis disease progression. Gene. 2015;562(1):138-144.
Stocks T, Van Hemelrijck M, Manjer J, Bjorge T, Ulmer H, Hallmans G,
Lindkvist B, Selmer R, Nagel G, Tretli S, Concin H, Engeland A,
Jonsson H, Stattin P. Blood pressure and risk of cancer incidence and
mortality in the Metabolic Syndrome and Cancer Project. Hypertension.
2012;59:802–810
Vinter N, Christesen AMS, Fenger-Gron M, Tjonneland A, Frost L. Atrial
fibrillation and risk of cancer: a Danish population-based cohort
study. J Am Heart Assoc. 2018;7:e009543
Conen D, Wong JA, Sandhu RK, Cook NR, Lee IM, Buring JE, Albert CM.
Risk of malignant cancer among women with new-onset atrial
fibrillation. JAMA Cardiol. 2016;1:389–396.
Ridker PM, Cook NR, Lee IM, Gordon D, Gaziano JM, Manson JE, Hennekens
CH, Buring JE. A randomized trial of low-dose aspirin in the primary
prevention of cardiovascular disease in women. N Engl J Med.
2005;352:1293–1304.
Rasmussen-Torvik LJ, Shay CM, Abramson JG, Friedrich CA, Nettleton JA,
Prizment AE, Folsom AR. Ideal cardiovascular health is inversely
associated with incident cancer: the Atherosclerosis Risk In
Communities study. Circulation. 2013;127:1270–1275.
Banke A, Schou M, Videbaek L, et al. Incidence of cancer in patients
with chronic heart failure: a long-term follow-up study. Eur J Heart
Fail. 2016;18(3):260-266.
Zheng J, Cheng C, Xu J, Gao P, Wang J, Chen L. miR-142-3p Regulates
Tumor Cell Autophagy and Promotes Colon Cancer Progression by
Targeting TP53INP2. Chemotherapy. 2022;67(2):57-66.
Zhang Y, Ma S, Zhang J, et al. MicroRNA-142-3p promotes renal cell
carcinoma progression by targeting RhoBTB3 to regulate HIF-1 signaling
and GGT/GSH pathways. Sci Rep. 2023;13(1):5935. Published 2023 Apr 12.
Zhang X, Sai B, Wang F, et al. Hypoxic BMSC-derived exosomal miRNAs
promote metastasis of lung cancer cells via STAT3-induced EMT. Mol
Cancer. 2019;18(1):40. Published 2019 Mar 13.
Ren D, Yang Q, Dai Y, et al. Oncogenic miR-210-3p promotes prostate
cancer cell EMT and bone metastasis via NF-κB signaling pathway. Mol
Cancer. 2017;16(1):117. Published 2017 Jul 10.
Ma D, Li Y, Wu K, et al. Multi-arm RNA junctions encoding molecular
logic unconstrained by input sequence for versatile cell-free
diagnostics. Nat Biomed Eng. 2022;6(3):298-309.
Yong Z, Siyu Ye, Xianqi C, et al. Polynomial Response Surface based on
basis function selection by multitask optimization and ensemble
modeling[J].Complex & Intelligent Systems,2022,Vol.8(2): 1015-1034