Overview
To screen patients with cardiovascular diseases for their risk of
cancer, our project aims to develop a LIRA-based system to target miRNA
that are both highly expressed in CVD patients and capable of promoting
cancer. LIRA is a designed RNA sequence with stem-loop structures, which
could be activated by its target RNA to initiate the expression of
reporter gene. To establish an ideal LIRA system, our model group mainly
need to solve the following four questions.
Question 1:
Which miRNA to target?
Question 2:
How to design LIRA to target miRNA?
Question 3:
How to improve the design of LIRA?
Question 4:
How to tailor LIRA-mediated reaction for samples with miRNA at
different concentrations?
To answer
Question 1
, we applied differential expression analysis in GEO database to
identify miRNA that are highly expressed in patients post myocardial
infarction (MI) or with heart failure. After differential analysis of
miRNA in cancer database from TCGA, we identified miR-210-3p and
miR-142-3p as ideal targets, as they are both highly expressed in
multiple types of cancer and in post MI patients. Additionally, we also
verified that these two miRNAs could promote migration and invasion of
cancer cells with wet lab experiments.
To address
Question 2
and
Question 3
, we designed single-arm LIRAs to target miR-210-3p or miR-142-3p
initially. Next, we designed double-arm LIRAs to target miR-210-3p and
miR-142-3p, which could only be activated in the presence of both
miRNAs. To optimize the design of LIRA, we analyzed multiple indicators
of LIRA, and used Multinomial Logistic Regression to evaluate the
potential contribution of these indicators to the expected performance
to LIRA. To improve model performance, we introduced Random Forest to
identify key indicators that could affect the stability and function of
LIRA.
To answer
Question 4
, we conducted kinetic modeling to simulate the reaction of
LIRA-mediated expression of reporter gene to obtain predicted time and
concentration curves of the reaction. Furthermore, using 3D curves, we
simulated the reaction with miR-210-3p and miR-142-3p at different
concentrations, and estimated suitable concentration of LIRA for
detecting miRNAs at a variety of concentration.
In summary, our model work laid the ground for our wet lab experiments
that generates the LIRA to target miR-210-3p and miR-142-3p, and paved
the way for the future application of the LIRA in the screening of
cancer risk in post-MI patients.
ldentify target miRNA for cancer screening in CVD patients
Epidemiology studies have revealed that patients with CVDs, such as
myocardial infarction (MI), heart failure (HF) and atherosclerosis, have
increased risk of getting cancer[1-4]. For instance, MI
patients exhibit increased prevalence of a variety of cancers and also
elevated mortality that are caused by cancer[1]. HF patients
also show higher cancer risk compared to healthy
individuals[5]. Experiments using animal models reveal that
MI enhances tumor progression through circulating factors and immune
reprogramming[6,7]. Especially, MI leads to aberrant miRNA
secretion from the inflicted heart into blood, which could circulate to
distant organs and contribute to cancer progression at distant
organs[8]. To find good targets for our cancer screening for
CVD patients, we analyzed database of CVD, including MI and HF, and
databases of multiple cancers, to identify miRNA that show increased
expression in both CVD and cancer, and then verified the function of the
identified miRNA on cancer progression.
Analysis of miRNAs in MI and cancer databases
The whole process of our analysis of MI and cancer database is shown in
Figure 1. First, we searched Gene Expression Omnibus (GEO) database
using myocardial infarction and miRNA as key words and collected 318
datasets. Since our aim is to identify highly expressed miRNA in the
blood of MI patients, we manually excluded 305 irrelevant datasets based
on the sample sources, species and disease descriptions. To the
remaining 13 datasets, we calculated the log2FC value of the
miRNAs based on the ratio of the mean values of the patient samples and
control and applied logarithmic transformation to the
ratio[9]. We used Student's t-test to obtain P-values for
each miRNA. Furthermore, to reduce the occurrence of false positives, we
adjusted the P-values by comparing these P-values with the threshold
selected for an acceptable false discovery rate (FDR)[10],
and used adj.P value to refer FDR. We used the log2FC > 1
and adj.P Value < 0.05 as our screening criteria[11], and
got 9 datasets that contained significantly upregulated miRNA.
Figure 1. Flowchart of miRNA analysis in MI and cancer database
Second, we summarized multiple epidemiology studies and identified 33
types of cancers showing faster progression or higher risk in
CVD[12-16]. We conducted differential expression analysis of
miRNA expression of these 33 types of cancer using the Cancer Genome
Atlas Program (TCGA) database and obtained 22 cancer datasets that show
significantly upregulated expression of miRNA using our screening
criteria (log2FC > 1 and adj.P Value < 0.05). The names
of these 22 types of cancers are listed in Table 1.
Table 1. The 22 types of cancer that show significantly upregulated
expression of miRNA
Abbreviation | Disease Name | Abbreviation | Disease Name |
---|---|---|---|
BLCA | Bladder Urothelial Carcinoma | LUAD | Lung Adenocarcinoma |
BRCA | Breast Invasive Carcinoma | LUSC | Lung Squamous Cell Carcinoma |
CESC | Cervical Squamous Cell Carcinoma & Endocervical Adenocarcinoma | PAAD | Pancreatic Adenocarcinoma |
CHOL | Cholangiocarcinoma | PCPG | Pheochromocytoma & Paraganglioma |
COAD | Colon Adenocarcinoma | PRAD | Prostate Adenocarcinoma |
ESCA | Esophageal Carcinoma | READ | Rectum Adenocarcinoma |
HNSC | Head And Neck Squamous Cell Carcinoma | SKCM | Skin Cutaneous Melanoma |
KICH | Kidney Chromophobe | STAD | Stomach Adenocarcinoma |
KIRC | Kidney Renal Clear Cell Carcinoma | THCA | Thyroid Carcinoma |
KIRP | Kidney Renal Papillary Cell Carcinoma | THYM | Thymoma |
LIHC | Liver Hepatocellular Carcinoma | UCEC | Uterine Corpus Endometrial Carcinoma |
We compared the significantly upregulated miRNAs in MI and cancers to
identify miRNAs that are significantly upregulated in both diseases and
obtained 24 miRNAs. We think that two criteria are especially important
for the potential effects on cancer promotion of these 24 miRNAs during
CVD:
1. log2FC values in MI. A high log2FC value
of the miRNA in MI demonstrates a large increased amount of this miRNA
in the blood of MI patients, which indicates that a large amount of this
miRNA is likely to reach distant organs through circulation system and
exerts its potential cancer-promoting effects at distant organs.
2.
Types of cancers that show significantly upregulated expression of
these miRNAs.
MiRNAs that are highly expressed in multiple types of cancers could
potentially influence the progression of multiple types of cancers. We
would like to design a screening kit to identify MI patients with
increased risk of getting of a variety of cancers, instead of a certain
type of cancer.
We summarized this two key information of the 24 miRNAs in Table 2.
Table 2. miRNAs significantly upregulated in both MI and cancers
miRNA ID | log2FC Value in MI | Types of Cancers | miRNA ID | log2FC Value in MI | Types of Cancers |
---|---|---|---|---|---|
miR-210-3p | 6.18 | 13 | miR-320b | 1.49 | 1 |
miR-142-3p | 3.51 | 10 | miR-199a-3p | 1.48 | 3 |
miR-455-5p | 3.09 | 1 | miR-362-3p | 1.39 | 1 |
miR-483-3p | 2.79 | 1 | miR-582-3p | 1.31 | 1 |
miR-199b-5p | 2.31 | 3 | miR-146b-5p | 1.26 | 9 |
miR-629-3p | 2.19 | 1 | miR-455-3p | 1.23 | 5 |
miR-330-5p | 2.14 | 4 | miR-486-5p | 1.17 | 2 |
miR-483-5p | 2.02 | 1 | miR-28-5p | 1.15 | 1 |
miR-576-5p | 1.95 | 4 | miR-671-5p | 1.06 | 3 |
miR-107 | 1.82 | 1 | miR-628-5p | 1.05 | 1 |
miR-450b-5p | 1.58 | 3 | miR-142-5p | 1.03 | 4 |
miR-199a-5p | 1.52 | 1 | miR-127-5p | 1.01 | 4 |
To further evaluate the log2FC in MI and types of cancers of
these 24 miRNAs, we applied a scatter dots graph to visualize these data
(Figure 2). MiR-210-3p and miR-142-3p distinguish from other miRNAs in
this graph, as they show both much higher log2FC values in MI
and much more types of cancers that showed significantly upregulated
expression of them than other miRNAs.
Figure 2. Scatter dots graph of the 24 miRNA on log2FC in MI
and Types of Cancers
As shown in Table 2, miR-210-3p ranks the highest in log2FC
value in MI (6.18), which demonstrate 73 folds of increased expression
in MI patients, and show significantly upregulated expression in 13
types of cancers, with log2FC values in cancer ranged from
1.02 to 5.04. The log2FC values of miR-210-3p in 13 types of
cancers are shown in Table 3.
Table 3. Types of cancers with significantly upregulated expression of
miR-210-3p
Types of Cancers | log2FC Value in Cancer |
---|---|
Cervical squamous cell carcinoma and endocervical adenocarcinoma |
5.04 |
Bladder Urothelial Carcinoma | 4.80 |
Lung Adenocarcinoma | 4.54 |
Lung Squamous Cell Carcinoma | 4.51 |
Uterine Corpus Endometrial Carcinoma | 3.14 |
Breast Invasive Carcinoma | 3.14 |
Kidney Renal Clear Cell Carcinoma | 3.07 |
Head and Neck Squamous Cell Carcinoma | 2.52 |
Kidney Renal Papillary Cell Carcinoma | 1.82 |
Esophageal Carcinoma | 1.58 |
Stomach Adenocarcinoma | 1.47 |
Prostate Adenocarcinoma | 1.30 |
Liver Hepatocellular Carcinoma | 1.02 |
miR-142-3p ranks the second highest in log2FC value in MI
(3.51), which demonstrated 11 folds of increased expression in MI
patients, and show increased expression in 10 types of cancers, with
log2FC values in cancers range from 1.04 to 6.61. The
log2FC values of miR-142-3p in 10 types of cancers are shown
in Table 4.
Table 4. Types of cancers with significantly upregulated expression of
miR-142-3p
Types of Cancers | log2FC Value in Cancer |
---|---|
Rectum Adenocarcinoma | 6.61 |
Colon Adenocarcinoma | 6.55 |
Cervical squamous cell carcinoma and endocervical adenocarcinoma |
3.58 |
Lung Adenocarcinoma | 2.29 |
Kidney Renal Clear Cell Carcinoma | 2.03 |
Breast Invasive Cancer | 1.87 |
Uterine Corpus Endometrial Carcinoma | 1.27 |
Bladder Urothelial Carcinoma | 1.24 |
Lung Squamous Cell Carcinoma | 1.09 |
Head and Neck Squamous Cell Carcinoma | 1.04 |
To summarize, our analysis shows that miR-210-3p and miR-142-3p are the
best target miRNAs to identify MI patients with high risk of getting
cancers.
Analysis of miRNA in HF and cancer databases
HF is a condition resulting from the impairment of cardiac contractility
and/or diastolic function[17].Current study also reveals that
HF can promote cancer[6]. Hence we also analyzed miRNAs that
are highly expressed in HF patients in GEO database. The analysis
process is shown in the flowchart (Figure 3).
Figure 3. Flowchart of miRNA analysis in HF and cancer database
We collected 158 datasets from GEO database using heart failure and
miRNA as key words. After we manually excluded 151 irrelevant datasets
based on the sample sources, species and disease descriptions, we got 7
datasets to analyze miRNA expression. Using log2FC > 1 and
adj.P Value < 0.05 as screening criteria, we got 6 datasets that
contain significantly upregulated miRNAs in HF. We compared the
significantly upregulated miRNAs in HF with those in cancers, which was
described in the previous section, we got 35 miRNAs that show
significantly upregulated expression in both diseases. We summarized the
log2FC in HF and types of cancers that show significantly
upregulated expression of these miRNAs in Table 5.
Table 5. miRNAs significantly upregulated in both HF and cancers
miRNA ID | log2FC Value in HF | Types of Cancers | miRNA ID | log2FC Value in HF | Types of Cancers |
---|---|---|---|---|---|
miR-944 | 5.60 | 1 | miR-331-5p | 1.75 | 1 |
miR-223-3p | 4.45 | 5 | miR-203a-3p | 1.70 | 6 |
let-7f-5p | 4.26 | 2 | miR-628-5p | 1.51 | 2 |
miR-15a-5p | 3.54 | 4 | miR-143-5p | 1.47 | 1 |
miR-16-5p | 3.46 | 5 | miR-18a-5p | 1.45 | 5 |
let-7g-5p | 3.25 | 2 | miR-15b-5p | 1.45 | 3 |
miR-126-3p | 3.20 | 2 | miR-301a-3p | 1.41 | 7 |
miR-19b-3p | 2.60 | 4 | miR-106a-5p | 1.30 | 8 |
miR-451a | 2.56 | 4 | miR-20b-5p | 1.24 | 4 |
miR-26b-5p | 2.52 | 2 | miR-17-5p | 1.23 | 13 |
miR-330-5p | 2.50 | 4 | miR-205-5p | 1.19 | 6 |
miR-4746-5p | 2.32 | 2 | miR-484 | 1.16 | 1 |
miR-455-3p | 2.19 | 5 | miR-642a-5p | 1.13 | 1 |
miR-9-5p | 2.19 | 7 | miR-944 | 1.12 | 1 |
miR-142-3p | 2.13 | 10 | miR-7-1-3p | 1.10 | 4 |
miR-1976 | 2.11 | 1 | miR-425-5p | 1.08 | 7 |
miR-200c-3p | 2.01 | 10 | miR-4662a-5p | 1.06 | 1 |
miR-125a-3p | 1.94 | 1 |
To further evaluate the log2FC in HF and types of cancers of
these 35 miRNAs, we applied a scatter dots graph to visualize these data
(Figure 4). Unlike the results of analysis in MI, these miRNAs all
clustered in the lower-left part of the graph, demonstrating that no
miRNA showed superior performances on both log2FC values in
HF and types of cancers that showed significantly upregulated
expression.
Figure 4. Scatter dots graph of the 24 miRNAs on log2FC in HF
and Types of Cancers
In summary, we did not identify ideal miRNAs as targets for cancer
screening from our analysis of HF database.
Selection of miR-210-3p and miR-142-3p as target miRNAs for cancer
screening in MI patients
From our analysis of MI database, we found that miR-210-3p and
miR-142-3p are two outstanding candidates biomarkers to identify MI
patients with high risk of cancer. We used an ROC curve to evaluate the
diagnostic performance of miR-210-3p and miR-142-3p on our collected raw
dataset samples from MI (Figure 5). We found that combined ROC curve of
miR-210-3p and miR-142-3p (AUC=0.78) showed better results than ROC of
miR-210-3p alone (AUC=0.66) and ROC of miR-142-3p alone (AUC=0.60),
which demonstrates that using miR-210-3p and miR-142-3p as target miRNA
simultaneously could decrease the probability of false discovery rate of
detecting MI than using these 2 miRNAs as target miRNA separately.
Figure 5. ROC Curves with miR-210-3p and miR-142-3p in MI
We also revisited our HF screening dataset to specifically analyze
miR-142-3p and miR-210-3p. MiR-142-3p was identified from our analysis
of HF dataset with log2FC value (2.13), which demonstrated a
5 folds increased expression (Figure 4). MiR-210-3p failed to be
selected from our analysis of HF dataset due to the log2FC
value (0.67) did not pass our screening criteria.
From our analysis, we suggest using both miR-210-3p and miR-142-3p
together as targets to screen MI patients to identify those with high
risks of getting cancer.
Verification of miR-210-3p and miR-142-3p in cancer cells
Through bioinformatic analysis, we identified miR-210-3p and miR-142-3p
as potential targets for screening cancer risk in MI patients. We did
literature research of these two miRNA, and found that they are involved
in the regulation of migration and invasion of cancer cells[18-21]
.To verify whether miR-210-3p and miR-142-3p can promote migration and
invasion of cancer cells, mimics and inhibitors of miR-210-3p or
miR-142-3p were transfected into human renal clear cell carcinoma cell
line 786-o cells, and the transfected cells were tested by Wound Healing
and Transwell assays.
Ⅰ. miR-210-3p
To verify the function of miR-210-3p in cancer cells, we did Transwell
and Wound Healing assays with 786-o cells transfected with mimics of
miR-210-3p (mimics) and inhibitor of miR-210-3p (inhibitor). In Wound
Healing assay, we evaluated the relative migration levels through
measuring the wound area of 786-o cells transfected with control, mimics
and inhibitor of miR-210-3p. Statistical analysis of the Wound Healing
assays shows that 786-o cells transfected with mimics of miR-210-3p
migrated faster than control cells and 786-o cells transfected with
inhibitor of miR-210-3p migrated slower than control cells (Figure 6).
Next, we evaluate the invasion levels through counting the amounts of
786-o cells that colonized in the lower well of transwell assays.
Statistical analysis demonstrated that 786-o cells transfected with
mimics of miR-210-3p showed higher invasion capability than control
cells and 786-o cells transfected with inhibitor of miR-210-3p showed
lower invasion ability than control cells (Figure 7). To summarize,
786-o cells with overexpression of miR-210-3p have increased migration
and invasion capabilities, demonstrating that miR-210-3p could promote
the migration and invasion of cancer cells.
Figure 6. Wound Healing assays of 786-o cells transfected with control,
mimics-210-3p and inhibitor-210-3p. (A) Representative images of Wound
Healing assays. (B) Statistical analysis of relative migration levels in
the wound healing assays (n = 6). Data are presented as means ± SEM. *
indicates p < 0.05, and ** indicates p < 0.01. Standard error and
statistics are shown in graph.
Figure 7. Transwell assays of 786-o cells transfected with control,
mimics-210-3p and inhibitor-210-3p. (A) Representative images of
Transwell assays. (B) Statistical analysis of relative invasion levels
in the Transwell assay (n=5). Data are presented as means ± SEM. ***
indicates p < 0.001. Standard error and statistics are shown in
graph.
Ⅱ. miR-142-3p
Similarly, to verify the function of miR-142-3p in cancer cells, we did
Transwell and Wound Healing assays with 786-o cells transfected with
mimics of miR-142-3p (mimics) and inhibitor of miR-142-3p (inhibitor).
In Wound Healing assay, we evaluated the relative migration levels
through measuring the wound area of 786-o cells transfected with
control, mimics and inhibitor of miR-142-3p. Statistical analysis of the
Wound Healing assays shows that 786-o cells transfected with mimics of
miR-142-3p migrated faster than control cells and 786-o cells
transfected with inhibitor of miR-142-3p migrated slower than control
cells (Figure 8). Next, we evaluate the invasion levels through counting
the amounts of 786-o cells that colonized in the lower well of Transwell
assays. Statistical analysis demonstrated that 786-o cells transfected
with mimics of miR-142-3p showed higher invasion capability than control
cells and 786-o cells transfected with inhibitor of miR-142-3p showed
lower invasion ability than control cells (Figure 9). In summary, 786-o
cells with overexpression of miR-142-3p have increased migration and
invasion capabilities, demonstrating that miR-142-3p could promote the
migration and invasion of cancer cells.
Figure 8. Wound Healing assays of 786-o cells transfected with control,
mimics-142-3p and inhibitor-142-3p. (A) Representative images of Wound
Healing assays.(B) Statistical analysis of relative migration levels in
the wound healing assays (n = 6). Data are presented as means ± SEM. *
indicates p < 0.05. Standard error and statistics are shown in graph.
Figure 9. Transwell assays of 786-o cells transfected with control,
mimics-142-3p and inhibitor-142-3p. (A) Representative images of
Transwell assays. (B) Statistical analysis of relative invasion levels
in the Transwell assay (n=5). Data are presented as means ± SEM. ***
indicates p < 0.001. Standard error and statistics are shown in
graph.
Together, the results of transwell and Wound Healing assays demonstrated
that increased expression of miR-210-3p and miR-142-3p in MI patients
might enhance the invasion and migration of cancer cells, which supports
our idea of developing a screening kit that targets miR-210-3p and
miR-142-3p to MI patients with high risk of cancer.
Design LIRA to target miR-210-3p and miR-142-3p
To design a screening approach that can target miR-210-3p and
miR-142-3p, we adapted from an innovative approach named Loop-Initiated
RNA Activator (LIRA) that was used to detect viral RNA[22].
As shown in Figure 10, LIRA forms a stem-loop structure that mainly
contains a recognition region, a ribosome binding sequence (RBS) and a
start codon AUG. Without interaction with target RNA, the RBS and start
codon in LIRA are kept in a close status in the stem structure, making
them impossible to initiate translation of downstream reporter gene. The
recognition region is a 31 nucleotides (nt) sequence that is reverse
complimentary to the target RNA of LIRA. While 21 nt of the reverse
complimentary sequence is designed to locate in the loop structure of
LIRA, which is named as a*, 10 nt of the reverse complimentary sequence
is designed to locate in the stem structure of LIRA, which is named as
b*. As a result, the target RNA of LIRA can bind to both a* and b*, and
the binding of the target RNA to b* destabilize the stem structure that
locks RBS and start codon, which makes them to be exposed in an open
status for the translation initiation of downstream reporter gene. Due
to the complexity of secondary structure and complicated hybridization
process of RNA, we used NUPACK software to design and analyze LIRA
sequences in the subsequent study.
Figure 10. Illustration of LIRA system[22]
Design LIRAs to target miR-210-3p and miR-142-3p separately
To design LIRA sequences that target miR-210-3p and miR-142-3p
separately, we first determined the composition and structure of each
part of LIRA, and replaced the recognition region with reverse
complementary sequence of miR-210-3p and miR-142-3p. We used the design
function of NUPACK to design the rest of LIRA sequences. Next, we used
the analysis function of NUPACK to simulate the hybridization process of
LIRA, and the nucleotide bases in the mismatch region of the LIRA with
abnormal structure were modified to achieve our design expectation,
which is to keep RBS and AUG at a closed status in a stem region without
interactions with target miRNA and release RBS and AUG to an open status
in a single strand RNA with interactions with target miRNA. The
procedure of our LIRA designing is listed below.
Figure 11. Parameter settings for single-arm LIRA design. (A) Set Domian
LIRA Toehold With 1 Segment of Sub-Sequences. (B) Domian miRNA settings.
(C) Strands settings. (D) Complex and Target Structure settings.
During the process of designing, we found that our situation differed
from the case reported in the literature in that the target RNA used in
the literature was 31nt, while the target miRNAs used by our team were
22nt and 23nt respectively[18]. Due to the difference in the
length of the target RNA, we were unable to use the way of LIRA design
reported in the literature directly. To determine the distribution of
the recognition region on our LIRA, we defined a concept of stem-loop
ratio: the ratio of the number of nucleotides of recognition region in
the stem structure to the number of nucleotides of recognition region in
the loop structure. We tried to design LIRA structure using different
stem-loop ratios. We found that when the number of nucleotides of the
recognition region in the stem is too large, it will partially overlap
with the complementary sequence of RBS, which will lead to the exposure
of the RBS sequence. Therefore, we designed LIRA to miR-210-3p with the
stem-loop ratios from 0/22 to 13/9 and LIRA to miR-142-3p from 0/23 to
13/10. The hybridization of LIRA to their target miRNAs was also
analyzed with NUPACK. The pairing results and related data are shown in
Table 6 and Table 7.
Table 6. Hybridization results of 14 LIRA structures of miR-210-3p and
related data
Stem-loop ratio | Single-arm sequence | Conformation of LIRA alone | Conformation of LIRA hybridized with miR-210-3p |
---|---|---|---|
0/22 | CUUGCUGGUGCUCCCUUUCCUGGGGUUCGGUCAUCAGCCGCUGUCACACGCACAGGAUAGGAGGAGAGGAGAGGAGACAUGUGUGAG | ||
1/21 | GAGUCGAAUCCUAAUUUUUCUGAGUACGAGUAUCAGCCGCUGUCACACGCACAGAUACGCGUACAGAGGAGAGAGAGGAUGAGGUUC | ||
2/20 | GUCUAGGAUGUGGGGUCUCUUAGCCGCCGGCUCAGCCGCUGUCACACGCACAGAGAGUGGGUGGAGAGGAGAUUAAAUAUGAUAGAC | ||
3/19 | CUCUCAGAUUUGAGCUCUUUUAGGUGUAGGUCAGCCGCUGUCACACGCACAGAAUGGCGUACACAGAGGAGAGGGAAAAUGAGAGAG | ||
4/18 | GAGUCGAGUGCGACAUCUCCUGAUGUCCGUCAGCCGCUGUCACACGCACAGAAACUGAGGGGCAAGAGGAGAUGAAGCAUGAGGGAC | ||
5/17 | GUCCGGGGUCAUGUCUCUUCUAACUUGCUCAGCCGCUGUCACACGCACAGAAAUGGGGUGUAGGAGAGGAGAGGGUUGAUGACGGAC | ||
6/16 | CCUGCGGAUCGUUGCUCUCCUGAUGUUUCAGCCGCUGUCACACGCACAGAAUGAGGGGCGGGUAAGAGGAGAGUGGGGAUGGGGGGG | ||
7/15 | CGGUGGAAUUGAACGUCUUCUGAUGGUCAGCCGCUGUCACACGCACAGGAGCUCCGGCGGCGGAAGAGGAGAUGCGCGAUGACACUG | ||
8/14 | CCUCGAGAUCUUUCAUUUCCUAACGUCAGCCGCUGUCACACGCACAGAAAAUGAGUGGGUGGUGAGAGGAGAUGGGAGAUGGUGGGG | ||
9/13 | GUGUAAAGUUUAGCUUUUUCUAAUUCAGCCGCUGUCACACGCACAGAAUGGCUAGGCGACUGGAAGAGGAGAGGAGAGAUGAUGUAC | ||
10/12 | CUGUCGGAUAUCUCAUCUUUUACUCAGCCGCUGUCACACGCACAGAUAAAAAGUCAGUCGCUGAAGAGGAGAUGGUAUAUGAGAUAG | ||
11/11 | CUACCAGGUAACAAGUUUUCUGUCAGCCGCUGUCACACGCACAGGAUAACACACACAGGGGUUGAGAGGAGAUUAUUUAUGAGGUAG | ||
12/10 | GUAUGGGAUGAAACAUUUUCUUCAGCCGCUGUCACACGCACAGGAUUUAAAGAAGGCGACGGUUAGAGGAGAUGGAUUAUGGUAUAC | ||
13/9 | GGCGGGAAUACAUAUUUUUCUCAGCCGCUGUCACACGCACAGUAAAAAAUAAAUUGACGGGGGCAGAGGAGAGGGGGGAUGAUCGUU |
Table 7. Hybridization results of 14 LIRA structures of miR-142-3p and
related data
Stem-loop ratio | Single-arm sequence | Conformation of LIRA alone | Conformation of LIRA hybridized with miR-142-3p |
---|---|---|---|
0/23 | GCUGCUGAUGAAACAUCUCCUAACAGCGUCAGUCCAUAAAGUAGGAAACACUACAGAGUGGGGGAGAGGAGAUGGCAAAUGUGUAGC | ||
1/22 | CUAAGAAGUCUCUGUUCUUCUAAGCAGGACGUUCCAUAAAGUAGGAAACACUACAGCGCGGUGCAGAGGAGAACGGAGAUGACGGGG | ||
2/21 | CCUACGGAUUGGGCGUCUUCUAAUCUGUCGGUCCAUAAAGUAGGAAACACUACAGACCAGCGGGAGAGGAGACGGAAAAUGCGAAGG | ||
3/20 | GUGGCUAAUCUGACGUCUUCUGAGAUCGCGUCCAUAAAGUAGGAAACACUACACGGACACGGUGAGAGGAGAUGAAAGAUGGGAAAC | ||
4/19 | GUUCGUAAUUCUGACUCUCUUAGUUGCCAUCCAUAAAGUAGGAAACACUACACCUGGGUGGGGAAGAGGAGAGUGUGAAUGGGGGGC | ||
5/18 | GAUAAAGGUGGGGGUUCUUCUGGCCGUCUCCAUAAAGUAGGAAACACUACACGCGUGGCGGCGGAGAGGAGAAUGGGGAUGGUUAUC | ||
6/17 | GAAGGGAGUGGAAUUUUUCUUGGCAGGUCCAUAAAGUAGGAAACACUACACCGAGGGGGGGGUGAGAGGAGAGACGUUAUGAGGGGC | ||
7/16 | GCGCUGAAUUAGUGUUCUCUUGGUAUUCCAUAAAGUAGGAAACACUACAGGGGAGGAUAGAAUAAGAGGAGAGCUGUAAUGAAAUGC | ||
8/15 | GAAGCAAGUACUAGAUCUUCUAAGAUCCAUAAAGUAGGAAACACUACACGAAGCGGGAAGGGUUAGAGGAGAUUGGGUAUGAGGUUC | ||
9/14 | GUAGUGAGUUAUCCGUCUCCUGAAUCCAUAAAGUAGGAAACACUACACACCCCGGGGGCUGGAUAGAGGAGACGUUUAAUGAACUGC | ||
10/13 | GGUCGGGAUUUAGCUUCUUCUGAUCCAUAAAGUAGGAAACACUACAUAGGGACAACUUAGUGGAAGAGGAGAAGGGAAAUGGCGAUC | ||
11/12 | GUGUUAAAUGUAAUGUUUUCUGUCCAUAAAGUAGGAAACACUACAUACGACGAAUACUAGGGGGAGAGGAGAUGGAACAUGAGAUAC | ||
12/11 | CCACCGAAUGCCUAUUCUCCUUCCAUAAAGUAGGAAACACUACACUGGGCCACGCGGGGGUAGGAGAGGAGAAAAAGCAUGAGGUGG | ||
13/10 | GCGUGAGAUCACCGUUCUUCUCCAUAAAGUAGGAAACACUACAAGGAUGUGAUACGAAGUGUAUAGAGGAGAGUCGGGAUGAGGGGU |
We think that the greater the free energy of a LIRA, the easier it is
for LIRA structures to be paired with miRNAs for translation; the
smaller the free energy of a LIRA, the more stable the structure of LIRA
is. Therefore, we chose free energy as an important index to analyze the
hybridization of LIRAs to their target miRNAs. We calculated the changes
of free energies of the LIRA before and after hybridization and fitted
the changes of free energy to stem-loop ratio. From the fitting curve,
it seems that there is no regularity between the changes of free energy
and stem-loop ratio (Figure 12 and Figure 13).
Figure 12. Fitting curves of the free energy of LIRA to miR-210-3p
Figure 13. Fitting curves of the free energy of LIRA to miR-142-3p
Since the LIRA sequence designed by NUPACK algorithm in the unrestricted
region is random, which causes large variations among these designed
LIRA. Therefore, we designed 10 sequences for each stem-loop ratio and
averaged the free energy values of these LIRA. A new fitting was
performed with the averaged free energy and stem-loop ratio (Figure 14
and Figure 15). The new fitting curve demonstrates that there is no
linear relationship between LIRA free energy and stem-loop ratio,
indicating that LIRA would not become more or less stable as stem-loop
ratio increase or decrease.
Figure 14. New fitting curve of miR-210-3p
Figure 15. New fitting curve of miR-142-3p
Next, we calculated the changes of free energy as the differences of
free energy of LIRA before and after hybridization with its target
miRNA, and analyzed the relation between LIRA free energy and the
changes of free energy. We observed that the data had a clear linear
relationship, so we linearly fitted it. The fitting results are shown in
Figure 16 and Figure 17. The slope of the fitted curve is approximately
equal to -1, which indicates that the free energy of LIRA after
hybridization with its target miRNA is stable at a constant value. The
free energy of LIRA hybridized with miR-210-3p is around -54 kcal/mol,
and the free energy of LIRA hybridized with miR-142-3p is around -44
kcal/mol. These results show that free energy of LIRA before
hybridization might be a more important indicator to determine the
quality of a LIRA.
Figure 16. Linear fitting results of miR-210-3p
Figure 17. Linear fitting results of miR-142-3p
Furthermore, we averaged 10 LIRA free energies at the same stem-loop
ratio, which could represent the LIRA free energy and hybridization free
energy at certain stem-loop ratio. Subsequently, we performed a
clustering with these 28 points, and observed a linear relationship
between LIRA free energy and the averaged changes of free energy (Figure
18 and Figure 19).
Figure 18. miR-210-3p clustering results
Figure 19. miR-142-3p clustering results
From our analysis of LIRA to miR-210-3p and miR-142-3p, we concluded
that stem-loop ratio has little effect on the free energy of LIRA. For
our subsequent wet lab experiments, we chose 3 LIRA sequences for
miR-210-3p with stem-loop ratios of 2/20, 7/15, and 10/12, and 3 LIRA
sequences for miR-142-3p with stem-loop ratios of 3/20, 8/15, and 11/12.
We selected these 6 LIRA sequences to verify whether stem-loop ratio
indeed had no effect on the performance of LIRA. Meanwhile, since the
free energies of these 6 LIRAs are very different, the influence of the
LIRA free energy on the functions of LIRA can also be investigated
(Table 8). In this regard, we expect that a stable LIRA structure is
often accompanied by a lower free energy.
Table 8. 6 LIRA data
miRNA | Stem-loop ratio |
Free energy of LIRA before hybridization(kcal/mol) |
---|---|---|
miR-210-3p | 2/20 | -66.61 |
miR-210-3p | 7/15 | -62.10 |
miR-210-3p | 10/12 | -56.54 |
miR-142-3p | 3/20 | -54.83 |
miR-142-3p | 8/15 | -47.89 |
miR-142-3p | 11/12 | -41.10 |
Design double-arm LIRA to target miR-210-3p and miR-142-3p
simultaneously
To target miR-210-3p and miR-142-3p simultaneously, we designed LIRAs
with double-arm structure that can target two target RNA[22].
As shown in Figure 20, double-arm LIRA has two recognition regions, A*
and B*. While A* is located on the short arm of the double-arm
structure, B* is located on the long arm of the double-arm structure,
which is connected to RBS and AUG. We think that putting one miRNA
reverse complimentary sequence in A* and the other miRNA reverse
complimentary sequence in B* would make our LIRA targeting two miRNAs.
Our expectation is that only when the LIRA binds to the target miRNA A
and then bind to the target miRNA B, it would expose the RBS and AUG in
an open status to initiate the translation of the downstream reporter
gene, and otherwise the RBS and AUG would be locked to at a closed
status in stem region.
Figure 20. Double-arm LIRA structure[22]
We identified the composition of each part of double-arm LIRA and
replaced the recognition region with sequences that were reverse
complimentary to miR-210-3p and miR-142-3p. Utilizing the design
function of NUPACK, we designed a primary double-arm LIRA structure and
obtained the sequence. The following is the design procedures of LIRA
sequences to detect miR-210-3p and miR-142-3p in NUPACK.
Figure 21. Parameter settings for double-arm LIRA design. (A) Set Domian
LIRA Toehold With 1 Segment of Sub-Sequences. (B) Domian miRNA settings.
(C) Strands settings. (D) Complex and Target Structure settings.
During the process of simulation, we utilized NUPACK to generate 100
double-arm LIRAs. First, we hybridized each double-arm LIRA to both
miR-210-3p and miR-142-3p , and found that 32 double-arm LIRAs could
only hybridize to a single miRNA. We analyzed the results of the
remaining 68 LIRA that could hybridize to both miRNAs, and 21 LIRAs
did not form a correct base-pairing structure between RBS and AUG. We
then modified the bases between RBS and AUG in these 21 LIRAs
according to the hybridization results to ensure that RBS and AUG
could be successfully exposed after hybridization. Finally, we
analyzed the sequences at other regions of these 21 LIRAs, and found
that 15 of them would destabilize AUG and RBS when LIRA interacts with
miR-210-3p alone. We finally get 6 ideal double-arm LIRA that meet the
design expectation, e.g., the RBS and AUG remain in a close status
without interaction with miR-210-3p and miR-142-3p, and keep in an
open status with interactions with miR-210-3p and miR-142-3p. The
simulation results of these 6 LIRAs are shown in Table 9.
Table 9. Hybridization results of 6 double-arm LIRA sequences to
detect 2 miRNAs
Number | Sequence | Conformation of LIRA alone |
Conformation of LIRA hybridized with miR-210-3p and miR-142-3p |
---|---|---|---|
1 | AGCCUCAGCGGUCGACCCGCGGUCCAUAAAGUAGGAAACACUACAUUUGUGGGGAGCGGGGUAUAUCAUCCUCUCCUAGUCAGCCGCUGUCACACGCACAGCGGCUGGAGAGGAGAAUAUGAUAUACGGGGCCGAGGGGGCU | ||
2 | GUCCACGUUGAGCGUCCAGUCCAUAAAGUAGGAAACACUACAUUUGUGGAAAGGAUGGGUGCCUGUCCGAUCUCAGCUUCAGCCGCUGUCACACGCACAGCGGCUGAAGAGGAGAUGAUAUGGGCGCCGACAACGCAGAU | ||
3 | CGAUUUUCUCUUGCAGGAAUCCAUAAAGUAGGAAACACUACAUUUAUGGACGCCUGCCGGUCCCGUACAUUCUCAACUUCAGCCGCUGUCACACGCACAGCGGCUGAAGAGGAGAAUACAUGGGGCUGUCGAGAACUUCG | ||
4 | GCACGACGCCUUUCGAGUUUCCAAAAAGUAGGAAACACUACAUUUGUGGACUUUCGACUUAGUCAUCAACUCUCAACUUCAGCCGCUGUCACACGCACAGUGGUAGAAGAGGAGAGAGAAUGACUAAGCUGGUGGCGUGC | ||
5 | CGGUUCCGCUAAGGCAUCAUCCAUAAAGUAGGAAACACUACAUUUAUGGAACAUGCCCAUGUUUAUCCGUUCUCGACUUCAGCCGCUGUCACACGCACAGUGGCUGAAGAGGAGAAUAUAUGGAUAUGAUGGUGGCCCCG | ||
6 | GUCCACGUUGAGCGUCCAGUCCAUAAAGUAGGAAACACUACAUUUGUGGAAAGGAUGGGUGCCUGUCCGAUCUCAGCUUCAGCCGCUGUCACACGCACAGCGGCUGAAGAGGAGAUAAAAUGGGCGCCGACAACGCAGAU |
Optimization of double-arm LIRA to miR-210-3p and miR-142-3p
Setting indicators by the characteristic of LIRA
To generate an ideal double-arm LIRA sequence to miR-210-3p and
miR-142-3p, we would like to optimize the LIRA sequences that we
designed in the previous section. To facilitate the optimization
process, we need to identify the indicators that could predict the
performances of LIRAs. For our analysis of the indicators, we focus on
the bases between RBS and AUG, as we think that the status of these
bases determines the status of LIRA mostly. In the case that these
bases are paired with other bases to form stem structure, LIRA should
be in a close status. On the contrary, in the case that these bases
are unpaired, LIRA should be in an open status to initiate
transcription of reporter genes. From NUPACK, we could get the
"unpaired probability" for any base, which calculates the probability
that such base is unpaired at equilibrium, and the "paired
probability" for any base, which calculates the probability that the
base is paired with another base at equilibrium. We define indicator
"unlock" for each product for example complexes of LIRA and its target
miRNA, by averaging the "paired probability" of bases between RBS and
AUG in the product and indicator "lock" for LIRA only by averaging the
"unpaired probability" of bases between RBS and AUG in LIRA.
Figure 22. Diagram of "lock" and "unlock" calculation process
1. Unlock
2. Lock
NUPACK could simulate the LIRA-mediated reaction and calculate the
amount of each product from such reaction. The reaction with LIRA,
miR-210-3p, miR-142-3p could generate multiple complexes, including
"LIRA/miR-210-3p/miR-142-3p/complex",
"LIRA/miR-142-3p/miR-210-3p/complex", "LIRA/miR-142-3p/complex", and
"LIRA/miR-210-3p/complex", which we label as product 1 to 4
accordingly and calculated their "unlock" values. Among all these
products, LIRA/miR-210-3p/miR-142-3p/complex is by far the dominant
one, so we treat it as the main product.
Subsequently, we define indicator "unlock divide lock" and "unlock
minus lock" using the "unlock" value of the main product, i.e., the
LIRA/miR-210-3p/miR-142-3p/complex.
3. Unlock divide lock
4. Unlock minus lock
"unlock level average" is the average of "unlock" values of all
products. "unlock level weighted" is the weighted average of "unlock"
value of all products, weighted by their corresponding concentrations
given by NUPACK.
5. Unlock level average
6. Unlock level weighted
We define the following four metrics using subtraction and division,
mean and weighted mean respectively.
7. Unlock minus lock average
8. Unlock minus lock weighted
9. Unlock divide lock average
10. Unlock divide lock weighted
It should be noted that some indicators differ in how many products
are included. Take "unlock", "unlock average" and "unlock weighted" as
an example: "unlock" includes the main product only, "unlock average"
includes the average "unlock" value of all products, "unlock weighted"
count the weighted average "unlock" value of all products.
Select better performance LIRA by indicators
To explore the influence of the above 10 indicators on the simulation
results of LIRA from NUPACK, which shows that whether LIRA meets the
design expectation (See part 2, design of double-arm LIRA), we
analyzed a variety of double-arm LIRA. In addition to the 6 LIRAs
designed in previous section that meet the design expectation (Table
10), we also included 6 LIRAs that did not meet the design expectation
as well. Sequences and conformation of the 6 LIRA which did not meet
expectations are shown in the table below.
Table 10. Sequences and conformation of 6 LIRA that did not meet the
design expectation
Number | Sequence | Conformation of LIRA alone |
Conformation of LIRA hybridized with miR-210-3p and miR-142-3p |
---|---|---|---|
7 | AGGACACAAGUAGCGGCACUCCAUAAAGUAGGAAACACUACAUUUAUGGAAAGCCGCGAGCUCCGUCUCGUCUCGGCUUCAGCCGCUGUCACACGCACAGCGGCUGGAGAGGAGACGUCAUGGAGCUCCCCUUGUAACCU | ||
8 | GCAUUGAUUCCAGUACCCCUCCAUAAAGUAGGAAACACUACAUUUAUGGACAGGUACCAAUCGCAUAAUCUUUCGGCUUCAGCCGCUGUCACACGCACAGCGGCUGAAGAGGAGAGAACAUGCGAUUGACGAAUCUUUGC | ||
9 | UCGGAGCGUGAAGAGGGUAUCCAUAAAGUAGGAAACACUACAUUUGUGGAGCCCCUCGUCCCGCGUUGGGUCUCGACUUCAGCCGCUGUCACACGCACAGCGGCUGAAGAGGAGACCAUAUGCGGGACAACGCGUAGCGA | ||
10 | UGCGAGGCGGUUACAGAAAUCCAUAAAGUAGGAAACACUACAUUUGUGGACCUUUGUCGGUCGCAUAGCAUCUCGACUUCAGCCGCUGUCACACGCACAGUGGUUGAAGAGGAGAUGGGAUGUGACUGCUCCGUUGAGCG | ||
11 | CCGAAGACAAAACUCUCAAUCCAUAAAGUAGGAAACACUACAUUUAUGGAACGAGAGGGAGGUCAUUUCCUUUUGGCUUCAGCCGCUGUCACACGCACAGCGGCUGAAGAGGAGAGGCCAUGACCUCCGCUUGUCGACGG | ||
12 | CCCACCUUCCUCGUGCGUGUCCAUAAAGUAGGAAACACUACAUUUGUGGAGUCGCACCGCCGGUAUCAGAUCUCGGUUUCAGCCGCUGUCACACGCACAGCGGCUGAAGAGGAGAUCCAAUGCCGGUGCUGGGAGCCGGG |
We call results that meet expectations as 1, and fail to meet
expectations as 0. Multivariate regression is applied to
analyze the effects of these 10 indicators on the predicted
performance of LIRA, i.e., meeting or not meeting the design
expectation. The results of multiple regression are shown in Table 11.
Table 11. Multiple regression result
Chi-square | Pseudo R2 | Significance | AIC | BIC |
---|---|---|---|---|
16.636 | 0.75 | 0.119 | 24.000 | 29.819 |
The multiple regression model has a high R2 value as 0.75,
which means that the model has a good fitting effect. However, the
P-value is 0.119, which means that the multiple regression model does
not pass the significance test. We think that the large p-value is
most likely caused by strong collinearity. To verify such
collinearity, we used the Pearson correlation coefficient matrix to
analyze the correlation among 10 indicators, which demonstrates that
the collinearity is indeed very large (Table 12).
Table 12. Pearson correlation coefficient matrix for 10 indicators
Note:* means p<0.05 and ** means p<0.01. We highlighted the
correlation coefficient value >0.7 with colors.
We find that the correlation between the group of indicators "unlock",
"unlock level average" and "unlock level weighted" is stronger than
the correlation between these 3 and other indicators. The same strong
correlation exists in the group of indicators "unlock minus lock",
"unlock minus lock average", and "unlock minus lock weighted", and in
the group of indicators "unlock divide lock", "unlock divide lock
average", and "unlock divide lock weighted".
We explored the reason for the strong correlation within different
groups. Take the group of indicators "unlock", "unlock level average"
and "unlock level weighted" as an example, these 3 indicators
basically measure the same thing "unlock", and only differ in the way
on how to include "unlock" values from different products. Since the
main product, i.e., the LIRA/miR-210-3p/miR-142-3p/complex, comprises
the vast majority proportion of all the products, which results in
almost equal values for the three indicators, it could explain why the
independent variants in model have strong collinearity.
To overcome the problem of collinearity, we use random forest which is
not sensitive to collinearity to train the 10 metrics and results
associated with unlock and lock. We believe that the indicators with
higher R2 value in testing datasets could better help us to
evaluate the performance of LIRA. The ratio of training set to test
set is 0.8:0.2. Results are shown in Table 13.
Table 13. Random Forest result
Dependent Variable | Dataset | R2 | MAE | MSE | RMSE | MAD | MAPE | EVS | MSLE |
---|---|---|---|---|---|---|---|---|---|
unlock | training datasets | 0.804 | 0.067 | 0.006 | 0.078 | 0.075 | 0.014 | 0.819 | 0.003 |
testing datasets | 0.758 | 0.022 | 0.001 | 0.025 | 0.017 | 0.001 | 0.939 | 0 | |
lock | training datasets | 0.814 | 0.045 | 0.003 | 0.058 | 0.051 | NULL | 0.825 | 0.002 |
testing datasets | -0.711 | 0.094 | 0.009 | 0.095 | 0.086 | 0.016 | -0.392 | 0.006 | |
unlock level average | training datasets | 0.827 | 0.050 | 0.004 | 0.061 | 0.046 | 0.018 | 0.843 | 0.002 |
testing datasets | -0.983 | 0.028 | 0.001 | 0.037 | 0.022 | 0.002 | -0.792 | 0.001 | |
unlock level weighted | training datasets | 0.811 | 0.094 | 0.014 | 0.117 | 0.07 | 0.052 | 0.827 | 0.008 |
testing datasets | 0.231 | 0.227 | 0.068 | 0.26 | 0.308 | 0.021 | 0.232 | 0.03 | |
unlock divide lock | training datasets | 0.804 | 1.964 | 10.705 | 3.272 | 0.876 | 0.13 | 0.804 | 0.31 |
testing datasets | -19.792 | 1.643 | 2.854 | 1.689 | 1.701 | 0.05 | -0.119 | 0.343 | |
unlock minus lock | training datasets | 0.817 | 0.103 | 0.016 | 0.128 | 0.097 | 0.136 | 0.835 | NULL |
testing datasets | -0.192 | 0.079 | 0.009 | 0.097 | 0.08 | 0.007 | 0.466 | 0.005 | |
unlock minus lock average | training datasets | 0.825 | 0.041 | 0.002 | 0.05 | 0.036 | 0.142 | 0.844 | NULL |
testing datasets | -0.219 | 0.034 | 0.002 | 0.039 | 0.029 | 0.007 | 0.311 | 0.001 | |
unlock minus lock weighted | training datasets | 0.803 | 0.072 | 0.008 | 0.088 | 0.062 | 0.309 | 0.815 | NULL |
testing datasets | -0.146 | 0.12 | 0.032 | 0.179 | 0.041 | 0.011 | 0.111 | 0.019 | |
unlock divide lock average | training datasets | 0.912 | 1.011 | 1.484 | 1.218 | 0.854 | 0.089 | 0.923 | 0.196 |
testing datasets | 0.911 | 1.028 | 1.452 | 1.205 | 1.028 | 0.006 | 0.935 | 0.034 | |
unlock divide lock weighted | training datasets | 0.779 | 0.458 | 0.317 | 0.563 | 0.566 | NULL | 0.781 | 0.075 |
testing datasets | -68.13 | 1.56 | 2.61 | 1.616 | 1.361 | 0.299 | -3.65 | 0.649 |
As can be seen from the table, on the training set, the fitting effect
of the model is significant, and the R2 value is generally
high (>0.8), indicating that the model has achieved good training
effect on each index on the training set. For most indicators, MAE,
MSE, RMSE, MAD and MAPE values are low, so it can be considered that
the deviation between the predicted value and the actual value of the
model is small.
On the test set, indicator "unlock", "unlock level weighted" and
"unlock divide lock level weighted" have R2 above zero,
which indicates that these indicators have a higher accuracy in
predicting the result of open and close. Particularly, the R2
of "unlock divide lock weighted" reached 0.911, thus we can get the
conclusion that "unlock divide lock weighted" performs best among all
the indicators.
We get weights of indicators on the performance of LIRA on NUPACK from
random forest analysis, and list them in Table 14 below.
Table 14 The weight of the indicator to the result
Indicators | unlock | lock | lnlock divide lock | unlock minus lock | unlock level average |
---|---|---|---|---|---|
weights | 0.050 | 0.110 | 0.140 | 0.080 | 0.090 |
Indicators | unlock level weighted | unlock minus lock average | unlock minus lock weighted | unlock divide lock average | unlock divide lock weighted |
---|---|---|---|---|---|
weights | 0.130 | 0.130 | 0.060 | 0.080 | 0.130 |
The weight of "unlock divide lock" is larger than that of other
indicators, which is followed closely by "unlock level weighted" and
"unlock divide lock weighted". To measure the expected performance of
LIRA quantitively, we calculated "expectation" for all LIRA according
to the formula shown as follows, and listed in Table 15.
Figure 23. Calculation method for indicators' weight for expectation
Table 15. The weight of the indicator to the result
numbers | 1 | 2 | 3 | 4 | 5 | 6 |
---|---|---|---|---|---|---|
expectation | 1.47 | 1.71 | 1.07 | 1.54 | 1.89 | 1.64 |
numbers | 7 | 8 | 9 | 10 | 11 | 12 |
---|---|---|---|---|---|---|
expectation | 0.58 | 0.44 | 0.13 | 0.23 | 0.59 | 0.47 |
Therefore, we sort all LIRAs according to "expectation" from high to
low, and find that LIRAs 2 and 5 have highest ranking score. Thus, we
conclude that LIRA sequences 2 and 5 might meet our design expectation
well, and should have high expression and low leakage in the wet lab
experiments.
Outlook
In the subsequent analysis, we attempt to explore the effects of some
of the LIRA's own parameters, such as base content, on indicators and
whether such parameters could be used to predict the performance of
LIRA on NUPACK, i.e., meeting or not meeting design expectation.
Next, we investigate the weights of parameters to indicators. We
collected the parameters of LIRA sequence itself from NUPACK software,
including "A","C","G","U" base content, and "A-U", "C-G", "G-U"
base-pair content. Then the random forest model is used for 10
indicators. The weights of different parameters on different
indicators are obtained as shown in Table 16 and Table 17 below.
Table 16. The weight of bases to the indicators
A | C | G | U | |
---|---|---|---|---|
unlock | 0.111 | 0.474 | 0.095 | 0.319 |
lock | 0.095 | 0.569 | 0.109 | 0.227 |
unlock divide lock | 0.098 | 0.304 | 0.493 | 0.104 |
unlock minus lock | 0.116 | 0.511 | 0.078 | 0.295 |
unlock level average | 0.154 | 0.530 | 0.124 | 0.193 |
unlock level weighted | 0.096 | 0.297 | 0.167 | 0.440 |
unlock minus lock average | 0.088 | 0.569 | 0.096 | 0.247 |
unlock minus lock weighted | 0.128 | 0.475 | 0.126 | 0.270 |
unlock divide lock average | 0.116 | 0.287 | 0.473 | 0.124 |
unlock divide lock weighted | 0.107 | 0.231 | 0.536 | 0.126 |
Table 17. The weight of the base pairs to the indicators
A-U | C-G | G-U | |
---|---|---|---|
unlock | 0.342 | 0.368 | 0.289 |
lock | 0.443 | 0.215 | 0.342 |
unlock divide lock | 0.430 | 0.231 | 0.339 |
unlock minus lock | 0.364 | 0.268 | 0.368 |
unlock level average | 0.387 | 0.240 | 0.373 |
unlock level weighted | 0.400 | 0.355 | 0.244 |
unlock minus lock average | 0.431 | 0.205 | 0.364 |
unlock minus lock weighted | 0.382 | 0.282 | 0.332 |
unlock divide lock average | 0.497 | 0.187 | 0.316 |
unlock divide lock weighted | 0.412 | 0.211 | 0.377 |
To judge the effect of different parameters on the result of open or
close accurately, we calculate by the following formula, multiply the
weights respectively, and obtain the weights of the parameters for the
result of unlock and lock.
Figure 24. Calculation method for parameters' weight for indicators
Table 18. The weight of the parameter to the result of unlock and lock
Paraments | A% | C% | G% | U% |
---|---|---|---|---|
weights | 0.108 | 0.419 | 0.248 | 0.226 |
Table 19. The weight of the parameter to the result of unlock and lock
Paraments | A-U% | C-G% | G-U% |
---|---|---|---|
weights | 0.302 | 0.393 | 0.305 |
C% has weight of 0.1976, which is obviously higher than other
parameters, and in Table 19, C-G% has weight of 0.393, which also
confirmed that C% matters most in influencing the result.
Analysis of LlRA-mediated reaction
Since the core of our project is to use LIRA to detect miR-210-3p and
miR-142-3p, a comprehensive and detailed understanding of the
LIRA-mediated reaction is essential for the success of our project.
Through simulation of the process and analysis of the kinetics of the
LIRA-mediated reaction, we can get crucial information of this
reaction, such as the trend of changes of substrates and peak time of
the expression of the reporter gene, which could provide critical
information for designing and optimizing of our project.
We simulated reactions mediated by the single-arm and the double-arm
LIRA under both intracellular and extracellular conditions, and our
simulation includes the following steps:
a. Write down the chemical equations of LIRA system;
b. Transform chemical equations into ODEs;
c. Collect parameter values;
d. Solve the equations using MATLAB;
e. Analyze results.
Simulation and analysis of intracellular reactions mediated by LIRA
The interactions between LIRA and its target RNA could change the
conformation of LIRA, and initiate translation of downstream reporter
gene. Since the reaction between single-arm LIRA and miR-142-3p has
the same reaction process and parameters as the reaction between
single-arm LIRA and miR-210-3p, we think that these two reactions have
the same concentration-time curve and peak time of expression of
reporter gene. Without loss of generality, we take miR-210-3p as an
example for kinetic simulation below.
Schematic presentation of the intracellular reaction mediated by
single-arm LIRA is shown in Figure 25
Figure 25. Intracellular single-arm LIRA reaction process
Reaction process of single-arm LIRA is listed in Figure 26
Figure 26. Equations of intracellular single-arm LIRA reaction
Since the transition from 210-CSIL to OSIL is rapid, we treated the
product of the combination of CSIL210 and miR210 as OSIL210.
Therefore, we carried out a reasonable simplification of the
simulation, and the simplified reaction is shown in Figure 27.
Figure 27. Simplified equations of intracellular single-arm LIRA
reaction
Based on the chemical reaction described above, the equations of
ordinary differential equations with specific parameter values are
listed in Figure 28.
Figure 28. ODEs of intracellular single-arm LIRA reaction
In the simulation, we set the initial concentration of PLMg and PLM210
as 0.01mol/L, and other substances as 0. We use MATLAB and ode45
function to get the numerical solution. As shown in Figure 29, the
concentration of EGFP reaches the maximum value at 83 minutes and 1/2
of the maximum value at 11.5 minutes. The concentration of OSIL210
increases rapidly and reaches the maximum value of 0.00054mol/L at
160s. Due to the low transcription rate of PLMg and PLM210 plasmids,
and the high reaction rate of OSIL210 to EGFP, the concentration of
OSIL210 gradually decreases after 160s.
Figure 29. simulation of intracellular single-arm LIRA reaction
Next, we simulated the intracellular reaction mediated by double-arm
LIRA, and the schematic presentation of this simulated reaction is
shown in Figure 30.
Figure 30. intracellular double-arm LIRA reaction process
For the reactions with double-arm LIRA, we assume that miR-210-3p
cannot bind to CTIL when miR-142-3p is not present. According to these
reaction processes, we can list the reaction equation of the
double-arm LIRA, as shown in Figure 31.
Figure 31. equations of intracellular double-arm LIRA reaction
According to the chemical reaction described above, the equations of
ordinary differential equations with specific parameter values are
listed below in Figure 32.
Figure 32. equations of intracellular double-arm LIRA reaction
In the simulation, we set the initial concentration of PLMag, PLM210
and PLM142 as 0.01mol/L, and other substances as 0. Similarly, we use
MATLAB and ode45 function to get the numerical solution. As shown in
Figure 33, the concentration of EGFP reaches the maximum value of
0.00118mol/L at 3 minutes. The concentration of OTIL210 increases and
reaches a maximum value of 0.00064mol/L at 95s. Due the low
transcription rate of PLMg and PLM210 plasmids and the high reaction
rate of OTIL210 to EGFP, the concentration of OTIL210 gradually
decreases after 95s.
Figure 33. Simulation of intracellular double-arm LIRA reaction
Simulation and analysis of extracellular reactions mediated by LIRA
Compared to the intracellular kinetic simulation, the extracellular
reaction process does not have the step of transcription of miR-142-3p
and miR-210-3p from plasmids. Since the intracellular and
extracellular conditions are different, the modeling parameters are
altered accordingly. The reaction between single-arm LIRA and
miR-142-3p has the same reaction process and parameters as the
reaction between single-arm LIRA and miR-210-3p, these two reactions
have the same concentration-time curve and the same peak time. Without
loss of generality, we take miR-210-3p as an example for kinetic
simulation below.
Schematic presentation of extracellular reaction mediated by
single-arm LIRA is shown in Figure 34.
Figure 34. Extracellular single-arm LIRA reaction process
Reaction process of single-arm LIRA are listed in Figure 35
Figure 35. Equations of extracellular single-arm LIRA reaction
Since the transition from 210-CSIL to OSIL is rapid, we treat the
product of the combination of CSIL210 and miR-210 as OSIL210.
Therefore, we carry out a reasonable simplification of this
simulation, and the simplified reaction is shown in Figure 36.
Figure 36. simplified equations of extracellular single-arm LIRA
reaction
According to the chemical reaction described above, the equations of
ordinary differential equations with specific parameter values are
listed below.
Figure 37. ODEs of extracellular single-arm LIRA reaction
In the simulation, we set the initial concentration of PLMg, PLM210
and PLM142 as 0.01mol/L, and other substances as 0. Similarly, we use
MATLAB and ode45 function to get the numerical solution. As shown in
Figure 38, the concentration of fluorescent protein EGFP reaches the
maximum value of 0.0083mol/L at 30 minutes and 1/2 of the maximum
value at 9 minutes. The concentration of OSIL210 increases rapidly and
reached a maximum value of 0.00054mol/L at 160s. The concentration of
OSIL210 gradually decreases after 160s.
Figure 38. Simulation of extracellular single-arm LIRA reaction
Next, we simulated the extracellular reaction mediated by double-arm
LIRA, and the schematic presentation of this simulated reaction is
shown in Figure 39.
Figure 39. Extracellular double-arm LIRA reaction process
The process of reaction mediated by single-arm LIRA is listed in
Figure 40
Figure 40. Equations of extracellular double-arm LIRA reaction
According to the chemical reaction expression above, the equations of
ordinary differential equations with specific parameter values are
listed below.
Figure 41. ODEs of extracellular double-arm LIRA reaction
In the simulation, we set the initial concentration of PLMag,
miR-210-3p and miR-142-3p as 0.01mol/L, and other substances as 0.
Similarly, we use MATLAB and ode45 function to get the numerical
solution. As shown in Figure 42, the concentration of EGFP reaches the
maximum value of 0.0083mol/L at 30 minutes and 1/2 of the maximum
value at 9 minutes. The concentration of OTIL210 increases rapidly and
reaches a maximum value of 0.00054mol/L at 160s. The concentration of
OTIL210 gradually decreases after 160s.
Figure 42. Simulation of extracellular double-arm LIRA reaction
Table 20. Explanation of abbreviation
Abbreviation | Description |
---|---|
PLMg | gate plasmid of CSIL |
PLMag | AND gate plasmid of CTIL |
PLM210 | input plasmid of miR-210 |
PLM142 | input plasmid of miR-142 |
CSIL | RBS closed single-input LIRA |
CTIL | RBS closed two-input LIRA |
OTIL | RBS open two-input LIRA |
210-CSIL | miR-210 & CSIL complex |
210-CTIL | miR-210 & CTIL complex |
142-CSIL | miR-142 & CSIL complex |
142-CTIL | miR-142 & CTIL complex |
OSIL210 | RBS open single-input LIRA formed by 210-CSIL |
OSIL142 | RBS open single-input LIRA formed by 142-CSIL |
EGFP | enhanced green fluorescent protein |
∅ | degradated substance |
Table 21. Parameters used in single-arm LIRA reaction process
Abbreviation | Description | Value(unit) | Reference |
---|---|---|---|
ktrg | gate plasmid transcription rate constant | 1.1×10-3 (s-1) | Baabu et al., 2021 |
ktr210 | miR-210 input plasmid transcription rate constant | 1.1×10-3 (s-1) | Baabu et al., 2021 |
ktr142 | miR-142 input plasmid transcription rate constant | 1.1×10-3 (s-1) | Baabu et al., 2021 |
kcb210 | combine rate constant of miR-210 and CSIL | 1×105 (M-1 ·s-1) | Baabu et al., 2021 |
kcb142 | combine rate constant of miR-210 and CSIL | 1×105 (M-1 ·s-1) | Baabu et al., 2021 |
ktl210 | OSIL210 translation rate constant | 0.3361 (s-1) | https://2023.igem.wiki/ugm-indonesia/ |
ktl142 | OSIL142 translation rate constant | 0.3361 (s-1) | https://2023.igem.wiki/ugm-indonesia/ |
kde210 | miR-210 degradation rate constant | 3×10-4 (s-1) | Baabu et al., 2021 |
kde142 | miR-142 degradation rate constant | 3×10-4 (s-1) | Baabu et al., 2021 |
kdeCSIL | CSIL degradation rate constant | 3×10-4 (s-1) | Baabu et al., 2021 |
kdeOSIL | OSIL degradation rate constant | 3×10-4 (s-1) | Baabu et al., 2021 |
Table 22. Parameters used in double-arm LIRA reaction process
Symbol | Description | Value(unit) intracellular |
Value(unit) cell-free |
---|---|---|---|
ktrag | AND gate plasmid transcription rate constant | 1.1×10-3 (s-1) | 1.1×10-3 (s-1) |
ktlOTIL | OTIL translation rate constant | 0.03361 (s-1) | 0.0168 (s-1) |
kcb1 | combine rate constant of miR-210 and CTIL | 1×105 (m-1·s-1) | 1×105 (m-1·s-1) |
kcb2 | combine rate constant of miR-210 and 142-CTIL | 1×105 (m-1·s-1) | 1×105 (m-1·s-1) |
kdeCTIL | CTIL degradation rate constant | 3×10-4 (s-1) | 3×10-4 (s-1) |
kdeOTIL | OTIL degradation rate constant | 1.3×10-5 (s-1) | 3×10-4 (s-1) |
kde210CTIL | 210-CTIL degradation rate constant | 3×10-4 (s-1) | 3×10-4 (s-1) |
kde142CTIL | 142-CTIL degradation rate constant | 3×10-4 (s-1) | 3×10-4 (s-1) |
kdeEGFP | EGFP degradation rate constant | 0 (s-1)* | 0 (s-1)* |
3-Dimensional simulation graph
After conducting kinetic analysis of the reactions mediated by our
double-arm LIRA, we started to think how to apply LIRA as a testing
kit in practical situations. We encountered an issue which is that the
concentrations of the two highly expressed target miRNAs in patient
blood often vary, whereas in our kinetic simulation, we assumed the
concentrations for both target miRNAs are equal. Consequently, it is
questionable to simply take the kinetic simulation results as our
reference for LIRA's application in real-world situation. To address
this issue, we aim to introduce different concentrations of our target
miRNAs in our LIRA reaction system to observe the corresponding
dynamic changes in EGFP expression.
Firstly, we set the concentrations of miR-210-3p and miR-142-3p. We
selected concentrations of 0.0025, 0.005, 0.01, 0.02, and 0.04 mol/L
for our 2 target miRNAs, while keeping the LIRA concentration at 0.01
mol/L. Subsequently, we simulate the reactions between 2 miRNA and
LIRA based on our extracellular kinetic model of reaction mediated by
double-arm LIRA. From our simulation of extracellular reaction in
Figure 40, we can see that the value of EGFP reaches its maximum in
our reaction at approximately 30 minutes. Hence, we collected the EGFP
values at 42 minutes in our reaction after changing the concentrations
of our target miRNA and presented the results as scatter points in
three-dimensional space, as shown in Figure 43.
Figure 43. The influence of the concentrations of miR-210-3p and
miR-142-3p on EGFP expression in LIRA system
From Figure 43, we can see that the EGFP expression increases to high
values as the concentrations of miR-210-3p and miR-142-3p increase.
Additionally, the scatter points in the graph tend to cluster more
densely at lower concentrations of miR-210-3p and miR-142-3p. To
better understand the relationship between the concentrations of
miR-210-3p and miR-142-3p and their corresponding EGFP values, we
attempted to fit these scatter points.
A Quadratic Regression is commonly used for surface fitting
models[23]. Therefore, we employed the fitting function as
Z = a + bX + cY + dX² + eY² + fXY, where Z represents the dependent
variable (EGFP), and X and Y are the independent variables (X:
[miR-210-3p], Y: [miR-142-3p]). By substituting the data into this
equation, we can obtain the values of each parameter. The specific
parameter interpretations and results are presented in Table 23 below.
Table 23. Explanations and Values of Second-Order Polynomial
Regression Parameters
Parameter | Description | Value |
---|---|---|
a | constant term | -0.000118 |
b | coefficient of the linear term for X | 0.3505 |
c | coefficient of the linear term for Y | 0.3288 |
d | coefficient of the quadratic term for X | - 7.42 |
e | coefficient of the quadratic term for Y | - 6.84 |
f | Interaction coefficient between X and Y | 3.76 |
Based on the calculations above, we obtained the regression equation:
Z = -0.000118 + 0.3505X + 0.3288Y + 3.76XY - 7.42X² - 6.84Y². To
understand the performance of our fitted equation, we conducted an
Analysis of Variance (ANOVA) at a 95% confidence level (α = 0.05) for
the terms in our regression. This analysis could assess the
significance of our model terms and determine the terms that have
statistically significant impacts on EGFP value. Tables 25 and 26
present the results of this analysis.
Table 24. ANOVA Results Summary for Regression Model Terms
Source | DF | Adj SS | Adj MS | F-Value | P-Value |
---|---|---|---|---|---|
Model | 5 | 0.000137 | 0.000027 | 12.78 | 0.000 |
Linear | 2 | 0.000094 | 0.000047 | 22.01 | 0.000 |
[miR-210-3p] | 1 | 0.000053 | 0.000053 | 24.64 | 0.000 |
[miR-142-3p] | 1 | 0.000056 | 0.000056 | 25.99 | 0.000 |
Square | 2 | 0.000047 | 0.000024 | 11.04 | 0.001 |
[miR-210-3p]*[ miR-210-3p] | 1 | 0.000026 | 0.000026 | 11.95 | 0.003 |
[miR-142-3p]*[ miR-142-3p] | 1 | 0.000022 | 0.000022 | 10.13 | 0.005 |
2-Way Interaction | 1 | 0.000013 | 0.000013 | 5.88 | 0.025 |
[miR-210-3p]*[ miR-142-3p] | 1 | 0.000013 | 0.000013 | 5.88 | 0.025 |
Error | 19 | 0.000041 | 0.000002 | ||
Total | 24 | 0.000177 |
Table 25. Regression model fit summary
Standard deviation | R2 | R2(adj) |
---|---|---|
0.0007532 | 96.44% | 91.99% |
Based on the results of the ANOVA, we can observe that the R² value is
high, indicating a good fit of the model. Additionally, the
coefficients for the constant term, the interaction term ([miR-210-3p]
* [142-3p]), the two linear terms ([miR-210-3p] and [miR-142-3p]), and
the two quadratic terms ([miR-210-3p] * [miR-210-3p] and [miR-142-3p]
* [miR-142-3p]) are all statistically significant. This suggests, in
our quadratic regression equation, the terms are not redundant or
causing interference in the regression results. Finally, we simulate
the surface fitting based on our equation and the result is visualized
in Figure 44.
Figure 44. Surface Plot of EGFP expression
From Figure 44 we can see that, as the concentrations of the two
miRNAs continue to increase, the reaction rate accelerates, and the
observed EGFP levels also rise continuously. When the concentration of
one miRNA remains constant, as the concentration of the other miRNA
increases, the EGFP value first rise and then decline. To have a more
detailed observation of the changes of EGFP value in specific
concentration ranges, we have plotted a contour map as shown in Figure
45.
Figure 45. Contour Plot of EGFP expression
Figure 45 is a contour plot of EGFP levels at 30 minutes for the
reaction between LIRA and two target miRNAs. The color intensity in
the figure represents different numerical ranges of EGFP.
Specifically, the gradient from light green to dark green indicates an
increasing trend of the EGFP variable from low to high. It can be
observed that there is a distinct high-value region in the upper right
corner of the plot. In contrast, as we move towards the lower left
corner, the EGFP levels continuously decrease, and a low-value region
appears. This may be attributed to the incomplete reaction of LIRA due
to lower concentrations of miRNAs, resulting in lower EGFP value.
Furthermore, we can observe in the plot that the EGFP value lies
within the range of 0.008-0.01 when miR-210-3p is within 0.025-0.04
mol/L and miR-142-3p is within 0.0275-0.04 mol/L.
We can apply this conclusion into practical scenarios. Assuming the
LIRA testing kit we use has a LIRA concentration of 0.01 mol/L, and
only as the EGFP value reaches between 0.008 and 0.01 will it be
considered as positive and identified by colors. We can simulate two
scenarios:
First, we can use LIRA to detect unknown concentrations of miRNA in
blood samples. When the concentrations of miR-210-3p and miR-142-3p in
the blood sample we are testing fall within the ranges of 0.025-0.04
mol/Land 0.0275-0.04 mol/L respectively, the LIRA testing kit will
successfully show colors and indicating after 30 minutes.
Second, we can use miRNA with known concentrations to validate the
quality of the LIRA testing kit. Suppose we have miR-210-3p and
miR-142-3p with concentrations as a mol/L and b mol/L, respectively,
and both a and b are within the concentration range of 0-0.04 mol/L.
Given the regression equation above, we can get Z = -0.000118 +
0.3505X + 0.3288Y + 3.76XY - 7.42X² - 6.84Y².When we add both a and b
to the LIRA testing kit and allow the reaction to proceed for 30
minutes, the EGFP value that are detected should be close to the value
of -0.000118 + 0.3505a + 0.3288b + 3.76ab - 7.42a² - 6.84b².With the
premises of disregarding error interference, If there is a significant
deviation between the two results, it indicates that there may be an
quality issue with the LIRA testing kit.
In summary, in the future application of LIRA detection, it is
necessary to take whether the concentration of LIRA and the
concentrations of miRNA are falling in matching ranges as
considerations and adjusting the LIRA concentration based on the
concentration of miRNA in actual blood samples. Due to the limitation
of lacking the information on the concentration of miRNA in blood
samples, we can not adjust the concentration of our LIRA in our test
kit. However, we believe that along with the advancement of miRNA
detection technology and through continuous adjustment and
optimization of our LIRA, we can finally obtain an ideal screening kit
to detect expression of miRNA in real samples.
Reference
Han H, Guo W, Shi W, et al. Hypertension and breast cancer risk: a
systematic review and meta analysis[J]. 2017, 7: 44877.
Rasmussen Torvik LJ, Shay CM, Abramson JG, et al. Ideal
cardiovascular health is inversely associated with incident cancer:
the Atherosclerosis Risk In Communities study [J]. Circulation,
2013, 127 (12): 1270 1275.
Solomon SD, Wang D, Finn P, et al. Effect of candesartan on cause
specific mortality in heart failure patients: the Candesartan in
Heart failure Assessment of Reduction in Mortality and morbidity
(CHARM) program [J]. Circulation, 2004, 110 (15): 2180 2183.
Malmborg M, Christiansen CB, Schmiegelow MD, et al. Incidence of new
onset cancer in patients with a myocardial infarction a nationwide
cohort study [J]. BMC Cardiovasc Disord, 2018, 18 (1): 198.
Hasin T, Iakobishvili Z, Weisz G. Associated risk of malignancy in
patients with cardiovascular disease: evidence and possible
mechanism [J]. Am J Med, 2017, 130 (7): 780 785.
Meijers WC, Maglione M, Bakker SJL, et al. Heart failure stimulates
tumor growth by circulating factors [J]. Circulation, 2018, 138 (7):
678 691.
Yang M, Li T, Guo S, et al. CVD phenotyping in oncologic disorders:
cardio-miRNAs as a potential target to improve individual outcomes
in revers cardio-oncology. J Transl Med. 2024;22(1):50. Published
2024 Jan 12.
Yuan Y, Mei Z, Qu Z, et al. Exosomes secreted from cardiomyocytes
suppress the sensitivity of tumor ferroptosis in ischemic heart
failure [J]. Signal Transduct Target Ther,2023,8(1):121.
Dembélé D, Kastner P. Fold change rank ordering statistics: a new
method for detecting differentially expressed genes. BMC
Bioinformatics. 2014;15:14. Published 2014 Jan 15.
Dai R, Zheng C. False discovery rate-controlled multiple testing for
union null hypotheses: a knockoff-based approach. Biometrics.
2023;79(4):3497-3509.
Yang G, Yang L, Wang W, Wang J, Wang J, Xu Z. Discovery and
validation of extracellular/circulating microRNAs during idiopathic
pulmonary fibrosis disease progression. Gene. 2015;562(1):138-144.
Stocks T, Van Hemelrijck M, Manjer J, Bjorge T, Ulmer H, Hallmans G,
Lindkvist B, Selmer R, Nagel G, Tretli S, Concin H, Engeland A,
Jonsson H, Stattin P. Blood pressure and risk of cancer incidence
and mortality in the Metabolic Syndrome and Cancer Project.
Hypertension. 2012;59:802–810
Vinter N, Christesen AMS, Fenger-Gron M, Tjonneland A, Frost L.
Atrial fibrillation and risk of cancer: a Danish population-based
cohort study. J Am Heart Assoc. 2018;7:e009543
Conen D, Wong JA, Sandhu RK, Cook NR, Lee IM, Buring JE, Albert CM.
Risk of malignant cancer among women with new-onset atrial
fibrillation. JAMA Cardiol. 2016;1:389–396.
Ridker PM, Cook NR, Lee IM, Gordon D, Gaziano JM, Manson JE,
Hennekens CH, Buring JE. A randomized trial of low-dose aspirin in
the primary prevention of cardiovascular disease in women. N Engl J
Med. 2005;352:1293–1304.
Rasmussen-Torvik LJ, Shay CM, Abramson JG, Friedrich CA, Nettleton
JA, Prizment AE, Folsom AR. Ideal cardiovascular health is inversely
associated with incident cancer: the Atherosclerosis Risk In
Communities study. Circulation. 2013;127:1270–1275.
Banke A, Schou M, Videbaek L, et al. Incidence of cancer in patients
with chronic heart failure: a long-term follow-up study. Eur J Heart
Fail. 2016;18(3):260-266.
Zheng J, Cheng C, Xu J, Gao P, Wang J, Chen L. miR-142-3p Regulates
Tumor Cell Autophagy and Promotes Colon Cancer Progression by
Targeting TP53INP2. Chemotherapy. 2022;67(2):57-66.
Zhang Y, Ma S, Zhang J, et al. MicroRNA-142-3p promotes renal cell
carcinoma progression by targeting RhoBTB3 to regulate HIF-1
signaling and GGT/GSH pathways. Sci Rep. 2023;13(1):5935. Published
2023 Apr 12.
Zhang X, Sai B, Wang F, et al. Hypoxic BMSC-derived exosomal miRNAs
promote metastasis of lung cancer cells via STAT3-induced EMT. Mol
Cancer. 2019;18(1):40. Published 2019 Mar 13.
Ren D, Yang Q, Dai Y, et al. Oncogenic miR-210-3p promotes prostate
cancer cell EMT and bone metastasis via NF-κB signaling pathway. Mol
Cancer. 2017;16(1):117. Published 2017 Jul 10.
Ma D, Li Y, Wu K, et al. Multi-arm RNA junctions encoding molecular
logic unconstrained by input sequence for versatile cell-free
diagnostics. Nat Biomed Eng. 2022;6(3):298-309.
Yong Z, Siyu Ye, Xianqi C, et al. Polynomial Response Surface based
on basis function selection by multitask optimization and ensemble
modeling[J].Complex & Intelligent Systems,2022,Vol.8(2):
1015-1034