NETWORK META-ANALYSIS USING DATA FROM PUBLISHED … · NETWORK META-ANALYSIS USING DATA FROM...
Transcript of NETWORK META-ANALYSIS USING DATA FROM PUBLISHED … · NETWORK META-ANALYSIS USING DATA FROM...
NETWORK META-ANALYSIS USING DATA FROM PUBLISHED TRIALS
AND DATA FROM THE FOOD AND DRUG ADMINISTRATION
MEDICAL REVIEWS: A CASE EXAMPLE OF FIRST LINE MEDICATIONS
FOR OPEN ANGLE GLAUCOMA
by
Qiyuan Shi
A thesis submitted to Johns Hopkins University in conformity with the
requirements for the degree of Master of Health Science
Baltimore, Maryland
May, 2015
ii
Abstracts
Background
Network meta-analysis (NMA) is a method that incorporates both direct evidence
within trials and indirect evidence across trials. The impact of different data sources
on NMAs is not widely examined.
Objectives
The objective of this study was to conduct and compare the results from NMAs using
clinical trial data available in the Food and Drug Administration (FDA) medical
reviews (FDA trials) and those published in the medical literature (published trials) on
first line medications for open angle glaucoma.
Methods
We searched the Cochrane Central Register of Controlled Trials, MEDLINE,
EMBASE in March 2014 and the US Food and Drug Administration’s website in
April 2014 for randomized, parallel group trials on the first line topical medications
for primary open-angle glaucoma and ocular hypertension. Eligible trials had to
compare a single active treatment with no treatment/placebo or another single active
topical medical treatment. Two individuals independently assessed trial eligibility,
abstracted data, and assessed the risk of bias. Using the FDA trials and the published
trials, we performed pair-wise meta-analyses and Bayesian network meta-analyses on
intraocular pressure at 3 months, our pre-specified primary outcome. We compared
the results from these analyses.
Results
iii
We included 16 FDA trials and 105 published trials. The network based on FDA trials
had 10 nodes (nine active drugs and placebo/vehicle) with a total sample size of 6183.
The network based on published trials had 14 nodes (13 active drugs and
placebo/vehicle) with a total sample size of 16898. There were 9 common treatments
between the FDA trials and the published trials in the network meta-analysis, resulting
in 36 comparisons. We estimated mean difference in IOP of 36 pairs of these
treatments. The median relative difference of the two networks was 14% (interquartile
range: 7% to 42%). The relative differences were greater than 25% in 14 pairs. Point
estimates from two pairs showed opposite directions. Generally, the results of the
published trials had better precision than the results of the FDA trials because of the
larger sample size of the published trials. In terms of relative rankings, bimatoprost,
travoprost, and latanoprost are the best ranking drugs regardless of the data source,
although 4/9 (44%) drugs had different ranking.
Conclusions
Systematic reviewers should consider FDA medical reviews because of the amount of
the information they can provide. The data source did not seem to change the
inference of the relative effectiveness of most of the drugs studied. The relative
rankings changed for the middle-ranked drugs but not for the top-ranked drugs.
Whether reporting bias has a role in the differences we observed needs to be further
evaluated.
Advisor: Tianjing Li, MD, MHS, PhD
Second reader: Mark L. Van Natta, MHS
iv
Acknowledgements
Before proceeding to the contents of this study, I would like to make some
acknowledgements to thank the people who contributed to this study or gave me help
of all kinds. Foremost, I would like to express my sincere gratitude to my advisor Dr.
Tiangjing Li for the continuous support of this study and research, for her patience,
motivation, enthusiasm, and immense knowledge. Her guidance helped me in all the
time of research and writing of this thesis. Second, I want to thank Mark L. Van Natta
for being my second reader and for his great advice and instructions. I would also like
to thank my dear colleague Benjamin Rouse for his support and help.
I would like to thank Hwanhee Hong, Sueko Ng, Nan Guo Cesar Ugarte-Gil, Manuele
Michelessi, Lijuan Zeng, and Kexin Jin for their help with screening, data abstraction,
and analysis.
Finally I would like to thank my parents for their support and love for all the time.
v
Table of Contents
1 Introduction .............................................................................................. 1
2 Methods ................................................................................................... 5
2.1 Eligibility Criteria ................................................................................................ 5
2.2 Retrieving Medical Reviews from the FDA Website .......................................... 5
2.3 Retrieving Published Trials .................................................................................. 6
2.4 Data Abstraction and Management ...................................................................... 6
2.5 Outcomes .............................................................................................................. 6
2.6 Pairwise and Network Meta-analysis ................................................................... 6
2.7 Evaluation of Clinical and Methodological Heterogeneity .................................. 8
2.8 Evaluation of Statistical Heterogeneity and Inconsistency .................................. 8
2.9 Comparison between Results from Different Sources ......................................... 9
3 Results .................................................................................................... 11
3.1 Identification of Trials ........................................................................................ 11
3.2 Analysis of FDA trials........................................................................................ 11
3.3 Analysis of Published Trials .............................................................................. 12
3.4 Comparison between FDA Trials and Published Trials ..................................... 14
3.5 Heterogeneity ..................................................................................................... 16
3.6 Inconsistency ...................................................................................................... 17
4 Discussions ............................................................................................ 18
5 References: ............................................................................................ 22
6 Appendices ............................................................................................ 25
7 Tables ..................................................................................................... 31
Table 1. Summary estimates for intraocular pressure at 3 months derived from
pair-wise meta-analysis based on 13 direct comparisons from 16 FDA trials ......... 31
vi
Table 2. Summary estimates for intraocular pressure at 3 months derived from
network meta-analysis of 16 FDA trials .................................................................. 32
Table 3. Summary estimates for intraocular pressure at 3 months derived from
pair-wise meta-analysis based on 36 direct comparisons from 105 published trials33
Table 4. Summary estimates for intraocular pressure at 3 months derived from
network meta-analysis of 105 published trials ......................................................... 34
Table 5. Comparisons between the results of direct pairwise meta-analyses based on
FDA trials and published trials................................................................................. 35
8 Figures ................................................................................................... 36
Figure 1. Identification of trials................................................................................ 36
Figure 2. Network graphs ......................................................................................... 37
Figure 3. Ranking probabilities for any drug at any position in the FDA trials network
.................................................................................................................................. 38
Figure 4. Ranking probabilities for any drug at any position in the published trials
network ..................................................................................................................... 39
Figure 5. Scatterplot of relative effect size between two treatments based on both
FDA and published trials.......................................................................................... 40
Figure 6. Relative effect size based on FDA trials and published trials .................. 41
Figure 7. Relative effect size between active drugs and placebo/timolol based on
FDA trials and published trials................................................................................. 42
Figure 8. Percentage ‘SUCRA’ of drugs in the two networks ................................. 43
1
1 Introduction
Since the concept of evidence-base medicine was introduced, systematic reviews are
often regarded as the highest-level of evidence [1, 2]. Because systematic reviews use
‘explicit and systematic methods to minimize bias’, they are thought to provide more
reliable findings than individual research studies from which conclusions and decision
can be made [3]. However, a typical systematic review focuses on the synthesis of
direct, pair-wise comparisons that are available from at least one study. If no such
study is available, for example, if no study has examined the comparative
effectiveness of treatment A and treatment B, it becomes a limitation for conventional
systematic reviews. Such needs lead to the development of network meta-analysis
(NMA).
NMA, as is indicated by its name, aims to build a network where nodes represent
interventions while the edges represent direct comparisons (that is available from at
least one included study) [4]. If two interventions A and B have one intermediate
comparator C, A and B can be compared indirectly generating indirect evidence. By
weighting both direct and indirect evidence, a mixed effect can be estimated. Thus,
NMAs can accomplish the task of “all-way” comparisons with more information [4].
A unique feature of NMAs is that it also can generate ranking probabilities of all the
interventions in terms of their efficacy and/or potential for harm[5]. Conclusions on
the comparative effectiveness of all interventions included in a network could be
drawn based on both the estimates of relative effects and ranking probabilities.
2
Systematic reviews are often subject to reporting bias, that is “systematic differences
between reported and unreported findings” [3]. Two common types of reporting bias
are publication bias and selective outcome reporting bias. Publication bias refers to
whether a study is published is dependent on the direction and magnitude of the
results; selective outcome reporting bias refers to bias in published studies due to
partial or distorted reporting of the outcomes [3]. Numerous studies have shown that
reporting bias is prevalent in clinical trials and has become a threat to the validity of
systematic reviews [6-8].
Because NMA is a newly developed methodology, the impact of reporting bias on
NMA is not well understood. One study by Trinquart et al., examining an
antidepressants network based on clinical trial data obtained from the Food and Drug
Administration (FDA), found that reporting bias affecting one drug could change the
relative rankings of drugs in the whole network [9]. In this study, medical reviews of
trials prepared by the FDA (we will use FDA trials for simplicity) were considered as
the reference (‘truth’). After conducting a sensitivity analysis on each drug, the
authors found that if the FDA trials were partially reported in the scientific journals,
the relative rankings of drugs based on the incomplete data set would differ from the
‘truth’, where all FDA trials were used.
In another study, Song et al. found that if all placebo-controlled trials were biased in
3
favor of “new” or “sponsored” treatments to the same extent, the ‘indirect comparison
will counterbalance such bias’ [10]. This finding suggests that NMAs might be more
robust to reporting bias under certain circumstances than pair-wise meta-analysis. The
same phenomenon was observed in a study of citalopram and escitalopram in which
the indirect evidence shows no evidence of difference while the direct evidence favors
the new, sponsored drug (in this case escitalopram)[11]. The authors speculated that
there was bias in the direct evidence.
Investigating reporting bias requires unique methods and sources of data. Registry
data such as data submitted to the regulatory agencies for gaining marketing approval
have become a popular source. Using the FDA in the United States as an example,
manufactures must submit clinical trial data including protocols and raw data from the
trials to the FDA before a drug can be approved. Scientists at the FDA review these
data and prepare reports that summarize their evaluation and decisions. Most of these
documents prepared by the FDA, especially the more recent ones are posted on the
FDA’s website for public viewing and use
(http://www.accessdata.fda.gov/scripts/cder/drugsatfda/).
These same trials sponsored by drug companies also may be published in scientific
journals, although not all of them are published. Trials with negative and null results
are less likely to be published than trials with positive results [12]. Studies also have
shown that the trial data submitted to the FDA and those published in the literature
4
sometimes disagree in sample size, effect size, and in the analysis plan [13].
To further investigate the impact of reporting bias on NMA, in particular, the value of
using FDA trials for NMA, we need to know the effect estimates and rankings of
interventions based on (1) the FDA trials alone; (2) published trials alone; (3) all
trials.
For the purpose of this project, we focused on analyses (1) and (2). We built upon an
ongoing systematic review and network meta-analysis [14] on the comparative
effectiveness and safety of first-line intraocular pressure (IOP) lowering drugs for
patients with primary open angle glaucoma (POAG) or ocular hypertension (OH).
Our study is a case example of examining the impact of reporting bias on NMA.
The objectives of this study were:
1. To identify and abstract data from FDA reviews of trials on first-line IOP lowering
drugs for patients with POAG or OH.
2. To conduct pairwise meta-analyses and NMAs using (1) data abstracted from the
FDA trials; (2) data abstracted from all published trials.
3. To compare the results from the two analyses above and evaluate the impact of
different data sources on the findings from NMA.
5
2 Methods
2.1 Eligibility Criteria
Trials were eligible for our NMA if they (1) recruited more than 60% patients
diagnosed with POAG or OH, using any definition specified in the trial; (2) compared
a first-line topical IOP lowering drug to placebo/no treatment or another IOP lowering
drug; (3) randomized controlled trial (RCT) that used a parallel design.
Trials were excluded if it (1) enrolled fewer than 10 patients in any study arm; (2)
evaluated combination medical therapies for POAG or OH; (3) had a follow-up
duration shorter than 28 days after randomization.
2.2 Retrieving Medical Reviews from the FDA Website
As described previously by Turner et al.[15], FDA medical reviews of approved drugs
are available from the FDA website
(http://www.accessdata.fda.gov/scripts/cder/drugsatfda/). We searched drug names
(active ingredients) and downloaded medical reviews in electronic form, including
‘Approval History’, ‘Letters’, ‘Reviews’, and ‘Related Documents’. For those
medical reviews that are not available from the FDA website, which are usually
medical reviews for drugs approved before 1996, we filed a request to FDA. We did
not receive any response from the FDA by the timeline allowed for this project. We
proceeded with medical reviews available for download from the FDA website. The
list of drugs we searched is available in Appendix 1.
6
2.3 Retrieving Published Trials
We used trials identified from an ongoing systematic review and NMA [14]. In brief,
Li et al. searched MEDLINE, EMBASE and Cochrane Register of Controlled Trials
(CENTRAL) for eligible trials up to March 2014. The search strategies are available
in Appendix 1.
2.4 Data Abstraction and Management
Two individuals working independently abstracted data from included trials into an
online database developed in the Systematic Review Data Repository
(http://srdr.ahrq.gov/) [16]. We adjudicated discrepancies through discussion.
2.5 Outcomes
IOP was typically used as the primary outcome in both published trials and FDA trials
because it is the endpoint upon which the hypotensive glaucoma drugs is approved
[17]. Thus, for all of the analyses in this study, we used mean IOP at 3 months as the
outcome measurement. If the 3 months IOP data were not available, we used the time
point that was closest to 3 months.
2.6 Pairwise and Network Meta-analysis
We conducted both pairwise meta-analyses and NMAs using data from the above
described sources (i.e., FDA trials and published trials). For pairwise meta-analysis,
7
we performed a random-effects meta-analysis for all comparisons with two or more
trials using the DerSimonian and Laird’s method [18], implemented in STATA
package ‘metan’ [19-21]. We assumed both comparison-specific heterogeneity and a
common heterogeneity across all comparisons [18].The latter assumption in theory
allows a larger degree of heterogeneity than the comparison-specific heterogeneity
approach and would results in less precise estimates.
We used Lu and Ade’s model under a Bayesian hierarchical framework with
random-effects for NMA [22, 23]. We conducted the analysis using the R package
‘gemtc’ and JAGS for Monte Carlo Markov Chain sampling [24, 25]. The analysis
estimated the mean difference in IOP for each pair of interventions. We also estimated
the probability for each intervention being ranked at one of the possible positions.
Such probabilities were used for plotting the ‘SUCRA’ (surface under the cumulative
ranking curve) values [19]. SUCRA value is the cumulative ranking probability,
which in theory accounts for the uncertainly in ranking better than a crude ranking
probability [19, 26]. Because the numbers of treatments (nodes) in each network were
different, the absolute ‘SCURA’ values from the two networks would have different
ranges. We therefore used the percentage of ‘SUCRA’, which was defined as ‘SUCRA’
values divided by maximum possible ‘SUCRA’ value in a specific network [26]. A
percentage ‘SUCRA’ of 100% means that the treatment has the best overall rankings.
After rescaling the ‘SUCRA’ values, we were able to compare the relative rankings of
treatments in the two networks regardless of the numbers of treatments in each
8
network. The larger the SUCRA value the better ranking a drug.
2.7 Evaluation of Clinical and Methodological Heterogeneity
Heterogeneity referred to any variability in the studies included in systematic reviews
[3]. We qualitatively evaluate clinical and methodological heterogeneity by
comparing the differences in characteristics of the patients, interventions, and the
nature of study designs.
2.8 Evaluation of Statistical Heterogeneity and Inconsistency
Statistical heterogeneity is a quantitative measurement of the effect size differences
due to variability. It is often measured by I2, tau2 and Q statistics in conventional
pairwise meta-analysis [27]. Q statistics are based on chi2 tests for differences
between observed data and expected data. I2 represents the “the percentage of the
variability in effect estimates that is due to heterogeneity rather than sampling
error”[28]. Tau2 is the parameter that captures the between-study variance when using
random-effect models.
In the pairwise meta-analysis, we first assumed comparison-specific heterogeneity
and estimated the tau2 for each comparison. Then we assumed that there was a
common heterogeneity across all the comparisons, regardless of the interventions
compared. Thus, a common tau2 was estimated to present the heterogeneity of all the
studies. In the NMA, we assumed common heterogeneity across the entire network
9
and estimated the tau2.
In the NMA, inconsistency refers to the disagreement in the estimated effect size
derived from direct and indirect comparisons. Several methods have been developed
to address inconsistency. We used ‘loop-specific’ method and ‘node-splitting’ method
to explore inconsistency [29-32]. ‘Loop-specific’ approach checks the triangular or
quadratic loops formed by three or four comparators. Inconsistency factors, defined as
the difference between estimates derived from the direct and indirect evidence, and
their 95% confidence intervals are commonly used to indicate the significance of
local inconsistency under the ‘loop-specific’ approach [23, 33]. Another approach,
‘node-splitting’ method, takes out direct comparisons one at a time and conducts a
NMA based on the remaining studies to give an indirect estimate for the same
comparison. Then the direct and indirect estimates are compared to quantify the
amount of inconsistency [29].
2.9 Comparison between Results from Different Sources
We conducted the above described analyses using the FDA trials and trials published
in the literature (referred to as published trials) separately. We compared the following
results obtained from these two data sources:
1. The point estimates and 95% confidence intervals based on pairwise
meta-analyses;
2. The point estimates and 95% credible intervals (intervals where the true parameter
10
lies in with a probability of 0.95 based on the posterior distribution in Bayesian
statistics) based on NMAs;
3. The percentage ‘SUCRA’ values based on NMAs.
11
3 Results
3.1 Identification of Trials
We identified 26 FDA medical reviews describing results from 72 studies (some are
not RCTs). Thirty (42%) RCTs were eligible; however only 16 of 30 (53%) trials
reported sufficient data (point estimates and confidence intervals) for our analysis
(Figure 1). The network based on FDA trials had 10 nodes (nine active drugs and
placebo/vehicle) with a total sample size of 6183 (Figure 2). Timolol was the most
commonly used comparator in these trials and was used as the reference group in our
analysis.
We identified 105 published trials eligible for analysis from the search results of
10936 studies. The reasons of exclusions are shown in Figure 1. The network based
on published trials had 14 nodes (13 active drugs and placebo/vehicle) with a total
sample size of 16898 (Figure 2). Similar to the network described above, timolol was
the most commonly used comparators (Figure 2).
3.2 Analysis of FDA trials
Table 1 shows the results from direct, pairwise meta-analyses of FDA trials.
Dorzolamide, bimatoprost, unoprostone, levobetaxolol, travoprost were compared to
timolol in at least two trials. Timolol lowered IOP more than dorzolamide,
unoprostone, and levobetaxolol; the mean differences (95% confidence intervals) in
IOP were 1.32 (0.63; 2.00), 2.00 (1.59; 2.41), and 1.25 (0.27; 2.23), respectively.
Bimatoprost and travoprost, the two prostaglandins, on the other hand, lowered IOP
12
more than timolol; the mean differences (95% confidence intervals) were 2.26 (1.70;
2.82) and 0.98 (0.52; 1.45), respectively.
Table 2 shows the results from the NMA of FDA trials. All active treatments except
unoprostone and betaxolol were more effective than placebo/vehicle (i.e., 95%
credible interval does not include 0) in lowering IOP at 3 months. Compared to
placebo, the mean differences (95% credible intervals) in IOP at 3 months, ordered by
the magnitude of IOP reduction, were: bimatoprost 6.03 (4.17; 7.85), travoprost 4.75
(2.95; 6.50), latanoprost 4.60 (2.61; 6.56), timolol 3.76 (2.09; 5.40), brinzolamide
2.66 (0.67; 4.59), levobetaxolol 2.63 (0.75, 3.98), unoprostone 1.77 (-0.06; 3.55), and
betaxolol 1.01(-0.96; 2.95).
Figure 3 shows the probability of each drug being ranked at every possible position
and the cumulative ranking probabilities. The ranking results were consistent with the
effect estimates obtained from the NMA. Bimatoprost was most likely to be ranked as
the best (probability=0.97). Placebo was most likely to be ranked as the worst
(probability=0.83). The descending order of ranking, in terms of effectiveness in
lowering IOP at 3 months, based on percentage SUCRA value was bimatoprost (9.97),
travoprost (8.64), latanoprost (8.31), timolol (7.03), brinzolamide (5.35), dorzolamide
(4.70), levobetaxolol (4.57), unoprostone (3.02), betaxolol (2.05) and placebo (0.35).
3.3 Analysis of Published Trials
13
Table 3 shows the results from direct, pairwise meta-analyses of published trials.
Apraclonidine, brimonidine, betaxolol, carteolol, levobunolol, brinzolamide,
dorzolamide, bimatoprost, latanoprost, travoprost and unoprostone were compared to
timolol in at least two trials. Among these comparisons, six were statistically
significant (i.e., 95% confidence interval does not include 0). Timolol lowered IOP
more than betaxolol, brinzolamide, dorzolamide; the mean differences (95%
confidence intervals) in IOP were 1.58 (0.87; 2.29), 1.10 (0.50; 1.70) and 0.76 (0.13;
1.39), respectively. Bimatoprost, latanoprost and travoprost, the three prostaglandins,
on the other hand, lowered IOP more than timolol; the mean differences (95%
confidence intervals) were 2.07 (1.49; 2.64), 1.32 (0.88; 1.77) and 1.22 (0.24; 2.20),
respectively.
Table 4 shows the results from the NMA of published trials. All active treatments
were more effective than placebo/vehicle (i.e., 95% credible interval does not include
0). Compared to placebo, the mean differences (95% credible intervals) in IOP at 3
months, ordered by the magnitude of reduction, were: bimatoprost 5.65 (4.89; 6.40),
travoprost 5.02 (4.23; 5.80), latanoprost 4.88 (4.22; 5.54), levobunolol 4.54 (3.77;
5.31), tafluprost 4.41 (2.88; 5.97), timolol 3.70 (3.11; 4.29), brimonidine 3.62 (2.87;
4.37), carteolol 3.44 (2.36; 4.52), dorzolamide 2.56 (1.86; 3.27), apraclonidine 2.55
(0.89; 4.23), brinzomalide 2.46 (1.61; 3.31), unoprostone 1.98 (1.13; 2.84), and
betaxolol 2.34 (1.63; 3.05).
14
Figure 3 shows the probability of each drug being ranked at every possible position
and the cumulative ranking probabilities. The ranking results were consistent with the
effect estimates gained from the NMA. Bimatoprost was most likely to be ranked as
the best (probability=0.924). Placebo was most likely to be ranked as the worst
(probability=0.999). The descending order in ranking, in terms of effectiveness in
lowering IOP at 3 months, based on percentage SUCRA value was bimatoprost
(13.92), travoprost (12.35), latanoprost (11.89), levobunolol (10.82), tafluprost
(10.46), timolol (8.42), brimonidine (8.07), carteolol (7.50), dorzolamide (4.79),
apraclonidine (4.78), brinzolamide (4.35), betaxolol (3.92), unoprostone (2.74), and
placebo (0.00).
3.4 Comparison between FDA Trials and Published Trials
We first compared the results of direct, pairwise meta-analyses based on the two data
sources. There were 10 common comparisons between the FDA trials and the
published trials. Using results from FDA trials as the reference, we found that,
compared to timolol, the results from published trials increased the point estimates of
the relative effectiveness of five drugs and decreased the point estimate of the relative
effectiveness of one drug. The percentage differences, defined as (estimate of
published trials minus estimate of FDA trials) divided by estimate of FDA trials, were
42% for dorzolamide, 8% for unoprostone, 33% for travoprost, 13% for latanoprost, 7%
for betaxolol, and -8% for bimatoprost.
15
We then compared the results derived from NMAs based on the two data sources.
There were 9 common treatments between the FDA trials and the published trials,
resulting in 36 comparisons. We estimated mean difference in IOP of 36 pairs of these
treatments. Figure 5 presents the point estimates from the two data sources. Point
estimates from two pairs (light blue dots in Figure 5) showed opposite directions,
although the credible intervals all included the null value. Analysis of FDA trials
revealed that unoprostone was better than betaxolol by 0.76 (-1.09; 2.57) and
brinzolamide was better than dorzolamide by 0.21 (-0.55; 0.97). Analysis of published
trials revealed that betaxolol was better than unoprostone by 0.36 (-0.48; 1.20) and
dorzolamide was better than brinzolamide by 0.1 (-0.95; 0.74).
The median relative difference of the two networks was 14% (interquartile range: 7%
to 42%). The relative differences were greater than 25% in 14 pairs. To account for
uncertainty, we also compared the credible intervals (Figure 6). Generally, the results
of the published trials had better precision than the results of the FDA trials because
of the larger sample size of the published trials (Figure 2). Three comparisons were
statistically different based on published trials: unoprostone was better than placebo
by 1.98 (1.13; 2.84); latanoprost was better than timolol by 1.18 (0.79; 1.57), and
betaxolol was better than placebo by 2.34 (1.63; 3.05). The same three comparisons
were not statistically significant based on FDA trials. The relative estimates were 1.77
(-0.06; 3.55) between unoprostone and placebo; 0.84 (-0.23; 1.94) between
latanoprost and timolol, and 1.01 (-0.93; 2.95) between betaxolol and placebo.
16
Figure 7 shows a subset of the 36 comparisons derived from NMAs based on the two
data sources using either placebo or timolol as the comparator. Compared to placebo,
the point estimates are similar regardless of the data source, although two of the
comparisons based on the FDA trials did not reach statistical significance. Compared
to timolol, the point estimates are also similar regardless of the data sources, and one
comparison based on the FDA trials did not reach statistical significance.
Figure 8 illustrates the differences in percentage ‘SUCRA’ values between two
networks. We found that the relative rankings changed for dorzolamide, brinzolamide
unoprostone, and betaxolol. The percentage ‘SUCRA’ values, sorted in descending
order, for the treatments in the network of FDA trials were: bimatoprost 99.69%,
travoprost 86.45%, latanoprost 83.10%, timolol 70.34%, brinzolamide 53.52%,
dorzolamide 47.00%, levobetaxolol 45.69%, unoprostone 30.21%, betaxolol 20.48,
and placebo 3.54%. The percentage ‘SUCRA’ values, sorted in descending order, for
the treatments in the network of published trials were: bimatoprost 99.42%, travoprost
88.18%, latanoprost 84.95%, levobunolol 77.23, tafluprost 74.72%, timolol 60.17%,
brimonidine 57.67%, carteolol 53.55%, dorzolamide 34.24%, apraclonidine 34.12%,
brinzolamide 31.05%, betaxolol 27.97%, unoprostone 19.56%, and placebo 0.02%.
3.5 Heterogeneity
In general, FDA trials are more homogenous than published trials. The I2 values for
comparisons based on FDA trials were less than 30%, indicating low level of
17
statistical heterogeneity [3]. For published trials, we identified two comparisons
(betaxolol and placebo, unoprostone and timolol) in which the I2 was greater than
75%, suggesting considerable statistical heterogeneity [3].
3.6 Inconsistency
The network of FDA trials had a “start” shape and did not include comparisons that
could be both directly and indirectly estimated. Thus, there was no estimable
inconsistency. As for the published trials, using the ‘node-splitting’ approach, we
found one node with evidence of statistical inconsistency. Using the ‘loop-specific’
approach, we found 6 out of 34 triangle loops (17.6%) had significant inconsistency
(p value < 0.05). We speculated that funding source and extreme effect size could be
the reasons introducing inconsistency.
18
4 Discussions
In this study, we conducted conventional pairwise meta-analyses and NMAs using
data from FDA trials and published trials. We found that FDA medical reviews are an
important source of clinical trial data that could be used for NMA. The data source
does not seem to change the inference about the drug’s relative effectiveness. The
relative rankings changed for the middle-ranked drugs but not for the top-ranked
drugs.
Several studies have examined the impact of reporting bias using the FDA trials.
Turner et al. used the FDA trials and their published versions to describe the selective
reporting bias in antidepressants and antipsychotics trials. Based on Turner’s work,
Trinquart et al. further examined the reporting bias on NMAs. In both Turner and
Trinquart’ studies, they used all FDA trials as one data source and published FDA
trials as the second data source, and compared the relative effectiveness and rankings
of drugs. However, it is worth noting that non-FDA trials published in the medical
literature, the traditional source of trials used for systematic review and meta-analysis,
could also be biased. Thus, we framed our question in a broader context than what has
been done previously. We used the FDA trials as the ‘reference’ and examined the
extent of discrepancies by using all published trials. Because published trials still
serve as the main source of data, we answered a pragmatic question: is it worth to put
resources in looking for FDA trials when conducting NMA?
19
Based on our case example, we believe that FDA medical reviews serve as an
important source of trial data; systematic reviewers should look for these data for
inclusion in their analysis. However, the FDA trial data were incomplete through the
public portal. Medical reviews of older drugs are not always available and for one half
of the available ones, no precision estimates were provided in the medical reviews.
The impact of incomplete FDA trial data on our analysis was threefold. First, the
numbers of nodes in the two networks were different. Second, the sample size of
some treatments was small in the FDA network (e.g., betaxolol and placebo),
resulting in less precise estimates and lower power to detect the differences between
the two data sources if there were any. Third, the ‘star’ shape of the FDA network
limits the ability to assess inconsistency.
As treatment decisions are increasingly relying on the synthesis of all trials through
systematic reviews and meta-analyses, the value of trial data could not be fully
realized if there were missing parts, either missing the entire trial or missing
information on effect size. We strongly urge the FDA to consider the downstream
products of trials and how public is already using the medical reviews to address
medical and public health questions. This implies that medical reviews for all
approved drugs should be made available. Furthermore, the medical reviews should
be prepared following a consistent format and the data on treatment effect should be
reported completely. In particular, the precision estimates for the primary and
secondary outcomes (or endpoints) should be provided.
20
We found that the point estimates are similar but the relative rankings are different for
a small number of drugs included in our networks. The estimates based on the FDA
trials are less precise, which could contribute to the unstable rankings we observed. It
is important to interpret the two types of measures (relative effectiveness and rankings)
jointly in the context of NMA.
A subset of the FDA trials included in our analyses was also published. One might be
interested in comparing the details of the FDA trials with its matching publication to
explore disagreement, a traditional approach to examine selective outcome reporting.
For the same reason, when we compared the point estimates and relative rankings
between the two data sources, we did not account for the precisions because the two
data sources are not independent. Future work could focus on developing statistical
tests to facilitate such comparisons.
Our study has several strengths. First, this study provided an informative comparison
between the results based on published trials and FDA trials in the context of NMA,
which was not empirically examined before. We included head-to-head as well as
placebo-controlled trials. We answered a pragmatic question of how different the
results would be by using different data sources. In addition, NMA allowed the
examinations of the impact of data sources and the likelihood of reporting bias
between any two treatments in the networks regardless of whether the treatments were
21
compared directly or indirectly in individual studies.
In conclusion, FDA medical reviews provide a good amount of trial data that should
be considered for NMA. The point estimates for the relative effectiveness were
similar for most drugs regardless of whether FDA or published trial data are used in
the NMA, although the relative rankings changed for some drugs. Whether reporting
bias has a role in the differences we observed needs to be further evaluated.
22
5 References:
1. Guyatt, G., et al., Evidence-based medicine: A new approach to teaching the practice of
medicine. JAMA, 1992. 268(17): p. 2420-2425.
2. Guyatt, G.H., et al., Users' guides to the medical literature: Ix. a method for grading
health care recommendations. JAMA, 1995. 274(22): p. 1800-1804.
3. Higgins JPT, G.S.e., Cochrane Handbook for Systematic Reviews of Interventions. The
Cochrane Collaboration, 2011.
4. Cipriani, A., et al., Conceptual and technical challenges in network meta-analysis. Ann Intern
Med, 2013. 159(2): p. 130-7.
5. Salanti, G., Indirect and mixed-treatment comparison, network, or multiple-treatments
meta-analysis: many names, many benefits, many concerns for the next generation evidence
synthesis tool. Research Synthesis Methods, 2012. 3(2): p. 80-97.
6. Dwan, K., et al., Comparison of protocols and registry entries to published reports for
randomised controlled trials. Cochrane Database Syst Rev, 2011(1): p. Mr000031.
7. Hopewell, S., et al., Publication bias in clinical trials due to statistical significance or
direction of trial results. Cochrane Database Syst Rev, 2009(1): p. Mr000006.
8. Scherer, R.W., P. Langenberg, and E. von Elm, Full publication of results initially presented in
abstracts. Cochrane Database Syst Rev, 2007(2): p. Mr000005.
9. Trinquart, L., A. Abbe, and P. Ravaud, Impact of reporting bias in network meta-analysis of
antidepressant placebo-controlled trials. PLoS One, 2012. 7(4): p. e35219.
10. Song, F., I. Harvey, and R. Lilford, Adjusted indirect comparison may be less biased than
direct comparison for evaluating new pharmaceutical interventions. J Clin Epidemiol, 2008.
61(5): p. 455-63.
11. Alkhafaji, A., et al., Impact of evergreening on patients and health insurance: a meta analysis
and reimbursement cost analysis of citalopram/escitalopram antidepressants. BMC Medicine,
2012. 10(1): p. 142.
12. Turner, E.H., et al., Selective Publication of Antidepressant Trials and Its Influence on
Apparent Efficacy. New England Journal of Medicine, 2008. 358(3): p. 252-260.
13. Turner, E.H., D. Knoepflmacher, and L. Shapley, Publication bias in antipsychotic trials: an
analysis of efficacy comparing the published literature to the US Food and Drug
Administration database. PLoS Med, 2012. 9(3): p. e1001189.
14. Li, T., et al., Comparative Effectiveness of First-line Medications for Patients with Primary
Open Angle Glaucoma or Ocular Hypertension – A Systematic Review and Network
Meta-analysis (in process).
15. Turner, E.H., How to access and process FDA drug approval packages for use in research.
BMJ, 2013. 347: p. f5992.
16. Li, T., et al., Innovations in data collection, management, and archiving for systematic reviews.
Ann Intern Med, 2015. 162(4): p. 287-94.
17. Weinreb, R.N. and P.L. Kaufman, Glaucoma Research Community and FDA Look to the
Future, II: NEI/FDA Glaucoma Clinical Trial Design and Endpoints Symposium: Measures of
Structural Change and Visual Function. Investigative Ophthalmology & Visual Science, 2011.
52(11): p. 7842-7851.
23
18. DerSimonian, R. and N. Laird, Meta-analysis in clinical trials. Control Clin Trials, 1986. 7(3):
p. 177-88.
19. Chaimani, A., et al., Graphical tools for network meta-analysis in STATA. PLoS One, 2013.
8(10): p. e76654.
20. Harris, R., et al., metan: fixed- and random-effects meta-analysis. Stata Journal, 2008. 8(1): p.
3-28.
21. White, I.R., Multivariate random-effects meta-regression: Updates to mvmeta. Stata Journal,
2011. 11(2): p. 255-270.
22. Lu, G. and A.E. Ades, Combination of direct and indirect evidence in mixed treatment
comparisons. Statistics in Medicine, 2004. 23(20): p. 3105-3124.
23. Lu, G. and A.E. Ades, Assessing Evidence Inconsistency in Mixed Treatment Comparisons.
Journal of the American Statistical Association, 2006. 101(474): p. 447-459.
24. van Valkenhoef, G., et al., Algorithmic parameterization of mixed treatment comparisons.
Statistics and Computing, 2012. 22(5): p. 1099-1111.
25. van Valkenhoef, G., et al., Automating network meta-analysis. Research Synthesis Methods,
2012. 3(4): p. 285-299.
26. Salanti, G., A.E. Ades, and J.P. Ioannidis, Graphical methods and numerical summaries for
presenting results from multiple-treatment meta-analysis: an overview and tutorial. J Clin
Epidemiol, 2011. 64(2): p. 163-71.
27. Higgins, J.P., et al., Measuring inconsistency in meta-analyses. Bmj, 2003. 327(7414): p.
557-60.
28. Higgins, J.P. and S.G. Thompson, Quantifying heterogeneity in a meta-analysis. Stat Med,
2002. 21(11): p. 1539-58.
29. Dias, S., et al., Checking consistency in mixed treatment comparison meta-analysis. Stat Med,
2010. 29(7-8): p. 932-44.
30. Dias, S., et al., Evidence synthesis for decision making 4: inconsistency in networks of
evidence based on randomized controlled trials. Med Decis Making, 2013. 33(5): p. 641-56.
31. Higgins, J.P.T., et al., Consistency and inconsistency in network meta-analysis: concepts and
models for multi-arm studies. Research Synthesis Methods, 2012. 3(2): p. 98-110.
32. Lumley, T., Network meta-analysis for indirect treatment comparisons. Stat Med, 2002.
21(16): p. 2313-24.
33. Dias, S., et al., Evidence Synthesis for Decision Making 4: Inconsistency in Networks of
Evidence Based on Randomized Controlled Trials. Medical Decision Making, 2013. 33(5): p.
641-656.
34. Tianjing Li, P.K.L., MS; Benjamin Rouse; Hwanhee Hong, PhD; Qiyuan Shi; David S.
Friedman, MD; Richard Wormald, FRCOphth; Kay Dickersin, PhD, Comparative
Effectiveness of First-line Medications for Patients with Primary Open Angle Glaucoma or
Ocular Hypertension – A Systematic Review and Network Meta-analysis (in process).
24
25
6 Appendices
Appendix 1. Search strategies
Cochrane Library #1 MeSH descriptor: [Glaucoma, Open-Angle] explode all trees
#2 MeSH descriptor: [Ocular Hypertension] explode all trees
#3 (open near/2 angle near/2 glaucoma*)
#4 (POAG or OHT)
#5 (((increas* or elevat* or high*) near/3 (ocular or intra-ocular)) and pressure)
#6 [34-#5]
#7 MeSH descriptor: [Adrenergic beta-Antagonists] explode all trees
#8 MeSH descriptor: [Timolol] explode all trees
#9 Timolol*
#10 MeSH descriptor: [Metipranolol] explode all trees
#11 Metipranolol*
#12 MeSH descriptor: [Carteolol] explode all trees
#13 Carteolol*
#14 MeSH descriptor: [Levobunolol] explode all trees
#15 Levobunolol*
#16 MeSH descriptor: [Betaxolol] explode all trees
#17 Betaxolol*
#18 MeSH descriptor: [Carbonic Anhydrase Inhibitors] explode all trees
#19 (Carbonic near/2 Anhydrase near/2 Inhibitor*)
#20 MeSH descriptor: [Acetazolamide] explode all trees
#21 Acetazolam*
#22 Brinzolamide*
#23 Dorzolamide*
#24 MeSH descriptor: [Prostaglandins, Synthetic] explode all trees
#25 latanoprost*
#26 travoprost*
#27 bimatoprost*
#28 unoprostone*
#29 tafluprost*
#30 MeSH descriptor: [Antihypertensive Agents] explode all trees
#31 MeSH descriptor: [Pilocarpine] explode all trees
#32 Pilocarpin*
#33 MeSH descriptor: [Epinephrine] explode all trees
#34 epinephrine*
#35 dipivefrin*
#36 MeSH descriptor: [Adrenergic alpha-2 Receptor Agonists] explode all trees
#37 (adrenergic near/2 alpha* near/3 agonist*)
#38 apraclonidin*
26
#39 brimonidine*
#40 (drug* or medic* or pharmacologic*) near/3 (treat* or therap* or intervent*)
#41 {or #7-#40}
#42 #6 and #41
MEDLINE (OVID) 1. exp clinical trial/ [publication type]
2. (randomized or randomised).ab,ti.
3. placebo.ab,ti.
4. dt.fs.
5. randomly.ab,ti.
6. trial.ab,ti.
7. groups.ab,ti.
8. or/1-7
9. exp animals/
10. exp humans/
11. 9 not (9 and 10)
12. 8 not 11
13. exp glaucoma open angle/
14. exp ocular hypertension/
15. (open adj2 angle adj2 glaucoma$).tw.
16. (POAG or OHT).tw.
17. (((increas$ or elevat$ or high$) adj3 (ocular or intra-ocular)) and pressure).tw.
18. or/13-17
19. exp adrenergic beta antagonists/
20. exp timolol/
21. timolol$.tw.
22. exp metipranolol/
23. metipranolol$.tw.
24. exp carteolol/
25. carteolol$.tw.
26. exp levobunolol/
27. levobunolol$.tw.
28. exp betaxolol/
29. betaxolol$.tw.
30. exp carbonic anhydrase inhibitors/
31. (carbonic adj2 anhydrase adj2 inhibitor$).tw.
32. exp Acetazolamide/
33. acetazolamide$.tw.
34. brinzolamide$.tw.
35. dorzolamide$.tw.
36. exp Prostaglandins, Synthetic/
37. latanoprost$.tw.
38. travoprost$.tw.
27
39. bimatoprost$.tw.
40. unoprostone$.tw.
41. brimonidine$.tw.
42. exp antihypertensive agents/
43. exp pilocarpine/
44. pilocarpin$.tw.
45. exp epinephrine/
46. epinephrin$.tw.
47. dipivefrin$.tw.
48. exp Adrenergic alpha-2 Receptor Agonists/
49. ((adrenergic adj2 alpha$ adj2 receptor$) or (adrenergic adj2 alpha$ adj2 agonist$)).tw.
50. apraclonidin$.tw.
51. tafluprost$.tw.
52. ((drug$ or medic$ or pharmacologic$) adj3 (treat$ or therap$ or intervent$)).tw.
53. or/19-52
54. 18 and 53
55. 12 and 54
Embase.com #1 'randomized controlled trial'/exp
#2 'randomization'/exp
#3 'double blind procedure'/exp
#4 'single blind procedure'/exp
#5 random*:ab,ti
#6 #1 OR #2 OR #3 OR #4 OR #5
#7 'animal'/exp OR 'animal experiment'/exp
#8 'human'/exp
#9 #7 AND #8
#10 #7 NOT #9
#11 #6 NOT #10
#12 'clinical trial'/exp
#13 (clin* NEAR/3 trial*):ab,ti
#14 ((singl* OR doubl* OR trebl* OR tripl*) NEAR/3 (blind* OR mask*)):ab,ti
#15 'placebo'/exp
#16 placebo*:ab,ti
#17 random*:ab,ti
#18 'experimental design'/exp
#19 'crossover procedure'/exp
#20 'control group'/exp
#21 'latin square design'/exp
#22 #12 OR #13 OR #14 OR #15 OR #16 OR #17 OR #18 OR #19 OR #20 OR #21
#23 #22 NOT #10
#24 #23 NOT #11
#25 'comparative study'/exp
28
#26 'evaluation'/exp
#27 'prospective study'/exp
#28 control*:ab,ti OR prospectiv*:ab,ti OR volunteer*:ab,ti
#29 #25 OR #26 OR #27 OR #28
#30 #29 NOT #10
#31 #30 NOT (#11 OR #23)
#32 #11 OR #24 OR #31
#33 'open angle glaucoma'/exp
#34 'intraocular hypertension'/exp
#35 (open NEAR/2 angle):ab,ti AND (angle NEAR/2 glaucoma*):ab,ti
#36 poag:ab,ti OR oht:ab,ti
#37 ((increas* OR elevat* OR high*) NEAR/3 (ocular OR 'intra ocular')):ab,ti AND
pressure:ab,ti
#38 #33 OR #34 OR #35 OR #36 OR #37
#39 'beta adrenergic receptor blocking agent'/exp
#40 'timolol'/exp
#41 timolol*:ab,ti
#42 'metipranolol'/exp
#43 metipranolol*:ab,ti
#44 'carteolol'/exp
#45 carteolol*:ab,ti
#46 'levobunolol'/exp
#47 levobunolol*:ab,ti
#48 'betaxolol'/exp
#49 betaxolol*:ab,ti
#50 'carbonate dehydratase inhibitor'/exp
#51 (carbonic NEAR/2 anhydrase):ab,ti AND (anhydrase NEAR/2 inhibitor*):ab,ti
#52 'acetazolamide'/exp
#53 acetazolamide*:ab,ti
#54 brinzolamide*:ab,ti
#55 dorzolamide*:ab,ti
#56 'latanoprost'/exp
#57 latanoprost*:ab,ti
#58 'travoprost'/exp
#59 travoprost*:ab,ti
#60 'bimatoprost'/exp
#61 bimatoprost*:ab,ti
#62 'unoprostone isopropyl ester'/exp
#63 unoprostone*:ab,ti
#64 'brimonidine'/exp
#65 brimonidine*:ab,ti
#66 'antihypertensive agent'/exp
#67 'pilocarpine'/exp
#68 pilocarpin*:ab,ti
29
#69 'adrenalin'/exp
#70 epinephrin*:ab,ti
#71 dipivefrin*:ab,ti
#72 'alpha 2 adrenergic receptor stimulating agent'/exp
#73 (adrenergic NEAR/2 alpha*):ab,ti AND (alpha* NEAR/2 agonist*):ab,ti
#74 apraclonidin*:ab,ti
#75 'tafluprost'/exp
#76 tafluprost*:ab,ti
#77 ((drug* OR medic* OR pharmacologic*) NEAR/3 (treat* OR therap* OR intervent*)):ab,ti
#78 #39 OR #40 OR #41 OR #42 OR #43 OR #44 OR #45 OR #46 OR #47 OR #48 OR #49 OR
#50 OR #51 OR #52 OR #53 OR #54 OR #55 OR #56 OR #57 OR #58 OR #59 OR #60 OR #61
OR #62 OR #63 OR #64 OR #65 OR #66 OR #67 OR #68 OR #69 OR #70 OR #71 OR #72 OR
#73 OR #74 OR #75 OR #76 OR #77
#79 #38 AND #78
#80 #32 AND #79
PubMed #1 ((randomized controlled trial[pt]) OR (controlled clinical trial[pt]) OR (randomised[tiab] OR
randomized[tiab]) OR (placebo[tiab]) OR (drug therapy[sh]) OR (randomly[tiab]) OR (trial[tiab])
OR (groups[tiab])) NOT (animals[mh] NOT humans[mh])
#2 (open[tw] AND angle[tw] AND glaucoma*[tw]) NOT Medline[sb]
#3 (POAG[tw] OR OHT[tw]) NOT Medline[sb]
#4 (((increase*[tw] OR elevat*[tw] OR high*[tw]) AND (ocular[tw] OR intra-ocular[tw])) AND
pressure[tw]) NOT Medline[sb]
#5 #2 OR #3 OR #4
#6 timolol*[tw] NOT Medline[sb]
#7 metipranolol*[tw] NOT Medline[sb]
#8 carteolol*[tw] NOT Medline[sb]
#9 levobunolol*[tw] NOT Medline[sb]
#10 betaxolol*[tw] NOT Medline[sb]
#11 (carbonic[tw] AND anhydrase[tw] AND inhibitor*[tw]) NOT Medline[sb]
#12 acetazolamide*[tw] NOT Medline[sb]
#13 brinzolamide*[tw] NOT Medline[sb]
#14 dorzolamide*[tw] NOT Medline[sb]
#15 latanoprost*[tw] NOT Medline[sb]
#16 travoprost*[tw] NOT Medline[sb]
#17 bimatoprost*[tw] NOT Medline[sb]
#18 unoprostone*[tw] NOT Medline[sb]
#19 brimonidine*[tw] NOT Medline[sb]
#20 pilocarpin*[tw] NOT Medline[sb]
#21 epinephrin*[tw] NOT Medline[sb]
#22 dipivefrin* NOT Medline[sb]
#23 ((adrenergic[tw] AND alpha*[tw] AND receptor*[tw]) OR (adrenergic[tw] AND alpha*[tw]
AND agonist*[tw])) NOT Medline[sb]
30
#24 apraclonidin*[tw] NOT Medline[sb]
#25 tafluprost*[tw] NOT Medline[sb]
#26 ((drug*[tw] OR medic*[tw] OR pharmacologic*[tw]) AND (treat*[tw] OR therap*[tw] OR
intervent*[tw])) NOT Medline[sb]
#27 #6 OR #7 OR #8 OR #9 OR #10 OR #11 OR #12 OR #13 OR #14 OR #15 OR #16 OR #17
OR #18 OR #19 OR #20 OR #21 OR #22 OR #23 OR #24 OR #25 OR #26
#28 #5 AND #27
#29 #1 AND #28
FDA (Drugs@FDA) #1. Acetazolamide
#2. Brimonidine
#3. Betaxolol
#4. Dichlorphenamide
#5. Levobunolol
#6. Timolol
#7. Brinzolamide
#8. Dorzolamide
#9. Bimatoprost
#10. Latanoprost
#11. Travoprost
#12. Unoprostone
#13. Acetazolamide
#14. Methazolamide
#15. Pilocarpine
#16. Carbachol
#17. Echothiophate
#18. Epinephrine
#19. Dipivefrin
#20. Metipranolol
#21. Demecarium
#22. Apraclonidine
#23. Carteolol
#24. Tafluprost
31
7 Tables
Table 1. Summary estimates for intraocular pressure at 3 months derived
from pair-wise meta-analysis based on 13 direct comparisons from 16 FDA
trials
Comparison-specific heterogeneity
Num. of trials Mean difference 95% CI, lower 95% CI, upper Tau-squared I-squared Mean difference 95% CI, lower 95% CI, upper
Placebo vs.
1 -3.00 -4.53 -1.47 NA NA -4.32 -6.56 -2.08
1 -1.00 -2.76 0.76 NA NA -2.32 -4.84 0.19
1 -2.70 -4.44 -0.96 NA NA -4.02 -6.52 -1.53
Timolol vs.
3 1.32 0.63 2.00 0.12 24% 1.32 0.49 2.15
2 -2.26 -2.82 -1.70 0.00 0% -3.58 -4.75 -2.41
2 2.00 1.59 2.41 0.00 0% 0.67 -0.39 1.73
3 1.25 0.27 2.23 0.52 73% 0.14 -0.95 1.22
3 -0.98 -1.45 -0.52 0.03 19% -2.31 -3.35 -1.27
1 -1.10 -1.99 -0.21 NA NA -2.42 -3.95 -0.90
Betaxolol vs.
1 -2.00 -3.54 -0.46 NA NA -3.32 -5.57 -1.07
1 -1.70 -3.45 0.05 NA NA -3.02 -5.53 -0.52
Brinzolamide vs.
2 0.22 -0.37 0.80 0.06 32% -1.11 -2.21 0.00
Latanoprost vs.
1 -0.30 -1.03 0.43 NA NA -1.62 -2.99 -0.25
Tau square: between-study variance in random-effect models; I square: proportion of variance due to heterogeneity
*Estimated tau square is 0.039; estimated I square is 36.03%
Common heterogeneity*
Levobetaxolol
Betaxolol
Timolol
Dorzolamide
Bimatoprost
Dorzolamide
Travoprost
Unoprostone
Levobetaxolol
Travoprost
Latanoprost
Levobetaxolol
Timolol
32
Table 2. Summary estimates for intraocular pressure at 3 months derived
from network meta-analysis of 16 FDA trials
Placebo
1.01 Betaxolol
(-0.93;2.95)
3.76 2.75 Timolol
(2.09;5.40) (1.05;4.42)
2.36 1.35 -1.40 Levobetaxolol
(0.75;3.98) (-0.3;3.03) (-2.06;-0.62)
2.66 1.65 -1.10 0.30 Brinzolamide
(0.67;4.59) (-0.35;3.59) (-2.15;-0.06) (-1.02;1.51)
2.45 1.44 -1.31 0.09 -0.21 Dorzolamide
(0.61;4.24) (-0.42;3.24) (-2.04;-0.58) (-0.98;1.06) (-0.97;0.55)
6.03 5.02 2.27 3.67 3.37 3.58 Bimatoprost
(4.17;7.85) (3.13;6.87) (1.45;3.07) (2.53;4.68) (2.05;4.69) (2.5;4.67)
4.60 3.59 0.84 2.24 1.94 2.15 -1.43 Latanoprost
(2.61;6.56) (1.58;5.56) (-0.23;1.94) (0.91;3.49) (0.44;3.46) (0.86;3.46) (-2.75;-0.06)
4.75 3.74 0.99 2.39 2.09 2.30 -1.28 0.15 Travoprost
(2.95;6.5) (1.92;5.53) (0.36;1.62) (1.37;3.28) (0.87;3.31) (1.35;3.26) (-2.30;-0.25) (-0.91;1.18)
1.77 0.76 -1.99 -0.59 -0.89 -0.68 -4.26 -2.83 -2.98 Unoprostone
(-0.06;3.55) (-1.09;2.57) (-2.71;-1.26) (-1.67;0.36) (-2.16;0.39) (-1.70;0.34) (-5.33;-3.17) (-4.14;-1.55) (-3.94;-2.02)
Grey
Green
Red
Blue
Beta-blocker
Carbonic anhydrase inhibitor
Prostaglandin analog
Unit: mmHg; Color coding: drug class
Mean difference < 0 favors the drug in the coloum
Mean difference > 0 favors the drug in the row
Reported numbers are calculated by column - row under the Lu and Ades homogeneous random effects model assuming consistency
Reported posterior means and 95% Bayesian credible intervals
Placebo/vehicle/no treatment
33
Table 3. Summary estimates for intraocular pressure at 3 months derived
from pair-wise meta-analysis based on 36 direct comparisons from 105
published trials
Comparison-specific heterogeneity
Num. of trials Mean difference 95% CI, lower 95% CI, upper Tau-squared I-squared Mean difference 95% CI, lower 95% CI, upper
Brimonidine 1 -2.30 -3.99 -0.61 NA NA -0.41 -2.86 2.04
Betaxolol 2 -3.90 -5.29 -2.52 1.30 81% -0.89 -2.48 0.69
Levobunolol 2 -7.52 -8.50 -6.50 0.00 0% -5.56 -7.36 -3.75
Timolol 4 -3.91 -5.12 -2.69 0.85 57% -2.00 -3.47 -0.52
Brinzolamide 2 -2.17 -3.23 -1.10 0.00 0% -0.29 -2.09 1.51
Dorzolamide 4 -1.91 -2.92 -0.90 0.51 51% -1.89 -2.90 -0.88
Bimatoprost 1 -4.60 -5.60 -3.60 NA NA -2.71 -4.70 -0.72
Unoprostone 1 -0.20 -1.56 1.16 NA NA 1.69 -0.52 3.90
Apraclonidine vs.
Timolol 2 -0.84 -3.75 2.08 3.73 84% 0.66 -1.12 2.45
Brimonidine vs.
Betaxolol 1 1.94 0.84 3.04 NA NA 3.83 1.78 5.88
Timolol 4 0.17 -0.70 1.03 0.55 81% 2.06 0.76 3.37
Brinzolamide 2 1.01 0.50 1.53 0.00 0% 2.89 1.40 4.39
Latanoprost 5 -1.36 -2.21 -0.50 0.73 78% 0.51 -0.74 1.77
Travoprost 1 -1.20 -3.77 1.37 NA NA 0.69 -2.49 3.87
Levobunolol 2 -2.00 -3.54 -0.46 NA NA -1.41 -3.54 0.72
Timolol 8 -1.58 -2.29 -0.87 0.43 48% 0.31 -0.95 1.57
Dorzolamide 2 -0.30 -0.96 0.36 0.00 0% 1.54 -0.02 3.11
Latanoprost 2 -1.06 -2.62 0.51 0.33 25% 0.84 -1.15 2.83
Unoprostone 1 0.60 0.09 1.11 NA NA 2.49 0.71 4.26
Levobunolol 1 -2.90 -4.59 -1.22 NA NA -1.01 -3.46 1.44
Timolol 4 0.03 -0.61 0.68 0.11 24% 1.95 0.58 3.31
Levobunolol vs.
Timolol 11 -0.03 -0.44 0.39 0.01 3% 1.88 0.68 3.08
Brinzolamide 3 1.10 0.50 1.70 0.00 0% 2.82 1.32 4.33
Dorzolamide 5 0.76 0.13 1.39 0.24 47% 2.65 1.38 3.93
Bimatoprost 5 -2.07 -2.64 -1.49 0.15 35% -0.28 -1.57 1.01
Latanoprost 14 -1.32 -1.77 -0.88 0.40 64% 0.56 -0.56 1.68
Travoprost 5 -1.22 -2.20 -0.24 0.79 67% 0.67 -0.66 1.99
Tafluprost 1 -0.30 -0.72 0.12 NA NA 1.59 -0.16 3.34
Unoprostone 2 0.94 -0.43 2.31 0.85 87% 2.86 1.37 4.34
Brinzolamide vs.
Dorzolamide 2 -0.58 -1.15 0.00 0.00 0% 1.30 -0.23 2.82
Dorzolamide vs.
Latanoprost 1 -2.90 -3.70 -2.10 NA NA -1.01 -2.90 0.88
Bimatoprost vs.
Latanoprost 6 0.87 0.01 1.73 0.82 76% 2.75 1.50 3.99
Travoprost 8 0.59 -0.13 1.30 0.73 74% 2.48 1.28 3.67
Latanoprost vs.
Travoprost 7 -0.22 -0.86 0.41 0.33 48% 1.68 0.44 2.91
Tafluprost 1 -0.90 -3.40 1.60 NA NA 0.99 -2.14 4.11
Unoprostone 6 3.07 2.51 3.63 0.01 2% 4.85 3.53 6.16
Tau square: between-study variance in random-effect models; I square: proportion of variance due to heterogeneity
*Estimated tau squared is 0.4374; estimated I-squared is 59.63%
Common heterogeneity*
Placebo vs.
Betaxolol vs.
Carteolol vs.
Timolol vs.
34
Table 4. Summary estimates for intraocular pressure at 3 months derived from network meta-analysis of 105 published trials
Placebo
2.55 Apraclonidine
(0.89;4.23)
3.62 1.07 Brimonidine
(2.87;4.37) (-0.60;2.72)
2.34 -0.21 -1.28 Betaxolol
(1.63;3.05) (-1.89;1.45) (-2.01;-0.54)
3.44 0.89 -0.18 1.09 Carteolol
(2.36;4.52) (-0.94;2.7) (-1.26;0.89) (0.02;2.17)
4.54 1.99 0.92 2.19 1.10 Levobunolol
(3.77;5.31) (0.31;3.65) (0.11;1.73) (1.43;2.97) (0.06;2.15)
3.70 1.15 0.08 1.36 0.27 -0.84 Timolol
(3.11;4.29) (-0.42;2.7) (-0.48;0.65) (0.79;1.93) (-0.65;1.18) (-1.43;-0.25)
2.46 -0.09 -1.16 0.11 -0.98 -2.08 -1.25 Brinzolamide
(1.61;3.31) (-1.84;1.64) (-1.97;-0.35) (-0.79;1.02) (-2.16;0.21) (-3.03;-1.14) (-2.01;-0.48)
2.56 0.01 -1.06 0.22 -0.87 -1.98 -1.14 0.10 Dorzolamide
(1.86;3.27) (-1.67;1.68) (-1.83;-0.28) (-0.52;0.95) (-1.97;0.22) (-2.80;-1.16) (-1.74;-0.53) (-0.74;0.95)
5.65 3.10 2.03 3.30 2.21 1.11 1.94 3.19 3.08 Bimatoprost
(4.89;6.40) (1.43;4.75) (1.29;2.76) (2.54;4.07) (1.14;3.28) (0.3;1.9) (1.39;2.5) (2.27;4.1) (2.29;3.88)
4.88 2.33 1.26 2.54 1.44 0.34 1.18 2.42 2.32 -0.76 Latanoprost
(4.22;5.54) (0.71;3.93) (0.69;1.84) (1.9;3.18) (0.45;2.44) (-0.36;1.04) (0.79;1.57) (1.61;3.23) (1.64;2.99) (-1.32;-0.21)
5.02 2.47 1.40 2.67 1.58 0.48 1.31 2.56 2.45 -0.63 0.13 Travoprost
(4.23;5.80) (0.80;4.12) (0.65;2.14) (1.89;3.46) (0.49;2.66) (-0.35;1.29) (0.74;1.89) (1.63;3.48) (1.64;3.26) (-1.2;-0.05) (-0.43;0.7)
4.41 1.86 0.79 2.07 0.97 -0.13 0.71 1.95 1.85 -1.24 -0.47 -0.61 Tafluprost
(2.88;5.97) (-0.26;3.98) (-0.73;2.34) (0.54;3.63) (-0.72;2.68) (-1.67;1.43) (-0.72;2.15) (0.34;3.58) (0.31;3.42) (-2.75;0.31) (-1.92;1.00) (-2.13;0.94)
1.98 -0.57 -1.64 -0.36 -1.45 -2.55 -1.72 -0.47 -0.58 -3.66 -2.90 -3.03 -2.43 Unoprostone
(1.13;2.84) (-2.29;1.14) (-2.48;-0.79) (-1.20;0.48) (-2.61;-0.30) (-3.47;-1.65) (-2.43;-1.01) (-1.48;0.53) (-1.47;0.31) (-4.51;-2.82) (-3.58;-2.21) (-3.88;-2.18) (-4.03;-0.85)
Grey
Gold
Green
Red
Blue
Alpha-2 adrenergic agonist
Placebo/vehicle/no treatment
Beta-blocker
Carbonic anhydrase inhibitor
Prostaglandin analog
Unit: mmHg; Color coding: drug class
Mean difference < 0 favors the drug in the coloum
Mean difference > 0 favors the drug in the row
Reported numbers are calculated by column - row under the Lu and Ades homogeneous random effects model assuming consistency
Reported posterior means and 95% Bayesian credible intervals
35
Table 5. Comparisons between the results of direct pairwise meta-analyses based on FDA trials and published trials
N Effect size (95%CI) Tau-squared I-squared N Effect size (95%CI) Tau-squared I-squared
1 -1.00 (-2.76; 0.76) NA NA 2 -3.90 (-5.29;-2.52) 1.30 81%
1 -2.70 (-4.44;-0.96) NA NA 4 -3.91 (-5.12;-2.69) 0.85 57%
3 1.32 (0.63; 2.00) 0.12 24% 5 0.76 (0.13;1.39) 0.24 47%
2 -2.26 (-2.82;-1.70) 0.00 0% 5 -2.07 (-2.64;-1.49) 0.15 35%
2 2.00 (1.59;2.41) 0.00 0% 3 1.35 (0.42;2.27) 0.57 86%
3 -0.98 (-1.45;-0.52) 0.03 19% 5 -1.22 (-2.20; -0.24) 0.79 67%
1 -1.10 (-1.99;-0.21) NA NA 14 -1.32 (-1.77; -0.88) 0.40 64%
1 1.70 (0.05;-3.45) NA NA 8 1.58 (0.87;-2.29) 0.43 48%
Brinzolamide vs.
2 0.22 (-0.37;0.80) 0.06 32% 2 -0.58 (-1.15;0.00) 0.00 0%
Latanoprost vs.
1 -0.30 (-1.03;0.43) NA NA 7 -0.22 (-0.86; 0.41) 0.33 48%
Dorzolamide
Travoprost
Dorzolamide
Bimatoprost
Unoprostone
Travoprost
Latanoprost
Betaxolol
Pairwise comparisons based on FDA trials Pairwise comparisons based on published trials
Placebo vs.
Betaxolol
Timolol
Timolol vs.
36
8 Figures
Figure 1. Identification of trials
37
Figure 2. Network graphs
38
Figure 3. Ranking probabilities for any drug at any position in the FDA trials network
Bimatoprost Travoprost Latanoprost Timolol Brinzolamide Dorzolamide Levobetaxolol Betaxolol Unoprostone Placebo
10 0 0 0 0 0.002 0.001 0 0.148 0.016 0.832
9 0 0 0 0 0.013 0.011 0.008 0.645 0.175 0.148
8 0 0 0 0 0.049 0.068 0.098 0.124 0.646 0.014
7 0 0 0.001 0 0.124 0.27 0.461 0.036 0.106 0.003
6 0 0 0.002 0.001 0.224 0.508 0.2 0.025 0.039 0.002
5 0 0.001 0.006 0.02 0.564 0.142 0.23 0.02 0.016 0.001
4 0 0.003 0.048 0.925 0.019 0.001 0.003 0.001 0 0
3 0.005 0.367 0.57 0.053 0.004 0 0.001 0 0 0
2 0.022 0.621 0.354 0.001 0.001 0 0 0 0 0
1 0.972 0.008 0.02 0 0 0 0 0 0 0
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
10
9
8
7
6
5
4
3
2
1
39
Figure 4. Ranking probabilities for any drug at any position in the published trials network
40
Figure 5. Scatterplot of relative effect size between two treatments based on both FDA and published trials
41
Figure 6. Relative effect size based on FDA trials and published trials
42
Figure 7. Relative effect size between active drugs and placebo/timolol based on FDA trials and published trials
43
Figure 8. Percentage ‘SUCRA’ of drugs in the two networks
44
Curriculum vitae
PERSONAL
Qiyuan Shi was born on Oct. 21st 1990 in Shanghai, China.
EDUCATION
Johns Hopkins University School of Public Health
09/2013-05/2015
Degree: Master of Health Science
Concentration: Epidemiology, Clinical trials and Evidence Synthesis
Relevant Coursework: Epidemiology, Biostatistics, Systematic Review and
Meta-analysis, Statistical Computing, Clinical Trials Management,
Pharmaco-epidemiology, Survival Analysis, Longitudinal Analysis.
Fudan University School of Pharmacy
09/2009-06/2013
Major: Pharmacy
Degree: Bachelor of Science
PROFESSIONAL DEVELOPMENT
Software & Languages: Microsoft Office Suite, SAS (Certified Base Programmer),
R, STATA, ArcGIS
Typesetting: Latex
Language: English & Mandarin
PUBLIC HEALTH EXPERIENCE
Research Assistant, Cochrane Eyes and Vision Group Johns Hopkins University
Center for Clinical Trials, Department of Epidemiology, School of Public Health,
Johns Hopkins University
02/2014—present
Search, collect, manage and screen clinical research studies from different
databases
Extract and manage data from studies
Conduct qualitative and quantitative analysis
Research Assistant, Cochrane Hypertension Group Fudan University
Department of Clinical Pharmacology, School of Pharmacy, Fudan University
06/2012—06/2013
Search, collect, manage and screen clinical research studies from different
databases
45
Extract and manage data from studies
Conduct qualitative and quantitative analysis
TEACHING EXPERIENCE
Teaching assistant and lab instructor, Department of Epidemiology &
Department of Biostatistics, Johns Hopkins University
09/2014—12/2014
INTERNSHIP EXPERIENCE
Pharmacist
Zhongshan Hospital, Shanghai, China
07/2012-12/2012