An Open Data API for Clinical Trials

Post on 13-Apr-2017

439 views 0 download

Transcript of An Open Data API for Clinical Trials

An Open Data API for Clinical Trials

2

myTomorrows is a global healthcare platform for clinical trials and early access programs, and facilitates access to drugs in deelopment for physicians and their patients.

www.mytomorrows.com

@mytomorrows

API Documentation:

https://mytomorrows.app.box.com/v/public-api/1/11231216095/94441848186/1

3

Clinical Trials API

The myTomorrows clinical trials API is an online service providing access to a central database of worldwide medical trial and study data.

The API is provided as a public web service and research tool for application developers, healthcare professionals, and domain experts working with clinical trials.

4

Agenda

Trial API Overview

What, Why, How?

API Goals and Solutions

API Design Principles

Trial Search Engine (MeSH)

Examples

5

API Fact Sheet

Public & Accessible: no registration required; get started with a web browser

Open web technology: HTTP + REST + JSON + Schema

Cloud infrastructure: hosted on Amazon Web Services for uptime and autoscaling

Versioned: legacy support for API clients; no surprises, no breaking changes

Comprehensive Data: 16 WHO global trial databases- 300k Clinical Trials- 447k Interventions- 275k Contacts, Sponsors, and Facilities- 21k Diseases- 185 Countries

Fresh Data: full Imports run on daily basis

6

The myTomorrows Mission

Short Term:

Improve access to treatment options Improve standards of care

Long Term:

Understanding the worldwide landscape of clinical trials is a necessary condition for these goals.

7

How Can an API Help?

Access to global, harmonized, healthcare data is a catalyst for research and new insights.

Search trends reveal interest in unmet medical needs and unknown patient populations.

Technical support for “Meaningful Use” policies and practices.

An API is a way to give a much wider group of people access to and information on clinical trials and population health trends.

8

What is an “Open” API?

Non-commercial and free to use.

Datasets can be republished downstream.

Based on open community standards and conventions.

The myTomorrows clinical trials API promotes several open data principles:

9

Use Case: Trial Discoverability

Clinical trials suffer from major discoverability problems.

Solving these problems means doctors can consider more treatment options for their patients.

An openAPI is a means to generally improve discoverability.

An API is a solution for access to information.

10

Master Data Management

Schema Mapping

Data Consolidation

Source Identification & Attribution

De-duplication

Error Detection & Correction

The medical trials API provides a standards-based model for formalizing trial registration data. Various techniques are used:

11

Industry-Level Pressure on Data Exchange

Trial Reporting in ClinicalTrials.gov — The Final Rulehttp://www.nejm.org/doi/pdf/10.1056/NEJMsr1611785

US toughens rules for clinical-trial transparencyhttp://www.nature.com/news/us-toughens-rules-for-clinical-trial-transparency-1.20616

NIH aims to beef up clinical trial design as part of new data sharing ruleshttp://www.sciencemag.org/news/2016/09/nih-aims-beef-clinical-trial-design-part-new-data-sharing-rules

New federal rules target woeful public reporting of clinical trial resultshttps://www.statnews.com/2016/09/16/clinical-trials-reporting-rules

Especially in the US there is pressure to improving clinical trial studies and data publication.

12

Design Principles

Favour human-readable content (optimize for access, scrutiny, portability)

Decompose data models and schema (optimize for loose coupling, small interfaces)

Model schemas on conventions in active use, e.g. PDED, TRDS (optimize for collaboration, familiarity)

Provide simple query model (optimize for common use cases, still allow difficult queries)

Interoperability between healthcare systems and data these systems contain still a primary challenge.

The myTomorrows clinical trials API is heavily informed by domain-driven design principles and pragmatic interests, e.g.

13

Use Case: Active Trials

The simple question - “what are the active clinical trials in the world?” - is not possible to answer without interpreting trial register data.

For example, some trial statuses found in registers which could be construed to mean that a given trial is currently running include:

“active”, “available”, “on-going”, “completed, no longer recruiting”, “underway”, “in practice”, “recruitment completed”, “inclusion stopped”

The myTomorrows API provides an abstraction layer that normalizes concerns like model schema and vocabulary.

14

Leaky Abstractions

1. Be as clear and familiar as possible while remaining general enough to be broadly applicable.

2. Be as specific as possible without compromising future change.

Due to various kinds of fragmentation (market, geographic, stakeholders, etcetera) this is one of the dominant problems of healthtech.

Some level of leaking is necessary due to focus and specialism, and some level generalization should be avoided due to verbosity, complexity.

Developing the clinical trials API, our middle-ground approach was to:

15

Data Mapping

1. ClinicalTrials.gov Protocol Data Element Definitionshttps://prsinfo.clinicaltrials.gov/definitions.html

2. WHO Trial Registration Data Sethttp://www.who.int/ictrp/network/trds/en/

Two prevailing schemas for trial register data that inform the API design:

API schemas extended to support custom trial register datasets:

3. Trial Register Custom Datasets (e.g. study design, data monitoring and ethics committees, etcetera)

PDED + TRDS + WHO Partner Network

16

The Importance of Data Mapping

1. Data outlives APIs

2. APIs outlive applications

The most important design decisions happen close to the data. Leaky abstractions addressed on this level.

17

Clinical Trial Datasources

Medical trial data is imported daily from the 16 World Health Organization Primary Partner registries:

http://www.who.int/ictrp/network/primary/en/

The Australian New Zealand Clinical Trials Registry (ANZCTR), The Chinese Clinical Trial Register (ChiCTR), The Clinical Research Information Service (CRIS), The Clinical Trials Registry- India (CTRI), The German Clinical Trials Register (DRKS), The EU Clinical Trials Register (EudraCT), The Iranian Registry of Clinical Trials (IRCT), The International Clinical Trials Registry Platform (ICTRP), Japic CTI (JAPIC), ClinicalTrials.gov, The Nederlands Trial Register (NTR), The Pan African Clinical Trials Registry (PACTR), The Brazilian Clinical Trials Registry (ReBec), The Sri Lanka Clinical Trials Registry (SLCTR), The Thai Clinical Trials Registry (TCTR), and The University Hospital Medical Information Network Clinical Trials Registry (UMIN-CTR).

18

Clinical Trial Search

Main API use case is searching trials, e.g. eligibility matching, medical research.

Two core search features:

Easy to filter patient options with general information, possible to explore related treatments

MeSH (Medical Subject Headings) condition search

Trial search filters (gender, location, dates, etcetera)

19

MeSH

“The Medical Subject Headings (MeSH®) thesaurus is a controlled vocabulary produced by the National Library of Medicine and used for indexing, cataloging, and searching for biomedical and health-related information and documents.”

https://www.nlm.nih.gov/mesh/

The API links clinical trial data through diseases as identified with MeSH terms.

Searches can be more or less specific depending on disease area.

20

Trial Search Filters

Study design, intervention model

Titles, medical codes, universal trial number, secondary ids

Inclusion / exclusion criteria

Country, study sites

Interventions, drug codes

Ethics committees, review boards

Use cases include: trial eligibility and availability, finding specialist practitioners and clinics, upcoming enrollment, etcetera

Example searchable data (from 30+ distinct filters) includes:

21

URLs: The Bridge Language

1. REST is over-engineered, FHIR is over-engineered!

2. Every healthcare knowledge worker knows how to manipulate URLs

Be like the Elasticsearch of healthcare data, but with *meaningful* schema and query models

Make the API accessible to domain experts:

22

Ideal API Users

Physicians / Medical Institutions

Clinical Scientists

Academic Researchers

Patient Advocates

Healthcare Technology Developers

Contract Research Organizations

23

Example 1: Pediatric Medicine

Status of the Pediatric Clinical Trials Enterprise

http://pediatrics.aappublications.org/content/early/2012/09/26/peds.2011-3565

Performing study with clinical trials API query:

https://api.mytomorrows.com/api/v1/medical_trials.json?max_age=16&country=US&type=interventional&max_enrollment=100&sponsor=NIH

24

Example 2: Meaningful Evidence

Large-scale analysis finds majority of clinical trials don't provide meaningful evidence

https://www.sciencedaily.com/releases/2012/05/120501162702.htm

Performing study with clinical trials API query:

https://api.mytomorrows.com/api/v1/medical_trials.json?max_enrollment=100

VS.

https://api.mytomorrows.com/api/v1/medical_trials.json?max_enrollment=1000

25

Example 3: Compliance With Mandatory Reporting

Compliance with mandatory reporting of clinical trial results on ClinicalTrials.gov: cross sectional study

http://www.bmj.com/content/344/bmj.d7373

Performing study with clinical trials API query:

https://api.mytomorrows.com/api/v1/medical_trials.json?country=US&type=interventional&end_from=2009-01-01&end_to=2009-12-31&fda=true&phase=-1

26

Questions?

Please get in touch if you have any feedback, feature requests, data exchange proposals, datasource recommendations / requests, or bug reports. Thanks!

tech@mytomorrows.com

API Documentationhttps://mytomorrows.app.box.com/v/public-api/1/11231216095/94441848186/1

Terms of Servicehttps://mytomorrows.app.box.com/v/public-api/1/11231216095/94310908387/1