Friend harvard 2013-01-30

65
If not "Harnessing the power of teams to build better models of disease in real time"

description

Stephen Friend, Jan 30, 2013. At HMS Genetics Center for Cancer Systems Biology, Boston, MA

Transcript of Friend harvard 2013-01-30

Page 1: Friend harvard 2013-01-30

If not

"Harnessing the power of teams to build better models of disease in real time"

Page 2: Friend harvard 2013-01-30

Examples:

Expression Profiles

Page 3: Friend harvard 2013-01-30

2000

Page 4: Friend harvard 2013-01-30
Page 5: Friend harvard 2013-01-30

Examples:

DNA Alterations

Page 6: Friend harvard 2013-01-30
Page 7: Friend harvard 2013-01-30
Page 8: Friend harvard 2013-01-30

Examples:

Proteomics

Page 9: Friend harvard 2013-01-30
Page 10: Friend harvard 2013-01-30
Page 11: Friend harvard 2013-01-30

Examples:

Synthetic Lethal Screens

Page 12: Friend harvard 2013-01-30
Page 13: Friend harvard 2013-01-30
Page 14: Friend harvard 2013-01-30

Examples:

Network Models

Page 15: Friend harvard 2013-01-30
Page 16: Friend harvard 2013-01-30

Examples:

Drugs and Trials

Page 17: Friend harvard 2013-01-30

PARP IGF1-R m-TOR VEGF-R Wee-1

Page 18: Friend harvard 2013-01-30

Reality: Overlapping Pathways

Page 19: Friend harvard 2013-01-30
Page 20: Friend harvard 2013-01-30
Page 21: Friend harvard 2013-01-30

• alchemist

Page 22: Friend harvard 2013-01-30

Examples

Mutations

Page 23: Friend harvard 2013-01-30

How often are we hurt by going from the particular to the general

in very complex systems driven by context?

Is this going from the particular to the general a central problem in

Hypothesis Driven Biomedical Research?

How often do we inappropriately praise findings that go on to have awkward adjacencies?

Page 24: Friend harvard 2013-01-30

.

Page 25: Friend harvard 2013-01-30

TENURE FEUDAL STATES

Page 26: Friend harvard 2013-01-30

What could be done by us?

Page 27: Friend harvard 2013-01-30

BUILDING PRECISION MEDICINE

Extensions of Current Institutions

Proprietary Short term Solutions

Open Systems of Sharing in a Commons

Page 28: Friend harvard 2013-01-30

NRNB Investigators Trey Ideker, PhD Principal Investigator, NRNB Departments of Medicine and Bioengineering University of California, San Diego

Dr. Ideker uses genome-scale measurements to construct network models of DNA damage response and cancer. He was the 2009 recipient of the Overton Prize from the International Society for Computational Biology.

Gary Bader, PhD Assistant Professor, Terrence Donnelly Centre for Cellular & Biomolecular Research University of Toronto

Dr. Bader works on biological network analysis and pathway information resources.

Alex Pico, PhD Executive Director, NRNB Gladstone Institute of Cardiovascular Disease Staff Research Scientist University of California, San Francisco

Dr. Pico develops software tools and resources that help analyze, visualize and explore biomedical data in the context of these networks

James Fowler, PhD Associate Professor, CalIT2 Center for Wireless & Population Health Systems and Political Science University of California, San Diego

Dr. Fowler’s research concerns social networks, behavioral economics, evolutionary game theory, and genopolitics (the study of the genetic basis of political behavior). His research on social networks has been featured in Time’s Year in Medicine.

Chris Sander, PhD Chair, Computational Biology Center, Tri-Institutional Professor Memorial Sloan-Kettering Cancer Center

Dr. Sander’s research focuses on Computational and Systems Biology of molecules, pathways, and processes.

Benno Schwikowski, PhD Chef du Laboratoire/Group Leader Pasteur Institute

Dr. Schwikowski’s expertise lies in combinatorial algorithms for Computational and Systems Biology.

Overview Technology Software Collabs Outreach Plans

Page 29: Friend harvard 2013-01-30

The National Resource for Network Biology: Integrating genomes & networks to understand health & disease

Patient genotype Genome sequencing

1) How to assemble and visualize network models of the cell?

2) How to use networks in healthcare?

Phenotype Disease diagnosis

Response to therapy/drug Side effects

Developmental outcome Rate of aging, etc.

Gene expression & other large scale molecular state measurements

NIH NCRR / NIGMS P41 GM103504

Draft Network Assembly

Page 30: Friend harvard 2013-01-30
Page 31: Friend harvard 2013-01-30
Page 32: Friend harvard 2013-01-30

Now possible to generate massive amount of human “omic’s” data

Page 33: Friend harvard 2013-01-30

Network Modeling Approaches for Diseases are emerging

Page 34: Friend harvard 2013-01-30

IT Infrastructure and Cloud compute capacity allows a generative open approach to solving problems

Page 35: Friend harvard 2013-01-30

Nascent Movement for patients to Control Sensitive information allowing sharing

Page 36: Friend harvard 2013-01-30

Open Social Media allows citizens and experts to use gaming to solve problems

Page 37: Friend harvard 2013-01-30

1- Now possible to generate massive amount of human “omic’s” data 2-Network Modeling Approaches for Diseases are emerging 3- IT Infrastructure and Cloud compute capacity allows a generative open approach to biomedical problem solving 4-Nascent Movement for patients to Control Sensitive information allowing sharing 5- Open Social Media allows citizens and experts to use gaming to solve problems

A HUGE OPPORTUNITY -- A HUGE RESPONSIBILITY

Page 38: Friend harvard 2013-01-30

We focus on a world where biomedical research is about to fundamentally change. We think it will be often conducted in an open, collaborative way where teams of teams far beyond the current guilds of experts will contribute to making better, faster, relevant discoveries

Page 39: Friend harvard 2013-01-30

Better Models of

Disease:

KNOWLEDGE

NETWORK

Techn

olo

gy P

latform

Rewards/Challenges

Imp

actf

ul M

od

els

Governance

Page 40: Friend harvard 2013-01-30

Two recurring problems in Alzheimer’s disease research

40

Ambiguous pathology Are disease-associated molecular systems & genes destructive, adaptive, or both? Bottom line: We need to identify causal factors vs correlative or adaptive features of disease.

Diverse mechanisms How do diverse mutations and environmental factors combine into a core pathology? Bottom line: There is no rigorous / consistent global framework that integrates diverse disease factors.

Page 41: Friend harvard 2013-01-30

Identifying key disease systems and genes- Gaiteri et al.

Example “modules” of coexpressed genes, color-coded

1.) Identify groups of genes that move together – coexpressed “modules” - correlated expression of multiple genes across many patients - coexpression calculated separately for Disease/healthy groups - these gene groups are often coherent cellular subsystems, enriched in one or more GO functions

Page 42: Friend harvard 2013-01-30

1.) Identify groups of genes that move together – coexpressed “modules” 2.) Prioritize the disease-relevance of the modules by clinical and network measures

Prioritize modules through expression synchrony with clinical measures or tendency too reconfigure themselves in disease

vs

Identifying key disease systems and genes

Page 43: Friend harvard 2013-01-30

Infer directed/causal relationships and clear hierarchical structure by incorporating eSNP information (no hair-balls here)

vs

Prioritize modules through expression synchrony with clinical measures or tendency too reconfigure themselves in disease

Identifying key disease systems and genes

1.) Identify groups of genes that move together – coexpressed “modules” 2.) Prioritize the disease-relevance of the modules by clinical and network measures 3.) Incorporate genetic information to find directed relationships between genes

Page 44: Friend harvard 2013-01-30

Example network finding: microglia activation in AD

Module selection – what identifies these modules as relevant to Alzheimer’s disease?

The eigengene of a module of ~400 probes correlates with Braak score, age, cognitive disease severity and cortical atrophy. Members of this module are on average differentially expressed (both up- and down-regulated).

Evidence these modules are related to microglia function

The members of this module are enriched with GO categories (p<.001) such as “response to biotic stimulus” that are indicative of immunologic function for this module. The microglia markers CD68 and CD11b/ITGAM are contained in the module (this is rare – even when a module appears to represent a specific cell-type, the histological markers may be lacking). Numerous key drivers (SYK, TREM2, DAP12, FC1R, TLR2) are important elements of microglia signaling.

Alzgene hits found in co-regulated microglia module:

Page 45: Friend harvard 2013-01-30

Figure key:

Five main immunologic families found in Alzheimer’s-associated module Square nodes in surrounding network denote literature-supported nodes. Node size is proportional to connectivity in the full module.

(Interior circle) Width of connections between 5 immune families are linearly scaled to the number of inter-family connections.

Labeled nodes are either highly connected in the original network, implicated by at least 2 papers as associated with Alzheimer’s disease, or core members of one of the 5 immune families.

Core family members are shaded.

Page 46: Friend harvard 2013-01-30

Transforming networks into biological hypotheses

Page 47: Friend harvard 2013-01-30

Testing network-based hypotheses

Page 48: Friend harvard 2013-01-30

Design-stage AD projects at Sage

Fusing our expertise in…

Join us in uniting genes, circuits and regions to build multi-scale biophysical disease models. Contact [email protected]

Diffusion Spectrum Imaging

Microcircuits & neuronal diversity

Gene regulatory networks

Feed

back

Page 49: Friend harvard 2013-01-30

PORTABLE LEGAL CONSENT Control of Private information by Citizens allows sharing

weconsent.us

John Wilbanks

• Online educational wizard • Tutorial video • Legal Informed Consent Document • Profile registration • Data upload

John Wilbanks TED Talk “Let’s pool our medical data” weconsent.us

Page 50: Friend harvard 2013-01-30

two approaches to building common scientific knowledge

Text summary of the completed project

Assembled after the fact

Every code change versioned

Every issue tracked

Every project the starting point for new work

All evolving and accessible in real time

Social Coding

Page 51: Friend harvard 2013-01-30

Synapse is GitHub for Biomedical Data

• Data and code versioned

• Analysis history captured in real time

• Work anywhere, and share the results with anyone

• Social/Interactive Science

• Every code change versioned

• Every issue tracked

• Every project the starting point for new work

• Social/Interactive Coding

Page 52: Friend harvard 2013-01-30

Data Analysis with Synapse

Run Any Tool

On Any Platform

Record in Synapse

Share with Anyone

Page 53: Friend harvard 2013-01-30

“Synapse is a nascent compute platform for transparent, reproducible, and modular collaborative research.”

Page 54: Friend harvard 2013-01-30

Currently at 16K+ datasets and ~1M models

Page 55: Friend harvard 2013-01-30

Download analysis and meta-analysis

Download another Cluster Result Download Evaluation and view more stats

• Perform Model averaging

• Compare/contrast models

• Find consensus clusters

• Visualize in Cytoscape

Page 56: Friend harvard 2013-01-30

Pancancer collaborative subtype discovery

Page 57: Friend harvard 2013-01-30

Objective assessment of factors influencing model

performance (>1 million predictions evaluated)

Sanger CCLE Prediction accuracy

improved by…

Not discretizing data

Including expression data

Elastic net regression

130 compounds 24 compounds

Cro

ss v

alid

atio

n p

red

icti

on

acc

ura

cy (

R2)

In Sock Jang

Page 58: Friend harvard 2013-01-30
Page 59: Friend harvard 2013-01-30

Sage-DREAM Breast Cancer Prognosis Challenge #1 Building better disease models together

154 participants; 27 countries

334 participants; >35 countries

>500 models posted to Leaderboard

breast cancer data

Challenge Launch: July 17

Sep 26 Status

Caldos/Aparicio

Page 60: Friend harvard 2013-01-30

How to accelerate and make affordable the efforts required to build better

models of disease ?

Page 61: Friend harvard 2013-01-30
Page 62: Friend harvard 2013-01-30
Page 63: Friend harvard 2013-01-30

(Nolan and Haussler)

THE FEDERATION

Schadt Ideker Friend Califano Nolan Vidal

Page 64: Friend harvard 2013-01-30

How to incent the joint evolution of ideas in a rapid learning space- prepublication?

How to fund where data generators and analysts are not always the same people- repeatedly?

Should we consider

Centralized Guilds vs Distributed Dynamic Teams?

Page 65: Friend harvard 2013-01-30

If not