Data integration: The STITCH database of protein–small molecule interactions

175
Data integration The STITCH database of protein–small molecule interactions Lars Juhl Jensen

description

Chemoinformatics Course, Technical University of Denmark, Lyngby, Denmark, November 19, 2009.

Transcript of Data integration: The STITCH database of protein–small molecule interactions

Page 1: Data integration: The STITCH database of protein–small molecule interactions

Data integrationThe STITCH database of protein–small molecule interactions

Lars Juhl Jensen

Page 2: Data integration: The STITCH database of protein–small molecule interactions

Kuhn et al., Nucleic Acids Research, 2010

Page 3: Data integration: The STITCH database of protein–small molecule interactions

functional associations

Page 4: Data integration: The STITCH database of protein–small molecule interactions

protein–small molecule

Page 5: Data integration: The STITCH database of protein–small molecule interactions

protein–protein

Page 6: Data integration: The STITCH database of protein–small molecule interactions

parts lists

Page 7: Data integration: The STITCH database of protein–small molecule interactions

>2.5 million proteins

Page 8: Data integration: The STITCH database of protein–small molecule interactions

630 genomes

Page 9: Data integration: The STITCH database of protein–small molecule interactions

many databases

Page 10: Data integration: The STITCH database of protein–small molecule interactions

different formats

Page 11: Data integration: The STITCH database of protein–small molecule interactions

model organism databases

Page 12: Data integration: The STITCH database of protein–small molecule interactions

Ensembl

Page 13: Data integration: The STITCH database of protein–small molecule interactions

RefSeq

Page 14: Data integration: The STITCH database of protein–small molecule interactions

PubChem compounds

Page 15: Data integration: The STITCH database of protein–small molecule interactions

>74,000 small molecules

Page 16: Data integration: The STITCH database of protein–small molecule interactions

curated knowledge

Page 17: Data integration: The STITCH database of protein–small molecule interactions

complexes

Page 18: Data integration: The STITCH database of protein–small molecule interactions

pathways

Page 19: Data integration: The STITCH database of protein–small molecule interactions

Letunic & Bork, Trends in Biochemical Sciences, 2008

Page 20: Data integration: The STITCH database of protein–small molecule interactions

high confidence

Page 21: Data integration: The STITCH database of protein–small molecule interactions

many databases

Page 22: Data integration: The STITCH database of protein–small molecule interactions

MIPSMunich Information center

for Protein Sequences

Page 23: Data integration: The STITCH database of protein–small molecule interactions

Gene Ontology

Page 24: Data integration: The STITCH database of protein–small molecule interactions

KEGGKyoto Encyclopedia of Genes and Genomes

Page 25: Data integration: The STITCH database of protein–small molecule interactions

MetaCyc

Page 26: Data integration: The STITCH database of protein–small molecule interactions

PIDNCI-Nature Pathway Interaction Database

Page 27: Data integration: The STITCH database of protein–small molecule interactions

Reactome

Page 28: Data integration: The STITCH database of protein–small molecule interactions

different formats

Page 29: Data integration: The STITCH database of protein–small molecule interactions

different identifiers

Page 30: Data integration: The STITCH database of protein–small molecule interactions

partially redundant

Page 31: Data integration: The STITCH database of protein–small molecule interactions

interaction data

Page 32: Data integration: The STITCH database of protein–small molecule interactions

protein–small molecule

Page 33: Data integration: The STITCH database of protein–small molecule interactions

in vitro binding assays

Page 34: Data integration: The STITCH database of protein–small molecule interactions

protein–protein

Page 35: Data integration: The STITCH database of protein–small molecule interactions

yeast two-hybrid

Page 36: Data integration: The STITCH database of protein–small molecule interactions

affinity purification

Page 37: Data integration: The STITCH database of protein–small molecule interactions

fragment complementation

Page 38: Data integration: The STITCH database of protein–small molecule interactions

Jensen & Bork, Science, 2008

Page 39: Data integration: The STITCH database of protein–small molecule interactions

genetic interactions

Page 40: Data integration: The STITCH database of protein–small molecule interactions

Beyer et al., Nature Reviews Genetics, 2007

Page 41: Data integration: The STITCH database of protein–small molecule interactions

gene coexpression

Page 42: Data integration: The STITCH database of protein–small molecule interactions
Page 43: Data integration: The STITCH database of protein–small molecule interactions

many databases

Page 44: Data integration: The STITCH database of protein–small molecule interactions

BindingDB

Page 45: Data integration: The STITCH database of protein–small molecule interactions

CTDComparative Toxicogenomics Database

Page 46: Data integration: The STITCH database of protein–small molecule interactions

DrugBank

Page 47: Data integration: The STITCH database of protein–small molecule interactions

GLIDAGPCR-Ligand Database

Page 48: Data integration: The STITCH database of protein–small molecule interactions

PDSP KiPsycoactive Drug Screening Program

Page 49: Data integration: The STITCH database of protein–small molecule interactions

PharmGKBPharmacogenomics Knowledge Base

Page 50: Data integration: The STITCH database of protein–small molecule interactions

BINDBiomolecular Interaction Network Database

Page 51: Data integration: The STITCH database of protein–small molecule interactions

BioGRIDGeneral Repository for Interaction Datasets

Page 52: Data integration: The STITCH database of protein–small molecule interactions

DIPDatabase of Interacting Proteins

Page 53: Data integration: The STITCH database of protein–small molecule interactions

IntAct

Page 54: Data integration: The STITCH database of protein–small molecule interactions

MINTMolecular Interactions Database

Page 55: Data integration: The STITCH database of protein–small molecule interactions

HPRDHuman Protein Reference Database

Page 56: Data integration: The STITCH database of protein–small molecule interactions

PDBProtein Data Bank

Page 57: Data integration: The STITCH database of protein–small molecule interactions

GEOGene Expression Omnibus

Page 58: Data integration: The STITCH database of protein–small molecule interactions

different formats

Page 59: Data integration: The STITCH database of protein–small molecule interactions

different identifiers

Page 60: Data integration: The STITCH database of protein–small molecule interactions

partially redundant

Page 61: Data integration: The STITCH database of protein–small molecule interactions

literature mining

Page 62: Data integration: The STITCH database of protein–small molecule interactions

>10 km

Page 63: Data integration: The STITCH database of protein–small molecule interactions

human readable

Page 64: Data integration: The STITCH database of protein–small molecule interactions

not computer readable

Page 65: Data integration: The STITCH database of protein–small molecule interactions

different names

Page 66: Data integration: The STITCH database of protein–small molecule interactions

text corpus

Page 67: Data integration: The STITCH database of protein–small molecule interactions

MEDLINE

Page 68: Data integration: The STITCH database of protein–small molecule interactions

SGDSaccharomyces Genome Database

Page 69: Data integration: The STITCH database of protein–small molecule interactions

The Interactive Fly

Page 70: Data integration: The STITCH database of protein–small molecule interactions

OMIMOnline Mendelian Inheritance in Man

Page 71: Data integration: The STITCH database of protein–small molecule interactions

dictionary

Page 72: Data integration: The STITCH database of protein–small molecule interactions

co-mentioning

Page 73: Data integration: The STITCH database of protein–small molecule interactions

NLPNatural Language Processing

Page 74: Data integration: The STITCH database of protein–small molecule interactions
Page 75: Data integration: The STITCH database of protein–small molecule interactions

restricted access

Page 76: Data integration: The STITCH database of protein–small molecule interactions

genomic context

Page 77: Data integration: The STITCH database of protein–small molecule interactions

gene fusion

Page 78: Data integration: The STITCH database of protein–small molecule interactions

Korbel et al., Nature Biotechnology, 2004

Page 79: Data integration: The STITCH database of protein–small molecule interactions

conserved neighborhood

Page 80: Data integration: The STITCH database of protein–small molecule interactions

operons

Page 81: Data integration: The STITCH database of protein–small molecule interactions

Korbel et al., Nature Biotechnology, 2004

Page 82: Data integration: The STITCH database of protein–small molecule interactions

bidirectional promoters

Page 83: Data integration: The STITCH database of protein–small molecule interactions

Korbel et al., Nature Biotechnology, 2004

Page 84: Data integration: The STITCH database of protein–small molecule interactions

phylogenetic profiles

Page 85: Data integration: The STITCH database of protein–small molecule interactions

Korbel et al., Nature Biotechnology, 2004

Page 86: Data integration: The STITCH database of protein–small molecule interactions

integration

Page 87: Data integration: The STITCH database of protein–small molecule interactions

many data types

Page 88: Data integration: The STITCH database of protein–small molecule interactions

not comparable

Page 89: Data integration: The STITCH database of protein–small molecule interactions

variable quality

Page 90: Data integration: The STITCH database of protein–small molecule interactions

spread over 630 genomes

Page 91: Data integration: The STITCH database of protein–small molecule interactions

quality scores

Page 92: Data integration: The STITCH database of protein–small molecule interactions

reproducibility

Page 93: Data integration: The STITCH database of protein–small molecule interactions

von Mering et al., Nucleic Acids Research, 2005

Page 94: Data integration: The STITCH database of protein–small molecule interactions

intergenic distances

Page 95: Data integration: The STITCH database of protein–small molecule interactions

Korbel et al., Nature Biotechnology, 2004

Page 96: Data integration: The STITCH database of protein–small molecule interactions

benchmarking

Page 97: Data integration: The STITCH database of protein–small molecule interactions

calibrate vs. gold standard

Page 98: Data integration: The STITCH database of protein–small molecule interactions

von Mering et al., Nucleic Acids Research, 2005

Page 99: Data integration: The STITCH database of protein–small molecule interactions

raw quality scores

Page 100: Data integration: The STITCH database of protein–small molecule interactions

probabilistic scores

Page 101: Data integration: The STITCH database of protein–small molecule interactions

orthology transfer

Page 102: Data integration: The STITCH database of protein–small molecule interactions

von Mering et al., Nucleic Acids Research, 2005

Page 103: Data integration: The STITCH database of protein–small molecule interactions

combine all evidence

Page 104: Data integration: The STITCH database of protein–small molecule interactions
Page 105: Data integration: The STITCH database of protein–small molecule interactions

Acknowledgments

Michael Kuhn

Monica Campillos

Christian von Mering

Manuel Stark

Samuel Chaffron

Philippe Julien

Tobias Doerks

Jan Korbel

Berend Snel

Martijn Huynen

Peer Bork

Page 106: Data integration: The STITCH database of protein–small molecule interactions

Predicting novel targets for existing drugs using side effect information

Lars Juhl Jensen

Page 107: Data integration: The STITCH database of protein–small molecule interactions

the problem

Page 108: Data integration: The STITCH database of protein–small molecule interactions

new uses for old drugs

Page 109: Data integration: The STITCH database of protein–small molecule interactions

drug–drug network

Page 110: Data integration: The STITCH database of protein–small molecule interactions

shared target(s)

Page 111: Data integration: The STITCH database of protein–small molecule interactions

chemical similarity

Page 112: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 113: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 114: Data integration: The STITCH database of protein–small molecule interactions

similar drugs share targets

Page 115: Data integration: The STITCH database of protein–small molecule interactions

only trivial predictions

Page 116: Data integration: The STITCH database of protein–small molecule interactions

the idea

Page 117: Data integration: The STITCH database of protein–small molecule interactions

chemical perturbations

Page 118: Data integration: The STITCH database of protein–small molecule interactions

phenotypic readouts

Page 119: Data integration: The STITCH database of protein–small molecule interactions

drug treatment

Page 120: Data integration: The STITCH database of protein–small molecule interactions

side effects

Page 121: Data integration: The STITCH database of protein–small molecule interactions

the implementation

Page 122: Data integration: The STITCH database of protein–small molecule interactions

information on side effects

Page 123: Data integration: The STITCH database of protein–small molecule interactions

package inserts

Page 124: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 125: Data integration: The STITCH database of protein–small molecule interactions

text mining

Page 126: Data integration: The STITCH database of protein–small molecule interactions

side-effect ontology

Page 127: Data integration: The STITCH database of protein–small molecule interactions

backtracking

Page 128: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 129: Data integration: The STITCH database of protein–small molecule interactions

side-effect correlations

Page 130: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 131: Data integration: The STITCH database of protein–small molecule interactions

GSC weighting

Page 132: Data integration: The STITCH database of protein–small molecule interactions

side-effect frequencies

Page 133: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 134: Data integration: The STITCH database of protein–small molecule interactions

raw similarity score

Page 135: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 136: Data integration: The STITCH database of protein–small molecule interactions

p-values

Page 137: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 138: Data integration: The STITCH database of protein–small molecule interactions

side-effect similarity

Page 139: Data integration: The STITCH database of protein–small molecule interactions

chemical similarity

Page 140: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 141: Data integration: The STITCH database of protein–small molecule interactions

reference set

Page 142: Data integration: The STITCH database of protein–small molecule interactions

drug–target pairs

Page 143: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 144: Data integration: The STITCH database of protein–small molecule interactions

drug–drug pairs

Page 145: Data integration: The STITCH database of protein–small molecule interactions

score bins

Page 146: Data integration: The STITCH database of protein–small molecule interactions

benchmark

Page 147: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 148: Data integration: The STITCH database of protein–small molecule interactions

fit calibration function

Page 149: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 150: Data integration: The STITCH database of protein–small molecule interactions

probabilistic scores

Page 151: Data integration: The STITCH database of protein–small molecule interactions

the results

Page 152: Data integration: The STITCH database of protein–small molecule interactions

drug–drug network

Page 153: Data integration: The STITCH database of protein–small molecule interactions

ATC codes

Page 154: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 155: Data integration: The STITCH database of protein–small molecule interactions

categorization

Page 156: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 157: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 158: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 159: Data integration: The STITCH database of protein–small molecule interactions

map onto score space

Page 160: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 161: Data integration: The STITCH database of protein–small molecule interactions

the experiments

Page 162: Data integration: The STITCH database of protein–small molecule interactions

20 drug–drug relations

Page 163: Data integration: The STITCH database of protein–small molecule interactions

in vitro binding assays

Page 164: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 165: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 166: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 167: Data integration: The STITCH database of protein–small molecule interactions

Ki<10 µM for 11 of 20

Page 168: Data integration: The STITCH database of protein–small molecule interactions

cell assays

Page 169: Data integration: The STITCH database of protein–small molecule interactions

Campillos & Kuhn et al., Science, 2008

Page 170: Data integration: The STITCH database of protein–small molecule interactions

9 of 9 showed activity

Page 171: Data integration: The STITCH database of protein–small molecule interactions

the future

Page 172: Data integration: The STITCH database of protein–small molecule interactions

SIDER

Page 173: Data integration: The STITCH database of protein–small molecule interactions

integration with STITCH

Page 174: Data integration: The STITCH database of protein–small molecule interactions

Acknowledgments

Monica Campillos

Michael Kuhn

Anne-Claude Gavin

Peer Bork

Page 175: Data integration: The STITCH database of protein–small molecule interactions

larsjuhljensen