Gene association networks - Large-scale integration of data and text

72
Gene association networks Large-scale integration of data and text Lars Juhl Jensen

Transcript of Gene association networks - Large-scale integration of data and text

Page 1: Gene association networks - Large-scale integration of data and text

Gene association networks

Large-scale integration of data and text

Lars Juhl Jensen

Page 2: Gene association networks - Large-scale integration of data and text

9.6 million genes

Page 3: Gene association networks - Large-scale integration of data and text

association network

Page 4: Gene association networks - Large-scale integration of data and text

guilt by association

Page 5: Gene association networks - Large-scale integration of data and text
Page 6: Gene association networks - Large-scale integration of data and text

genomic context

Page 7: Gene association networks - Large-scale integration of data and text

gene fusion

Page 8: Gene association networks - Large-scale integration of data and text

Korbel et al., Nature Biotechnology, 2004

Page 9: Gene association networks - Large-scale integration of data and text

phylogenetic profiles

Page 10: Gene association networks - Large-scale integration of data and text

Korbel et al., Nature Biotechnology, 2004

Page 11: Gene association networks - Large-scale integration of data and text

experimental data

Page 12: Gene association networks - Large-scale integration of data and text

gene coexpression

Page 13: Gene association networks - Large-scale integration of data and text
Page 14: Gene association networks - Large-scale integration of data and text

physical interactions

Page 15: Gene association networks - Large-scale integration of data and text

Jensen & Bork, Science, 2008

Page 16: Gene association networks - Large-scale integration of data and text

curated knowledge

Page 17: Gene association networks - Large-scale integration of data and text

protein complexes

Page 18: Gene association networks - Large-scale integration of data and text
Page 19: Gene association networks - Large-scale integration of data and text

pathways

Page 20: Gene association networks - Large-scale integration of data and text

Letunic & Bork, Trends in Biochemical Sciences, 2008

Page 21: Gene association networks - Large-scale integration of data and text

many databases

Page 22: Gene association networks - Large-scale integration of data and text

different formats

Page 23: Gene association networks - Large-scale integration of data and text

different identifiers

Page 24: Gene association networks - Large-scale integration of data and text

variable quality

Page 25: Gene association networks - Large-scale integration of data and text

not comparable

Page 26: Gene association networks - Large-scale integration of data and text

hard work

Page 27: Gene association networks - Large-scale integration of data and text

(Ph.D. students)

Page 28: Gene association networks - Large-scale integration of data and text

parsers

Page 29: Gene association networks - Large-scale integration of data and text

mapping files

Page 30: Gene association networks - Large-scale integration of data and text

quality scores

Page 31: Gene association networks - Large-scale integration of data and text

affinity purification

Page 32: Gene association networks - Large-scale integration of data and text

von Mering et al., Nucleic Acids Research, 2005

Page 33: Gene association networks - Large-scale integration of data and text

score calibration

Page 34: Gene association networks - Large-scale integration of data and text

gold standard

Page 35: Gene association networks - Large-scale integration of data and text

von Mering et al., Nucleic Acids Research, 2005

Page 36: Gene association networks - Large-scale integration of data and text

implicit weighting by quality

Page 37: Gene association networks - Large-scale integration of data and text

common scale

Page 38: Gene association networks - Large-scale integration of data and text

cross-species transfer

Page 39: Gene association networks - Large-scale integration of data and text

Franceschini et al., Nucleic Acids Research, 2013

Page 40: Gene association networks - Large-scale integration of data and text

missing most of the data

Page 41: Gene association networks - Large-scale integration of data and text

>10 km

Page 42: Gene association networks - Large-scale integration of data and text

too much to read

Page 43: Gene association networks - Large-scale integration of data and text

text mining

Page 44: Gene association networks - Large-scale integration of data and text

comprehensive lexicon

Page 45: Gene association networks - Large-scale integration of data and text

cyclin dependent kinase 1

Page 46: Gene association networks - Large-scale integration of data and text

CDC2

Page 47: Gene association networks - Large-scale integration of data and text

orthographic variation

Page 48: Gene association networks - Large-scale integration of data and text

spaces and hyphens

Page 49: Gene association networks - Large-scale integration of data and text

cyclin dependent kinase 1

Page 50: Gene association networks - Large-scale integration of data and text

cyclin-dependent kinase 1

Page 51: Gene association networks - Large-scale integration of data and text

prefixes and suffixes

Page 52: Gene association networks - Large-scale integration of data and text

CDC2

Page 53: Gene association networks - Large-scale integration of data and text

hCdc2

Page 54: Gene association networks - Large-scale integration of data and text

“black list”

Page 55: Gene association networks - Large-scale integration of data and text

SDS

Page 56: Gene association networks - Large-scale integration of data and text

co-mentioning

Page 57: Gene association networks - Large-scale integration of data and text

counting

Page 58: Gene association networks - Large-scale integration of data and text

within documents

Page 59: Gene association networks - Large-scale integration of data and text

within paragraphs

Page 60: Gene association networks - Large-scale integration of data and text

within sentences

Page 61: Gene association networks - Large-scale integration of data and text

quality scores

Page 62: Gene association networks - Large-scale integration of data and text

score calibration

Page 63: Gene association networks - Large-scale integration of data and text

cross-species transfer

Page 64: Gene association networks - Large-scale integration of data and text

combine all evidence

Page 65: Gene association networks - Large-scale integration of data and text

Szklarczyk et al., Nucleic Acids Research, 2015string-db.org

Page 66: Gene association networks - Large-scale integration of data and text

web resource

Page 67: Gene association networks - Large-scale integration of data and text

download files

Page 68: Gene association networks - Large-scale integration of data and text

REST API

Page 69: Gene association networks - Large-scale integration of data and text

Bioconductor package

Page 70: Gene association networks - Large-scale integration of data and text

Cytoscape App

Page 71: Gene association networks - Large-scale integration of data and text

AcknowledgmentsDamian Szklarczyk

Michael KuhnAndrea Franceschini

Milan SimonovicAlexander Roth

Sune Pletscher-FrankildJohn “Scooter” MorrisChristian von Mering

Peer Bork

Page 72: Gene association networks - Large-scale integration of data and text

Unacknowledgments

Do yourself a favor, don’t fly