Multi-Omics Bioinformatics across Application Domains

62
Multi-Omics Bioinformatics across Application Domains Christoph Steinbeck European Bioinformatics Institute (EMBL-EBI)

Transcript of Multi-Omics Bioinformatics across Application Domains

Page 1: Multi-Omics Bioinformatics across Application Domains

Multi-Omics Bioinformatics across Application Domains

Christoph Steinbeck

European Bioinformatics Institute(EMBL-EBI)

Page 2: Multi-Omics Bioinformatics across Application Domains

The European Molecular Biology Laboratory

Structural biology

Grenoble, France

Bioinformatics

Hinxton, Cambridge, UK

Structural biology

Hamburg, Germany

Main laboratory

Heidelberg, Gemany

Monterotondo, Rome, Italy

Neuroscience

• 60 nationalities• 1800 personnel

Page 3: Multi-Omics Bioinformatics across Application Domains

The European Bioinformatics Institute

(EBI)

Page 4: Multi-Omics Bioinformatics across Application Domains

The European Bioinformatics Institute

(EBI)

Page 5: Multi-Omics Bioinformatics across Application Domains

The European Bioinformatics Institute

(EBI)

Page 6: Multi-Omics Bioinformatics across Application Domains

The European Bioinformatics Institute

(EBI)

Page 7: Multi-Omics Bioinformatics across Application Domains

A word about BrExit: EMBL member states

Austria, Belgium, Croatia, Czech Republic, Denmark, Finland, France, Germany, Greece, Iceland, Ireland, Israel, Italy, Luxembourg, Netherlands, Norway, Portugal, Spain, Sweden, Switzerland and the UK

Associate member states: Argentina, Australia

Prospect member states: Hungary, Poland, Slovak Republic

Page 8: Multi-Omics Bioinformatics across Application Domains

Our funders

• EMBL-EBI is primarily funded by EMBL member states.

• Other major funders:• European Commission• Research Councils UK• National Institutes

of Health• Wellcome Trust• Industry Programme

Page 9: Multi-Omics Bioinformatics across Application Domains

Data and tools to support life-science research

www.ebi.ac.uk/services

Bioinformatics services

Page 10: Multi-Omics Bioinformatics across Application Domains

European Bioinformatics Institute (EBI)Genes, genomes & variation

Literature & ontologies Europe PubMed Central Gene Ontology Experimental Factor Ontology Molecular structures

Protein Data Bank in Europe Electron Microscopy Data Bank

European Nucleotide Archive 1000 Genomes

Gene, protein & metabolite expression

Protein sequences, families & motifs

Chemical biology

Reactions, interactions & pathways Systems

Ensembl Ensembl Genomes

European Genome-phenome Archive Metagenomics portal

Page 11: Multi-Omics Bioinformatics across Application Domains

Big data, big demand

~18.5 million requests to EMBL-EBI

websites every day

60 petabytesof EMBL-EBI storage capacity

EMBL-EBI handles

9.2 million jobs on average per month

Scientists at over

5 million unique sites use

EMBL-EBI websites

Page 12: Multi-Omics Bioinformatics across Application Domains

Post Genomic Era

Christoph Steinbeck

European Bioinformatics Institute(EMBL-EBI)

[O]ur understanding of the human genome has changed in the most fundamental ways. The small number of genes -- some 30,000 -- supports the notion that we are not hard wired. We now know the notion that one gene leads to one protein, and perhaps one disease, is false.

Craig Venter, June 2001

Page 13: Multi-Omics Bioinformatics across Application Domains

Genes are not the full story

Page 14: Multi-Omics Bioinformatics across Application Domains
Page 15: Multi-Omics Bioinformatics across Application Domains
Page 16: Multi-Omics Bioinformatics across Application Domains

Nutrition

Exercise

Disease

AgeDrugs

Environment

Phenome/ Exposome

Page 17: Multi-Omics Bioinformatics across Application Domains

The Metabolome is the most accessible and

dynamically changing Molecular Phenotype

Page 18: Multi-Omics Bioinformatics across Application Domains

Metabolomics

Measures occurrence and concentrations of many small molecules (metabolites) in an organism at once.

Page 19: Multi-Omics Bioinformatics across Application Domains

Organism Parts

Page 20: Multi-Omics Bioinformatics across Application Domains

Nuclear Magnetic Resonance (NMR)

Mass Spec

Metabolomics uses a wide-range of analytical techniques

Page 21: Multi-Omics Bioinformatics across Application Domains

Metabolomics has taken off world-wide

Page 22: Multi-Omics Bioinformatics across Application Domains
Page 23: Multi-Omics Bioinformatics across Application Domains
Page 24: Multi-Omics Bioinformatics across Application Domains

•8.7 mio eukaryotic species on earth (+- 1.3mio)

Page 25: Multi-Omics Bioinformatics across Application Domains

•8.7 mio eukaryotic species on earth (+- 1.3mio)•1.2 mio species identified and classified

Page 26: Multi-Omics Bioinformatics across Application Domains

•8.7 mio eukaryotic species on earth (+- 1.3mio)•1.2 mio species identified and classified•3000 - 4000 complete species genomes sequenced

Page 27: Multi-Omics Bioinformatics across Application Domains

•8.7 mio eukaryotic species on earth (+- 1.3mio)•1.2 mio species identified and classified•3000 - 4000 complete species genomes sequenced

Page 28: Multi-Omics Bioinformatics across Application Domains

•8.7 mio eukaryotic species on earth (+- 1.3mio)•1.2 mio species identified and classified•3000 - 4000 complete species genomes sequenced

What about completed metabolomes?

Page 29: Multi-Omics Bioinformatics across Application Domains

•8.7 mio eukaryotic species on earth (+- 1.3mio)•1.2 mio species identified and classified•3000 - 4000 complete species genomes sequenced

What about completed metabolomes?

Page 30: Multi-Omics Bioinformatics across Application Domains

Building upon extensive genomics research, we argue that the time is now right to focus intensively on model organism metabolomes. We propose a grand challenge for metabolomics studies of model organisms: to identify and map all metabolites onto metabolic pathways, to develop quantitative metabolic models for model organisms, and to relate organism metabolic pathways within the context of evolutionary metabolomics, i.e., phylometabolomics. These efforts should focus on a series of established model organisms in microbial, animal and plant research.

Metabolites. 2016 Feb 15;6(1)

Page 31: Multi-Omics Bioinformatics across Application Domains

Species Metabolomes are being assembled on the fly

right now through data sharing in Metabolomics

Page 32: Multi-Omics Bioinformatics across Application Domains

Experimental Repository

Reference Layer

Chemistry Spectroscopy Biology

Ana

lysi

s To

ols

Primary Literature

Primary data and Meta-Data, Spectra, Protocols, Synopses, ...

MetaboLights Database at the EBI

Page 33: Multi-Omics Bioinformatics across Application Domains

MetaboLights Database at the EBILabs around the world send us their data and

we…

Archive it

Classify itShare it with other data providers

Analyse it

…provide tools to help researchers

use it

A collaborative enterprise

Page 34: Multi-Omics Bioinformatics across Application Domains

MetaboLights Database at the EBILabs around the world send us their data and

we…

Archive it

Classify itShare it with other data providers

Analyse it

…provide tools to help researchers

use it

A collaborative enterprise

Data at the EBI can be

used freelyby anyone for any

purpose

Page 35: Multi-Omics Bioinformatics across Application Domains

MetaboLights Repository at the EBILabs around the world send us their data and

we…

Archive it

Classify itShare it with other data providers

Analyse it

…provide tools to help researchers

use it

A collaborative enterprise

EBI databases are supported over

decades

Page 36: Multi-Omics Bioinformatics across Application Domains

Data growth in EBI data repositories

Page 37: Multi-Omics Bioinformatics across Application Domains

Data growth in EBI data repositories

3-month doubling time

for Metabolomics

Page 38: Multi-Omics Bioinformatics across Application Domains

Data growth in EBI data repositories

3-month doubling time

for Metabolomics

MetaboLights is now the recommended repositoryfor the Nature journals,

EMBO journal, PLOS journals, Metabolomics

Journal and others

Page 39: Multi-Omics Bioinformatics across Application Domains

Sansone,… Steinbeck et al. (2012) Toward interoperable bioscience data.

Nature Genetics, 44, 121–126.

Page 40: Multi-Omics Bioinformatics across Application Domains

Sansone,… Steinbeck et al. (2012) Toward interoperable bioscience data.

Nature Genetics, 44, 121–126.

Controlled VocabulariesOntologies

Page 41: Multi-Omics Bioinformatics across Application Domains

Sansone,… Steinbeck et al. (2012) Toward interoperable bioscience data.

Nature Genetics, 44, 121–126.

Controlled VocabulariesOntologies

Minimum Information Standards

Page 42: Multi-Omics Bioinformatics across Application Domains

Repository Entry

Page 43: Multi-Omics Bioinformatics across Application Domains

Repository Entry

Page 44: Multi-Omics Bioinformatics across Application Domains

Reference Layer

Page 45: Multi-Omics Bioinformatics across Application Domains
Page 46: Multi-Omics Bioinformatics across Application Domains
Page 47: Multi-Omics Bioinformatics across Application Domains
Page 48: Multi-Omics Bioinformatics across Application Domains

7 most annotated metabolomes in MetaboLights

Page 49: Multi-Omics Bioinformatics across Application Domains

30 most annotated metabolomes in MetaboLights

Page 50: Multi-Omics Bioinformatics across Application Domains

1600 metabolome sizes in MetaboLights on a log scale

Page 51: Multi-Omics Bioinformatics across Application Domains

Number of Studies in MetaboLights per Species

Page 52: Multi-Omics Bioinformatics across Application Domains
Page 53: Multi-Omics Bioinformatics across Application Domains

Shareprivateprepublicationstudieswithreviewersandothertrustedparties.

BiocratesXMLwithrawdataisannotatedanduploaded

Page 54: Multi-Omics Bioinformatics across Application Domains

Multi-Omics Data Handling

Page 55: Multi-Omics Bioinformatics across Application Domains
Page 56: Multi-Omics Bioinformatics across Application Domains
Page 57: Multi-Omics Bioinformatics across Application Domains

A Case for Deep Metabolome Annotation

Page 58: Multi-Omics Bioinformatics across Application Domains

A Case for Deep Metabolome AnnotationHelp building species metabolomes

•Submit your metabolomics study to MetaboLights•Submit data publications (e.g. to NATURE Scientific Data)

•Be highly cited :)

Page 59: Multi-Omics Bioinformatics across Application Domains

Slides athttps://www.slideshare.net/csteinbeck

Funding

Page 60: Multi-Omics Bioinformatics across Application Domains

Thanks for your attention

Page 61: Multi-Omics Bioinformatics across Application Domains

[email protected]

Thank you!

Page 62: Multi-Omics Bioinformatics across Application Domains