BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component...

19
2016-10-10 1 BIMM34 Biomedicine Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental Medical Science Lund University Learning outcomes Knowledge and understanding On completion of the course, the students shall be able to - account for key theory - explain how research is evaluated Competence and skills On completion of the course, the students shall be able to - understand underlying concepts of central bioinformatic programs - propose, execute, interpret and critically review basic bioinformatic analyses

Transcript of BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component...

Page 1: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

1

BIMM34 Biomedicine –

Bioinformatics component

Mauno Vihinen

Protein structure and bioinformatics group

Department of Experimental Medical Science

Lund University

Learning outcomes

Knowledge and understanding

On completion of the course, the students shall be able to

- account for key theory

- explain how research is evaluated

Competence and skills

On completion of the course, the students shall be able to

- understand underlying concepts of central bioinformatic programs

- propose, execute, interpret and critically review basic bioinformatic

analyses

Page 2: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

2

Teachers

Abhishek Gabriel Gerard

Niroula Teku Schaafsma

Practical issues

Five days, whole week

Hands-on practical training

Lectures providing background

Personal assignment to search, investigate and analyse biological data

Portfolio to report and interpret your results

Be active and keep on asking questions!

Bioinformatics is a very wide subject, on this course we provide the

essential introduction

Bioinformatics skills are important for all bioscientists

Page 3: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

3

Topics

What is bioinformatics?

Introduction

What information can you find for sequences?

Databases

How to compare sequences?

Sequence comparison and alignment

How to find sequences?

Searching sequences

What can a sequence tell us?

Sequence analysis

What can a structure tell us?

Structural analysis

Assignments

LibGuide

Learning diary

explain what you have done and what the results mean

Return your assignments to the teacher of the day

We aim at fast response!

All the questions have to be answered.

Personal project i.e. gene and protein

Page 4: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

4

What is bioinformatics?

Mauno Vihinen

Protein structure and bioinformatics group

Department of Experimental Medical Science

Lund University

Definition of bioinformatics

The use of computer science, mathematics, and information theory to

model and analyze biological systems, especially systems involving

genetic material.

The Free Dictionary

Page 5: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

5

Bioinformatics is a discipline to

• create

• organize

• analyze

• store

• retrieve

• share, and

• distribute

biological and biomedical information.

My own definition

It’s all about making sense of data!

Page 6: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

6

Features of bioinformatics

(Often) large amounts of data, even Big Data

Computer assisted/based

Often statistical component needed

Closely connected to other disciplines - omicses

Collaboration

Highly variable job descriptions: from programming to special data analysis tasks

Open access, open source

Lots of freely available databases and programs

Community wide development

Features of bioinformatics

Systematics essential at numerous levels

Central data providers (NCBI, EBI, DDBJ)

Core facilities, NBIS in Sweden

Many biological/medical studies impossible without bioinformatics

Omics-wide technologies are bioinformatics driven

Fast development of the field

In the intersection of many fields and disciplines

Page 7: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

7

Application areas

Sequence analysis Metabolomics

Gene identification Metagenomics

Evolution and phylogenetics Protein engineering

Genomics Clinical applications

Comparative genomics

Expression data analysis

(transcriptomics, proteomics)

NGS data analysis

Variation studies

Systems biology

Text mining

Protein structures, modeling

Software development

Algorithms

Statistical approaches

Pharmacogenetics

Rational drug design

Areas in computer science

Algorithms

Data mining

Text mining

Data compression

Pattern recognition

Machine learning

Image processing

Simulations

Statistical inference

Program interfaces

Visualisation

One of the Big Data sciences

Page 8: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

8

Computer languages

C, C++

Java

Perl

Python

R

SQL

Matlab

Spreadsheets

Primary immunodeficiencies

Page 9: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

9

Mechanisms of amino acid substitutions

Pathogenic or not? And if so, then how?

Identification of harmful/harmless variants

Prioritization of cases

Prediction of mechanisms

Methods should have high accuracy and high throughput

Driver variants in cancers

Multigenic diseases

Exome and complete genome data analysis

Pharmacogenetics and -genomics

Personalized medicine

Variation effect analysis

Page 10: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

10

Machine learning

A computational system that learns patterns from training data to be able

to classify unknown cases.

PON-P2 principle

Page 11: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

11

PROlocalizer

B cell development, transcriptomics and

proteomics

Page 12: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

12

Signaling and metabolic pathways

Immunome data integration

Page 13: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

13

Systematics, Variation Ontology

893 genes

177 disease related

181 integral to plasma membrane

91 extracellular

119 receptors

284 novel pseudogenes

3188 pseudogene fragments

Human immunome genes

Page 14: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

14

584 proteins

1349 interactions

Evolution of immunome interactome

Disease candidate gene prediction

Page 15: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

15

Systems biology and systems medicine

Analytical approach to biological processes in health and disease

Central T-cell signaling network

Page 16: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

16

PIDs in T-cells, network analysis

Primary immunodeficiency classification

Page 17: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

17

WAS XLT Mixed

Genotype-phenotype correlations

Structural basis of diseases - XLA

Page 18: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

18

Structural consequence of variants

Variants in tRNA

Page 19: BIMM34 Biomedicine Bioinformatics component · BIMM34 Biomedicine – Bioinformatics component Mauno Vihinen Protein structure and bioinformatics group Department of Experimental

2016-10-10

19

Bioinformatics is integral part of numerous biomedical studies

Large datasets can only be analyzed with computers

Important to know what data is available

Standards, formats, data integration

Essential to understand how methods work

Interpretation of the results

Summary