Leveraging Sequence Classification by Taxonomy-based ... › ~zivbj › class11 › lectures ›...

Leveraging Sequence Classification by Taxonomy-based Multitask Learning

Christian Widmer Jose Leiva Yasemin Altun Gunnar Ratsch

Presented by

Meghana Kshirsagar

Outline

bull Multitask Learning setting

bull Application

ndash prediction of splicing sites across organisms

bull 3 approaches to multi-task learning

ndash Top-down

ndash Pairwise

ndash Multitask kernel

bull Experiments and Results

Prelude Classification Annotated

data Model

learn classify

Unlabelled data

Multitask learning setting

Multi-task learning

bull Learning multiple concepts together

ndash simultaneously

ndash transfer learning between concepts

bull Model quality limited by insufficient training data

ndash exploit similarity between tasks

bull In Computational Biology

ndash Organisms share evolutionary history

ndash Many biological processes are conserved

Why Multi-task learning

Setting

bull Consider M tasks T1 TM

bull We are given data D1 DM for each task

ndash Di = (x1 y1) for each task

bull Build M classifiers using all available information from all tasks

bull Use a taxonomy to learn these multiple tasks

Why use Taxonomy

bull Taxonomy can be used to define relationships between two tasks

bull In biology taxonomy naturally arises from a phylogenic tree

bull Closer tasks will benefit more from each other

Application to a biological problem

Prediction of splicing sites

transcription

splicing

translation

Given annotated pre-mRNA data in multiple organisms Find new splicing sites find Donor and Acceptor locations Question Can we build a better model using data from multiple organisms

Hierarchy used

70 million years Similarity of genome

Techniques and algorithms

Exploiting hierarchies - 1

More general organism (ex Plantae)

More specific organisms

Top-Down Model 1 Build model on root node

using data from all leaf nodes of the tree

2 Build model on next level similar to using all data in leaf nodes under that subtree

bull Can use any machine learning technique to build models at each level

bull How to build ldquosimilarrdquo models

Detour Support Vector Machine

Mathematical formulation

Regularizer Error Loss

= w1x + b1

classifier

How to build similar models

Given A parent model wpar

Want w wpar

In other words want (w ndash wpar) to be small

Regular SVM

DA SVM

Hierarchical Top-Down learning

w11 is built similar to w1 and using data from one Organism Leaf node classifiers are used for prediction

w0 is trained using all data from all tasks (all leaf nodes)

Pairwise Approach

bull Simultaneous learning of all tasks

bull Train classifiers for all M tasks at the same time

bull Similarity is enforced by the regularization term and by task similarity matrix values

Experiments and Results

Methods compared

Multitask Learning Methods

1 Top-Down

2 Pairwise Regularization

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Splice-site recognition problem

bull Use insight ndash GT or GC exhibited at donor site ndash AG consensus at acceptor site

bull Each input sequence is

bull Data ndash 15 organisms ndash Training 10000 examples per organism Test data

6000 examples per organism

Performance AUC (area under precision recall curve)

Observations

bull Gain is more for lower levels in hierarchy

bull ldquoA nidulansrdquo baselines do better

bull ldquoMouserdquo doesnrsquot benefit much

ndash possible reason not much similarity in taxonomy

bull No performance loss on distantly related organisms

Critique

bull Experimental evaluation not very thorough

bull Learning in the absence of a hierarchy

ndash How to define task similarity measures

bull Assumes that some training (labeled) data is available from all organisms

ndash In many scenarios there is no data available on less studied organisms

Comments Thoughts

Multitask Kernel approach

Outline

bull Multitask Learning setting

bull Application

ndash prediction of splicing sites across organisms

bull 3 approaches to multi-task learning

ndash Top-down

ndash Pairwise

ndash Multitask kernel

bull Experiments and Results

data Model

learn classify

Unlabelled data

Multi-task learning

Setting

Why use Taxonomy

transcription

splicing

translation

Hierarchy used

= w1x + b1

classifier

Want w wpar

Regular SVM

DA SVM

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

data Model

learn classify

Unlabelled data

Multi-task learning

Setting

Why use Taxonomy

transcription

splicing

translation

Hierarchy used

= w1x + b1

classifier

Want w wpar

Regular SVM

DA SVM

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

Multi-task learning

Setting

Why use Taxonomy

transcription

splicing

translation

Hierarchy used

= w1x + b1

classifier

Want w wpar

Regular SVM

DA SVM

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

Multi-task learning

Setting

Why use Taxonomy

transcription

splicing

translation

Hierarchy used

= w1x + b1

classifier

Want w wpar

Regular SVM

DA SVM

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

Setting

Why use Taxonomy

transcription

splicing

translation

Hierarchy used

= w1x + b1

classifier

Want w wpar

Regular SVM

DA SVM

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

Why use Taxonomy

transcription

splicing

translation

Hierarchy used

= w1x + b1

classifier

Want w wpar

Regular SVM

DA SVM

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

transcription

splicing

translation

Hierarchy used

= w1x + b1

classifier

Want w wpar

Regular SVM

DA SVM

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

transcription

splicing

translation

Hierarchy used

= w1x + b1

classifier

Want w wpar

Regular SVM

DA SVM

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

Hierarchy used

= w1x + b1

classifier

Want w wpar

Regular SVM

DA SVM

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

= w1x + b1

classifier

Want w wpar

Regular SVM

DA SVM

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

= w1x + b1

classifier

Want w wpar

Regular SVM

DA SVM

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

= w1x + b1

classifier

Want w wpar

Regular SVM

DA SVM

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

Want w wpar

Regular SVM

DA SVM

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

Pairwise Approach

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

Methods compared

1 Top-Down

3 Multitask Kernel

Baselines 1113150

bull Plain

bull Common

Observations

Critique

Comments Thoughts

Observations

Critique

Comments Thoughts

Observations

Critique

Comments Thoughts

Observations

Critique

Comments Thoughts

Critique

Comments Thoughts

Leveraging Sequence Classification by Taxonomy-based ... › ~zivbj › class11 › lectures ›...

Documents

Transcript of Leveraging Sequence Classification by Taxonomy-based ... › ~zivbj › class11 › lectures ›...

Class11 Youngs Modulus

Lec14.Intro Hplc

History Class11 TN Board

Kingdom Plantae Unit 3. Video: Kingdom Plantae Video Quiz.

Sdc13 feb11 class11

Bulats2 class11 feb12

SS Lec14 KalmanDecomposition(1)

Lec14 Rotational Motion(7)

Information Architecture class11 03 27

Slides Class11.Motivation

lec14 - University of Michigan

Http:// re/plantae/ KINGDOM PLANTAE.

Kingdom Plantae The Plants- Kingdom Plantae · Kingdom Plantae The Plants- Kingdom Plantae ... Pollen Pine pollen Winged seed of a White Pine ... Pollen grains Fig. 15.5

Lec14 Fuse Protection

lec14 DTMOS

Ge9 class11 apr16

281 lec14 eukaryptic_transcription

Dd114 spring2010 class11

lec14-demandpage (1)

MAE 640 Lec14