Instance Deep Multiple Learning Attention-based · 2020-07-08 · Attention-based Deep Multiple...

Attention-based Deep Multiple Instance LearningMaximilian Ilse, Jakub Tomczak, Max WellingAMLAB, University of Amsterdam

ICML 2018

Motivation

Typical size of benchmark

natural images: up to 256x256

Motivation

Typical size of medical images:

~10,000x10,000

Motivation

Typical size of medical images:

~10,000x10,000

How to process it?

Motivation

Goal: Find (local) objects (abnormal

changes in tissue) in an image.

Motivation

Data: billions of pixels, 101-102 scans,

weak labels (for regions or a scan).

Motivation

Data: billions of pixels, 101-102 scans,

weak labels (for regions or a scan).

Solution: Use local information in the

image and look for Regions of

Interest.Ricci-Vitiani, L., et al. "Identification and expansion of human colon-cancer-initiating cells." Nature 445.7123 (2007): 111.

Supervised Learning vs. Multiple Instance Learning

One image - one label

One image - one label Many images - one label

Individual labels: are unknown.

Assumptions about the label Y :

Assumptions about the label Y :Instances with (yk = 1) = key instances

Multiple Instance Learning

A MIL classifier as a probabilistic model:

Must be permutation-invariant!

Theorem (Zaheer et al., 2017)A scoring function for a set of instances X, , is a symmetric function (i.e., permutation invariant to the elements in X), if and only if it can be decomposed in the following form:

S(X) = g(∑x∊X f(x))

where f and g are suitable transformations.

Theorem (Qi et al., 2017)For any ε > 0, a Hausdorff continuous symmetric function can be arbitrarily approximated by a function in the form g(maxx∊X f(x)), where max is the element-wise vector maximum operator and f and g are continuous functions, that is:

|S(X) - g(maxx∊X f(x))| < ε.

The theorems say that we can model a permutation-invariant θ(X) by composing:

● a transformation f of individual instances,

● a permutation-invariant function σ, e.g., sum, mean or max (MIL pooling),

● a transformation of combined instances using a function g:

θ(X) = g(σ(f(x1), …, f(xK))

Multiple Instance Learning: Components

We model both transformations f and g using neural networks.

Two approaches:

- embedded-based

- instance-based

Two approaches:

- embedded-based

- instance-based

MIL pooling:

- mean,

- max,

- other (e.g., Noisy-Or).

Issues:

- Embedded-based approach

lacks interpretability.

- Instance-based approach

propagates error.

- max and mean are non-learnable.

Multiple Instance Learning: Attention-based approach

We propose to use the attention mechanism as MIL pooling:

where:

We propose to use the attention mechanism as MIL pooling:

where:

attention with gating mechanism

The attention mechanism as MIL pooling:

- MIL operator is trainable;

- attention weights could be

interpreted (key instances).

Embedded-based approach

is interpretable and fully trainable.

Experiments: MNIST-based problem

Experiments: Breast Cancer

Experiments: Colon Cancer

Conclusion

Deep MIL: a flexible approach to cope with large images.

Attention mechanism: interpretable and learnable MIL pooling.

Next step: Application to whole-slide classification.

Next step: taking into account spatial dependencies (non i.i.d. instances).

Conclusion

Code on github:https://github.com/AMLab-Amsterdam/AttentionDeepMIL

Contact:ilse.maximilian@gmail.comjakubmkt@gmail.com

The research conducted by Jakub M. Tomczak was funded by the European Commission within the Marie Skłodowska-Curie Individual Fellowship (Grant No. 702666, ”Deep learning and Bayesian inference for medical imaging”).

The research conducted by Maximilian Ilse was funded by the Nederlandse Organisatie voor Wetenschappelijk Onderzoek (Grant DLMedIa: Deep Learning for Medical Image Analysis).

Instance Deep Multiple Learning Attention-based · 2020-07-08 · Attention-based Deep Multiple...

Documents

Transcript of Instance Deep Multiple Learning Attention-based · 2020-07-08 · Attention-based Deep Multiple...

In Search of Attention - University of Notre Damezda/Google.pdf · in Google is a direct and unambiguous measure of attention. For instance, Google’s Chief Economist Hal Varian

Corporate Social Responsibility After Disaster · 2017. 9. 3. · For instance, Wal-Mart garnered favorable attention for its contributions in New Orleans and the Gulf Coast after

Design of achromatic augmented reality visors based on …labs.ece.uw.edu/amlab/Papers_Journals/Elyas_AR_Visor.pdf · 2021. 1. 21. · visors unsuitable for a seamless user experience.

NYC Department of Education - NYCSCA · Roofing ROOFING Instance on Modified Bitumen: Roof 2 Inspected Instance Condition 5 - Poor Instance Quantity 4,000 Instance Quantity Uom S.F.

CS 478 - Instance Based Learning1 Instance Based Learning.

Instance Matching

Instance Rationalization

Deep Generative The Success of Models · The Success of Deep Generative Models Jakub Tomczak AMLAB, University of Amsterdam PASC, 2018

IDX Taxonomy 2020 Instance Document Creation Illustration · IDX Taxonomy 2020 Instance Illustration 1. Creating Instance Document Instance Document is the format in which the reporting

Instance-based Ontology Matching by Instance Enrichment

DA-GAN: Instance-Level Image Translation by Deep Attention …openaccess.thecvf.com/content_cvpr_2018/papers/Ma_DA-GAN... · 2018-06-11 · images, e.g. photo editing [4, 13], and

instance variables property procedures for the mintQuantity instance variable.

PhD, Electrical Engineering (GPA: 4/4) MS, Electrical ...labs.ece.uw.edu/amlab/CVs/CV_AM.pdf · Xie, Jiajiu Zheng, Peipeng Xu, Jianting Yao, James Whitehead, Arka Majumdar, ... David

OSP318. ProfileSynchronizationServiceInstanceProfileSynchronizationServiceInstance Profile Service Instance Instance.

Attention Deficit Disorder (ADD) / Attention Deficit ... · Attention Deficit Disorder (ADD) / Attention ... Attention Deficit Hyperactivity Disorder ... Attention Deficit Disorder

TABLE OF CONTENTS - Pandaawsassets.panda.org/downloads/beyondthetrees.pdf · • Attention has focused on establishing global targets for conservation, for instance to conserve 10

Cavity enhanced nonlinear optics for few photon optical ...labs.ee.washington.edu/amlab/Papers_Journals/Taylor_opex.pdf · Cavity enhanced nonlinear optics for few photon optical

S4Net: Single stage salient-instance segmentation · rather than instance segments. 2.3 Semantic instance segmentation Earlier semantic instance segmentation methods [22–24, 54]

Van der Waals materials integrated nanophotonic devices [Invited]labs.ece.uw.edu/amlab/Papers_Journals/vdW_material_nano... · 2019. 1. 3. · nanob Publis [33], A 2.3 Electrica All

Chapter 4 Attention Attention Defining Attention