We are sorry, but the presentation on this USB device was prepared on an IBM computer…it cannot be...

Post on 01-Apr-2015

214 views 1 download

Tags:

Transcript of We are sorry, but the presentation on this USB device was prepared on an IBM computer…it cannot be...

We are sorry, but the presentation on this USB device

was prepared on an IBM computer…it cannot be run on

this Dell Computer…

ISAC's perspective on data: standards, structure and analysis

J. Paul RobinsonSVM Professor of Cytomics

Purdue UniversityPresident, International Society for Analytical Cytology

(ISAC)

The HTS rationale• The infinite monkey theorem defines the HTS rationale. It

states that a monkey hitting keys at random on a typewriter keyboard will almost surely eventually type every book in France's Bibliothèque Nationale de France (National Library).

• In the restatement of the theorem most popular among English speakers, the monkeys eventually type out the collected works of William Shakespeare.

• The original image was presented in Émile Borel's 1913 book "Mécanique Statistique et Irréversibilité”.

http://forum.swarthmore.edu/dr.math/problems/bridge8.5.98.htmlhttp://www.nutters.org/monkeys.html

One perspective…

BioIT Magazine2002

How many standards can you afford?

SssssssssSsssSssSssSssss

Monkey Business

• Let’s look at the infinite monkey theorem again…

• In 2003, scientists at Paignton Zoo and the University of Plymouth, in Devon in England reported that they had left a computer keyboard in the enclosure of six Sulawesi Crested Macaques for a month

• Not only did the monkeys produce nothing but five pages consisting largely of the letter S, they started by attacking the keyboard with a stone, and continued by urinating and defecating on it.

Historical Picture

• Began with flow cytometry– Invented in 1960s– 1970s started with single fluorescence signal and two

laser scatter signals – total 3 variables – 1977 Herzenberg et al 2 color flow compensation– 1980s two fluorescence signals and two scatter

signals– 1990s three to 11 fluorescence signals and two

scatter signals– 2000s 32 fluorescence signals and 10-15 scatter

signals

Fundamentals of Flow Cytometry

PMT 1

PMT 2

PMT 5

PMT 4

DichroicFilters

BandpassFilters

Laser(s)

Flow chamber

PMT 3

Scatter

Sensor

Sample

103 to 105 cells/sec

4-50 variables/cell

Advanced polychromatic cytometry

From Mario Roederer

ISAC Standards Implementation1984 – Introduction of FCS 1.0 (Flow Cytometry Standard)

- Murphy & Chused

1989 - FCS 2.0

1997 - FCS 3.0

• Cytometry 28:118-122 (1997)

2008 - FCS 4.0Proposed major update

Main challenges

• HCS activities must leverage information sciences – integrating information from genomics, emerging protein interaction networks, and ongoing chemical-genetic studies into a public knowledge base of biological systems

• Standards have to be defined and followed!

What is the issue?• There are thousands of analytical and diagnostic instrument in

clinics and labs• These instruments are manufactured by 10-20 different

companies• They use reagents from 1-200 companies for the same tests• Many tests used fluorescence as the reporter system• Imaging systems are not uniform – there are no accepted

imaging standards• There are no algorithm evaluation standards• There is no way to take data sets into any single management

engine• There are no organized databases that can evaluate the vast

amount of research or clinical data collected

How Big a Problem is it?

• 20,000 flow cytometers - 10 manufacturers• 5-8,000 confocal microscopes - 10 manufacturers• 20-50,000 other fluorescence systems – 20 manufacturers• 5-10,000 DNA microarray readers – 10 manufacturers• 500 Laser Scanning Microscopes – 2 manufacturers• 1000 plus HCS instruments – 20 manufacturers

How much data is enough? Flow assay

• Standard 7 tube assay• Each tube 7 colors plus 2 scatter parameters• 50,000 cells per tube• 450,000 parameters per tube x 7 =3,150,000• Run 25 patients/tests per day= 25x7=175

assays= 551,250,000 points per day• If you ran this assay 100 times in a year you

would have 55,125,000,000 points

How much data is enough? HCS assay

• 384 well plate assay• 6 images per well = 2304 per plate• Each image contains 100 cells (450k per image 600x800

pixels) (1,036,800k of image space = 1 Gbyte)• We collect 20 parameters per cell• We have 230,400 x 20 parameters = 4,608,000• For a 10 plate assay we have 46,080,000 parameters• If we run 2 assays a day 5 days a week for 40 weeks• 46,080,000 x 2 x 5 x 40 = 18,432,000,000 parameters• Total storage space =1 Gbyte x 10 x 2 x 5 x 40 = 4 Tbytes

How many images are there…

• Industry estimates indicate that 80 billion new images are created every year

• 219,178,082 per day• 9,132,420 per hour• 152,207 per minute• 2,536 per second

http://a06.cgpublisher.com/proposals/244/index_html

Calibration and Standards?• Very few real standards• Local calibration if at all• Standards processes must be created and implemented

across several fields• Necessary to identify

– Instrument standards– Reagent Standards– Analysis Standards– Data structure standards– Metadata standards– Algorithm identification (at least)

Identical microbeads with various calibrated binding capacities of goat-anti-mouse IgG on their surface:

Eve

nts

Mean fluorescence intensity (MFI)

1 2

3 4

Blank

QSC, Cat. No. 815Bangs Laboratories, Inc.www.bangslabs.com

Antibody binding capacity (ABC) provided by the manufacturer :

Blank. 0 MESF

1. 6851 MESF

2. 23379 MESF

3. 58333 MESF

4. 213369 MESF

bead

Ab site

MESF=Molecules of equivalent soluble fluorochrome

QSC Beads (Quantum Simply Cellular)

Noise measurement with a standard

R. M. Zucker and O. Price, Cytometry 43 (2001) 273 - 294

QC- Optical Filters

• Depending on location, filters can be placed under extreme stress• Environmental conditions (humidity)

Excitation Efficiency Profiles

(note – there really isn’t a 545 nm line available!!)

Management compensation of fluorescence overlap becomes crucial

Noise measurement in the images II

Wavelet

transform 1

Wavelet

transform 2

signal

wavelet shrinkage

effect ?

noise

YES

NO

scale corr. s/n filter

s/n childs/n

parent

raw image

denoised image (signal)

diff.

Light detector stability analysis I

ttt

ststt

ebI

bb

trend component: random walks (RW)

periodic component: dynamic harmonic regression

j

tjjtjjtt e)}tsin(b)tcos(a{I

What Standards are available?

• Beads for size, intensity, color• No calibration tools available for high

resolution optical microscope (Richardson slide no longer manufactured)

• 1990 we created the Handbook of Flow Cytometry Methods to exactly define methods

• 1997 we created Current Protocols in Cytometry

About original data…“It is crucially important to keep your original digital or analog data exactly asthey were acquired and to record yourinstrument settings. This primary ruleof good scientific practice will allowyou or others to return to your originaldata to see whether any informationwas lost by the adjustments made tothe images. In fact, some journal reviewersor editors request access to suchprimary data to ensure accuracy.”

J Cell Biol. 166:11-15, 2004

Workshop on Standards and Calibration in Cytometry and Biological Imaging Modalities

Jointly sponsored by

International Society for Analytical Cytology (ISAC)& the Society for Biomolecular Sciences (SBS)

Site and date not yet set

To highlight the areas of cell analysis that need to be standardized To develop a series of recommendations on:

– Data file standards– Imaging standards– Archival/storage standards– Compression modalities– Algorithms and processing – Analytical technologies

ISAC 21st Century

• Flow and imaging are equally emphasized in ISAC

• Standards and Calibration

• Biosafety issue

• Core managers support

• Education

• Public Policy

• www.isac-net.org

• www.cyto.purdue.edu– Cytometry web/email discussion– Educational materials, Tutorials, Lectures

Next ISAC Congress May 17-21, 2008, Budapest, Hungary

Some ReferencesR.A. Hoffman, Current Protocols in Cytometry, 1997 : 1.3.1-1.3.19J.C.S. Wood, Current Protocols in Cytometry, 1997 : 1.4.1-1.4.12Cytometry, Volume 33, Number 2, 1998R. M. Zucker and O. Price, Cytometry 43 (2001) 273 - 294