Adam Kutz Kern Walster. The task of sequencing genomes produces massive amounts of data ...

5
DNA Sequencing caught in deluge of data Adam Kutz Kern Walster

Transcript of Adam Kutz Kern Walster. The task of sequencing genomes produces massive amounts of data ...

Page 1: Adam Kutz Kern Walster.  The task of sequencing genomes produces massive amounts of data  Traditional data transmission is becoming a bottleneck  Researchers.

DNA Sequencing caught in deluge of data

Adam KutzKern Walster

Page 2: Adam Kutz Kern Walster.  The task of sequencing genomes produces massive amounts of data  Traditional data transmission is becoming a bottleneck  Researchers.

The task of sequencing genomes produces massive amounts of data

Traditional data transmission is becoming a bottleneck

Researchers storing data on Hard drives and shipping via FedEx◦ This is less than optimal and insecure

A Data Problem

Page 3: Adam Kutz Kern Walster.  The task of sequencing genomes produces massive amounts of data  Traditional data transmission is becoming a bottleneck  Researchers.

Bioinformatics: computing and biology New companies offer data analysis Genome sequencing can help victims or

rare genetic diseases A renewed hope for cancer patients

A New Field

Page 4: Adam Kutz Kern Walster.  The task of sequencing genomes produces massive amounts of data  Traditional data transmission is becoming a bottleneck  Researchers.

Cost of sequencing a human genome dropped from $10.9 million in 2007 to $10,500 today◦ Massive cost reduction has increased availability

Generates 13 quadrillion bases/year Researchers have to selectively dump data,

lack the capability to store it

Scale

Page 5: Adam Kutz Kern Walster.  The task of sequencing genomes produces massive amounts of data  Traditional data transmission is becoming a bottleneck  Researchers.

Cloud computing Storing data until better analysis methods

are found Google investing in DNANexus, may develop

the capacity to process the data

Solution