Adam Kutz Kern Walster. The task of sequencing genomes produces massive amounts of data ...

Post on 24-Dec-2015

213 views 0 download

Transcript of Adam Kutz Kern Walster. The task of sequencing genomes produces massive amounts of data ...

DNA Sequencing caught in deluge of data

Adam KutzKern Walster

The task of sequencing genomes produces massive amounts of data

Traditional data transmission is becoming a bottleneck

Researchers storing data on Hard drives and shipping via FedEx◦ This is less than optimal and insecure

A Data Problem

Bioinformatics: computing and biology New companies offer data analysis Genome sequencing can help victims or

rare genetic diseases A renewed hope for cancer patients

A New Field

Cost of sequencing a human genome dropped from $10.9 million in 2007 to $10,500 today◦ Massive cost reduction has increased availability

Generates 13 quadrillion bases/year Researchers have to selectively dump data,

lack the capability to store it

Scale

Cloud computing Storing data until better analysis methods

are found Google investing in DNANexus, may develop

the capacity to process the data

Solution