Slide 1
Cloud Storage Made Seamless
Marc VillemadeTechnology EvangelistScality
Ranajit NevatiaVP, MarketingPanzura
There are two types of data(roughly)
StructuredWe (sort of) know how to manage this
UnstructuredThis is the new beast we have issues with
Slide 2
How to define Structured Data?Structured data is a set of organized pieces of data
Relational databases are a perfect exampleAtomic pieces are, on their own, meaningless
Slide 3
What about Unstructured Data?Unstructured data is self-contained pieces of
data Self-descriptiveMeaningful in and of itselfTypically has metadata attached to it
Email, Videos, Presentations, Spreadsheets, satellite images…
An easy way to think about it is anything that can be stored in one file is unstructured data
Slide 4
Some numbers…In 2012, Humanity will generate 2.7 ZB of data 1
It is estimated that we permanently store ~ 1 ZB of it 2 (~40%)
80% of it is unstructured 1
500 Quadrillion files (500,000 million million files)
Next year and so on, it will grow by 50% y-o-y 1
It will double every 2 years in the next 10 years
Kind of unfathomable, ain’t it?
Slide 5
(1) IDC numbers – (2) University of Southern California (2007)
Humans like organized thingsWell, some of them at least…
Structured storage systems have been used for Unstructured Data Organized in file systems, hierarchies, directories Easier for us
And then new data creation patterns emerged early 2000s The model doesn’t fit anymore And here’s why
Slide 6
Typical SAN / NAS issues at Scale
Technology refresh and migration necessary to benefit from larger disks
Scheduled maintenance window nuisance
Limitations on # of files
Volume management is complex
Serial architecture compromises performance
RAID is less efficient for large drives
FC networks are expensive & point-to-point
Cost is prohibitive for large capacity
Slide 7
Humans like organized thingsWell, some of them at least…
Structured data storage systems are used for Unstructured. Organized in file systems, hierarchies, directories Easier for us
And then new data creation patterns emerged early 2000s The model doesn’t fit anymore SANs and NASes were not made to handle this
Slide 8
So what’s the solution?
We believe it’s Object StorageYahoo!, Amazon, Google.. were the pioneers
Main CharacteristicsFlat NamespaceInfinite ScalabilityElasticityCost-efficiencyData availability and durability
Slide 9
Scality’s Storage Vision
Slide 10
Their DCTheir App.YOUR Data
Their DCYOUR App.YOUR Data
YOUR DCYOUR App.YOUR Data
Scality has developed a distributed (scale–out) object-based storage software to turn x86 servers into Petabyte scale storage for unstructured data (files).(Scality is NOT designed for VM, VDI, Relational Database)
Slide 11
What is the Secret Sauce?
• Distributed System• Distributed metadata• No Single point of failure• Self healing• Organic upgrades
Slide 12
What’s unique about Scality RING
• Performance• ESG Lab report: we’re 10x faster than any other object store
• Hardware-agnostic• Software Vendor• Mixed hardware (disks, nodes)
• Erasure-Coding with No penalty on read• With only 60% overhead
• Tiering• Policy driven• Automatic, Transparent
Top Related