Data Science Day New York: GigaOM Big Data Market Overview
-
Upload
cloudera-inc -
Category
Documents
-
view
1.264 -
download
0
Transcript of Data Science Day New York: GigaOM Big Data Market Overview
![Page 1: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/1.jpg)
Big Data Market Overview
![Page 2: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/2.jpg)
Jo Maitland, Research Director, GigaOM
• 15+ years in technology research and journalism with focus on emerging infrastructure technologies including next generation storage, networking, virtualization, and cloud computing– Forrester Research (Analyst)– The 451 Group (Analyst)– TechTarget (Executive Editor)– UBM Tech (LightReading.com, Senior
Editor)– Computerwire (Senior Writer)– PC Week (Reporter)
![Page 3: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/3.jpg)
Agenda
• Data growth, it’s big• Oh the mess we are in…• Let’s turn off all the computers• Don’t be daft!• There’s new technologies to help store and analyze all this data• Enter Hadoop, NoSQL and Hype.• It’s the apps stupid• Emerging trends• Questions to consider
![Page 4: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/4.jpg)
How Big?
![Page 5: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/5.jpg)
Data growth at Facebook
![Page 6: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/6.jpg)
Data growth at Twitter
![Page 7: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/7.jpg)
Growth of machine generated data
![Page 8: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/8.jpg)
Data growth worldwide
![Page 9: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/9.jpg)
Data growth in the enterprise is staggering
• Walmart handles more than 1 million customer transactions per hour
• There are about 90 trillion emails per year
• Google processes some 24 petabytes of data per day
• AT&T transfers 30PB of data per day
![Page 10: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/10.jpg)
Business decision-makers are screwed, basically
![Page 11: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/11.jpg)
The Answer?
![Page 12: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/12.jpg)
What to do…
• Turn off all the computers?• Turn off some of the computers? • Stop storing everything and
classify your data?• All attempts to stem the tide
of big data will fail.
![Page 13: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/13.jpg)
Two new technologies have come to our rescue
Hadoop
NoSQL
![Page 14: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/14.jpg)
Commercial solutions enter the fray
• Hadoop distribution companies– Cloudera– HortonWorks– MapR– + +
• NoSQL database companies– 10gen (MongoDB)– DataStax (Cassandra)– Basho (Riak)– + +
![Page 15: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/15.jpg)
Hadoop + big data apps = useful
![Page 16: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/16.jpg)
Big data applications are key
• Operational intelligence– Splunk, Sumo Logic
• Sales and marketing– GoodData, Media Science, Bloomreach
• Visualization– Tableau Software, QlikTech, Palantir
• Business Intelligence– Platfora, Domo, WibiData
• Online advertizing– Collective, DataXu, RocketFuel, Turn
• Data as a service• FICO, DataSift, Bluekai
![Page 17: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/17.jpg)
What’s next?
![Page 18: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/18.jpg)
Emerging trends
• More data• Focus on applications• Data democratization and trust• A shift to real time
![Page 19: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/19.jpg)
data
![Page 20: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/20.jpg)
Emerging trends
• More data• Applications• Data democratization and trust• A shift to real time
![Page 21: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/21.jpg)
Applications
![Page 22: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/22.jpg)
Square
![Page 23: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/23.jpg)
PredPol
![Page 24: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/24.jpg)
23andMe
![Page 25: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/25.jpg)
Emerging trends
• More data• Applications• Data democratization and trust• A shift to real time
![Page 26: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/26.jpg)
Data democratization and trust
![Page 27: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/27.jpg)
Emerging trends
• More data• Applications• Data democratization and trust• A shift to real time
![Page 28: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/28.jpg)
Shift to real time
![Page 29: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/29.jpg)
Questions to consider
![Page 30: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/30.jpg)
Investors
• Is the company in an area that is already well funded or over-funded?– Infrastructure
• What are the emerging sub-categories?– Cloud-based services
• What’s the new angle?– ?
![Page 31: Data Science Day New York: GigaOM Big Data Market Overview](https://reader034.fdocuments.us/reader034/viewer/2022052603/55d4fa8abb61eb764c8b457e/html5/thumbnails/31.jpg)
Customers
• Are there existing big data apps you could use instead of building a custom app?– Log file analysis
• What is your 3 year big data roadmap?– Just as companies have measured their ROI on technology
investments, they should also measure the value they receive from information.