Post on 09-May-2015
description
BIG DATA MANAGEMENT – forget the hype, let‘s talk facts
A PANEL WORKSHOP POWERED BY GEEKETTES.IO
BIG DATA MANAGEMENT – forget the hype, let‘s talk facts
Ivan P. YamshchikovSenior AnalystYandex
Ivan Yamshchikov, @kr0nikerSenior Analyst
Analytics and big data. Ecosystem.
Yandex: >6000 employees, 9 countries and closer than you think.
“Maths in internet analytics is like nailing something on a space ship. You might know how to use nails, yet it does not mean that you’ll do it right.”
Me.
Analytics in Yandex
What does analyst do? And why does he do it?
Analytics and big data.
• Giving common sense to managers and marketing.
• Giving insights to the developers.
Why do you always need to have it?
Common sense
Do you have some common sense? How is this service doing?
Do you have some common sense? How is this service doing?
Do you have some common sense? How is this service doing?
Do you have some common sense? How is this service doing?
Do not look for answers before you have questions.
Summing up:
1. - Analytics helps you to ask the right question.- Big data gives you the answer.
2. - Big data gives you a lot of stuff to analyze.- Analytics gives you understanding which data you actually need
3. - Big data gives you correlations.- Analytics helps to understand them.
Ivan Yamshchikov
Senior analyst
@kr0niker
kroniker@yandex-team.ru
Thanks.
Felienne HermansProfessor and EntrepreneurDelft University of Technology
BIG DATA MANAGEMENT – forget the hype, let‘s talk facts
Spreadsheets: The dark matter of IT
Felienne HermansDelft University of Technology
Spreadsheets: The dark matter of IT
Felienne HermansDelft University of Technology
(@felienne)
Spreadsheets exist ‘under the radar’
Spreadsheets live 5 years on average
Average sheet is used by 12 people
33%
Disaster!
Horror stories
Horror stories
European Spreadsheet Risk Interest Group
Horror stories
European Spreadsheet Risk Interest Group
eusprig.o
rg
Loss 2.4 million dollarsCause typo
Loss 2.4 million dollarsCause typo
Loss 24 million dollars
Cause copy-paste error
What can we do?
Visualize
Visualization!Felienne Hermans
(@felienne)
BIG DATA MANAGEMENT – forget the hype, let‘s talk facts
Anke Domscheit-BergPublicist, Activist & PoliticanOpengove.me
Big Government. Big Data.Anke Domscheit-Berg
@ankedwww.opengov.meBerlin, 05.05.2014
Big Government Data.The Good. The Bad. The Ugly.
Where data can help• Health registers• Accident registers• Real time traffic data• Environmental data• Public service data• Educational data• Infrastructure data• Spending data• …-> better services, less corruption, wiser spending, life-saving
Where data can be (and is) abused
• Profiling of “transparent” citizens
• Algorithms replace human judgement
• Shared data bases – exponential power
• …
-> manipulation, black mailing, oppression, control, murder (drones)
Data can be life saving (http://bluebook.scts.org/)
2005: Guardianwins legal battle to get data published on mortality rates
after heart surgery
Data can eliminate our privacy.
• MIT could identify 95% of 1.5m People based on anonymised location data of their mobile phones
• 4 location + time data sets of one mobile phone are sufficient to identify a person, in more complex cases, 11 data sets are needed
• Comparison? 12 data sets are required for a safe identification of a human being based on his/her finger print
“its only meta data”
Freedom of Speech can vanish.
“What a relief, I got an anti-flu vaccine, which only causes mild symptoms. Getting a real infection caused by the flu would have been really bad.”
US DHS finds 377 Words suspicious
US DHS finds 377 Words suspicious
German BND uses 15.000 “suspicious” words to filter
communications.
BIG DATA MANAGEMENT – forget the hype, let‘s talk facts
Olga SteidlGrowth HackerLinko
BIG DATA MANAGEMENT – forget the hype, let‘s talk facts
Thank youIvan, Felienne, Anke, OlgaBerlin Geekettes & NEXT
European Demo DayJune 6th
Berlin
Geekettes.io
Thank you!
Lisa Lang@lilaineuroe