Data: A Cautionary Tale by Daniel Katz
-
Upload
jooyoun-paek -
Category
Education
-
view
122 -
download
6
description
Transcript of Data: A Cautionary Tale by Daniel Katz
![Page 1: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/1.jpg)
A Cautionary Tale
![Page 2: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/2.jpg)
![Page 3: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/3.jpg)
![Page 4: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/4.jpg)
![Page 5: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/5.jpg)
The Big Picture Collect Clean Model Store Present
![Page 6: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/6.jpg)
{ "classes": [ { "name": "Fundamental Process of Design", "professor": "Joo Youn Paek" , "year" : " 2010 ", "semester" : "fall", "students": [ { "student" : { "name": “Joe Student", “email": “[email protected]", "twitter_name": “@itp4life" , “blog_url": “http://itp4life.blogspot.co" , } } ] } ]}
![Page 7: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/7.jpg)
<classes><class>
<name>Fundamental Process of Design</name><professor>Joo Youn Paek</professor><year>2010</year><semester>Fall</semester><students>
<student><name>Joe Student</name><email>[email protected]</email>
<twitter_name>@itp4life</twitter_name><blog_url>http://itp4life.blogspot.com</blog_url></student>
</students></class>
</classes>
![Page 8: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/8.jpg)
![Page 9: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/9.jpg)
![Page 10: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/10.jpg)
The Open Data Movement is in Full Swing Governments Institutions Scientists Enthusiasts
http://vimeo.com/2598878
![Page 11: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/11.jpg)
Commercial tools and open source are starting to converge
![Page 12: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/12.jpg)
There will always be assumptions
![Page 13: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/13.jpg)
Bring it down
![Page 14: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/14.jpg)
FreeBase – Entity Graph Info Chimp Twitter Facebook
![Page 15: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/15.jpg)
Data.gov MTA
![Page 16: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/16.jpg)
Arduino Smart Phone Other sensors
![Page 17: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/17.jpg)
![Page 18: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/18.jpg)
Don’t be intimidated by data from disparate sources
![Page 19: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/19.jpg)
![Page 20: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/20.jpg)
![Page 21: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/21.jpg)
Clean up messy data Inconsistent data points Identify patterns Combine data from disparate
sources
![Page 22: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/22.jpg)
Collection of Twitter Responses from API
Value.parseJson().user.screen_name
![Page 23: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/23.jpg)
![Page 24: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/24.jpg)
Depending on the type of data you are collecting, there are appropriate places to
store it
![Page 25: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/25.jpg)
Non-programmers Google Fusion Tables
For programmers Geo Database and programming tools
PostGIS (Postgresql) GeoTools (Java)
![Page 26: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/26.jpg)
Non-programmers Google Docs (Read into processing) Microsoft Excel (internal charting tool) Text based formatting (visualize with
Google Chart API)
For programmers Any relational database
MySql PostgresSql
![Page 27: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/27.jpg)
Graph Database
![Page 28: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/28.jpg)
http://blog.blprnt.com/blog/blprnt/your-random-numbers-getting-started-with-processing-and-data-visualization
http://code.google.com/p/gdocjdbc/
![Page 29: Data: A Cautionary Tale by Daniel Katz](https://reader035.fdocuments.us/reader035/viewer/2022062511/54c6978a4a79593e338b4588/html5/thumbnails/29.jpg)
http://www.infochimps.com/datasets/tweets-during-state-of-the-union-address
http://code.google.com/p/google-refine/
http://dev.twitter.com/doc/get/geo/search
http://flowingdata.com/2009/07/14/how-does-the-average-consumer-spend-his-money/
http://www.bls.gov/cex/ http://www.google.com/
fusiontables/Home