Data 101: Introduction to Data Visualization
-
Upload
david-newbury -
Category
Data & Analytics
-
view
444 -
download
0
Transcript of Data 101: Introduction to Data Visualization
David NewburyProfessional nerd artist
@workergnomewww.workergnome.com
Data 101. David Newbury — @workergnome 2
What We're Doing Today:
—(Brief) History of Data Visualization—(Tiny) Theory of Visualization—(Nerdy) Overview of Concepts—(Fake) Data Exploration—(Incomplete) Overview of Tools
Data 101. David Newbury — @workergnome 3
What We're not Doing Today:
—Writing Code—Thinking about Mapping—Worrying about Data Provenance
Data 101. David Newbury — @workergnome 4
Which is biggest?
15012, 8271, 30193, 1189, 9913, 16000, 92481, 49801, 100407, 2910, 3809, 8018, 61528, 18083, 38691, 1800
Data 101. David Newbury — @workergnome 5
Edward Tufte
The Visual Display of Quantitative Information.
Data 101. David Newbury — @workergnome 23
Dataviz is constructed reality.You are telling a story, not (just) stating facts.
Data 101. David Newbury — @workergnome 28
data artas opposed to
data visualization
as opposed to
statistical graphicsData 101. David Newbury — @workergnome 29
StatisticalGraphicsHow do I create Statistical Graphs in SAS 9.1.3 without Proc Gplot. UCLA: Statistical Consulting Group.http://www.ats.ucla.edu/stat/sas/notes2/Data 101. David Newbury — @workergnome 30
Data Art
Dear Data Giorgia Lupi & Stefanie Posavec.http://www.dear-data.com
Data 101. David Newbury — @workergnome 31
Two Uses1). help people grasp things outside their reach
2.) tell stories
Data 101. David Newbury — @workergnome 33
explanatory visualization work
as opposed to
exploratory visualizations
Data 101. David Newbury — @workergnome 34
Dataviz is constructed reality.Do you care how true your story is?
Do you care how accurate your story is?
Are you trying to teach, entertain, or convince?
Data 101. David Newbury — @workergnome 35
Potential Subjects.
subways, sheep, the solar system,shoes, sleep, skyline,snow, supermarket, sausages,school,the sea, spiders,staircases, syrup, soap,sawmills, stereos...
Data 101. David Newbury — @workergnome 42
Potential Subjects.
subways, sheep, the solar system,shoes, sleep, skyline,snow, supermarket, sausages,school,the sea, spiders,staircases, syrup, soap,sawmills, stereos...
...and other things that begin with S.
Data 101. David Newbury — @workergnome 43
Scope
Out of the infinite stories about any subject, which parts are you going to choose?
Data 101. David Newbury — @workergnome 45
Possible Scopes
All trains in a dayAll the rides that I've been on this yearMy train this morningAll of the stops in the cityEach lineEvery train stop in the past 50 years
Data 101. David Newbury — @workergnome 46
Dimension
Which bits of information about a subjectare you going to focus on?
Data 101. David Newbury — @workergnome 47
Possible Dimensions
number of carsduration of ridedate of a ridedifferent linesnumber of stopscost per ridenumber of stops per daytime between stopsData 101. David Newbury — @workergnome 48
Types of Data
number of cars - Numericduration of ride - Numericdate of a ride - Datedifferent lines - Categorynumber of stops - Numericcost per ride - Categorynumber of stops per day - Numerictime between stops - NumericData 101. David Newbury — @workergnome 51
number of cars - Measureduration of ride - Measuredate of a ride - Measuredifferent lines - Categoriesnumber of stops - Measurecost per ride - Categoriesnumber of stops per day - Measuretime between stops - Measurecleanliness - Categories
Data 101. David Newbury — @workergnome 54
A hidden dimension:
David (1), Daniel (2), Dawn (3), Danique (4)
Position of the item in the group.
Data 101. David Newbury — @workergnome 56
Choose one.
subways, sheep, the solar system,shoes, sleep, skyline,snow, supermarket, sausages,school,the sea, spiders,staircases, syrup, soap,sawmills, stereos...
...and other things that begin with S.
Data 101. David Newbury — @workergnome 59
Domain
number of cars - 1...8duration of ride - 30 sec...2 hoursdate of a ride - - 24ft...200ftdifferent lines - Red line, Blue line, Green line, Silver Line, Yellow Linenumber of stops - **2..20cost per ride - "$2.50, $1.75, $3.00, $0.00"number of stops per day - ??...???Data 101. David Newbury — @workergnome 62
Range
Domain is the possible input values
Range is the possible output values
Data 101. David Newbury — @workergnome 63
Data3, 7, 10, 6, 2Position of the item in the group.
Domain[0-10][1-5]
RangeX: 400px Y: 800px
MappingX: item position Y: numeric value Data 101. David Newbury — @workergnome 64
Data3, 7, 10, 6, 2Position of the item in the group.
MultiplesDimensions
Data 101. David Newbury — @workergnome 67
Dataval1: 3, 7, 10, 6, 2val2: 5, 8, 1, 8, 3val3: Cat, Dog, Cat, Cat, DogPosition of the item in the group.
MappingX: item position Y: val1 Size: val2 Color: val3
Data 101. David Newbury — @workergnome 68
Dimensions beyond X and Y.
ColorSizeShapeLabelsPatternsIconsAnything Else You Can Imagine
Data 101. David Newbury — @workergnome 69