CEE 243:Week 4 Introduction to Data Analysis · Introduction to Data Analysis CEE 243 Analyzing...

Post on 21-May-2020

7 views 0 download

Transcript of CEE 243:Week 4 Introduction to Data Analysis · Introduction to Data Analysis CEE 243 Analyzing...

Copyright 2012

Introduction to Data Analysis

CEE 243

CEE 243:Week 4

Introduction to Data Analysis

Copyright 2012

Introduction to Data Analysis

CEE 243

Big ideas

• Reason to collect or display data: enable action

– Important to clarify who should/could take action

• Energy analysis requires lots of data, and there

are multiple ways to display and manipulate

data. This session shows you

– Examples of lots of data

– Examples of different good ways to show data:

scatter plots, carpet plots, commercial examples

– Good ways to show and annotate data to support

action

2

Copyright 2012

Introduction to Data Analysis

CEE 243

Poor data display prevents the communication of

information to the intended audience

3

What is wrong with this image?

Copyright 2012

Introduction to Data Analysis

CEE 243

Poor data display in the Building Performance

Domain wastes time and money

• Obvious Clutter

• Terrible Time scale

• Poorly positioned Legend

• Poor choice of scale for second Y axis

• No so what

4

Copyright 2012

Introduction to Data Analysis

CEE 243

Tufte’s principles of Best Practice Data Display enable

unambiguous communication of information

• Integrity of Data

• Enforce Visual Comparisons

• Show causality

• Show Multivariate Data

• Integrate words, numbers and images

• Design should be content driven

Suggested Reading: “The Visual Display of

Quantitative Information” by Tufte

5

Copyright 2012

Introduction to Data Analysis

CEE 243

Interesting (Amazing!) Visualization

• http://www.ted.com/talks/hans_rosling_shows_t

he_best_stats_you_ve_ever_seen.html

• Minutes 1-12

6

Copyright 2012

Introduction to Data Analysis

CEE 243

Impressive data display for a large volume of

energy information

Shows geography plus intensity (Howard 2012) 7

Copyright 2012

Introduction to Data Analysis

CEE 243

Best Practice Data Display

• Shows geography plus time variance in army size, army

movement, temperature

• Methods: – Labels

– Line size

– Line location

– Direction - color

– Date – annotation

– Temperature

8

Courtesy of Tufte Charles Minard's flow map of Napoleon's March, http://en.wikipedia.org/wiki/Charles_Joseph_Minard

Copyright 2012

Introduction to Data Analysis

CEE 243

Jargon

• Lie factor = “the size of an effect shown in a graph

divided by the actual size of the effect in the data on

which the graph is based” – Source: http://www.infovis-wiki.net/index.php/Lie_Factor

• Example:

– Size of Gas economy effect in graph (right) = (5.3 inches –

0.6”) / 0.6” = 783%

– Size of time (left) effect in data = (27.5 – 18) mpg / 18 mpg =

53%

– Lie factor = 14.8

9 [Tufte, 1991]

Copyright 2012

Introduction to Data Analysis

CEE 243

Jargon

• Range frame: Data frame that begins and ends at the minimum

and maximum values respectively

– Minimum not 0

10 [Tufte, 1991]

Copyright 2012

Introduction to Data Analysis

CEE 243

An importance checklist to help ensure clear graphs

Grey grid

Active grid lines

Range frame – only very carefully

Graphs must be at the correct magnitude

Use graphics that are proportional to data: Lie Factor ~ 1

Option to turn grid on and off

Golden rectangle (ratio of length to height of approximately 1.6:1)

Use clear, thorough labels -- not of vertical orientation;

Label important features and so-whats in the data itself

Maximum number of data streams is six

Sans Serif font

White background, important for Excel graphs

11

Copyright 2012

Introduction to Data Analysis

CEE 243

Annotation of figures

Two crucial parts to a caption or figure

annotation:

1. Features: important geometric content of the

figure or chart: axes, colors

2. Significance: information that the figure

contains and its significance for the reader

12

Copyright 2012

Introduction to Data Analysis

CEE 243

Fundamental Performance Analysis

13

Copyright 2012

Introduction to Data Analysis

CEE 243

Analysis of a typical work day shows a recognizable

pattern • Features

• Significance:

14

Copyright 2012

Introduction to Data Analysis

CEE 243

This site load for this campus has recognizable pattern

15

• Features

• Significance:

Copyright 2012

Introduction to Data Analysis

CEE 243

Analyzing multiple days or weeks enables further

investigations

16

A The load profile shows that overnight setbacks were relaxed. B AM peaks are far in excess of the midday peak, leading to excessive demand charges.

Granderson 2010, Energy Management handbook

• Features

• Significance:

Copyright 2012

Introduction to Data Analysis

CEE 243 17

Carpet plots help identify patterns in large data

sets

• Features

• Significance:

Copyright 2012

Introduction to Data Analysis

CEE 243

Monthly totals enable rudimentary comprehension of

building performance and annual benchmarking

18

• Features

• Significance:

Copyright 2012

Introduction to Data Analysis

CEE 243

Benchmarking has many forms, which is most

appropriate for you?

• At the site or building level – Utility Values

• Compare with last year

• Normalized comparison – Commercial Website name Cal Arch

http://poet.lbl.gov/cal-arch/

– Home Version also home energy saver

http://hes.lbl.gov/

• GSA Building Performance Tool Kit (Augenbroe)

• Optimum Performance Benchmarks??

19

Copyright 2012

Introduction to Data Analysis

CEE 243

Jargon

Degree Day Analysis: A unit of measurement equal to a difference of one degree between the mean outdoor temperature on a certain day and a reference temperature, – used in estimating the energy needs for heating or

cooling a building.

Reference Base Temperature =

15.5oC in the UK (60o F)

18.3oC in the US (65o F)

20

Copyright 2012

Introduction to Data Analysis

CEE 243

Sample Degree Days:

American Locations and Heating Demand

• Table Illustrates number of Degree days for Variable Tbase (ASHRAE)

Copyright 2012

Introduction to Data Analysis

CEE 243

Degree Day Sample

Example of United States Heating Degree Days V Gas Consumption

• Features

• Significance:

Copyright 2012

Introduction to Data Analysis

CEE 243

CEE 243 Data Analysis

23

Copyright 2012

Introduction to Data Analysis

CEE 243

Process for data analysis in CEE 243

24

Select system & points for analysis

Synthesize functional intent and data context

Access and graph data

Assess data conformance to functional intent

Identify potential causes

Document and discuss findings

Step 1

Step 4

Step 2

Step 3

Step 5

Step 6

End

All Ok

All Ok

All Ok

All Ok

All Ok

All Ok

Steps 1-5 to Clarify

Else

ElseSteps 1-4 for

satisfactory fault diagnosis

Steps 1-3 for additional

clarity

Else

Else repeat Step 3 for additional clarity

Copyright 2012

Introduction to Data Analysis

CEE 243

Simple But Effective Visualization

Can you improve it?

• Features

• Significance:

Copyright 2012

Introduction to Data Analysis

CEE 243

Effective Visualization:

CV Box 2-1-3 Offices 297,295 and 293

Functioning properly Suspicious Behavior (Revise) Problem (Immediate attention)

• Features

• Significance:

Copyright 2012

Introduction to Data Analysis

CEE 243

Effective Visualization?

Radiant Floors

High Slab Heat •Radiant Heat On/Off?

•Sensor or Valve Failure?Test?

Valve Closed: •OAT>78°

High Slab Heat •But valve closed°

•Sensor or Valve Failure? Test?

Valve position

change not gradual

Set Point at 72°, not

71°

Unoccupied

Space Temp

setpoint≠ 66°

Valve 120% Open? Not

Possible

• Features

• Significance:

Copyright 2012

Introduction to Data Analysis

CEE 243

Effective Visualisation?

Natural Ventilation

Open Window: •Space Temp high

•OAT <Space Temp

Smoke Test •Once per week

Close Window: •Space Temp <70°

No Nat Vat: •68°<OAT<85

°

68°Min OAT Set Point

Window Closed

Window Open

Night Purge: •Day OAT>75°

•Night OAT<65°

•Space

Temp>65°

• Features

• Significance:

Copyright 2012

Introduction to Data Analysis

CEE 243

An importance checklist to help ensure clear graphs

In addition to today’s Slides

Traffic light green/yellow/red show assessed status

On Images, Annotations of anomalies (important features) on

which the developer wants a reader to take action, e.g., note a

times-10 conversion error

In Wiki, accompanying text to explain

Functional Intent

Functional Areas

Problems Areas (Describe annotated anomalies)

Recommended action

Labels on x and y axes

Names of graphed data

Engineering units of graphed data

29

Copyright 2012

Introduction to Data Analysis

CEE 243

Other Visualizations

30

Copyright 2012

Introduction to Data Analysis

CEE 243

These links contain thought provoking visualizations

http://vis.stanford.edu/files/2010-Narrative-InfoVis.pdf

http://www.cbe.berkeley.edu/centerline/winter2009.pdf

http://www.cs171.org/2008/final_project/web_pages/wright_w/wright-

w.htm

http://www.nytimes.com/interactive/2010/06/29/magazine/rivera-

pitches.html

http://spotlight.abs.gov.au/Flash/#/2/0

http://vanderlin.cc/projects/mbta_visualizations

31

Copyright 2012

Introduction to Data Analysis

CEE 243

This slide promotes the assertive evidence based

approach to powerpoint presentations

http://www.writing.engr.psu.edu/slides.html

No bullets

32

http://hosting.epresence.tv/LBL/1/Page/Home.aspx?=&page=1

Leader Alumni 1/27/12

Copyright 2012

Introduction to Data Analysis

CEE 243

Big ideas

• Reason to collect or display data: enable action

– Important to clarify who should/could take action

• Energy analysis requires lots of data, and there

are multiple ways to display and manipulate

data. This session shows you

– Examples of lots of data

– Examples of different good ways to show data:

scatter plots, carpet plots, commercial examples

– Good ways to show and annotate data to support

action

33