Tor Hovland: Taking a swim in the big data lake

31
Taking a swim in the big data lake Tor Hovland

Transcript of Tor Hovland: Taking a swim in the big data lake

Page 1: Tor Hovland: Taking a swim in the big data lake

Taking a swim in the big data lakeTor Hovland

Page 2: Tor Hovland: Taking a swim in the big data lake

Company vision

Powel delivers software for tomorrow’s energy and environmental needs – today

Page 3: Tor Hovland: Taking a swim in the big data lake

Smart EnergyMaximising renewable power production and trading

Page 4: Tor Hovland: Taking a swim in the big data lake

Smart InfrastructureHelping to makeinformed decisions

Page 5: Tor Hovland: Taking a swim in the big data lake

Supporting municipal processes

Page 6: Tor Hovland: Taking a swim in the big data lake

ContractingVisualising the entire construction process – start to finish

Page 7: Tor Hovland: Taking a swim in the big data lake

Powel Offices

Page 8: Tor Hovland: Taking a swim in the big data lake
Page 9: Tor Hovland: Taking a swim in the big data lake

• Storage repository in the cloud• Various sources• Typically a lot of data• Data for the future

Big data lake

Page 10: Tor Hovland: Taking a swim in the big data lake

• Data warehouse: you clean and structure data before storage.(schema-on-write)

• Data lake: you store raw data. The user figures out how to query it.(schema-on-read)

Data lake vs data warehouse

Page 11: Tor Hovland: Taking a swim in the big data lake

• When you set up a machine learning experiment, you need historic data.

• Storage in the data lake is cheap.

• Store everything now, even if you don’t know if you’ll need it.

Data for the future

Page 12: Tor Hovland: Taking a swim in the big data lake

The case

Cloud storage

Collection dashboard

Data queries

Consumption predictions

Page 13: Tor Hovland: Taking a swim in the big data lake

The building blocks

Power BI

Data Lake Analytics& U-SQL

Machine Learning

Blob storagefor raw data

Blob storagefor business data

Stream Analytics

Event Hub

Service Fabric& Stateful Actors

Web app

Page 14: Tor Hovland: Taking a swim in the big data lake

Simulated meter data

2+𝑐𝑜𝑠 (π+4 π 𝑥 )

2+𝑐𝑜𝑠 (2π 𝑥)

Daily pattern

Seasonal pattern

Combined pattern

+ a consumption factor because people have different needs+ some random noise to make it more realistic

Page 15: Tor Hovland: Taking a swim in the big data lake

• A small grid company: 10.000 meters on hour resolution.• A big grid company: 1 mill. meters or more on hour resolution.

• 500 meters on second resolution• equivalent to 500 * 60 * 60 = 1,8 mill. meters on hour resolution.

Data load

Page 16: Tor Hovland: Taking a swim in the big data lake

Meter simulator actors on Service Fabric

Meter sim

Meter sim Meter

sim

Meter sim

Meter sim

booter

Meter sim booter

web client

Meter sim

Meter sim

Meter sim

Meter sim

Meter sim

Service Fabric

Page 17: Tor Hovland: Taking a swim in the big data lake
Page 18: Tor Hovland: Taking a swim in the big data lake

github.com/torhovland@torhovland

[email protected]

Contact

Page 19: Tor Hovland: Taking a swim in the big data lake

Demos

Page 20: Tor Hovland: Taking a swim in the big data lake
Page 21: Tor Hovland: Taking a swim in the big data lake
Page 22: Tor Hovland: Taking a swim in the big data lake
Page 23: Tor Hovland: Taking a swim in the big data lake
Page 24: Tor Hovland: Taking a swim in the big data lake
Page 25: Tor Hovland: Taking a swim in the big data lake
Page 26: Tor Hovland: Taking a swim in the big data lake
Page 27: Tor Hovland: Taking a swim in the big data lake
Page 28: Tor Hovland: Taking a swim in the big data lake
Page 29: Tor Hovland: Taking a swim in the big data lake
Page 30: Tor Hovland: Taking a swim in the big data lake
Page 31: Tor Hovland: Taking a swim in the big data lake