FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2...

19
FOGengine: Towards Big Data Analy8cs in the Fog Farhad Mehdipour Tech Futures Lab & Unitec Ins8tute of Technology Auckland, New Zealand [email protected] Bahman Javadi Western Sydney University, Sydney, Australia Aniket Mahan8 University of Auckland, Auckland, New Zealand The 2 nd Interna,onal Conference on Big Data Intelligence and Compu,ng, Auckland, New Zealand, August 810, 2016

Transcript of FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2...

Page 1: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

FOG-­‐engine:  Towards  Big  Data  Analy8cs  in  the  Fog

Farhad  Mehdipour  

Tech  Futures  Lab  &  Unitec  Ins8tute  of  Technology  Auckland,  New  Zealand  

[email protected]  

 

Bahman  Javadi    

Western  Sydney  University,  Sydney,  Australia  

Aniket  Mahan8    

University  of  Auckland,  Auckland,  New  Zealand  

The  2nd  Interna,onal  Conference  on  Big  Data  Intelligence  and  Compu,ng,  Auckland,  New  Zealand,  August  8-­‐10,  2016  

Page 2: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

Introduc8on

•   Challenges  of  the  current  cloud-­‐based  plaForms  –  The  cloud  physically  located  in  a  distant  datacenter    

   à    Latency  –  Ver8cally  fragmented  –  Real-­‐8me  processing  large  quan88es  of  IoT  data  

   à  more  security,  capacity,  and  analy8cs  challenges  –  Incapability  of  current  cloud  for  efficient  Big  Data  Analy8c  

2  

 Our  solu,on  •  An  on-­‐premise  and  real-­‐8me  data  analy8c  engine  (FOG-­‐Engine)  located  near  

where  data  is  generated  •  Collabora8on  and  proximity  interac8on  between  IoT  devices  in  a  distributed  

and  dynamic  manner  

Page 3: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

Current  plaYorms  issues:  Not  Fully  Integrated,  No  low-­‐latency,  and  might  be  Expensive  

Data  Storage,  Analy,cs  

CLOUD

Network  &  Internet  Infrastructure  

IoTs

Network  of  Sensors  and  Actuators  (Physical  World)

Raw  

Data  

Raw  

data  

Big  Data  Flow

3  

.   .   .

CYBE  R-­‐PHYSICAL  SYSTEM

Feedback

.   .   .

Network:  latency,  bandwidth  and  cost  •  Large  geographical  distanceà  Higher  Latency  •  The  aggregated  b/w  of  sensors  >>  network  b/w  •  Big  Data  is  heavy  to  moveà  Higher  Cost  and  Latency  Ra

w  

data  

Cloud:  latency  The  cloud  uses  virtual  machines  à  unnecessary  data  movement  

Raw  data:  the  size  can  be  huge  (e.g.  camera)  

Service  Ecosystem:  Fragmented  IoT  infrastructure  à  designers  need  to  interact  with  many  services  from  sensors/actuators  to  data  analy8c

Page 4: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

Fog  Compu8ng  

•  The  Fog    –  extends  the  cloud  compu8ng  paradigm  to  the  edge  of  the  network,    

–  enables  a  new  breed  of  applica8ons  and  services    –  an  appropriate  solu8on  for  the  applica8ons  and  services  that  fold  under  the  umbrella  of  the  IoTs.  

•  Benefits    –  low  latency  –  loca8on  awareness  –  widespread  geographical  distribu8on  –  mobility  support  –  the  strong  presence  of  streaming  and  real-­‐8me  applica8ons  –  heterogeneity  

4  

Page 5: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

Related  Works  

  AWS Microso[ IBM Google Alibaba Service AWS  IoT Azure  IoT  Hub IBM  Watson  IoT Google  IoT AliCloud  IoT

D a t a  Collec,on

HTTP,   WebSockets ,  MQTT

HTTP,   AMQP,   MQTT   and  custom   protocols   (using  protocol  gateway  project)

MQTT,  HTTP HTTP HTTP

Security L i n k   E n c r y p 8 o n  (TLS),   Authen8ca8on  (SigV4,  X.509)

L ink   Encryp8on   (TLS) ,  Authen8ca8on   (Per-­‐device  with  SAS  token)

Link  Encryp8on  (TLS),  Authen8ca8on  (IBM  Cloud  SSO),  Iden8ty  management  (LDAP)

Link  Encryp8on  (TLS)

Link  Encryp8on  (TLS)

Integra,on REST  APIs REST  APIs REST  and  Real-­‐8me  APIs

REST  APIs,  gRPC REST  APIs

Data  Analy,cs Am a z o n   M a c h i n e  Learning  model  (Amazon  QuickSight)

Stream   Analy8cs,   Machine  Learning

IBM  Bluemix  Data  Analy8cs

Cloud  Dataflow,  BigQuery,  Datalab,  Dataproc

MaxCompute

G a t e w a y  Architecture

Device  Gateway    (in  Cloud)

Azure   IoT   Gateway   (on-­‐premises   gateway,   beta  version)

General  Gateway General  Gateway  (on-­‐premises)

Cloud  Gateway  (in  Cloud)

5  

Page 6: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

FOG FOG

Our  View:  Decentralized  Hierarchical  Big  Data  Processing  on  the  Edge  

Data  Processing,  Mining,  Storage,  and  Visualiza,on  

CLOUD

Network  Access Network  Access

Raw  

data  

.   .   .

Raw  

data  

.   .   .

Semi-­‐structured  data  

LAN:  very  high  b/w  (x1GB/s)  MAN:  high  b/w  (x100MB/s)  

WAN:  low  b/w  (x10MB/s)  

Global  data  analy,cs  on  semi-­‐structured  data  

Data  Preprocessing    &  Analy,c  

FOG-­‐Engine:  Data  Analy8c  micro-­‐Engine  

Centralized:  the  FOG-­‐Engine  supports  all  the  func8onali8es  of  the  system

IoTs

Network  of  Sensors  and  Actuators  (Physical  World)

Page 7: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

7  

Velocity  

Volume  

High  b/w  Low  Speed  

High  b/w  High  Speed  

Low  b/w  Low  Speed  

Low  b/w    High  Speed  

How  to  Realize  Big  Data’s  Vs:  Velocity,  Volume,  …

Our  Solu8on

Current  plaYorm

Current  Cloud

Our  Solu,on

Page 8: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

8  

Volume  

Towards  Real-­‐Timeness,  with  Added  Values

Current  Cloud

Our  Solu,on

Higher  Speed  Higher  bandwidth  Flexibility  Heterogeneity    Extensibility,  etc.  More  Engineerable

Page 9: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

FOG-­‐Engine  vs.  Cloud

Characteris,c FOG-­‐engine   Cloud  plaForm Processing  hierarchy Local  data  analy8cs Global  data  analy8cs

Processing  fashion In-­‐stream  processing Batch  processing

Compu,ng  power GFLOPS TFLOPS

Network  Latency Miliseconds Seconds

Data  storage Gigabytes Infinite

Data  life,me Hours/Days Infinite

Fault-­‐tolerance High High

Processing  resources Heterogeneous  (e.g.  CPU,  FPGA) Homogeneous  (Data  center)

Versa,lity Only  exists  on  demand Intangible  servers

Provisioning Limited   by   the   number   of   FOG-­‐engines  in  the  vicinity

Infinite,  with  latency

Mobility  of  nodes May  be  mobile  (e.g.  in  the  car) None

9  

Page 10: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

10  

Data  cleaning  Feature  

extrac,on  &  transforma,on  

DB  storage  

Data  analy,cs  Interpreta,on  

and  presenta,on  

Data  collec,on/  integra,on  

Raw    data  

Data  Analysts  

A  Typical  Data  Analy8c  Flow  

Page 11: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

Data  cleaning  

Feature  extrac,on  &  

transforma,on  

DB  Storage  

Data  analy,cs  

Interpreta,on  &  

presenta,on  

Data  collec,on/  integra,on  

Fog-­‐Engine  

Data  Analy,cs  

Interpreta,on  &  

presenta,on  

DB  Storage  

Data  integra,on  

Cloud

Data  from  other  FOG-­‐engines/  IoT  nodes

Raw  Data  from  IoTs  

Users  

FOG-­‐engines/  Users

A  Modified  Data  Analy8c  Flow  

11  

Page 12: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

Fog Fog Fog

.   .   .

Centralized  data  analy,cs  and  storage  

CLOUD

Network  Access Network  Access Network  Access

Smart  City  

Fog  engine  

Raw  data  stream  

Fog  engine   Fog  engine  

12  

Page 13: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

Collabora,ng  peers  

in  the  FOG Cloud  

Data  Analy,c  Engine  

Data  Cleaning,  Aggrega,on  &  Visualiza,on  

Data  Collec,on  and  Import

Data  Storage  System

Network  Interface  to  

Physical  World

Peer-­‐to-­‐Peer  Networking  

API

Network  Interface  to  Cloud  (Gateway)

Communica,on  Unit

Data  Analy,cs  &  Storage  Unit Orchestra,on  Unit  

IoTs  

General  Architecture  of  FOG-­‐Engine    

13  

Page 14: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

Data  Acquisi,on  Interfaces  (APIs)  

USB   WiFi/BT     UART   GPIO  

Sensors   IoT  Devices   Web   Local  

storage  

ETL  (Extract,  Load  and  Transform)  

Cleaning,  Filtering,  Integra,on  

Rules  library  

Data  Analy,c  

Models  library  

Cloud  Interface  (Gateway)  

Cloud  

Orchestra,on/Cluster  forma,on/Dispatching  

Module  

Peer  FOG-­‐engine  

FOG-­‐engine  scheduler  and  task  manager  

Peer-­‐to-­‐Peer  Networking  Interface  

USB:  Universal  serial  bus  BT:  Bluetooth  UART:  Universal  Asynchronous  Receiver/Transmioer    SPI:  Serial  Peripheral  Interface  Bus  GPIO:  General-­‐purpose  input/output  pins  

SPI  

Detailed  Architecture  of  FOG-­‐engine    

14  

Page 15: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

Preliminary  Results  

•  Implementa8on  plaYorm:  Raspberry  Pi  2.0  and  3.0  

•  Scenarios  1)  Mul,ple  receivers,  mul,ple  analysers,  and  mul,ple  

transmibers  scenario  2)  Mul,ple  receivers,  mul,ple  analysers,  and  single  

transmiber  scenario  3)  Mul,ple  receivers,  single  analyser,  and  single  

transmiber  scenario  

15  

Page 16: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

Scenario  II  

Mul,ple  receivers,  mul,ple  analysers,  and  single  transmiber  scenario:      

–  Mul8ple  FOG-­‐engines  receive  and  analyse  data  individually,    –  FEs  data  is  transmioed  to  the  cloud  via  one  of  them  which  acts  as  a  cluster  head  

FOG-­‐Engine  (FE)  

FOG-­‐Engine  (FE)   Cloud  

Page 17: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

17  

IoT-­‐FE  1kb  0  

20  

40  

1   2   3   4   5   6   7   8   9   10   11   12   13   14   15   16   17   18   19   20  

The  transmission  ,me  (ms)  for  various  data  sizes  

IoT-­‐FE  communica,on  

IoT-­‐FE  1kb   IoT-­‐FE  10kb   IoT-­‐FE  100kb   IoT-­‐FE  1mb  

FE-­‐FE  1kb  0  

1000  

2000  

1   2   3   4   5   6   7   8   9   10   11  12   13  14   15   16   17   18   19  20  

The  transmission  ,me  (ms)  for  various  data  sizes  

FE-­‐FE  communica,on  

FE-­‐FE  1kb   FE-­‐FE  10kb   FE-­‐FE  100kb   FE-­‐FE  1mb  

FE-­‐Cloud  1kb  0  

2000  

4000  

1   2   3   4   5   6   7   8   9   10   11  12   13  14   15   16   17   18   19  20  

The  transmission  ,me  (ms)  for  various  data  size  FE-­‐Cloud  communica,on  

FE-­‐Cloud  1kb   FE-­‐Cloud  10kb   FE-­‐Cloud  100kb   FE-­‐Cloud  1mb  

FE  to  Cloud  communica8on  8me  is  significant  compared  to  IoT-­‐FE  and  FE-­‐FE  communica8on  8mes  (as  expected).    

Page 18: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

18  

0  

100  

200  

300  

400  

1   2   3   4   5   6   7   8   9   10   11   12   13   14   15   16   17   18   19   20  

The  transmission  ,me  of  1KB  data  for  various  communca,on  types  

IoT-­‐FE  1kb   FE-­‐FE  1kb   FE-­‐Cloud  1kb  

0  

100  

200  

300  

400  

1   2   3   4   5   6   7   8   9   10   11   12   13   14   15   16   17   18   19   20  

The  transmission  ,me  of  10KB  data  for  various  communca,on  types  

IoT-­‐FE  10kb   FE-­‐FE  10kb   FE-­‐Cloud  10kb  

0  

500  

1000  

1500  

2000  

1   2   3   4   5   6   7   8   9   10   11   12   13   14   15   16   17   18   19   20  

The  transmission  ,me  of  100KB  data  for  various  communica,on  types  

IoT-­‐FE  100kb   FE-­‐FE  100kb   FE-­‐Cloud  100kb  

Data  transmission  8me  increases  by  increasing  the  size  of  data  for  all  communica8on  types  

Page 19: FOG$engine:*Towards*Big*Data Analy8cs*in*the*Fogbjavadi/... · The2 nd!Interna,onal!Conference!on!Big!Data!Intelligence!and!Compu,ng,!Auckland,!New!Zealand,!August!8A10,!2016! Introduc8on

Thank  you  

Any  Ques8ons/Comments?  

19