Big data overwiew

18
Big Data Overview

Transcript of Big data overwiew

Page 1: Big data overwiew

Big Data Overview

Page 2: Big data overwiew

○ What is Big Data?

○ Big Data in a real life.

○ Big Data problems & challenges?

○ Ways to solve these problems.

○ Big Data tools and technologies.

Crucial Questions

Page 3: Big data overwiew

What is Big Data?

○ Too big to fit in memory

○ Too fast data acquisition

requirements

○ Too fast data processing

○ Too complex for

traditional processing

Page 4: Big data overwiew

Big Data: numbers & facts

701,389 Facebook logins

38,194 posts to instagram

2.4 Million search queries

2.78 Million video views

Page 5: Big data overwiew

Example: car fleet management○ 1M car profiles

○ Daily reports

○ Track position

by request

○ Keep history in

database

Page 6: Big data overwiew

Real-time car fleet management○ 1K cars connected in real time

○ Gather data via OBD2 scanners in real-time

○ Gather data from cars’ GPS sensors in real-time

○ Store the data for future processing

○ Real-time calculation to predict traffic, engine

problems, accidents

Page 7: Big data overwiew

What can we do with (Big)Data?○ Data ingestion & acquisition○ Data storage (search, transfer, sharing)○ Data processing & analysis○ Data visualization

Page 8: Big data overwiew

Data Ingestion & Acquisition

○ Extract:

RDBMS, file systems, messaging

systems, sensors, log files

○ Transform:

Filter, encode/decode, aggregate,

validate

○ Load:

Data warehouse, messaging system

Page 9: Big data overwiew

Data Storage

Big Data storage challenges:

○ Size (keep and search huge

amount of data)

○ Speed (data acquisition, data

search)

○ Availability (fault tolerance,

partition tolerance)

Page 10: Big data overwiew

○ Consistency: all nodes see the same data at the same time

○ Availability: every request gets a response (success or failure)

○ Partition tolerance: system works despite of network failures

CAP Theorem

Page 11: Big data overwiew

Streaming vs Batch processing

Batch Batch

Stream

Data

Page 12: Big data overwiew

Data processing: Lambda Architecture

Page 13: Big data overwiew

Data processing: Kappa Architecture

Page 14: Big data overwiew

Data processing: MapReduce

Page 15: Big data overwiew

Data Visualization

Page 16: Big data overwiew

Everything as a Service

Page 17: Big data overwiew

Example: Amazon Web Services

Page 18: Big data overwiew

Q & A ?

○ What is Big Data?

○ Big Data in a real life.

○ Big Data problems & challenges?

○ Ways to solve these problems.

○ Big Data tools and technologies.