Teaching Big Data Analtyics to Business School MS Students

Post on 17-Oct-2021

3 views 0 download

Transcript of Teaching Big Data Analtyics to Business School MS Students

Teaching Big Data Analtyics to Business School MS Students

Ramesh Shankar

Uconn School of Business

IT Teaching Workshop 2019, Wharton

1

MSBAPM Curriculum

2

Hadoop books Source: David Tilson, IT Teaching Workshop 2018

3

Hadoop resources

4

Cloudera VM Enabling virtualization

5

AWS EMR (Elastic MapReduce) Cluster

Source: David Tilson, IT Teaching Workshop 2018

6

7

AWS EC2:

8

Topics covered• Linux

• Hadoop Distributed File System

• Apache Sqoop• Extract data from RDBMS, into HDFS

• Apache Pig• Extract, Transform, Load (ETL) on data obtained via Sqoop• Schema on read, no permanent schema, flat files

• Apache Hive• Hadoop Data Warehousing Tool• Schema on read, permanent schema required, flat files

• MapReduce – conceptual overview

• Spark• In-memory Analytics

• Recommender Systems • Illustrates Spark

9

HDFS

10

Sqoop

11

Pig

12

Hive

13

14

15

Spark – recommender system (ALS)

16