Data Visualization Using R Programming

26
Slide ‹#› © 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com Data Visualization via R Programming

Transcript of Data Visualization Using R Programming

Page 1: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Data Visualizationvia R Programming

Page 2: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Session Objectives

At the end of this session, you will be able to

ᗍ Understand Data Visualization through use-casesᗍ Learn about Rᗍ Know how R language helps in visualizing dataᗍ Understand the Benefits of Data Visualization ᗍ Interpret Large Data Setsᗍ Perform Data Visualization

Page 3: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

What is R?

One of the things we like about R is that it can create beautiful graphics compared to other data analysis tools

ᗍ The R programming language is a free and open source software which is quickly gaining popularity due to its data handling capacity

ᗍ R has many nice GUI’s. The most famous GUI is R-studio

ᗍ R comes with supportive help and good inbuilt graphic commands

ᗍ It has inbuilt data sets from real data

Page 4: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Use Case – 1: Flights Data

The table gives the data about the flights that departed NYC (i.e. JFK, LGA, EWR) for 2013

Source: RITA, Bureau of transportation statistics

1. Which month in the year 2013 witnessed maximum average departure delay? 2. From which airport shall I book my flight to reduce waiting time? 3. Which airline should I choose to reduce waiting time?

Are you able to answer these questions by looking at the huge data in the table?

Page 5: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Use Case – 2: Ordered Items DataAfter looking at this data: Can you answer the following questions?

1. Which region has the highest number of sellers?2. What is the Region-wise maximum and minimum sales?

Page 6: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

What’s the Solution?

It is difficult to recognize data with such huge data

There is a solution for this in R using Data Visualization

Let us get to know what is data visualization and how we can find results easily with large amount of data using data visualization

Page 7: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Data visualization is the presentation of data in a pictorial or graphical format

It helps in decision making by providing accurate and complete insights of data

Data visualization is the art and practice of gathering, analyzing, and graphically representing empirical information Data visualizations are sometimes called information graphics or just charts and graphs

What is Data Visualization?

Page 8: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

It is unrealistic to expect that data visualization tools and techniques will unleash a barrage of ready-made stories from datasets

Every new visualization is likely to give us some insights into our data

Using Visualization to Discover Insights

Visualise

Analyze,interpret

Documents Insights

Transform Dataset

Finding Insights in

Data

Page 9: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Why is Data Visualization Important?

As more and more data is collected and analyzed, decision makers at all levels welcome data visualization software that enables them to see analytical results presented visually

Because of the way the human brain processes information, it is faster for people to grasp the meaning of many data points when they are displayed in charts and graphs rather than poring over piles of spreadsheets or reading pages and pages of reports

Raw data is boring and it’s difficult to make sense of it in its natural form. Adding visualization to it can be useful

Page 10: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Let’s look at how we can visualize our data in R

A popular way to both analyse and visualize data is to use the R language

ggplot2 is one of the basic static visualization packages in R…

Page 11: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

ggplot: Line plot.

Lets Visualize the Data in R!

Page 12: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Challenges

High throughput data sets are growing exponentially both in size and complexity

Extracting meaningful information from this data requires not only programming skills, but also understanding of the analysis workflows and visualization tools that help to condense a large amount of information into a comprehensible story

Page 13: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Use Case – 1: SolutionMonth wise departure delay

Page 14: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Use Case – 1: Solution

Airport and month-wise departure delay

Page 15: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Use Case – 1: Solution

Airline wise average departure delay

Page 16: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Use Case – 2: Solution

Let us look at the following plots

Page 17: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Use Case – 2: Solution

Now try answering these questions:

Question: Which region has the highest number of sellers?

Answer: Central Region has the highest number of sellers(as can be seen as the maximum area covered in pie chart)

Moreover, you can now also tell which region has the lowest number of sellers, here the West Region.

Page 18: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Use Case – 2: Solution

Question:What is the Region-wise maximum and minimum sales?

Answer: As can be seen from the plot each region has its specific maximum and minimum units sale.(The top represents the maximum and bottom is for minimum)

The dark line represents the median of sales of that region

Page 19: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Big Deal about R in Data Visualization

Companies that incorporate R practices get more done, plain and simple

Data visualization using R helps stakeholders interpret data according to their department’s or project’s or customer’s (or whomever’s) specific needs

Page 20: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Business Benefits of Data Visualization

Each year, the amount of data being created is increasing tremendously. Due to the massive availability of data, businesses are becoming more & more number-driven in their decision making

ᗍ Absorb more information easily

ᗍ Discover relationships & patterns between business & operational activities

ᗍ Identify emerging trends faster

ᗍ Directly interact with data which increases productivity and saves time

ᗍ Improves Collaboration

Page 21: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Job Trends – R

Page 22: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Why SkillSpeed?

Course Curriculum

from Industry Experts

Instructor Led Live Virtual Sessions

Lifetime access to Course

Content via LMS

100% Placement Assistance

24x7 Support

24x7

Page 23: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Corporate Partners

Page 24: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Contact us..

Lines open 24/7

To know more about the course, Please contact:

IND+91-90660-20904 USA1866-607-6547 (Toll Free)

Or reach us [email protected]

Page 25: Data Visualization Using R Programming

Slide ‹#›© 2015 BlueCamphor Technologies (P) Ltd. www.skillspeed.com

Image References

http://datajournalismhandbook.org/1.0/en/understanding_data_7.html

Google images

Page 26: Data Visualization Using R Programming