M 2 Data Analytics Lifecycle

22
Data Analytics Lifecycle B.Bhuvaneswaran Assistant Professor (SS) Department of Computer Science & Engineering Rajalakshmi Engineering College Thandalam Chennai – 602 105 [email protected]

description

M 2 Data Analytics Lifecycle

Transcript of M 2 Data Analytics Lifecycle

Page 1: M 2 Data Analytics Lifecycle

Data Analytics LifecycleB.Bhuvaneswaran

Assistant Professor (SS)Department of Computer Science & Engineering

Rajalakshmi Engineering CollegeThandalam

Chennai – 602 [email protected]

Page 2: M 2 Data Analytics Lifecycle
Page 3: M 2 Data Analytics Lifecycle

Phase 1: Discovery Phase 2: Data Preparation Phase 3: Model Planning Phase 4: Model Building Phase 5: Communicate Results Phase 6: Operationalize

Page 4: M 2 Data Analytics Lifecycle
Page 5: M 2 Data Analytics Lifecycle

Phase 1: Discovery Learn the business domain, including relevant history,

such as whether the organization or business unit has attempted similar projects in the past, from which you can learn.

Assess the resources you will have to support the project, in terms of people, technology, time, and data.

Frame the business problem as an analytic challenge that can be addressed in subsequent phases.

Formulate initial hypotheses (IH) to test and begin learning the data.

Page 6: M 2 Data Analytics Lifecycle

Question? (Ref. Module-2, Page-16) In which lifecycle stage are initial

hypotheses formed? A. Discovery B. Model planning C. Model building D. Data preparation

Page 7: M 2 Data Analytics Lifecycle
Page 8: M 2 Data Analytics Lifecycle

Of all of the phases, the step of Data Preparation is generally the most iterative and time intensive.

Page 9: M 2 Data Analytics Lifecycle

Question? (Ref. Module-2, Page-20) In which phase of the data analytics

lifecycle do Data Scientists spend the most time in a project? A. Discovery B. Data Preparation C. Model Building D. Communicate Results

Page 10: M 2 Data Analytics Lifecycle
Page 11: M 2 Data Analytics Lifecycle

Phase 4: Model Building Develop data sets for testing, training, and

production purposes. Get the best environment you can for

executing models and workflows, including fast hardware and parallel processing.

Page 12: M 2 Data Analytics Lifecycle

Question? (Ref. Module-2, Page-29) In which lifecycle stage are test and

training data sets created? A. Model building B. Model planning C. Discovery D. Data preparation

Page 13: M 2 Data Analytics Lifecycle

Question? (Ref. Module-2, Page-29) In which lifecycle stage are appropriate

analytical techniques determined? A. Model planning B. Model building C. Data preparation D. Discovery

Page 14: M 2 Data Analytics Lifecycle
Page 15: M 2 Data Analytics Lifecycle

Question? (Ref. Module-2, Page-31) In which phase of the analytic lifecycle

would you expect to spend most of the project time? A. Discovery B. Data preparation C. Communicate Results D. Operationalize

Page 16: M 2 Data Analytics Lifecycle
Page 17: M 2 Data Analytics Lifecycle

Question? (Ref. Module-2, Page-33) Which activity is performed in the

Operationalize phase of the Data Analytics Lifecycle? A. Define the process to maintain the model B. Try different analytical techniques C. Try different variables D. Transform existing variables

Page 18: M 2 Data Analytics Lifecycle
Page 19: M 2 Data Analytics Lifecycle

Question? (Ref. Module-2, Page-37) What is an appropriate data visualization

to use in a presentation for an analyst audience? A. Pie chart B. Area chart C. Stacked bar chart D. ROC curve

Page 20: M 2 Data Analytics Lifecycle
Page 21: M 2 Data Analytics Lifecycle

References Data Science and Big Data Analytics

(DSBDA), EMC.

Page 22: M 2 Data Analytics Lifecycle