LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training

15
Course Details “Machine Learning gives computer the ability to learn without being explicitly programmed” ~l.Samuel DataScience Training Build your own predictive models in 45 days with zero prior knowledge LEARN BY DOING BY-SASMITA PANIGRAHI In this project ,we will build a Interactive Visualization dashboard In this course, you will learn how to apply Data Science through seven pragmatic steps - Frame, Acquire, Refine, Transform, Explore, Model, and Insight - to any business problem. The focus will be to learn the principles through an applied case study and by actually coding in Python to solve this. Objective Learn how to employ statistical and machine learning algorithms to solve real life problems by working on real time Projects. Develop proficiency in using python and its libraries like Pandas, numPy, Seaborn, Interactive and live coding session Taught by Real time Practitioners Project-1 -Pandas,Plotly, python core Project-2 Sales Forecasting Project-3 Fraud Detection Project-4 Document Clustering

Transcript of LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training

Page 1: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training

Course Details “Machine Learning gives computer the ability to learn without being explicitly programmed” ~l.Samuel

DataScience Training Build your own predictive models in 45 days with zero prior knowledge

LEARN BY DOING BY-SASMITA PANIGRAHI

In this project ,we will build a Interactive Visualization dashboard

In this course, you will learn how to apply Data Science through seven pragmatic steps - Frame, Acquire, Refine, Transform, Explore, Model, and Insight - to any business problem. The focus will be to learn the principles through an applied case study and by actually coding in Python to solve this.

Objective ‣ Learn how to employ statistical and machine

learning algorithms to solve real life problems by working on real time Projects.

‣ Develop proficiency in using python and its libraries like Pandas, numPy, Seaborn,

‣ Interactive and live coding session

‣ Taught by Real time Practitioners

Project-1

-Pandas,Plotly,python core

Project-2

Sales Forecasting

Project-3

Fraud Detection

Project-4

Document Clustering

Page 2: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training

‣ What is Artificial Intelligence ‣ Introduction To DataScience ‣ Real Time UseCases Of DataScience ‣ Who is a DataScientist?? ‣ DataScience Project Lifecycle ‣ Skillsets needed for DataScientist ‣ Difference between DataEngineer,

DataScientist and DataAnalyst ‣ 6 Steps to take in 3.5 Months for a

complete transformation to DataScience from any other domain

‣ Machine Learning-Giving Computers The ability to learn from data ‣ Supervised vs Unsupervised ‣ DeepLearning vs Machine Learning

‣ Software Installation ‣ Jupyter Notebook Tutorial ‣ Introduction to Python ‣ Comments ‣ Variable,Operators,DataTypes ‣ If Else,For and While Loops ‣ Functions ‣ Lambda Expression ‣ Taking input from keyboard ‣ List ‣ Tuple ‣ Set ‣ Dictionary ‣ INTERVIEW QUESTIONS-

ASSIGNMENT-1

Module-1(Python Basics)

Python Fundamentals

Welcome To The Course

Page 3: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training

Module-2(Python Advance)

‣ Introduction to Numpy ‣ Creating Arrays ‣ arange(),linspace() etc. ‣ Creating Arrays of Random Numbers ‣ Basic Operations on an Array ‣ Applying Universal functions on an array ‣ Linear Algebra operations on an array ‣ Numpy DataTypes ‣ Type Conversion ‣ Array Stacking

‣ASSIGNMENT-2

NumPy

Pandas

‣ Introduction to Pandas ‣ Creating DataFrames ‣ Reading data from csv,excel etc. into a

DataFrame & writing df into csv,excel ‣ Selection and Indexing ‣ Conditional Selection ‣ Groupby ‣ Pivot Table ‣ Merging , Joining, Cancatenation ‣ Missing Value Treatment ‣ Data Visualisation using Pandas

‣ASSIGNMENT-3

Page 4: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training

we’ll begin curriculum focused on various data visualization techniques and how they can help us engage and learn from our data using Plotly

Visualisation-Plotly

‣ Line Plots ‣ Scatter Plots ‣ Pair Plots ‣ Histograms ‣ Heat Maps ‣ Bar Plots ‣ Count Plots ‣ Factor Plots ‣ Box Plots ‣ Violin Plots ‣ Swarm Plots ‣ Strip Plots ‣ Pandas Builtin Visualisation Library

‣ASSIGNMENT-4

Module-3( Visualisation)

Prcatice , Practice and Practice!!!!!!! Implement what you have learnt so far by working in a real time Project……..

Project-1

Pandas Numpy Plotly

Python Core

Skills Required

Page 5: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training

Module-4 (Statistics)

‣ Descriptive vs Inferential Statistics ‣ Mean,Median,Mode ‣ Central Limit Theorm ‣ Measure of dispersion ‣ Inter Quartile Range ‣ Variance ‣ Standard Deviation ‣ Box Plot ‣ Z score ‣ Scatter Plot ‣ Pearson’s Product Moment Correlation-r ‣ R square ‣ Adjusted R-square ‣ Normal Distribution ‣ Standard Normal Distribution ‣ Emprical rule of Normal Distribution ‣ What is an Outlier ‣ Outlier Detection and Removal

Statistics

Page 6: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training

Module-5 (ML-Linear Reg)

Linear Regression, Cost Function,Gradient

Descent,sklearn

‣ Introduction to Machine Learning ‣ Supervised vs Unsupervised ‣ Regression vs Classification ‣ Linear Regression Theory ‣ Gradients/Derivative Theory ‣ Assumption of Linear Regression ‣ Cost Function ‣ Optimize Cost function using Gradient

Descent ‣ Gradient Descent Detail Explanation ‣ Mathematical Derivation ‣ Multi- Colinearity ‣ MAE ‣ MSE ‣ RMSE

Prcatice , Practice and Practice!!!!!!! Implement what you have learnt so far

Project-2(Sales Prediction)

Sklearn Pandas Plotly

Python Core Linear Regression

Page 7: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training

‣ What is Bootstap ‣ Bagging ‣ Difference between Random Forest and

Decision Tree ‣ Feature Selection using Random Forest ‣ Hyperparameter tuning

Random Forest

Module-6 (Decision Tree, Random Forest)

Decision Tree‣ What is ID3 Algorithm ‣ Entropy ‣ Calculating Information Gain ‣ Overfitting,Underfitting,Best fit

‣ Confusion Matrix ‣ Classification Report ‣ Recall ‣ Precision ‣ AUC ‣ ROC ‣ Cross Validation

CLASSIFICATION VALIDATION TECHNIQUES

Page 8: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training

‣ Implement PCA from scratch using Numpy and using sklearn is a real time dataset

Step By Step Implementation of PCA From scratch (with out sklearn) and by using

sklearn

Module-7 (PCA)

Principal Component Analysis

‣ Introduction to Dimensionality Reduction ‣ PCA Theory discussion ‣ Eigen Values , Eigen Vectors ‣ Step by Step Detail Mathematical

Derivation ‣ Individual and Cummulative Variation

Ratio

Prcatice , Practice and Practice!!!!!!! Implement what you have learnt so far bworking in a real time Project……..

Project-3(Fraud Detection)

Sklearn Python Pandas Plotly PCA

Decision Tree Random Forest

Page 9: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training

Module-9 (KMeans)

‣ Introduction to Unsupervised Machine Learning

‣ KMeans Theory ‣ How to decide K in KMeans

Module-10 (NLP)

‣ Introduction to NLP ‣ Text Preprocessing Techniques using

Space and NLTK ‣ Word Tokens ‣ StopWord Removal ‣ Count Vectorizer ‣ Tf-Idf Vectorizer

KMeans Clustering

Text Mining

Project-4(Document Clustering)

Sklearn Python Pandas Plotly PCA

KMeans NLTK,Spacy

Page 10: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training

Last Class.........

Discussion on Some Real Time Projects from •different domains(Banking,Retail,Manufacturing etc. domain projects)

Resume Preparation Tips•

Discussion of some Magic Python Packages and •Libraries for Data Analysis, Visualization, Outlier Detection, Time Series and For Handling Class Imbalance Problem

Page 11: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training

Recordings For Bonus Chapters............

Introduction DeepLearning•What is a Perceptron•Neural Network Architecture •Activation Function•

- Sigmoid - hyperbolic tangent - relu - leaky relu - parametric relu

Hidden Layers•Feed Forward, Back Propagation•Gradient Descent•Introduction to KERAS•Implementing first Deep Learning model using •KERAS

Page 12: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training
Page 13: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training
Page 14: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training
Page 15: LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training

Bonus Recording