Bringing Open Data Science into Excel with Anaconda Fusion
-
Upload
continuum-analytics -
Category
Data & Analytics
-
view
4.154 -
download
0
Transcript of Bringing Open Data Science into Excel with Anaconda Fusion
© 2016 Continuum Analytics- Confidential & Proprietary
BRING OPEN DATA SCIENCE INTO EXCEL WITH ANACONDA FUSION
Peter Wang, CTO Christine Doig, Senior Data Scientist & Anaconda Fusion Product Manager Fabio Pliger, Senior Software Engineer & Anaconda Fusion Tech Lead
2
• Empowering Analysts with Power of Anaconda • What is Anaconda? • Demo 1: Interactive Data Visualizations in Excel • Demo 2: Machine Learning in Excel • Demo 3: Big Data & ETL Processes in Excel • Demo 4: Python as a VBA replacement - Advanced mode • Summary
Agenda
Empower Business Analysts to: • Leverage data science assets easily through MS Excel • Enrich MS Excel analysis with the power of Python • Go beyond simple viz to powerful interactive viz
Empower Data Scientists to: • Extend their work to business analysts • Showcase their work through powerful interactive storytelling
3
Empowering Analysts with Power of Anaconda
5
is…. leading Open Data Science platformPowered by Python the fastest growing data science language
• Accelerate Time-to-Value • Connect Data, Analytics & Compute • Empower Data Science Teams
© 2016 Continuum Analytics- Confidential & Proprietary 6
ACCELERATE Time-to-Value
INNOVATE faster through managed agile experimentation MOVE from analysis to deployment immediately DELIVER high performance analytics processing
CONNECT Data, Analytics & Compute
LEVERAGE innovative open source analytics to extract value from data MAXIMIZE your computational power to easily analyze all your data CONNECT and integrate all your data sources for predictive models
EMPOWER Data Science Teams
ITERATE quickly to create powerful analysis and predictive models COLLABORATE and share with your data science team PUBLISH interactive results to the business
© 2016 Continuum Analytics- Confidential & Proprietary 7
Data ScientistBiz Analyst Data EngineerDeveloper DevOps
Deploy & Operate
Explore & Analyze
Collaborate & Publish
Data Science Team
9
Connects Open Data Science with Microsoft Excel
Anaconda Fusion
• BRING interactive visualizations, machine learning and ETL to Excel
• BRIDGE Excel Data to Python and R through notebooks
• ACCESS all the power of Python and Big Data, natively embedded inside Excel
10
Biz Analysts
Explore & Analyze
Data Scientist
Collaborate
Empower the Data Science Team• Connect Analysts and Excel users with Open Data Science • Leverage analysis tools the team is already skilled with: Excel • Make it easy for Data Scientists to share their work with business leaders
11
Accelerate Time-to-Value• Increase impact of Data Science throughout the organization • Drive business value of analytic solutions by involving analysts and management • Make analyst teams more productive
Data ScientistBiz AnalystsBig Data & ETL
Interactive Data Visualizations
Client Machine Compute Node
Compute Node
Compute Node
Head Node
Machine Learning
Statistics and Advanced Analytics
12
• Connect Excel to the Anaconda Platform via Jupyter notebooks • Unified UI with spreadsheets (data), notebooks (analytics) and a kernel (compute) • Expose and explore Big Data in Excel
Connect Data, Analytics and Compute
ANALYTICS
COMPUTE
DATA
13
Excel, SQL, Tableau
Wor
ks
wit
hD
eliv
ers
Dat
a
spreadsheets spreadsheets, dataframes, tables dataframes, tables
spreadsheets, reports, visualizations
spreadsheets with macros predictive models, data transformations,
interactive data visualizations
Data ScientistAnalyst/Manager Advanced Analyst
Excel, SQL, VBA, SAS
(Python, R)
SQL, Hadoop Python, R
14
Data ScientistAnalyst/Manager Advanced Analyst
in magic mode in expert mode
Anaconda Fusion for Everyone
from fusion import fusion
16
Anaconda Repository
DeveloperBiz Analyst Data Scientist
Anaconda Enterprise Notebook Server
Anaconda Fusion
Build and publish
software packages
Install software packages
Collaborate with
notebooksAccess
notebooks functionality
in Excel
DevOps / IT
Manage and control accounts, software and data access and
guarantee security and complianceDeploying Anaconda in Enterprises
• No Python install needed for Business Analysts (small footprint)
• Computations can run server-side
• Python 2 and 3 support
18
Demo 1: Interactive Data Visualization for Analysts
Interactive visualizations for exploring Excel data • Move around, zoom in and out • Easily change visualization types • Select & highlight points
19
Demo 2: Machine Learning for everyone
• Analysts • Choose functions & parameters with “magic
mode” • Access and reference Excel data
• Data Scientists • Learn how to use Anaconda Fusion decorator
in notebooks to configure “magic mode” • Everyone
• Access all the rich capabilities in Python (including scikit-learn for machine learning)
20
Demo 3: Harness Big Data from Excel
• For Analysts: • Execute queries from Excel • Import and manage data from Hadoop
• For Data Scientists • Connect to Hadoop via Dask, Ibis, Impala • Build interactive query apps
Client Machine Compute Node
Compute Node
Compute Node
Head Node
21
Demo 4: Python as VBA replacement for Advanced Analysts
• Write Python scripts in Excel • Leverage Python ecosystem for
Open Data Science
23
Why Anaconda Fusion?
Anaconda Fusion Connects Open Data Science with Microsoft Excel
EMPOWER Data Science Teams
ACCELERATE Time-to-Value
CONNECT Data, Analytics & Compute
24
https://www.continuum.io/anaconda-subscriptions
Anaconda Fusion is available with Anaconda Enterprise Subscription
25
Join the Innovators Program
Free of cost!
https://go.continuum.io/anaconda-fusion-innovators/