Doing data science with F#

Post on 18-Nov-2014

1.118 views 0 download

description

The ability to take data, understand it, visualize it and extract useful information from it is becoming a hugely important skill. How can you turn all those logs, histories of purchases and trades or open government data, into useful information that help your business make money? In this talk, we’ll look at doing data science using F#. The F# language is perfectly suited for this task – type providers integrate external data directly into the language – your language suddenly _understands_ CSV, XML, JSON, REST services and other sources. The interactive development style makes it easy to explore data and test your algorithms as you’re writing them. Rich set of libraries for working with data frames, time series and for visualization gives you all the tools you need. And finally – F# easily integrates with statistical environments like R and Matlab, giving you access to the industry standard libraries.

Transcript of Doing data science with F#

Doing data science with F#

Tomas Petricek tomas@tomasp.net | @tomaspetricek

PhD Student at Cambridge & Coordinator of http://fsharp.org

F# Software Foundation

http://www.fsharp.org

software stacks

trainings teaching F# user groups snippets

mac and linux community books and tutorials

consulting open-source MonoDevelop

contributions research support cross-platform mailing lists

Community matters!

All the Data of the World

kaggle vega grammar data sources presentation

R provider visualization

F# Data Science Working Group

www.fslab.org

data acquisition

statistics data cleaning machine learning

data transformation visualization type providers

time-series data aggregation

Acquire

AnalyzeVisualize

Demo: Analyzing Titanic survivors

Deedle data frame

Data exploration

Indexing and aggregation

F# Charting library

Simple & composable

Interactive style

www.fslab.org

Demo: Understanding the world

F# Data type providers

First-class data

CSV, REST, WorldBank…

R Type provider

Statistics & visualization

5000 tested packages

www.fslab.org

Demo: US debt over the last century

Deedle data frame

Time-series alignment

Data transformations

Vega visualization

F# wrapper for Vega

Pre-alpha version

www.fslab.org

F# for Data Science

acquire, analyze, visualize

interactive experience

safety and efficiency of .net

ready for production

@tomaspetricek

Going forward

Use #fsharp for fun & profit

Join local user groups

Help us build data science tools

fsharp.org | fslab.org | tomasp.net

@tomaspetricek