KEMBAR78
Doing data science with F# | PDF
Doing data science with F#
Tomas Petricek
tomas@tomasp.net | @tomaspetricek
PhD Student at Cambridge & Coordinator of http://fsharp.org
software stacks

trainings
mac and linux

teaching F#

user groups

snippets

community books and tutorials

F# Software Foundation
consulting

open-source MonoDevelop

http://www.fsharp.org
contributions research support
cross-platform

mailing lists
Community matters!
All the Data of the World
data acquisition

statistics data cleaning machine learning
data transformation

visualization type providers

F# Data Science Working Group
kaggle

vega grammar

R provider

data sources presentation

www.fslab.org

time-series

visualization

data aggregation
Acquire

Visualize

Analyze
Demo: Analyzing Titanic survivors
Deedle data frame
Data exploration
Indexing and aggregation

F# Charting library
Simple & composable
Interactive style

www.fslab.org
Demo: Understanding the world
F# Data type providers
First-class data
CSV, REST, WorldBank…

R Type provider
Statistics & visualization
5000 tested packages

www.fslab.org
Demo: US debt over the last century
Deedle data frame
Time-series alignment
Data transformations

Vega visualization
F# wrapper for Vega
Pre-alpha version

www.fslab.org
F# for Data Science
acquire, analyze, visualize
interactive experience
safety and efficiency of .net
ready for production
@tomaspetricek
Going forward
Use #fsharp for fun & profit
Join local user groups
Help us build data science tools
fsharp.org | fslab.org | tomasp.net
@tomaspetricek

Doing data science with F#

  • 1.
    Doing data sciencewith F# Tomas Petricek tomas@tomasp.net | @tomaspetricek PhD Student at Cambridge & Coordinator of http://fsharp.org
  • 2.
    software stacks trainings mac andlinux teaching F# user groups snippets community books and tutorials F# Software Foundation consulting open-source MonoDevelop http://www.fsharp.org contributions research support cross-platform mailing lists
  • 3.
  • 4.
    All the Dataof the World
  • 5.
    data acquisition statistics datacleaning machine learning data transformation visualization type providers F# Data Science Working Group kaggle vega grammar R provider data sources presentation www.fslab.org time-series visualization data aggregation
  • 6.
  • 7.
  • 8.
    Deedle data frame Dataexploration Indexing and aggregation F# Charting library Simple & composable Interactive style www.fslab.org
  • 9.
  • 10.
    F# Data typeproviders First-class data CSV, REST, WorldBank… R Type provider Statistics & visualization 5000 tested packages www.fslab.org
  • 11.
    Demo: US debtover the last century
  • 12.
    Deedle data frame Time-seriesalignment Data transformations Vega visualization F# wrapper for Vega Pre-alpha version www.fslab.org
  • 13.
    F# for DataScience acquire, analyze, visualize interactive experience safety and efficiency of .net ready for production @tomaspetricek
  • 14.
    Going forward Use #fsharpfor fun & profit Join local user groups Help us build data science tools fsharp.org | fslab.org | tomasp.net @tomaspetricek