Datathon presentation for distro

9
1 DATA ANALYTICS BEST PRACTICES DATA SCIENCE March 22, 2014

Transcript of Datathon presentation for distro

Page 1: Datathon presentation for distro

1

DATA ANALYTICS BEST PRACTICES

DATA SCIENCE

March 22, 2014

Page 2: Datathon presentation for distro

22

TODAY’S GOALS

Datathon (March 22, 2014)

Data analytics best practices

Use the Python language

Application to the 37BillionMiles dataset

NumPy Pandas IPython Matplotlib

Page 3: Datathon presentation for distro

33

WHO AM I? [ZACH FREEMAN]

Datathon (March 22, 2014)

Professional

Academic

Favorite Podcasts

My presentation preferences

Page 4: Datathon presentation for distro

44Datathon (March 22, 2014)

WHO

ARE

YOU?

Page 5: Datathon presentation for distro

55Datathon (March 22, 2014)

Page 6: Datathon presentation for distro

66Source: http://cacm.acm.org/blogs/blog-cacm/169199-data-science-workflow-overview-and-challenges/fulltext Datathon (March 22, 2014)

Page 7: Datathon presentation for distro

77

CLOSING COMMENTS

Datathon (March 22, 2014)

• Skills can be picked up all over the place

• Iterate, start small, just dive in

• It’s easier in a team

• Try to avoid getting frustrated

• Have fun!

Page 8: Datathon presentation for distro

88

RESOURCES

Datathon (March 22, 2014)

• http://cacm.acm.org/blogs/blog-cacm/169199-data-

science-workflow-overview-and-challenges/fulltext

• https://datasense.withgoogle.com/preview

• http://cs109.org/

Page 9: Datathon presentation for distro

9

THANK YOU!