9. Bokeh
• Interactive visualization
• Novel graphics
• Streaming, dynamic, large data
• For the browser, with or without a server
• No need to write Javascript
• Support for R, Scala, Julia, Lua
http://bokeh.pydata.org
22. Dask: Out of Core Scheduler for Python
• A parallel computing framework
23. Dask: Out of Core Scheduler for Python
• A parallel computing framework
• That leverages the excellent Python ecosystem
24. Dask: Out of Core Scheduler for Python
• A parallel computing framework
• That leverages the excellent Python ecosystem
• Using blocked algorithms and task scheduling
25. Dask: Out of Core Scheduler for Python
• A parallel computing framework
• That leverages the excellent Python ecosystem
• Using blocked algorithms and task scheduling
• Written in pure Python
26. Dask: Out of Core Scheduler for Python
• A parallel computing framework
• That leverages the excellent Python ecosystem
• Using blocked algorithms and task scheduling
• Written in pure Python
27. Dask: Out of Core Scheduler for Python
• A parallel computing framework
• That leverages the excellent Python ecosystem
• Using blocked algorithms and task scheduling
• Written in pure Python
Core Ideas
28. Dask: Out of Core Scheduler for Python
• A parallel computing framework
• That leverages the excellent Python ecosystem
• Using blocked algorithms and task scheduling
• Written in pure Python
Core Ideas
• Dynamic task scheduling yields sane parallelism
29. Dask: Out of Core Scheduler for Python
• A parallel computing framework
• That leverages the excellent Python ecosystem
• Using blocked algorithms and task scheduling
• Written in pure Python
Core Ideas
• Dynamic task scheduling yields sane parallelism
• Simple library to enable parallelism
30. Dask: Out of Core Scheduler for Python
• A parallel computing framework
• That leverages the excellent Python ecosystem
• Using blocked algorithms and task scheduling
• Written in pure Python
Core Ideas
• Dynamic task scheduling yields sane parallelism
• Simple library to enable parallelism
• Dask.array/dataframe to encapsulate the functionality
31. Dask: Out of Core Scheduler for Python
• A parallel computing framework
• That leverages the excellent Python ecosystem
• Using blocked algorithms and task scheduling
• Written in pure Python
Core Ideas
• Dynamic task scheduling yields sane parallelism
• Simple library to enable parallelism
• Dask.array/dataframe to encapsulate the functionality
• Distributed scheduler coming
38. PyData's Future
• Dozens of international meetup groups
• Intl conferences each year, including collab
with EuroPython, Strata, and others
• More companies investing in the ecosystem
• Dato - SFrame, SGraph, ...
• Cloudera - Impyla, Ibis, ...
• Microsoft - Python in AzureML
• Databricks - PySpark
• Continuum - *.*