PixieDust
An open source helper library for Python notebooks. It makes working with data simpler.
Jupyter Notebooks are powerful interactive tools for fast and flexible data experimentation and analysis. They can contain live code, rich text, equations, and visualizations. However, the learning curve is steep. Coding a simple visualization requires lots of research, which works against the promise of data science notebooks as tools for easy collaboration.
This is where PixieDust comes in. It’s an open source helper library that’s designed to lower the barrier to entry for scientists and developers working in Jupyter Notebooks. PixieDust speeds the main steps of data science:
- Data loading from remote files
- Data visualisation with a simple
display()
API - Dashboard creation with minimal coding using PixieApps
With PixieDust
Without PixieDust
Both produce a chart, like so
PixieDust & Spark
If you’re processing data at scale, PixieDust also works with Apache Spark™ data structures like PySpark DataFrames, DataSets, and GraphFrames. When detecting that Spark is available, PixieDust enables these extra capabilities:
- PackageManager to install third-party Spark packages
- Spark Progress Monitoring to see how Spark Jobs are progressing
- Scala bridge to write Scala/Java code in the same Python Notebook
Getting started
Sample notebooks are available on IBM’s Data Science Experience:
- Welcome to PixieDust
- Analyze traffic data using PixieDust & Spark
- Analyze open data sets with Spark & PixieDust
If you prefer to work locally, you can start by installing PixieDust using its local installer. For more advanced applications using PixieDust, check out the projects featured here.
Projects
-
Not the Chart You’re Looking for? Filter and Try Again.
MediumFiltering data with PixieDust.
-
Analyzing Data with IBM Cloud SQL Query
Medium | NotebookEasily make SQL queries on your object storage data.
-
Quick Access to IBM Cloud Service Credentials from a Python Notebook
Medium | NotebookSee all your orgs, spaces, and services together in one place.
-
The Visual Python Debugger for Jupyter Notebooks You’ve Always Wanted
MediumIntroducing PixieDebugger.
-
My First Pull Request for the Open Source PixieDust Project
MediumSetting up Python test environments, collaborating on GitHub, and sharing my changes with the world.
-
Notebooks for Spreadsheet Users
MediumDreading pivot tables? Reach for a Python Notebook + PixieDust instead.
-
Share Your Jupyter Notebook Charts on the Web
Medium | GitHubQuickly turn charts into standalone web pages.
-
Building Your First Machine Learning System
Medium | NotebookTrain your model and deploy it, Watson ML for developers (part 2).
-
Deploy Your Analytics as Web Apps Using PixieDust’s 1.1 Release
Notebook | MediumFrom data science, directly to the web.
-
Visualizing Data with Jupyter Notebooks, PixieDust, and Compose MongoDB
Compose ArticlesConnecting to Compose MongoDB and creating rich presentations for your data inside a Jupyter notebook is made easier with PixieDust.
-
Medium.com, More Stats Please
GitHub | MediumAnalyzing Medium publication stats in a Python notebook.
-
Analyzing Podcast Metadata Using Jupyter Notebooks
GitHub | MediumBuilding podcast playlists based on your current mood.
-
Using Watson Machine Learning to Predict a Post's Potential
Notebook | MediumGaze into the Reddit crystal ball.
-
Mapping for Data Science with PixieDust and Mapbox
Notebook | MediumAdd another layer to your Jupyter notebooks with built-in map rendering.
-
pixiedust_peacetech
GitHubLocal visibility to mitigate business disruption in emerging economies.
-
Share the (PixieDust) Magic on PyPI
MediumPackaging and distributing your PixieDust plugin as a Python module.
-
You Too Can Make Magic (in Jupyter Notebooks with PixieDust)
MediumGetting started with custom visualizations, simple tables & word clouds.
-
The Most Popular Search Term at SXSW, According to Our Chatbot
GitHub | Medium | NotebookAnalyzing conversation data using Spark, Jupyter, and PixieDust.
-
I am Not a Data Scientist (But I play one in this blog post, thanks to PixieDust)
Medium | developerWorksNotebooks aren't just for Data Scientists.
-
Sentiment Analysis of Twitter Hashtags With Spark
Notebook | Medium | developerWorksBuild a real-time dashboard with PixieDust & Jupyter Notebooks.
-
Predict Flight Delays with Apache Spark
Notebook | MediumKnow about delayed flights before they happen.
-
Weather Forecast with PixieDust
GitHub | developerWorksMake your own forecast in a Python notebook.