Skip to content

JUPYTER NOTEBOOK

JUPYTER NOTEBOOK is an incredibly powerful and popular tool used by data analysts and scientist to interactively develop, analyze and present data.
Its simple web based user interface helps us to visualize analytics, run machine learning models, and explore new possibilities.

jupyter notebook example output

jupyter notebook output

OPTIMUS enhances the power of JUPYTER NOTEBOOKS for users from analysts to engineers by simplifying the integration of notebook analysis with the broader process of extracting and preparing data for analysis and exporting the results from notebooks to other processes.

Capabilities

By leveraging and integrating Papermill - an opensource solution developed by NETFLIX - into the solution, OPTIMUS allows you to:

  • Directly run one or multiple notebooks with different parameter values from the automation script. This allows the user to create resusable parameterized JUPYTER NOTEBOOKS, in effect turning the notebooks into reusable data processing and automation functions.

  • The Jupyter notebooks can be written in various languages (Python, Scala, SparkSQL, and R). You are also not limited by the installed libraries in OPTIMUS as you can setup your own virtual environments with additional python libraries and use Jupyter with your own custom virtual environment

  • Data can be shared between cells via temporary tables. Or across notebooks and with the automation script through the use of scrapbook.

  • Results from the notebook can be easily exported as Excel, csv or image files and used for further downstream automation, such as sharing on a cloud drive, generating a PDF document and/or sending a report to the user via email.

  • And using OPTIMUS, multiple notebooks can be chained and scheduled in a workflow to accomplish complex automation and data transformation tasks.

Some examples of how NETFLIX has revolutonized the use and deployment of JUPYTER NOTEBOOKS for data analysis.

image

image

image

Installing Jupyter Notebook

Jupyter Notebook and Papermill will be automatically installed with your OPTIMUS installation. But should you wish to install your own instance, check out:
How to install Jupyter Notebook in Windows

Getting Started

Its assumed that you have already some working knowledge of tools like Pandas and matplotlib to fully utilize the power of Jupyter Notebook for data analysis. Here are some useful guides to get started or to refresh your memory.

Pandas

Quick Start guide on Jupyter Notebook and using Pandas

Pandas Cookbook from Py Data Org

Dataframe Styling using Pandas

Style Pandas dataframe like a master

Matplotlib

Matplotlib Intro from W3 schools