Visualizations in Databricks notebooks

Azure Databricks has built-in support for charts and visualizations in both Databricks SQL and in notebooks. This page describes how to work with visualizations in a Databricks notebook. For information about using visualizations in Databricks SQL, see Visualization in Databricks SQL.

To view the types of visualizations, see visualization types.

Create a new visualization

To recreate the example in this section, use the following code:

sparkDF = spark.read.csv("/databricks-datasets/bikeSharing/data-001/day.csv", header="true", inferSchema="true")
display(sparkDF)

To create a visualization, click + above a result and select Visualization. The visualization editor appears.

New visualization menu

  1. In the Visualization Type drop-down, choose a type.

    Visualization editor

  2. Select the data to appear in the visualization. The fields available depend on the selected type.

  3. Click Save.

Visualization tools

If you hover over the top right of a chart in the visualization editor, a Plotly toolbar appears where you can perform operations such as select, zoom, and pan.

Notebook visualization editor toolbar

If you hover over the top right of a chart outside the visualization editor a smaller subset of tools appears:

Notebook chart toolbar

Create a new data profile

Note

Available in Databricks Runtime 9.1 LTS and above.

Data profiles display summary statistics of an Apache Spark DataFrame, a pandas DataFrame, or a SQL table in tabular and graphic format. To create a data profile from a results cell, click + and select Data Profile.

Azure Databricks calculates and displays the summary statistics.

Data Profile

  • Numeric and categorical features are shown in separate tables.
  • At the top of the tab, you can sort or search for features.
  • At the top of the chart column, you can choose to display a histogram (Standard) or quantiles.
  • Check expand to enlarge the charts.
  • Check log to display the charts on a log scale.
  • You can hover your cursor over the charts for more detailed information, such as the boundaries of a histogram column and the number of rows in it, or the quantile value.

You can also generate data profiles programmatically; see summarize command (dbutils.data.summarize).

Work with visualizations and data profiles

Note

Data profiles are available in Databricks Runtime 9.1 LTS and above.

Rename, duplicate, or remove a visualization or data profile

To rename, duplicate, or remove a visualization or data profile, click the downward pointing arrow at the right of the tab name.

Notebook visualization drop down menu

You can also change the name by clicking directly on it and editing the name in place.

Edit a visualization

Click Edit visualization button beneath the visualization to open the visualization editor. When you have finished making changes, click Save.

Edit colors

You can customize a visualization's colors when you create the visualization or by editing it.

  1. Create or edit a visualization.
  2. Click Colors.
  3. To modify a color, click the square and select the new color by doing one of the following:
    • Click it in the color selector.
    • Enter a hex value.
  4. Click anywhere outside the color selector to close it and save changes.

Temporarily hide or show a series

To hide a series in a visualization, click the series in the legend. To show the series again, click it again in the legend.

To show only a single series, double-click the series in the legend. To show other series, click each one.

Download a visualization

To download a visualization in .png format, click the camera icon camera iconin the notebook cell or in the visualization editor.

  • In a result cell, the camera icon appears at the upper right when you move the cursor over the cell.

    camera in notebook cell

  • In the visualization editor, the camera icon appears when you move the cursor over the chart. See Visualization tools.

Add a visualization or data profile to a dashboard

  1. Click the downward pointing arrow at the right of the tab name.
  2. Select Add to dashboard. A list of available dashboard views appears, along with a menu option Add to new dashboard.
  3. Select a dashboard or select Add to new dashboard. The dashboard appears, including the newly added visualization or data profile.