August 2020
These features and Azure Databricks platform improvements were released in August 2020.
Note
The release date and content listed below only corresponds to actual deployment of the Azure Public Cloud in most case.
It provide the evolution history of Azure Databricks service on Azure Public Cloud for your reference that may not be suitable for Azure operated by 21Vianet.
Note
Releases are staged. Your Azure Databricks account may not be updated until up to a week after the initial release date.
Important
Version 3.26 has been released to customers in the Canada Central and Central India regions only. All other regions will get the 3.26 features at the same time that 3.27 is released.
Token Management API is GA and admins can use the Admin Console to grant and revoke user access to tokens
August 26 - September 1, 2020: Version 3.27
Token management is now generally available. Azure Databricks administrators can use the Token Management API and the Admin Console to manage their users' Azure Databricks personal access tokens. As an admin, you can:
- Monitor and revoke users' personal access tokens.
- Control the lifetime of future tokens in your workspace.
- Control which users can create and use tokens via the Permissions API or in the Admin Console.
In the transition from Public Preview to GA, the Token Management API parameter created_by
was changed to created_by_id
, and a new parameter, created_by_username
was added.
For more information, see Monitor and manage access to personal access tokens.
Message size limits for Shiny apps increased
August 26 - September 1, 2020: Version 3.27
The maximum application size for Shiny apps has been increased from 10 MB to 20 MB. If your application's total size exceeds this limit, review the recommendations in the Shiny FAQ.
Improved instructions for setting up a cluster in local mode
August 26 - September 1, 2020: Version 3.27
In the cluster UI:
- If you create a cluster with 0 workers, a tool tip appears recommending that you use local mode and showing the associated configuration setting (
spark.master local[*]
). - You can no longer set
spark.master local[*]
for a cluster, unless the cluster has 0 workers.
View version of notebook associated with a run
August 26 - September 1, 2020: Version 3.27
From the Experiments sidebar, you can now display the version of a notebook associated with a run. For details, see View notebook experiment.
Databricks Runtime 7.2 GA
August 20, 2020
Databricks Runtime 7.2 brings many additional features and improvements over Databricks Runtime 7.1, including:
- Auto Loader is generally available: Auto Loader is an efficient method for incrementally ingesting a large number of files into Delta Lake. It is now GA and adds the following features:
- Directory listing mode option: Auto Loader adds a new directory listing mode, in addition to the existing file notification mode, for determining when there are new files.
- Cloud resource management API: You can now use our Scala API to manage cloud resources created by Auto Loader. You can list notification services and tear down specific notification services using this API.
- Rate limiting option: You can now use the
cloudFiles.maxBytesPerTrigger
option to limit the amount of data processed in each microbatch. - Option validation: Auto Loader now validates the options you provide.
validation
will fail. To skip option validation, setcloudFiles.validateOptions
tofalse
.
- Efficiently copy a Delta table with clone.
- Improvements:
- Snowflake connector has been upgraded to version 2.8.1, which includes Spark 3.0 support.
- Credential passthrough improvements
- TensorBoard improvements
- Upgraded Python and R libraries
For details, see the complete Databricks Runtime 7.2 (EoS) release notes.
Databricks Runtime 7.2 ML GA
August 20, 2020
Databricks Runtime 7.2 for Machine Learning is built on top of Databricks Runtime 7.2 and brings new and improved Python and system libraries. For details, see the complete Databricks Runtime 7.2 (EoS) release notes.
Databricks Runtime 7.2 Genomics GA
August 20, 2020
Databricks Runtime 7.2 for Genomics is built on top of Databricks Runtime 7.2 and significantly speeds up the conversion of literal numpy 1D and 2D float-typed ndarrays to Java arrays. The Glow genome-wide association study documentation reflects the usage.
Permissions API (Public Preview)
August 18, 2020
Databricks is pleased to announce the public preview of the Permissions API, which lets you manage permissions for:
- Tokens
- Clusters
- Pools
- Jobs
- Notebooks
- Folders (directories)
- MLflow registered models
For more information, see Permissions API.
Databricks Connect 7.1 (GA)
August 12, 2020
Databricks Connect now supports Databricks Runtime 7.1.
In Databricks Runtime 7.1, Databricks recommends that you always use the most recent version of Databricks Connect.
Repeatable installation order for cluster libraries
August 12-25, 2020: Version 3.26
On a cluster running Databricks Runtime 7.2 or above, Azure Databricks now processes all cluster libraries in the order that they were installed.
Create model from MLflow registered models page (Public Preview)
August 12-25, 2020: Version 3.26
You can now create a new model from the MLflow registered models page. For details, see Create a new registered model and assign a logged model to it.
Databricks Container Services supports GPU images
August 12-25, 2020: Version 3.26
You can now use Databricks Container Services on clusters with GPUs to create portable deep learning environments with customized libraries.
For details, see Databricks Container Services on GPU compute.