KoalasKoalas

Koalas 允许你使用 pandas 数据帧 API 来访问 Apache Spark 中的数据。Koalas allows you to use the pandas DataFrame API to access data in Apache Spark.

要求Requirements

在 Databricks Runtime 7.0 或更低版本上,将 Koalas 作为 Azure Databricks PyPI 库进行安装。On Databricks Runtime 7.0 or below, install Koalas as an Azure Databricks PyPI library.

笔记本Notebook

以下笔记本演示如何从 pandas 迁移到 Koalas。The following notebook shows how to migrate from pandas to Koalas.

pandas 到 Koalas 笔记本pandas to Koalas notebook

获取笔记本Get notebook

资源Resources