外部数据源External data sources

重要

此功能目前以公共预览版提供。This feature is in Public Preview. 请联系 Azure Databricks 代表,以申请访问权限。Contact your Azure Databricks representative to request access.

外部数据源是一种计算资源,支持对 Azure Databricks 环境外部的一组数据对象运行 SQL 命令。An external data source is a computation resource that lets you run SQL commands on a set of data objects external to the Azure Databricks environment.

外部数据源上的 SQL 命令通过单个帐户,因此无法确定运行查询的人员。SQL commands on an external data source go through a single account so you cannot identify who ran the query. 无法在 Azure Databricks SQL Analytics 中查看外部数据源的状态,并且配置外部数据源时必须配置对数据源内对象的访问权限。You cannot see the status of an external data source in Azure Databricks SQL Analytics and you must configure access to objects within the data source when you configure the external data source.

外部数据源支持外部数据源固有的 SQL 命令。External data sources support SQL commands native to the external data source.

必须是 Azure Databricks 管理员才能管理外部数据源。You must be an Azure Databricks admin to manage external data sources.

另一类型的数据源是 SQL 终结点The other type of data source is a SQL endpoint.

备注

对外部数据源执行 SQL Analytics 查询不会产生 DBU 费用。You do not incur DBU charges to process SQL Analytics queries on external data sources. Azure Databricks 保留对此免费使用进行限制的权利,后续也可能会开始对此使用收费。Azure Databricks reserves the right to cap or throttle this free usage or may begin charging for such usage in the future.

创建外部数据源Create an external data source

  1. 单击边栏底部的用户设置图标图标,然后选择“设置”。Click the User Settings Icon icon at the bottom of the sidebar and select Settings.

  2. 单击“外部数据源”选项卡。Click the External Data Sources tab.

  3. 单击“+ 新建数据源”。Click + New Data Source.

  4. 选择数据源类型,然后按照配置说明进行操作:Select a data source type and follow the configuration instructions:

  5. 单击“创建”。Click Create.

支持的外部数据源类型Supported external data source types

SQL Analytics 支持以下外部数据源类型:SQL Analytics supports the following external data source types:

  • Axibase 时序数据库Axibase Time Series Database
  • CassandraCassandra
  • ClickHouseClickHouse
  • CockroachDBCockroachDB
  • IBM 支持的 DB2DB2 by IBM
  • DruidDruid
  • ElasticSearchElasticSearch
  • Google AnalyticsGoogle Analytics
  • GraphiteGraphite
  • HiveHive
  • ImpalaImpala
  • InfluxDBInfluxDB
  • JIRAJIRA
  • JSONJSON
  • Apache KylinApache Kylin
  • MemSQLMemSQL
  • Microsoft Azure Synapse AnalyticsMicrosoft Azure Synapse Analytics
  • Microsoft Azure SQL 数据库Microsoft Azure SQL Database
  • Microsoft SQL ServerMicrosoft SQL Server
  • MongoDBMongoDB
  • MySQLMySQL
  • OracleOracle
  • PostgreSQLPostgreSQL
  • PrestoPresto
  • PrometheusPrometheus
  • QuboleQubole
  • RocksetRockset
  • SalesforceSalesforce
  • ScyllaDBScyllaDB
  • SnowflakeSnowflake
  • TreasureDataTreasureData
  • VerticaVertica
  • Yandex AppMetricaYandex AppMetrica
  • Yandex MetricaYandex Metrica