将两个 KLL bigint 草图缓冲区合并为一个。
Syntax
from pyspark.sql import functions as sf
sf.kll_sketch_merge_bigint(left, right)
参数
| 参数 | 类型 | Description |
|---|---|---|
left |
pyspark.sql.Column 或 str |
第一个 KLL bigint 草图。 |
right |
pyspark.sql.Column 或 str |
第二个 KLL bigint 草图。 |
退货
pyspark.sql.Column:合并的 KLL 草图。
例子
示例 1:合并两个 KLL bigint 草图
from pyspark.sql import functions as sf
df = spark.createDataFrame([1,2,3,4,5], "INT")
sketch_df = df.agg(sf.kll_sketch_agg_bigint("value").alias("sketch"))
result = sketch_df.select(sf.kll_sketch_merge_bigint("sketch", "sketch")).first()[0]
result is not None and len(result) > 0
True