Azure Cosmos DB 中的更改源拉取模型

适用于: ✅ NoSQL

使用更改源拉取模型可以按自己的节奏使用 Azure Cosmos DB 更改源。与更改源处理器类似，你可以使用更改源拉取模型来并行处理多个更改源使用者之间的更改。

对比更改源处理器

许多情况下，既可以使用更改源处理器又可以使用更改源拉取模型来处理更改源。拉取模型的延续令牌和更改源处理器的租约容器都可作为更改源中最后处理项（或一批项）的书签。

但是，延续令牌无法转换为租约（反之亦然）。

注释

在大多数情况下，如果需要从更改源中读取数据，最简单的方法是使用更改源处理器。

你应该在这些情况下考虑使用拉取模型：

读取特定分区键的更改
为了控制客户端软件接收和处理更改的速度
执行对变更源中现有数据的一次性读取（例如，进行数据迁移）

下文阐述了更改源拉取模型与更改源处理器之间的几点关键差异：

功能 / 特点	更改源处理器	更改源请求模型
持续追踪更改源处理的当前点	租赁（存储在 Azure Cosmos DB 容器中）	继续标记（存储在内存中或手动进行保存）
能够重播过去的更改	是（在使用推送模型的情况下）	是（在使用拉取模型的情况下）
轮询将来的更改	基于用户指定的 `WithPollInterval` 值自动检查更改	Manual
没有新更改时的行为	自动等待 `WithPollInterval` 值，然后重新检查	必须检查状态并手动重新检查
处理整个容器的更改	是的，自动并行处理从同一容器使用更改的多个线程和机器	支持，请使用 `FeedRange` 来手动并行处理
仅处理单个分区键的更改	不支持	是的

注释

与使用更改源处理器进行读取不同，当使用拉取模型时，如果未出现新变化，则需要显式处理。这由 HTTP 304 NotModified指示。返回 0 个文档且 HTTP 状态代码为 200 OK 的更改源响应并不一定意味着已到达更改源的末尾，应继续进行轮询。

使用拉取模型

若要使用拉取模型来处理变更事件流，请创建一个 FeedIterator 实例。在最初创建 FeedIterator 时，必须指定所需的 ChangeFeedStartFrom 值，该值由读取更改的起始位置和所需的 FeedRange 值组成。 FeedRange 是分区键值范围，指定根据特定 FeedIterator 可从更改源中读取的项。此外，必须为所需的更改处理模式指定必需的 ChangeFeedMode 值：最新版本或所有版本和删除模式。使用 ChangeFeedMode.LatestVersion 或 ChangeFeedMode.AllVersionsAndDeletes 指示读取更改源的模式。使用所有版本和删除模式时，必须选择从 Now() 值或从特定延续令牌开始的更改源。

你还可以选择指定 ChangeFeedRequestOptions 以设置 PageSizeHint。设置后，此属性会对每页收到的项目的最大数目进行设置。如果受监视集合中的操作通过存储过程执行，则在从更改源读取项时，会保留事务范围。因此，收到的项数可能高于指定的值，通过同一事务更改的项会作为某一原子批的一部分返回。

以下示例以最新版本模式获取一个返回实体对象（在本例中为 FeedIterator 对象）的 User：

FeedIterator<User> InteratorWithPOCOS = container.GetChangeFeedIterator<User>(ChangeFeedStartFrom.Beginning(), ChangeFeedMode.LatestVersion);

小窍门

对于早于 3.34.0的版本，可以通过设置 ChangeFeedMode.Incremental来使用最新版本模式。 Incremental和LatestVersion都指向更改记录的最新版本模式，而使用任一模式的应用程序将观察到相同的行为。

所有版本和删除模式都处于预览状态，可与预览版 .NET SDK 版本 >= 3.32.0-preview 一起使用。以下示例以所有版本和删除模式获取一个返回 FeedIterator 对象的 User：

FeedIterator<ChangeFeedItem<User>> InteratorWithPOCOS = container.GetChangeFeedIterator<ChangeFeedItem<User>>(ChangeFeedStartFrom.Now(), ChangeFeedMode.AllVersionsAndDeletes);

注释

在最新版本模式下，你将收到代表被更改项的对象和一些附加元数据。所有版本和删除模式会返回一个不同的数据模型。

可以获取最新版本模式的完整示例，或所有版本和删除模式。

通过流使用更改源

两种更改源模式的 FeedIterator 都有两种选项。除了返回实体对象的示例之外，还可以获取提供 Stream 支持的响应。利用流，你可以在不先将数据反序列化的情况下读取数据，从而节省客户端资源。

以下是如何在最新版本模式下获取 FeedIterator 以返回 Stream 的示例：

FeedIterator iteratorWithStreams = container.GetChangeFeedStreamIterator(ChangeFeedStartFrom.Beginning(), ChangeFeedMode.LatestVersion);

使用整个容器的更改

如果没有向 FeedRange 提供 FeedIterator，则可以按自己的节奏处理整个容器的更改源。这是一个示例，展示如何使用最新版本模式，从当前时间开始读取所有更改：

FeedIterator<User> iteratorForTheEntireContainer = container.GetChangeFeedIterator<User>(ChangeFeedStartFrom.Now(), ChangeFeedMode.LatestVersion);

while (iteratorForTheEntireContainer.HasMoreResults)
{
    FeedResponse<User> response = await iteratorForTheEntireContainer.ReadNextAsync();

    if (response.StatusCode == HttpStatusCode.NotModified)
    {
        Console.WriteLine($"No new changes");
        await Task.Delay(TimeSpan.FromSeconds(5));
    }
    else 
    {
        foreach (User user in response)
        {
            Console.WriteLine($"Detected change for user with id {user.id}");
        }
    }
}

由于更改源实际上是包含所有后续写入和更新项的无穷列表，因此 HasMoreResults 的值始终为 true。在尝试读取更改源时，如果未出现新更改，你会收到 NotModified 状态的响应。这不同于接收没有更改和 OK 状态的响应。在有更多更改可用时，可以获取空的更改源响应，并且应继续轮询，直到收到 NotModified。在前面的示例中，NotModified 通过在重新检查更改之前等待 5 秒来处理。

使用分区键的更改

在某些情况下，你可能希望仅处理特定分区键的更改。可以获取特定分区键的 FeedIterator，并采用处理整个容器的方式来处理更改。

FeedIterator<User> iteratorForPartitionKey = container.GetChangeFeedIterator<User>(
    ChangeFeedStartFrom.Beginning(FeedRange.FromPartitionKey(new PartitionKey("PartitionKeyValue")), ChangeFeedMode.LatestVersion));

while (iteratorForThePartitionKey.HasMoreResults)
{
    FeedResponse<User> response = await iteratorForThePartitionKey.ReadNextAsync();

    if (response.StatusCode == HttpStatusCode.NotModified)
    {
        Console.WriteLine($"No new changes");
        await Task.Delay(TimeSpan.FromSeconds(5));
    }
    else
    {
        foreach (User user in response)
        {
            Console.WriteLine($"Detected change for user with id {user.id}");
        }
    }
}

使用 FeedRange 实现并行化

在更改源处理器中，工作自动分布到多个使用者。在更改源拉取模型中，可以使用 FeedRange 来并行处理更改源。 FeedRange 表示分区键值的一个范围。

下面的示例展示了如何获取容器的范围列表：

IReadOnlyList<FeedRange> ranges = await container.GetFeedRangesAsync();

获取容器的 FeedRange 值列表时，每个FeedRange都会获得一个。

使用 FeedRange 可以创建一个 FeedIterator，以便跨多个计算机或线程并行处理更改源。上面的示例展示了如何获取整个容器或某一个分区键的 FeedIterator，与之不同的是，你可以使用 FeedRanges 来获取多个 FeedIterator，这样就可以并行处理更改源。

如果您想使用 FeedRanges，需要通过一个协调器进程来获取 FeedRanges，并将其分配给这些计算机。此分配可能是：

使用 FeedRange.ToJsonString 并分发此字符串值。使用者可以将此值用于 FeedRange.FromJsonString。
如果分发正在进行，则传递 FeedRange 对象引用。

下面的示例展示了如何使用两个并行读取的独立虚构计算机从容器的更改源开头进行读取：

机器 1：

FeedIterator<User> iteratorA = container.GetChangeFeedIterator<User>(ChangeFeedStartFrom.Beginning(ranges[0]), ChangeFeedMode.LatestVersion);
while (iteratorA.HasMoreResults)
{
    FeedResponse<User> response = await iteratorA.ReadNextAsync();

    if (response.StatusCode == HttpStatusCode.NotModified)
    {
        Console.WriteLine($"No new changes");
        await Task.Delay(TimeSpan.FromSeconds(5));
    }
    else
    {
        foreach (User user in response)
        {
            Console.WriteLine($"Detected change for user with id {user.id}");
        }
    }
}

机器 2:

FeedIterator<User> iteratorB = container.GetChangeFeedIterator<User>(ChangeFeedStartFrom.Beginning(ranges[1]), ChangeFeedMode.LatestVersion);
while (iteratorB.HasMoreResults)
{
    FeedResponse<User> response = await iteratorA.ReadNextAsync();

    if (response.StatusCode == HttpStatusCode.NotModified)
    {
        Console.WriteLine($"No new changes");
        await Task.Delay(TimeSpan.FromSeconds(5));
    }
    else
    {
        foreach (User user in response)
        {
            Console.WriteLine($"Detected change for user with id {user.id}");
        }
    }
}

保存延续令牌

可以通过获取延续令牌来保存 FeedIterator 的位置。延续令牌是字符串值，它会跟踪 FeedIterator 的上次处理的更改，并且允许 FeedIterator 稍后在此点进行恢复。如果指定了延续令牌，则优先于开始时间并从起始值开始。以下代码读取自容器创建以来的更改源。当没有更多更改可用时，它保留一个继续标记，以便以后可以继续使用更改源。

FeedIterator<User> iterator = container.GetChangeFeedIterator<User>(ChangeFeedStartFrom.Beginning(), ChangeFeedMode.LatestVersion);

string continuation = null;

while (iterator.HasMoreResults)
{
    FeedResponse<User> response = await iterator.ReadNextAsync();

    if (response.StatusCode == HttpStatusCode.NotModified)
    {
        Console.WriteLine($"No new changes");
        continuation = response.ContinuationToken;
        // Stop the consumption since there are no new changes
        break;
    }
    else
    {
        foreach (User user in response)
        {
            Console.WriteLine($"Detected change for user with id {user.id}");
        }
    }
}

// Some time later when I want to check changes again
FeedIterator<User> iteratorThatResumesFromLastPoint = container.GetChangeFeedIterator<User>(ChangeFeedStartFrom.ContinuationToken(continuation), ChangeFeedMode.LatestVersion);

使用最新版本模式时，只要 Azure Cosmos DB 容器存在，FeedIterator 延续令牌就不会过期。使用所有版本和删除模式时，只要更改发生在连续备份的保留窗口内，FeedIterator 延续令牌就有效。

若要使用拉取模型来处理变更事件流，请创建一个 Iterator<FeedResponse<JsonNode>> responseIterator 实例。创建 CosmosChangeFeedRequestOptions 时，必须指定从何处开始读取更改源，并传递要使用的 FeedRange 参数。 FeedRange 是分区键值范围，指定可从更改源中读取的项。

如果想要以所有版本和删除模式读取更改源，则还必须在创建 allVersionsAndDeletes() 时指定 CosmosChangeFeedRequestOptions。所有版本和删除模式都不支持从头或某个时间点处理更改源。必须从现在开始或从延续令牌处理更改。所有版本模式和删除模式目前为预览版，可以在 Java SDK 版本 > = 4.42.0 中使用。

使用整个容器的更改

指定 FeedRange.forFullRange() 后就可以按自己的节奏处理整个容器的更改源。还可选择在 byPage() 中指定一个值。设置后，此属性会对每页收到的项目的最大数目进行设置。

注释

以下所有代码片段都取自 GitHub 中的示例。可以使用最新版本模式示例和所有版本和删除模式示例。

以下示例展示如何在最新版本模式下获取 responseIterator 值：

CosmosChangeFeedRequestOptions options = CosmosChangeFeedRequestOptions
        .createForProcessingFromBeginning(FeedRange.forFullRange());
Iterator<FeedResponse<JsonNode>> responseIterator = container
    .queryChangeFeed(options, JsonNode.class)
    .byPage()
    .toIterable()
    .iterator();

以下示例展示如何在所有版本和删除模式下获取 responseIterator：

CosmosChangeFeedRequestOptions options = CosmosChangeFeedRequestOptions
    .createForProcessingFromNow(FeedRange.forFullRange())
    .allVersionsAndDeletes();

Iterator<FeedResponse<JsonNode>> responseIterator = container
    .queryChangeFeed(options, JsonNode.class)
    .byPage()
    .toIterable()
    .iterator();

然后，我们可以循环访问结果。由于更改源实际上是包含所有后续写入和更新项的无穷列表，因此 responseIterator.hasNext() 的值始终为 true。以下示例使用最新版本模式从头开始读取所有更改。每次迭代在处理所有事件后都会保留一个延续令牌。它从更改源中的最后一个处理点进行选取，并使用 createForProcessingFromContinuation 进行处理：

int i = 0;
List<JsonNode> results;
while (responseIterator.hasNext()) {
    FeedResponse<JsonNode> response = responseIterator.next();
    results = response.getResults();
    logger.info("Got " + results.size() + " items(s)");

    // applying the continuation token
    // only after processing all events
    options = CosmosChangeFeedRequestOptions
            .createForProcessingFromContinuation(response.getContinuationToken());
    i++;
    if (i >= 5) {
        // artificially breaking out of loop - not required in a real app
        System.out.println("breaking....");
        break;
    }
}

使用分区键的更改

在某些情况下，你可能希望仅处理特定分区键的更改。可以按照处理整个容器的相同方式处理特定分区键的更改。以下是使用最新版本模式的示例：

options = CosmosChangeFeedRequestOptions
        .createForProcessingFromBeginning(FeedRange.forLogicalPartition(new PartitionKey(partitionKey)));

responseIterator = container
    .queryChangeFeed(options, JsonNode.class)
    .byPage()
    .toIterable()
    .iterator();

int pkIndex = 0;

while (responseIterator.hasNext()) {
    FeedResponse<JsonNode> response = responseIterator.next();
    results = response.getResults();
    logger.info("Got " + results.size() + " items(s) retrieved");

    // applying the continuation token
    // only after processing all events
    options = CosmosChangeFeedRequestOptions
        .createForProcessingFromContinuation(response.getContinuationToken());
    pkIndex++;
    if (pkIndex >= 5) {
        // artificially breaking out of loop
        System.out.println("breaking....");
        break;
    }
}

使用 FeedRange 实现并行化

在更改源处理器中，工作自动分布到多个使用者。在更改源拉取模型中，可以使用 FeedRange 来并行处理更改源。 FeedRange 表示分区键值的一个范围。

以下示例使用最新版本模式演示如何获取容器的范围列表：

Mono<List<FeedRange>> feedranges = resources.container.getFeedRanges();
List<FeedRange> feedRangeList = feedranges.block();

获取容器的 FeedRange 列表时，每个FeedRange你都会获得一个。

通过使用 FeedRange，您可以在多台计算机或线程间并行处理变更流。上面的示例展示了如何处理整个容器或某一个分区键的更改，与之不同的是，你可以使用 FeedRanges 来并行处理更改源。

如果您想使用 FeedRanges，需要通过一个协调器进程来获取 FeedRanges，并将其分配给这些计算机。此分配可能是：

使用 FeedRange.toString() 并分发此字符串值。
如果分发正在进行，则传递 FeedRange 对象引用。

以下示例使用最新版本模式。该示例展示如何使用两个并行读取的独立虚构计算机从容器的更改源开头进行读取：

机器 1：

FeedRange range1 = feedRangeList.get(0);
options = CosmosChangeFeedRequestOptions
        .createForProcessingFromBeginning(range1);

int machine1index = 0;
responseIterator = container
    .queryChangeFeed(options, JsonNode.class)
    .byPage()
    .toIterable()
    .iterator();

while (responseIterator.hasNext()) {
    FeedResponse<JsonNode> response = responseIterator.next();
    results = response.getResults();
    logger.info("Got " + results.size() + " items(s) retrieved");

    // applying the continuation token
    // only after processing all events
    options = CosmosChangeFeedRequestOptions
        .createForProcessingFromContinuation(response.getContinuationToken());

    machine1index++;

    if (machine1index >= 5) {
        // artificially breaking out of loop - not required in a real app
        System.out.println("breaking....");
        break;
    }
}

机器 2:

FeedRange range2 = feedRangeList.get(1);
options = CosmosChangeFeedRequestOptions
        .createForProcessingFromBeginning(range2);

responseIterator = container
    .queryChangeFeed(options, JsonNode.class)
    .byPage()
    .toIterable()
    .iterator();

int machine2index = 0;

while (responseIterator.hasNext()) {
    FeedResponse<JsonNode> response = responseIterator.next();
    results = response.getResults();
    logger.info("Got " + results.size() + " items(s) retrieved");

    // applying the continuation token
    // only after processing all events
    options = CosmosChangeFeedRequestOptions
        .createForProcessingFromContinuation(response.getContinuationToken());

    machine2index++;
    if (machine2index >= 5) {
        // artificially breaking out of loop - not required in a real app
        System.out.println("breaking....");
        break;
    }
}

若要使用拉取模型处理更改提要，请创建一个类型为ItemPaged[Dict[str, Any]]的 responseIterator 实例。调用变更源 API 时，您必须指定从何处开始读取变更源，并传递您要使用的feed_range参数。 feed_range 是分区键值范围，指定可从更改源中读取的项。

你还可以为想要处理更改的更改源模式指定 mode 参数：LatestVersion 或 AllVersionsAndDeletes。默认值为 LatestVersion。使用 LatestVersion 或 AllVersionsAndDeletes 指示读取更改源的模式。使用 AllVersionsAndDeletes 模式时，可以从现在开始处理更改，也可以从 continuation 令牌开始处理更改。不支持使用 start_time 从头或某个时间点读取更改源。

注释

AllVersionsAndDeletes 模式为预览版，在 Python SDK 版本 4.9.1b1 或更高版本中可用。

使用整个容器的更改

如果没有提供 feed_range 参数，则可以按自己的节奏处理整个容器的更改源。

注释

以下所有代码片段都取自 GitHub 中的示例。

下面是一个示例，说明如何在responseIterator模式下从LatestVersion中获取Beginning。由于 LatestVersion 是默认模式， mode 因此不需要传递参数：

responseIterator = container.query_items_change_feed(start_time="Beginning")

下面是如何在responseIterator模式下从AllVersionsAndDeletes中获取Now的示例，因为Now是start_time参数的默认值，因此无需传递：

responseIterator = container.query_items_change_feed(mode="AllVersionsAndDeletes")

然后，我们可以循环访问结果。由于更改馈送实际上是一个包含所有未来写入和更新的无限项列表，responseIterator 可以无限循环。以下示例使用最新版本模式从头开始读取所有更改。每次迭代都会打印文档的更改源。

responseIterator = container.query_items_change_feed(start_time="Beginning")
for doc in responseIterator:
    print(doc)

使用分区键的更改

在某些情况下，你可能希望仅处理特定分区键的更改。您可以像处理整个容器的更改一样，使用 partition_key 参数进行处理。下面是使用 LatestVersion 模式的示例：

pk = "partition_key_value"
responseIterator = container.query_items_change_feed(start_time="Beginning", partition_key=pk)
for doc in responseIterator:
    print(doc)

使用 FeedRange 实现并行化

在更改源拉取模型中，可以使用 feed_range 来并行处理更改源。 feed_range 表示分区键值的一个范围。

以下示例演示如何获取容器的范围列表。 list 命令将迭代器转换为列表：

rangesIterator = container.read_feed_ranges(force_refresh=False)
ranges = list(rangesIterator)

获取容器的 feed_range 值列表时，每个feed_range都会获得一个。

可以使用 feed_range 创建一个迭代器，以便跨多个计算机或线程并行处理更改源。与上一个演示如何获取 responseIterator 整个容器或单个分区键的示例不同，可以使用 feed_range 获取多个迭代器，这些迭代器可以并行处理更改馈送。

下面的示例展示了如何使用两个并行读取的独立虚构计算机从容器的更改源开头进行读取：

机器 1：

responseIterator = container.query_items_change_feed(start_time="Beginning", feed_range=ranges[0])
for doc in responseIterator:
    print(doc)

机器 2:

responseIterator = container.query_items_change_feed(start_time="Beginning", feed_range=ranges[1])
for doc in responseIterator:
    print(doc)

保存延续令牌

可以通过获取延续令牌来保存迭代器的位置。延续标记是一个字符串值，用于跟踪 responseIterator 上次处理的更改，并允许迭代器稍后恢复。如果指定了延续令牌，则优先于开始时间并从起始值开始。以下代码读取自容器创建以来的更改源。当没有更多更改可用时，它保留一个继续标记，以便以后可以继续使用更改源。

responseIterator = container.query_items_change_feed(start_time="Beginning")
for doc in responseIterator:
    print(doc)
continuation_token = container.client_connection.last_response_headers['etag']

注释

由于 continuation 令牌包含之前使用的 mode 参数，如果使用了 continuation，则会忽略 mode 参数，并改为使用来自 mode 令牌的 continuation。

以下示例展示了如何使用 continuation 令牌读取容器的更改源：

responseIterator = container.query_items_change_feed(continuation=continuation_token)
for doc in responseIterator:
    print(doc)

若要使用拉取模型来处理变更事件流，请创建一个 ChangeFeedPullModelIterator 实例。最初创建ChangeFeedPullModelIterator时，必须在其中changeFeedStartFrom指定一个必需ChangeFeedIteratorOptions值，该值包括读取更改的起始位置以及要提取更改的资源（分区键或 FeedRange）。

注释

如果changeFeedStartFrom未指定任何值，则会从Now()中提取整个容器的更改提要。目前，JavaScript SDK 仅支持最新版本，默认情况下处于选中状态。

可以选择在 maxItemCount中使用 ChangeFeedIteratorOptions 来设置每页接收的最大项目数。下面的示例介绍了如何在最新版本模式中获取返回实体对象的迭代器：

const options = {
    changeFeedStartFrom: ChangeFeedStartFrom.Now()
};

const iterator = container.items.getChangeFeedIterator(options);

使用整个容器的更改

如果未在 FeedRange 内提供 PartitionKey 或 ChangeFeedStartFrom 参数，则可以按照自己的节奏处理整个容器的更改源。下面的示例将从当前时间开始读取所有更改：

async function waitFor(milliseconds: number): Promise<void> {
  return new Promise((resolve) => setTimeout(resolve, milliseconds));
}

const options = {
      changeFeedStartFrom: ChangeFeedStartFrom.Beginning()
};

const iterator = container.items.getChangeFeedIterator(options);

let timeout = 0;

while(iterator.hasMoreResults) {
    const response = await iterator.readNext();
    if (response.statusCode === StatusCodes.NotModified) {
        timeout = 5000;
    } 
    else {
        console.log("Result found", response.result);
        timeout = 0;
    }
    await waitFor(timeout);
}

由于更改源实际上是包含所有后续写入和更新项的无穷列表，因此 hasMoreResults 的值始终为 true。在尝试读取更改源时，如果未出现新更改，你会收到 NotModified 状态的响应。这不同于接收没有更改和 OK 状态的响应。在有更多更改可用时，可以获取空的更改源响应，并且应继续轮询，直到收到 NotModified。在前面的示例中，NotModified 通过在重新检查更改之前等待 5 秒来处理。

使用分区键的更改

在某些情况下，你可能希望仅处理特定分区键的更改。可以获取特定分区键的迭代器，并以与处理整个容器相同的方式处理更改。

async function waitFor(milliseconds: number): Promise<void> {
  return new Promise((resolve) => setTimeout(resolve, milliseconds));
}

const options = {
      changeFeedStartFrom: ChangeFeedStartFrom.Beginning("partitionKeyValue")
};

const iterator = container.items.getChangeFeedIterator(options);

let timeout = 0;

while(iterator.hasMoreResults) {
    const response = await iterator.readNext();
    if (response.statusCode === StatusCodes.NotModified) {
        timeout = 5000;
    } 
    else {
        console.log("Result found", response.result);
        timeout = 0;
    }
    await waitFor(timeout);
}

使用 FeedRange 实现并行化

在更改源拉取模型中，可以使用 FeedRange 来并行处理更改源。 FeedRange 表示分区键值的一个范围。

下面的示例展示了如何获取容器的范围列表：

const ranges = await container.getFeedRanges();

获取容器的 FeedRange 值列表时，每个FeedRange都会获得一个。

可以使用 FeedRange 创建一个迭代器，以便跨多个计算机或线程并行处理更改源。与上一个演示如何获取整个容器或单个分区键的更改源迭代器的示例不同，可以使用 FeedRanges 获取多个迭代器，后者可以并行处理更改源。

下面的示例展示了如何使用两个并行读取的独立虚构计算机从容器的更改源开头进行读取：

机器 1：

async function waitFor(milliseconds: number): Promise<void> {
  return new Promise((resolve) => setTimeout(resolve, milliseconds));
}

const options = {
      changeFeedStartFrom: ChangeFeedStartFrom.Beginning(ranges[0])
};

const iterator = container.items.getChangeFeedIterator(options);

let timeout = 0;

while(iterator.hasMoreResults) {
    const response = await iterator.readNext();
    if (response.statusCode === StatusCodes.NotModified) {
        timeout = 5000;
    } 
    else {
        console.log("Result found", response.result);
        timeout = 0;
    }
    await waitFor(timeout);
}

机器 2:

async function waitFor(milliseconds: number): Promise<void> {
  return new Promise((resolve) => setTimeout(resolve, milliseconds));
}

const options = {
      changeFeedStartFrom: ChangeFeedStartFrom.Beginning(ranges[1])
};

const iterator = container.items.getChangeFeedIterator(options);

let timeout = 0;

while(iterator.hasMoreResults) {
    const response = await iterator.readNext();
    if (response.statusCode === StatusCodes.NotModified) {
        timeout = 5000;
    } 
    else {
        console.log("Result found", response.result);
        timeout = 0;
    }
    await waitFor(timeout);
}

保存延续令牌

可以通过获取延续令牌来保存迭代器的位置。延续标记是一个字符串值，用于跟踪更改源迭代器的上次处理更改，并允许迭代器稍后恢复。如果指定了延续令牌，则优先于开始时间并从起始值开始。以下代码读取自容器创建以来的更改源。当没有更多更改可用时，它保留一个继续标记，以便以后可以继续使用更改源。

const options = {
      changeFeedStartFrom: ChangeFeedStartFrom.Beginning()
};

const iterator = container.items.getChangeFeedIterator(options);

let timeout = 0;
let continuation = "";
while(iterator.hasMoreResults) {
    const response = await iterator.readNext();
    if (response.statusCode === StatusCodes.NotModified) {
        continuation = response.continuationToken;
        break;
    } 
    else {
        console.log("Result found", response.result);
    }
}

// For checking any new changes using the continuation token
const continuationOptions = {
    changeFeedStartFrom: ChangeFeedStartFrom(continuation)
}
const newIterator = container.items.getChangeFeedIterator(continuationOptions);

只要 Azure Cosmos DB 容器仍然存在，延续令牌永远不会过期。

使用异步迭代器

可以使用 JavaScript AsyncIterator 获取变更提要。以下是一个示例 AsyncIterator。

async function waitFor(milliseconds: number): Promise<void> {
  return new Promise((resolve) => setTimeout(resolve, milliseconds));
}
const options = {
      changeFeedStartFrom: ChangeFeedStartFrom.Beginning()
};
let timeout = 0;

for await(const result of container.items.getChangeFeedIterator(options).getAsyncIterator()) {
    if (result.statusCode === StatusCodes.NotModified) {
      timeout = 5000;
    }
    else {
      console.log("Result found", result.result);
      timeout = 0;
    }
    await waitFor(timeout);
}

后续步骤

Last updated on 2026-01-14

通过

Azure Cosmos DB 中的更改源拉取模型

对比更改源处理器

使用拉取模型

通过流使用更改源

使用整个容器的更改

使用分区键的更改

使用 FeedRange 实现并行化

保存延续令牌

后续步骤

其他资源