Supported metrics for Microsoft.MachineLearningServices/workspaces/onlineEndpoints

The following table lists the metrics available for the Microsoft.MachineLearningServices/workspaces/onlineEndpoints resource type.

Table headings

Metric - The metric display name as it appears in the Azure portal. Name in Rest API - Metric name as referred to in the REST API. Unit - Unit of measure. Aggregation - The default aggregation type. Valid values: Average, Minimum, Maximum, Total, Count. Dimensions - Dimensions available for the metric. Time Grains - Intervals at which the metric is sampled. For example, PT1M indicates that the metric is sampled every minute, PT30M every 30 minutes, PT1H every hour, and so on. DS Export- Whether the metric is exportable to Azure Monitor Logs via Diagnostic Settings.

For information on exporting metrics, see - Create diagnostic settings in Azure Monitor.

For information on metric retention, see Azure Monitor Metrics overview.

For a list of supported logs, see Supported log categories - Microsoft.MachineLearningServices/workspaces/onlineEndpoints

Category: Traffic

Metric	Name in REST API	Unit	Aggregation	Dimensions	Time Grains	DS Export
Connections Active The total number of concurrent TCP connections active from clients.	`ConnectionsActive`	Count	Average	<none>	PT1M	No
Data Collection Errors Per Minute The number of data collection events dropped per minute.	`DataCollectionErrorsPerMinute`	Count	Minimum, Maximum, Average	`deployment`, `reason`, `type`	PT1M	No
Data Collection Events Per Minute The number of data collection events processed per minute.	`DataCollectionEventsPerMinute`	Count	Minimum, Maximum, Average	`deployment`, `type`	PT1M	No
Network Bytes The bytes per second served for the endpoint.	`NetworkBytes`	BytesPerSecond	Average	<none>	PT1M	No
New Connections Per Second The average number of new TCP connections per second established from clients.	`NewConnectionsPerSecond`	CountPerSecond	Average	<none>	PT1M	No
Request Latency The average complete interval of time taken for a request to be responded in milliseconds	`RequestLatency`	Milliseconds	Average	`deployment`	PT1M	Yes
Request Latency P50 The average P50 request latency aggregated by all request latency values collected over the selected time period	`RequestLatency_P50`	Milliseconds	Average	`deployment`	PT1M	Yes
Request Latency P90 The average P90 request latency aggregated by all request latency values collected over the selected time period	`RequestLatency_P90`	Milliseconds	Average	`deployment`	PT1M	Yes
Request Latency P95 The average P95 request latency aggregated by all request latency values collected over the selected time period	`RequestLatency_P95`	Milliseconds	Average	`deployment`	PT1M	Yes
Request Latency P99 The average P99 request latency aggregated by all request latency values collected over the selected time period	`RequestLatency_P99`	Milliseconds	Average	`deployment`	PT1M	Yes
Requests Per Minute The number of requests sent to online endpoint within a minute	`RequestsPerMinute`	Count	Average	`deployment`, `statusCode`, `statusCodeClass`, `modelStatusCode`	PT1M	No

Next steps

Last updated on 2026-05-13

Supported metrics for Microsoft.MachineLearningServices/workspaces/onlineEndpoints

Category: Traffic

Next steps

Additional resources