Query consistency in Kusto
Last modified: 04/13/2023
Strong consistency vs. weak consistency
By default, queries in Kusto run with strong consistency. Meaning - the query planning stage and the query finalization stage occur on the same node that is in charge of managing updates in the database. This node is called the database admin node, and by default there’s a single database admin node in the cluster. The same node is in charge of orchestrating all control commands that are run in the context of databases it manages, and committing the changes made by them to the database metadata.
While strong consistency is advantageous in providing queries access to the most recent updates that occurred in the database (data appends, data deletion, schema modifications, etc.), under very high loads this could overload the database admin node and impact its availability.
Using weak consistency enables spreading that load to additional nodes in the cluster that can serve as query heads. In this mode, clients making queries might observe some latency (up to a few minutes) between updates made in the database, and their queries reflecting those changes.
For example, if 1000 records are ingested each minute into a table in the database, queries over that table running with strong consistency will have access to the most-recently ingested records, whereas queries over that table running with weak consistency may not have access to a few thousands of records from the last few minutes.
- Query consistency in Kusto
Weakly consistent query heads
By default - 20% of the nodes in the cluster, with a minimum of 2 nodes, and a maximum of 30 nodes - can serve as weakly consistent query heads.
- For example, for a cluster with 15 nodes this means 3 nodes.
These parameters, as well as a few others, can be controlled using the cluster-level Query weak consistency policy.
The same policy allows controlling the refresh rate of the database metadata on the weakly consistency query heads. By default, these nodes will refresh the latest database metadata every 2 minutes. This process that usually takes up to a few seconds, unless the amount of changes that occur in that period is very high.
It’s advised to start with the default values, and only adjust if necessary.
Fetching the database metadata
As mentioned above, the database metadata is managed by the database admin node. Each transaction that modifies it (e.g., appends data, drops data, changes the schema, etc.) gets committed to this node’s memory, and changes are also written to persistent storage.
When a weakly consistency query head starts refreshing the database metadata, it fetches of changes between the current database version it currently holds in memory, and the database’s current version. This delta is downloaded from persistent storage.
In order to not increase the load on persistent storage, the amount of nodes in the cluster that can serve as weakly consistency query heads is limited, and so is the frequency at which these node fetch the latest version of the metadata.
The time it takes to complete fetching the database metadata may increase significantly when the database has many child entities (tables, columns, extents), in which case the size of the persistent storage artifacts can be very large. As of this writing, there’s no way to reduce that time, aside from distributing these entities across more than a single database.
Weak consistency modes
There are 4 modes of weak query consistency:
Mode | Description |
---|---|
Random | Queries are routed randomly to one of the nodes in the cluster that can serve as a weakly consistent query head. |
Affinitized by database | All queries that run in the context of the same database get routed to the same weakly consistent query head. |
Affinitized by query text | All queries that have the same hash for their query text get routed to the same weakly consistent query head. |
Affinitized by session ID | All queries that have the same hash for their session ID (provided separately, explained below) get routed to the same weakly consistent query head. |
When should I use weak consistency?
Whenever you want to reduce the load from the database admin node, and don’t have a strong dependency on updates that occurred in the database in the last few minutes.
For example, if you’re running the following query, which counts the number of error records per week in the last 90 days, your insights will probably not be impacted if records ingested in the past few minutes are omitted.
my_table
| where timestamp between(ago(90d) .. now())
| where level == "error"
| summarize count() by level, startofweek(Timestamp)
When should I use affinity by query?
This mode of weak consistency can be helpful when queries are also leveraging the Query results cache. This way, repeating weakly consistent queries that are run frequently by the same identity leverages results cached from recent executions of the same query on the same query head, and reduce the load on the cluster.
When should I use affinity by database?
This mode of weak consistency can be helpful if it is important that queries running against the same database will all get executed against the same (though, not most recent) version of the database.
If, however, there’s an imbalance in the amount of queries running against databases in the cluster (e.g. 70% of queries are run in the context of a specific database), then the query head serving queries for that database will be more loaded than other query heads in the cluster, which is suboptimal.
When should I use affinity by session ID?
This mode of weak consistency can be helpful if it is important that queries that belong to the same user activity/session will all get executed against the same (though, not most recent) version of the database.
It does, however, require you to explicitly specify the session ID as part of each query’s client request properties. See below.
When shouldn’t I use weak consistency?
When you have a strong dependency on updates that occurred in the database in the last few minutes, you should stick with the default mode of strong consistency.
For example, if you are running the following query, which counts the number of error records in the 5 minutes, and triggers an alert if that count is larger than 0.
my_table
| where timestamp between(ago(5m)..now())
| where level == "error"
| count
Another case is when the database metadata is very large (e.g. there are millions of data shards/extents in the database) - this could result with weakly consistent query heads spending resources on frequently downloading & deserializing large metadata artifacts from persistent storage, which would also increase the potential for transient failures in these downloads and other operations running against the same persistent storage.
How do I specify query consistency?
Specifying the query consistency mode can be done either by the client sending the request, or using a server side policy. If it isn’t specified by either, the default mode of strong consistency applies.
Specifying in client request properties
The query consistency mode can be set in the client request properties for the query.
The name of the query option to set is queryconsistency
, and the values to set are:
Mode | Client request option value |
---|---|
Strong | strongconsistency |
Weak (Random) | weakconsistency |
Weak (Affinitized by database) | weakconsistency_by_database |
Weak (Affinitized by query text) | weakconsistency_by_query |
Weak (Affinitized by session ID) | weakconsistency_by_session_id |
When setting the queryconsistency
option to weakconsistency_by_session_id
, one should also set the query option named query_weakconsistency_session_id
with a string value that represents the session’s ID.
A potential mistake is setting the above properties as if they were boolean options (e.g. weakconsistency
= true
, instead of queryconsistency
= weakconsistency
). Doing so doesn’t have any impact on the effective consistency mode (which is either the default strong, or the one specified on the server side policy).
Specifying in a workload group query consistency policy
The query consistency mode can be controlled on the server side, as part of a Query consistency policy at the workload group level.
Doing so can eliminate the need for users to specify the consistency mode in their client request properties, as well as provide the admin of the cluster with the ability to prevent users from running with an undesired consistency mode.
As the policy is defined per workload group, you can easily define that different workloads use different consistency modes (depending on their needs), without having to change the calling code for these workloads.
The names of the value to in the query consistency policy are:
Mode | Query consistency policy value |
---|---|
Strong | Strong |
Weak (Random) | Weak |
Weak (Affinitized by database) | WeakAffinitizedByDatabase |
Weak (Affinitized by query text) | WeakAffinitizedByQuery |
Weak (Affinitized by session ID) | WeakAffinitizedBySessionId |
Setting IsRelaxable
to false
prevents the value set by the user in the client request properties to override the one that was set in the query consistency policy.
For example, the policy defined by the following control command, will result with:
- All queries that get classified to the
default
workload group will run with weak consistency. - The consistency mode defined by the user in the client request properties is ignored.
.alter-merge workload_group default ```
{
"QueryConsistencyPolicy": {
"QueryConsistency": {
"IsRelaxable": false,
"Value": "Weak"
}
}
} ```
Specifying in Kusto.Explorer
Controlling the consistency mode (strong or weak) can be done by following these steps:
- Click on “Tools” in the ribbon.
- Click on “Options”.
- Click on “Connections”.
- Set the “Query weak consistency” option to the desired mode.
- Click “OK”.
Specifying in Kusto Web UI
Controlling the consistency mode (strong or weak) can be done by following these steps:
- Click on the cog wheel icon (“Settings”) in right part of the top bar.
- Click on “Connection”
- Use the “Weak consistency” toggle to select the desired mode.
Query consistency in cross-cluster queries
When you run a query on cluster A, that invokes a remote query on cluster B, the effective consistency mode is the one that was determined on cluster A, unless it gets overridden on cluster B.
For example, if the query consistency was determined as weakconsistency
, due to the query consistency policy defined on the workload group in cluster A, then the sub-query to cluster B will be sent with the same weakconsistency
.
Only if the query consistency policy defined on the workload group in cluster B overrides the consistency mode (by specifying IsRelaxable
= false
), then the mode defined on cluster B applies.