Further query perf updates for oximeter field lookups. by jmcarp · Pull Request #10110 · oxidecomputer/omicron

jmcarp · 2026-03-20T14:25:34Z

Joins are expensive in clickhouse. In #9262, we dropped the number of joins in the field lookup query from the number of fields to the number of relevant field tables. However, this still leaves us with up to six field tables to join. In this patch, we do away with joins for the field lookup query entirely. Instead, we select the relevant rows from each field table, then combine them with UNION ALL, and use anyIf and GROUP BY to pivot from long format to wide. This speeds up the field query with indentical results, with greater speedups for timeseries that use more field tables. A timeseries whose labels are all strings will see no difference in performance; a series that has string, uuid, u8, u16, u32, and i16 labels (like the switch_port_control_data_link metrics) will return results much faster.

bnaecker

This is clever! I'd like to see some performance numbers if possible. Could we look at the returned query summary from some representative OxQL queries, and see what the performance impact is? If possible, looking at memory consumption would be nice too, but we don't have an easy way to access that per-query. You would have to take the query ID and go back to the system.query_log table, I think.

bnaecker · 2026-03-20T15:48:59Z

oximeter/db/src/client/oxql.rs

+                    .into_iter()
+                    .collect();


Does this actually need to be a Vec<_>? It looks like we iterate over it below, which could happen just as well with the set itself.

bnaecker · 2026-03-20T15:50:43Z

oximeter/db/src/client/oxql.rs

-            } else {
-                query.push_str(" WHERE ");
-            }
+            query.push_str(" HAVING ");


Is it helpful to have a test checking this new syntax element? I always find it hard to build a SQL query programmatically without seeing the final output.

bnaecker · 2026-03-20T15:55:00Z

oximeter/db/test-output/all-fields-query.sql

@@ -0,0 +1,44 @@
+SELECT
+  assumeNotNull(anyIf(fields_i32, field_name = 'foo')) AS foo,
+  assumeNotNull(anyIf(fields_u32, field_name = 'index')) AS INDEX,


Why is INDEX capitalized here?

bnaecker · 2026-03-20T16:02:08Z

oximeter/db/src/client/oxql.rs

+                let union_parts: Vec<String> = field_types
+                    .iter()
+                    .map(|&this_type| {
+                        let value_cols: Vec<String> = field_types


This nested-loop strikes me as inefficient, given that we know that all but one of the elements is going to be NULL AS <some_table_name>. I wonder if we can:

build an array of all the possible NULL AS fields_{type} entries we'll need

on L995, call enumerate().iter() instead of just .iter()

inside the .map() on L996, clone the array and overwrite the one element at that index with the string like field_value AS fields_{type} instead

I'm not positive that will be noticeably better, given the small sizes, but it's definitely asymptotically better.

jmcarp requested a review from bnaecker March 20, 2026 14:25

bnaecker reviewed Mar 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Further query perf updates for oximeter field lookups.#10110

Further query perf updates for oximeter field lookups.#10110
jmcarp wants to merge 1 commit intomainfrom
jmcarp/oximeter-field-union

jmcarp commented Mar 20, 2026

Uh oh!

bnaecker left a comment

Uh oh!

bnaecker Mar 20, 2026

Uh oh!

bnaecker Mar 20, 2026

Uh oh!

bnaecker Mar 20, 2026

Uh oh!

bnaecker Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jmcarp commented Mar 20, 2026

Uh oh!

bnaecker left a comment

Choose a reason for hiding this comment

Uh oh!

bnaecker Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

bnaecker Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

bnaecker Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

bnaecker Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants