Optimize parquet row filter auto strategy with adaptive fallback by hhhizzz · Pull Request #9956 · apache/arrow-rs

hhhizzz · 2026-05-10T10:36:35Z

Which issue does this PR close?

Closes [Parquet] Better heuristics to pick between RowSelection and Mask filter representation #8846
Part of [EPIC] Faster performance for parquet predicate evaluation for non selective filters #7456
Addresses Filter pushdown selectivity threshold #9591
Related to Parquet decoding/pushdown performance improvements #9589
Follow-up to [Parquet]Performance Degradation with RowFilter on Unsorted Columns due to Fragmented ReadPlan #8565

Rationale for this change

RowFilter can be much slower than a full scan when predicate pushdown produces a highly fragmented RowSelection. In that shape, the reader spends substantial time repeatedly skipping and decoding tiny row runs. #8565 showed an extreme case where row-filter pushdown was around 10x slower than scanning and filtering afterwards.

This PR makes RowSelectionPolicy::Auto more cost-aware. Instead of treating predicate pushdown as always beneficial once planned, Auto now chooses among:

selector-backed pushdown when it can skip useful work;
mask-backed execution when fragmented selections are better represented as a dense bitmap;
adaptive post-filter execution when pushdown is unlikely to save enough decoding work.

This is not intended to disable predicate evaluation. It changes where the predicate is evaluated when the observed row-selection shape suggests that pushdown overhead is likely to dominate.

This PR also fixes a correctness issue in the explicit Mask path. With sparse page-loaded ranges, a mask-backed read plan could previously try to consume selected rows outside the pages that were actually loaded, causing decoding failures. Loaded row ranges are now represented explicitly, and explicit Mask can safely execute over sparse page-loaded data.

What changes are included in this PR?

Auto strategy and cost model

Adds structured row-selection shape analysis for Auto.
Adds CostModelObservation / decision reasons so the reader can explain why it chose pushdown or post-filter execution.
Adds an adaptive post-filter cost model for row groups:
- observe early pushdown output shape;
- keep pushdown for sparse / low-selectivity cases where it still saves output decoding;
- switch later row groups to post-filter execution for high-selectivity or fragmented moderate/high-selectivity cases.
Starts directly with post-filter execution for selected cheap cases where predicate columns are already projected and pushdown cannot avoid decoding them.

Mask / selector planning

Splits row-selection strategy resolution into a dedicated planning layer.
Keeps explicit Mask and Selectors behavior intact.
Makes Auto conservative around sparse page-loaded ranges.
Adds sparse loaded-range tracking so explicit Mask no longer assumes all page data is dense.
Replaces loaded-range intersection with a linear two-pointer merge.

Post-filter execution

Adds a post-filter reader path that decodes the required output and predicate columns once, evaluates the RowFilter, and then projects back to the caller-requested output columns.
Handles nested projection conservatively by requiring whole-root batch projection support.
Avoids reusing sparse predicate chunks when rebuilding a base/full read.
Avoids evaluating the same row-group predicate twice when caller-provided row selection is present.
Disables adaptive post-filter execution for try_next_reader handoff paths.

Benchmarks and tests

Extends arrow_reader_row_filter benchmarks with strategy-sensitive cases.
Adds focused coverage for:
- sparse loaded page ranges;
- explicit Mask correctness;
- Auto strategy decisions;
- adaptive post-filter decisions;
- caller-provided row selections;
- predicate cache behavior;
- async reader snapshots.

Are these changes tested?

Yes.

Unit / integration validation:

cargo fmt -p parquet -- --check
git diff --check
cargo test -p parquet --lib arrow::push_decoder
cargo test -p parquet --lib arrow::arrow_reader::read_plan
cargo test -p parquet --lib arrow::arrow_reader::selection
cargo test -p parquet --lib
cargo test -p parquet --test arrow_reader --all-features
cargo bench -p parquet --bench arrow_reader_row_filter --features arrow,async --no-run
cargo clippy -p parquet --all-targets --all-features -- -D warnings
cargo +nightly doc --document-private-items --no-deps --workspace --all-features

Benchmark evidence:

Lower current/main is better.

`arrow_reader_row_filter`

Summary across the arrow_reader_row_filter benchmark cases:

group	current/main	delta
all	0.9685x	-3.15%
async	0.9353x	-6.47%
sync	1.0028x	+0.28%

The improvement is concentrated in async row-filter cases where fragmented pushdown previously paid extra planning/selection overhead. Sync cases are broadly neutral.

Most notable async improvements:

mode	filter	projection	main	current	current/main	delta
async	`utf8View <> ''`	all columns	8.652 ms	6.365 ms	0.7357x	-26.43%
async	`int64 > 90`	all columns	7.958 ms	5.924 ms	0.7445x	-25.55%
async	`int64 > 90`	exclude filter column	7.603 ms	5.893 ms	0.7751x	-22.49%
async	`utf8View <> ''`	exclude filter column	7.610 ms	6.003 ms	0.7889x	-21.11%

The remaining cases are mostly within noise or small regressions. The largest sync regressions in this run were around +1.5%, while the aggregate sync result was +0.28%.

TPC-DS with predicate pushdown enabled （SF10 on a AMD64 machine)

Against main, with predicate pushdown enabled, aggregate current speedup was:

suite	aggregate speedup
TPC-DS	+3.271%

Largest median improvements:

query	main median	current median	current/main	time change	speedup
q9	613.776 ms	294.409 ms	0.4797x	-52.03%	+108.48%
q59	190.115 ms	133.384 ms	0.7016x	-29.84%	+42.53%
q70	207.927 ms	150.346 ms	0.7231x	-27.69%	+38.30%
q65	342.422 ms	249.613 ms	0.7290x	-27.10%	+37.18%
q26	128.938 ms	107.054 ms	0.8303x	-16.97%	+20.44%

q9 was the largest improvement and was stable: current was faster in 10/10 rounds.

Largest median regressions:

query	main median	current median	current/main	time change	slow rounds
q2	68.829 ms	76.223 ms	1.1074x	+10.74%	9/10
q92	45.358 ms	47.347 ms	1.0439x	+4.39%	5/10
q68	177.821 ms	185.364 ms	1.0424x	+4.24%	8/10
q77	87.460 ms	91.013 ms	1.0406x	+4.06%	8/10
q19	115.416 ms	119.864 ms	1.0385x	+3.85%	7/10

Overall, the microbenchmark shows the intended improvement in async fragmented-row-filter cases, while sync behavior remains approximately neutral. The TPC-DS run shows a positive aggregate result with several large stable wins, but also identifies q2 as the main remaining regression to investigate.

Are there any user-facing changes?

No intended breaking API changes.

RowSelectionPolicy::Auto may choose different internal execution strategies than before. Explicit Mask and Selectors policies remain available for callers that want fixed behavior.

The Mask execution path is now more robust for sparse page-loaded ranges, which makes future use of Mask safer in page-index / fragmented-selection cases.

alamb · 2026-05-14T20:12:48Z

👋

hhhizzz · 2026-05-15T02:41:16Z

👋

👋🏻 I find the result is still not stable next day I publish it, I can repro some regression in some rare case, still working on it.

alamb · 2026-05-15T14:45:42Z

Yeah, we are at the point where the code is already pretty fast, so additional optimizations get harder and harder

…auto-fallback-pr # Conflicts: # parquet/src/arrow/push_decoder/mod.rs # parquet/src/arrow/push_decoder/reader_builder/mod.rs # parquet/src/arrow/push_decoder/remaining.rs

…ack-pr' into codex/parquet-reader-auto-fallback-pr

…auto-fallback-pr

hhhizzz · 2026-05-22T14:30:46Z

A bit of context on how this PR evolved.

The initial motivation was #8565: predicate pushdown is not always cheaper than scanning when the produced RowSelection becomes highly fragmented. After my last PR(#8733), I think there would be more improvement. My first attempt was to make Auto prefer Mask more often for fragmented selections, because a dense bitmap can avoid some of the tiny select/skip run overhead.

That turned out to be incomplete. Page pruning means the loaded pages may be sparse, and the previous mask path could assume rows were available even when their pages had not been loaded. So the work split into two parts:

make explicit Mask safe by tracking loaded row ranges;
keep Auto free to choose Selectors, Mask, or post-filter execution based on the shape of the actual read.

I also tried several purely static rules based on selectivity, projection shape, and data type. Some helped, but the results were fragile: rules that improved one fragmented case could regress sparse output reads or cacheable predicate cases. In particular, string / variable-width cases were easy to overfit. That is why the final design moved toward a small adaptive cost model instead of a larger pile of static heuristics.

The current implementation is roughly:

use row-selection shape analysis to decide between Mask and Selectors;
represent sparse loaded ranges explicitly so Mask does not fail on page-pruned data;
observe early row-group pushdown behavior;
switch later row groups to post-filter execution only when pushdown appears unlikely to pay for itself;
keep explicit Mask / Selectors policies as caller intent and only apply this adaptive behavior to Auto.

I also added focused benchmark cases after seeing that the original benchmark suite did not clearly expose the cost-model-sensitive cases. The goal was to cover both the original fragmented-selection cliff and the cases where an overly aggressive rule could regress.

So the PR is larger than a single heuristic change because the important part was separating the concepts:

what rows are selected;
which pages are actually loaded;
how that selection should be represented;
whether pushdown or post-filter execution is cheaper for Auto.

That separation is what makes the Mask correctness fix and the adaptive Auto behavior fit together.

…auto-fallback-pr

…t cost

…auto-fallback-pr

alamb · 2026-05-27T19:37:45Z

Thank you @hhhizzz -- I will try and review this later today or tomorrw

alamb · 2026-05-27T19:38:14Z

(I am not likely going to be able to review 8k lines in detail, however, so I will probably look at the high level first)

hhhizzz · 2026-05-28T03:06:45Z

(I am not likely going to be able to review 8k lines in detail, however, so I will probably look at the high level first)

Thanks for taking a look at this PR! I completely understand that an 8k-line diff is daunting to review in detail.

To help make the review process easier, I wanted to clarify only about 3,450 lines are production code, while the remaining 4,800+ lines are benchmarks and extensive unit/integration tests.

If you still feel this is too large to review as a single PR, I would be more than happy to split this into smaller , incremental PRs.😄 Here is how we can cleanly divide the work:

PR 1 (Infrastructure & Metrics): Expose ArrowReaderMetrics + refactor/extract strategy.rs and its isolated test file selection/tests.rs (No functional changes to the reader).
PR 2 (Post-Decode Filter State): Introduce post_filter.rs with its internal unit tests (Laying the groundwork for the fallback path).
PR 3 (Selection Policy & Cost Model): Add cost_model.rs, selection_policy.rs, integrate them into the push decoder state machine, and add the main benchmarks.

etseidl · 2026-05-28T15:37:42Z

If you still feel this is too large to review as a single PR, I would be more than happy to split this into smaller , incremental PRs.😄 Here is how we can cleanly divide the work:

Please do...that would help me, and would provide a baseline to compare the improvements against.

hhhizzz added 3 commits May 10, 2026 16:53

perf(parquet): add adaptive row filter fallback

3576d7e

fix(parquet): address auto fallback review issues

5592f85

fix(parquet): harden auto fallback review fixes

22b2911

github-actions Bot added the parquet Changes to the parquet crate label May 10, 2026

hhhizzz added 3 commits May 10, 2026 19:11

fix(parquet): address CI failures

713979e

refactor(parquet): split row filter fallback helpers

a5f3a17

docs(parquet): explain row filter fallback design

5db1ea7

hhhizzz marked this pull request as ready for review May 11, 2026 10:06

hhhizzz marked this pull request as draft May 12, 2026 08:47

Qiwei Huang and others added 18 commits May 16, 2026 15:02

Merge origin/main into parquet-reader-auto-fallback-pr

e5c2f9d

docs: design row filter fallback readability refactor

a809215

refactor(parquet): clarify row filter fallback transition

f747e49

fix(parquet): address row filter CI failures

d31e805

refactor(parquet): frame auto post-filter as cost model

bbf7064

fix(parquet): gate row filter profiling by async feature

55341f4

bench(parquet): add focused row filter cost model cases

235bf05

Optimize post-filter selection resolve

d03f920

bench(parquet): add row filter cost model focus cases

61078c9

refactor(parquet): clarify reader cost model flow

6a0c4f6

fix(parquet): clean rustdoc link

7ea8132

Merge origin/main into parquet-reader-auto-fallback-pr

fd650e4

refactor(parquet): clarify row filter cost model

5b9576b

fix(parquet): keep cost model test feature neutral

bfee76e

fix(parquet): satisfy row filter bench clippy

bd48c95

fix(parquet): keep sparse projected filters on pushdown

7c5fde8

Limit projected predicate cost model switch

75f0d9f

Support post-filter for whole nested projections

eab1642

hhhizzz and others added 10 commits May 21, 2026 15:37

Refine post-filter root projection planning

269a84e

perf(parquet): start post-filter for cheap projected reads

891aa5e

Merge remote-tracking branch 'origin/main' into codex/parquet-reader-…

cefef89

…auto-fallback-pr # Conflicts: # parquet/src/arrow/push_decoder/mod.rs # parquet/src/arrow/push_decoder/reader_builder/mod.rs # parquet/src/arrow/push_decoder/remaining.rs

test(parquet): update row filter expectations for post-filter auto

7db5c9e

fix(parquet): satisfy clippy and rustdoc checks

f8e9e47

fix(parquet): observe cacheable projected predicates

b024d07

Merge remote-tracking branch 'hhhizzz/codex/parquet-reader-auto-fallb…

1227872

…ack-pr' into codex/parquet-reader-auto-fallback-pr

test(parquet): update async row filter snapshots

51d5abe

Refactor row filter execution planning

ef1fa0b

Merge remote-tracking branch 'origin/main' into codex/parquet-reader-…

c8bf3bb

…auto-fallback-pr

hhhizzz marked this pull request as ready for review May 22, 2026 14:16

Qiwei Huang added 5 commits May 25, 2026 00:03

Merge remote-tracking branch 'origin/main' into codex/parquet-reader-…

1ad89d7

…auto-fallback-pr

Add row filter regression benchmarks

d6838a1

perf(parquet): gate projected predicate post-filter by deferred outpu…

3088783

…t cost

Merge remote-tracking branch 'origin/main' into codex/parquet-reader-…

c487b8c

…auto-fallback-pr

Format parquet tests after main merge

4d59bcd

hhhizzz force-pushed the codex/parquet-reader-auto-fallback-pr branch from c472348 to 4d59bcd Compare May 28, 2026 00:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize parquet row filter auto strategy with adaptive fallback#9956

Optimize parquet row filter auto strategy with adaptive fallback#9956
hhhizzz wants to merge 39 commits into
apache:mainfrom
hhhizzz:codex/parquet-reader-auto-fallback-pr

hhhizzz commented May 10, 2026 •

edited

Loading

Uh oh!

alamb commented May 14, 2026

Uh oh!

hhhizzz commented May 15, 2026

Uh oh!

alamb commented May 15, 2026

Uh oh!

hhhizzz commented May 22, 2026

Uh oh!

alamb commented May 27, 2026

Uh oh!

alamb commented May 27, 2026

Uh oh!

hhhizzz commented May 28, 2026

Uh oh!

etseidl commented May 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hhhizzz commented May 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Auto strategy and cost model

Mask / selector planning

Post-filter execution

Benchmarks and tests

Are these changes tested?

arrow_reader_row_filter

TPC-DS with predicate pushdown enabled （SF10 on a AMD64 machine)

Are there any user-facing changes?

Uh oh!

alamb commented May 14, 2026

Uh oh!

hhhizzz commented May 15, 2026

Uh oh!

alamb commented May 15, 2026

Uh oh!

hhhizzz commented May 22, 2026

Uh oh!

alamb commented May 27, 2026

Uh oh!

alamb commented May 27, 2026

Uh oh!

hhhizzz commented May 28, 2026

Uh oh!

etseidl commented May 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hhhizzz commented May 10, 2026 •

edited

Loading

`arrow_reader_row_filter`