vello_hybrid: Make probe more asynchronous by LaurenzV · Pull Request #1671 · linebender/vello

LaurenzV · 2026-05-26T08:32:34Z

Context

Right now, the probing API is fully synchronous: We first construct the scene, render into using WebGL, and then cause a full pipeline stall by directly requesting a read-back from GPU to CPU memory. While it's incredibly hard to get reliable measurements, this does seem to incur a large latency on low-tier devices. Here are some numbers (again, they can vary wildly from run to run for reasons unknown to me, but overall they are in the same ballpark):

Vivo Y21:

Xiaomi RedNote 11:

Solution

Therefore, this PR proposes to switch to a slightly different approach: It makes use of WebGL Sync objects and pixel pack buffers to ensure that the vast majority of the work can happen asynchronously in the background without stalling the CPU. On the CPU side, we can then simply continuously poll the Sync object (for example in a RAF callback) to see whether the operation has finished (by calling `clientWaitSync with a timeout of 0, such that it returns immediately in case it's not done yet. I've done some crude measurements to confirm that this is very cheap). If it's not done yet, we can simply do other stuff, so the whole point is to avoid blocking while rendering is still in progress.

When applying this, basically the only synchronous parts are 1) scene construction + submitting commands to the GPU (start_probe in the screenshots below) and 2) doing the final read-back from the pixel pack buffer to the CPU (readback in the screenshots below). While the overall time can be longer sometimes, the synchronous parts seem to be much faster than before, therefore this seems like a win to me. Especially if we plan on extending the probe scene in the future.

Vivo Y21:

Xiaomi RedNote 11:

LaurenzV · 2026-05-26T15:29:15Z

I don't know why, but CI doesn't seem to like running the test. :/ I will have to look into it tomorrow...

This PR introduces two changes to our CI for testing sparse strips in CI: 1) Instead of having one single job, we split them up into three jobs: The first job for checking the examples, the second for running the tests without WASM and the third for running the tests with WASM. This cuts the "bottleneck" time for this test specifically from around 13 minutes to around 6-7 minutes. This won't have an immediate effect since the overall bottleneck is still Windows CI, but it's an easy change and will likely be worth it in the future once we reduce the time of Windows CI. 2) Use MacOS-based runners instead of Ubuntu for running the WASM tests. When working on linebender#1671, I have encountered the problem that the probe test does not pass anymore [due to some weird timeout](https://github.com/linebender/vello/actions/runs/26457569847/job/77896141078). <img width="1318" height="443" alt="image" src="https://github.com/user-attachments/assets/d1b4264b-30a2-4eb8-b3a8-7f2a55d624d9" /> I've spent more than a day trying to debug this now, but have not gotten further than concluding that for some reason, invoking asynchronous methods like `setTimeout` or `requestAnimationFrame` causes those timeouts to occur, regardless of what happens in there (even when removing the actual probing, it still happened when I invoked those methods). Locally everything works fine, and apparently it also works fine when switching to a MacOS-based runners. I'm fairly certain that this is some weirdness with the Github CI runners and not with probing failing completely on Linux (though I admittedly haven't tested it since I don't have a linux laptop), so I think this is the easiest thing to do for now. I'd also like to get to the bottom of this, but I've already spent more time on this than I can really justify, so I hope that this is fine doing for now.

ajakubowicz-canva · 2026-06-01T03:53:25Z

+    // Just to make sure they are kept alive until the probe
+    // finished. This is _probably_ not necessary because WebGl is a synchronous API
+    // so the `delete_` commands should only be executed once rendering has finished, but
+    // safe is safe.


Could this be expanded upon. Is my assumption that this comment means "kept alive via a reference to prevent garbage collection of resources"?

So basically, if we don't keep it around it would be dropped after probe_inner is done:

vello/sparse_strips/vello_hybrid/src/render/webgl.rs

Lines 616 to 620 in a7916e4

render_result?;

let pending = launch_probe(&self.gl, probe_resources, width, height);

Ok(pending)

I was just a bit worried if it's a problem if the drop is already triggered at that point, however, I'm pretty sure it's fine because as mentioned in the comment, WebGL is conceptually synchronous, meaning that if the fence_sync and the transfer to pixel pack buffer is called before the end of the method, it is guaranteed that those operations have finished before the deletions are triggered. If you agree with that, then happy to revert that part!

I am happy to remove (also, I would really hope that tests would catch something like this if it's wrong after removal)

ajakubowicz-canva · 2026-06-01T04:18:18Z

+    /// which can be checked again in the future. Otherwise, the probe result or an error will be
+    /// returned.
+    pub fn try_finish(mut self) -> Result<WebGlProbeStatus, WebGlProbeError> {
+        let status = self.gl.client_wait_sync_with_u32(


Luckily clientWaitSync is supported by Safari 15.

ajakubowicz-canva · 2026-06-01T04:49:49Z

-        self.gl.delete_framebuffer(Some(&probe_framebuffer));
-        self.gl.delete_texture(Some(&probe_texture));
-        Ok(pixels)
+        render_result?;


Is it worth a comment that we check for a valid render result here because WebGlProbeResources drop frees resources?

Alternatively could the WebGlProbeResources be created above render_scene such that the ? can be applied right there.

Yes this is the reason, will add a comment.

ajakubowicz-canva

Nice!

laurenz-canva force-pushed the laurenz/probe_2 branch from 5e2cd8b to dfbef54 Compare May 26, 2026 15:20

LaurenzV marked this pull request as draft May 26, 2026 15:29

laurenz-canva force-pushed the laurenz/probe_2 branch 5 times, most recently from 597e6f1 to 47475a6 Compare May 28, 2026 09:32

LaurenzV mentioned this pull request May 28, 2026

Tweaks to WASM CI for sparse strips #1680

Merged

laurenz-canva force-pushed the laurenz/probe_2 branch from 47475a6 to 4a4e046 Compare May 29, 2026 07:00

Make probing API asynchronous

77b641f

laurenz-canva force-pushed the laurenz/probe_2 branch from 4a4e046 to 77b641f Compare May 29, 2026 07:07

fix message

a7916e4

LaurenzV requested a review from ajakubowicz-canva May 29, 2026 08:33

LaurenzV marked this pull request as ready for review May 29, 2026 08:33

ajakubowicz-canva reviewed Jun 1, 2026

View reviewed changes

Add a comment

82149cb

ajakubowicz-canva approved these changes Jun 1, 2026

View reviewed changes

Don't keep resources around

e636265

laurenz-canva enabled auto-merge June 1, 2026 05:11

laurenz-canva added this pull request to the merge queue Jun 1, 2026

github-merge-queue Bot removed this pull request from the merge queue due to failed status checks Jun 1, 2026

laurenz-canva added this pull request to the merge queue Jun 1, 2026

github-merge-queue Bot removed this pull request from the merge queue due to failed status checks Jun 1, 2026

laurenz-canva force-pushed the laurenz/probe_2 branch from 66ca6bf to 99d8ff3 Compare June 1, 2026 07:44

Ignore test by default

fbd0db0

laurenz-canva force-pushed the laurenz/probe_2 branch from 99d8ff3 to fbd0db0 Compare June 1, 2026 07:45

laurenz-canva added this pull request to the merge queue Jun 1, 2026

Merged via the queue into main with commit c67158d Jun 1, 2026
19 checks passed

laurenz-canva deleted the laurenz/probe_2 branch June 1, 2026 08:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vello_hybrid: Make probe more asynchronous#1671

vello_hybrid: Make probe more asynchronous#1671
laurenz-canva merged 5 commits into
mainfrom
laurenz/probe_2

LaurenzV commented May 26, 2026 •

edited

Loading

Uh oh!

LaurenzV commented May 26, 2026

Uh oh!

ajakubowicz-canva Jun 1, 2026

Uh oh!

LaurenzV Jun 1, 2026

Uh oh!

ajakubowicz-canva Jun 1, 2026

Uh oh!

ajakubowicz-canva Jun 1, 2026

Uh oh!

ajakubowicz-canva Jun 1, 2026

Uh oh!

LaurenzV Jun 1, 2026

Uh oh!

ajakubowicz-canva left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	render_result?;

	let pending = launch_probe(&self.gl, probe_resources, width, height);

	Ok(pending)

Conversation

LaurenzV commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Solution

Uh oh!

LaurenzV commented May 26, 2026

Uh oh!

ajakubowicz-canva Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

LaurenzV Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

ajakubowicz-canva Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

ajakubowicz-canva Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

ajakubowicz-canva Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

LaurenzV Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

ajakubowicz-canva left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

LaurenzV commented May 26, 2026 •

edited

Loading