Improved rendering engine, recording resilience, and analytics trackingRendering engine tweaks by richiemcilroy · Pull Request #1674 · CapSoftware/Cap

richiemcilroy · 2026-03-21T16:56:42Z

Rendering engine: Rewrite cursor interpolation to use a fixed-timestep spring simulation with lookahead click targeting, replace densification with decimation, and simplify both cursor and display motion blur shaders into cleaner box/radial kernels. Add cursor x-axis tilt based on movement delta, anticipatory click animations, idle fade lookahead, edge-snapped zoom follow, and continuous zoom-out transitions with ramp easing. Switch zoom focus interpolation to lazy on-demand precomputation with cluster-based targeting for auto-zoom segments.
Recording pipeline resilience: Extract shared blocking-thread-finish and mux-send-error helpers used across all macOS/Windows muxer implementations. Propagate encoder failures as errors instead of warn-and-continue. Track optional pipeline (mic, camera, system audio) failures separately from display so a degraded track no longer aborts the entire recording — failures are captured in a recording-diagnostics.json sidecar.
Editor playback: Tune mid-start warmup parameters, drain prefetch buffers more aggressively, add extended retry with forward-skip recovery on frame misses, and switch to blocking render sends with a larger channel and drain-flush cycle.
Decoder: Use forward-only cache fallback during sequential playback (non-scrubbing) and add decoder pool bounds checking to prevent out-of-range access.
Encoding: Fix AVFoundation asset writer failures on UYVY camera sample buffers by rebuilding the sample buffer with a fresh format description instead of using copy_with_new_timing.
Analytics: Associate desktop PostHog events with the authenticated user's distinct ID, add auth_surface property to all auth tracking events, fix is_signup flags on login forms, add a 7-day window guard for signup tracking, and correct the Stripe webhook platform field to a proper enum.
Defaults: Tune click spring (530/1.0/40), default motion blur (1.0), cursor rotation amount (0.15), and add a cursor tilt slider to the editor config sidebar.

Greptile Summary

This is a broad, well-scoped improvement PR touching the rendering engine, recording resilience, editor playback, decoder, encoder, and analytics layers. The changes are generally sound and well-tested, but two issues in the rendering pipeline need attention before merge.

Key changes:

Cursor interpolation: Replaced event-driven spring simulation with a fixed-timestep (60 fps) simulation loop; densification replaced with decimation; click lookahead and idle-fade lookahead added. Previously flagged O(n) linear scans replaced with binary-search partition_point.
Recording resilience: Optional pipeline tracks (mic, camera, system audio) now fail gracefully and log to a recording-diagnostics.json sidecar instead of aborting the entire recording. Encoder failures now propagate as errors rather than warn-and-continue. Shared helpers extracted for blocking-thread-finish and mux-send-error patterns.
Encoder fix: AVFoundation UYVY camera sample buffers are now rebuilt with a fresh format description instead of copy_with_new_timing, fixing writer failures on camera overlay tracks.
Decoder: Forward-only cache fallback during sequential playback; decoder pool bounds check fixes a potential out-of-range access (best_id vs decoder_idx mismatch).
Zoom focus: Lazy on-demand incremental precomputation replaces full upfront precompute; cluster-based cursor targeting for auto-zoom segments.
Analytics: PostHog desktop events associated with authenticated user's distinct ID; auth_surface added to all auth tracking events; is_signup flags corrected on login forms; 7-day signup tracking window guard; Stripe webhook platform field corrected to a proper enum.
loadOp: "load" in webgpu-renderer.ts: Both render paths changed from "clear" to "load", which risks undefined/stale pixels on the first frame or after a GPU device-lost event.
cursor.wgsl offset_base: The expression offset_base = -vel_len / 2.0 / vel_len - 0.5 always simplifies to -1.0, making the motion blur sampling range a fixed constant. This is likely unintentional and could produce incorrect visual results depending on the design intent.

Confidence Score: 3/5

Safe to merge for recording/analytics changes; two rendering issues (loadOp and cursor shader offset_base) need verification before merge.
The recording pipeline, encoder, decoder, and analytics changes are well-implemented and well-tested. The cursor spring simulation rewrite is solid. However, both rendering issues are in the hot visual path: the loadOp: "load" change risks first-frame artifacts in the WebGPU renderer for both playback paths, and the offset_base expression in cursor.wgsl always evaluates to -1.0 regardless of velocity, which may or may not be the intended blur range but is at minimum misleading and could produce incorrect visual output.
apps/desktop/src/utils/webgpu-renderer.ts (both loadOp changes) and crates/rendering/src/shaders/cursor.wgsl (offset_base expression).

Important Files Changed

Filename	Overview
crates/rendering/src/shaders/cursor.wgsl	Cursor motion blur simplified from Gaussian to box kernel; `offset_base` expression always simplifies to `-1.0` due to `vel_len` cancellation, making the sampling range a constant regardless of intent.
apps/desktop/src/utils/webgpu-renderer.ts	`loadOp` changed from `"clear"` to `"load"` on both render paths; risks stale/garbage pixels on first frame render or after GPU device loss events.
crates/rendering/src/cursor_interpolation.rs	Full rewrite from event-driven to fixed-timestep spring simulation; replaces densification with decimation, adds click lookahead and idle-fade lookahead; O(n) scans from previous review replaced with binary-search `partition_point`.
crates/recording/src/output_pipeline/core.rs	Shared helpers extracted for blocking-thread-finish and mux-send-error; encoder failures now propagate as errors rather than warn-and-continue; covered by new unit tests.
crates/recording/src/studio_recording.rs	Optional pipeline tracks (mic, camera, system audio) now fail gracefully instead of aborting the recording; diagnostics are captured in a `recording-diagnostics.json` sidecar; well-tested.
crates/enc-avfoundation/src/mp4.rs	Fixes AVFoundation asset writer failures on UYVY camera buffers by rebuilding sample buffers with fresh format descriptions; comprehensive test matrix added for UYVY camera scenarios.
crates/rendering/src/decoder/multi_position.rs	Adds bounds check for empty decoder pool and filters pool positions by `decoder_count`, preventing out-of-range access; fixes `best_id` vs `decoder_idx` mismatch in `update_decoder_position`.
crates/rendering/src/zoom_focus_interpolation.rs	Replaces full upfront precompute with lazy on-demand incremental precomputation; adds cluster-based cursor targeting for auto-zoom segments; adds `click_spring` propagation to cursor interpolation.
apps/web/actions/analytics/track-user-signed-up.ts	Moves signup-tracking window check from SQL `CURRENT_DATE()` to a JS-side 7-day window guard; adds `getAffectedRows` helper for cross-ORM result shape compatibility.
apps/web/app/api/webhooks/stripe/route.ts	Stripe webhook `platform` field corrected from a boolean expression to a proper `"desktop"

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[Recording Input] --> B{Track Type}
    B -->|Display - required| C[Display Pipeline]
    B -->|Mic/Camera/System - optional| D[Optional Track Pipelines]

    C --> E{Display Error?}
    E -->|Yes| F[Abort Recording]
    E -->|No| G[Continue]

    D --> H{Track Error?}
    H -->|Yes - Runtime| I[record_track_failure\nRuntime stage]
    H -->|Yes - Stop| J[finalize_optional_track\nStop stage]
    H -->|No| K[Track finished OK]

    I --> L[SharedTrackFailures\nArc-Mutex-Vec]
    J --> L
    K --> G

    G --> M[stop_recording]
    M --> N[take_track_failures]
    N --> O{Any failures?}
    O -->|Yes| P[Write recording-diagnostics.json]
    O -->|No| Q[Clean finish]
    P --> Q

    subgraph Cursor Rendering
        R[CursorEvents] --> S[decimate_cursor_moves]
        S --> T[build_smoothed_timeline\nFixed-timestep 60fps loop]
        T --> U[Click lookahead target\n500ms window]
        T --> V[Spring profile\nbinary search]
        U --> W[interpolate_timeline\nIndex lookup]
        V --> W
        W --> X[InterpolatedCursorPosition]
    end

Prompt To Fix All With AI

This is a comment left during a code review.
Path: crates/rendering/src/shaders/cursor.wgsl
Line: 97

Comment:
**`offset_base` always simplifies to `-1.0`**

The expression:
```wgsl
let offset_base = -vel_len / 2.0 / vel_len - 0.5;
```
cancels to `-(vel_len / 2.0) / vel_len - 0.5 = -0.5 - 0.5 = -1.0` regardless of `vel_len`. The loop then samples from `input.uv - velocity_uv` (i=0) to `input.uv - 0.05 * velocity_uv` (i=19), placing all 20 trailing samples strictly behind the cursor with no forward coverage.

If the intent was to center the kernel symmetrically (samples from `-0.5*velocity_uv` to `+0.5*velocity_uv`), `offset_base` should be `-0.5`. If the intent was to trail behind the cursor (samples from `-velocity_uv` to `0.0`), `offset_base` should simply be `-1.0`.

Either way, using the current expression with `vel_len` terms that cancel is misleading and masks the actual behavior. Suggest replacing with the explicit constant:
```suggestion
    let offset_base = -1.0;
```

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: apps/desktop/src/utils/webgpu-renderer.ts
Line: 302

Comment:
**`loadOp: "load"` may expose stale pixels on first frame**

Switching from `"clear"` to `"load"` skips resetting the attachment to `clearValue` before the render pass. In WebGPU, on the very first frame (or after a resize/GPU-device-lost recovery), the texture contents are **undefined** — `"load"` will then composite on top of garbage memory rather than a clean black background. This can manifest as single-frame visual glitches during playback start or after seeking.

The same change is applied to both `renderFrameWebGPU` (line 301) and `renderNv12FrameWebGPU` (line 415).

If the goal is to avoid the clear cost when the composite shader always writes every pixel, an alternative is to keep `"clear"` but only for the first frame render of a new session. Otherwise, at minimum add a comment confirming the shader guarantees full-pixel coverage so future readers understand the invariant.

```suggestion
				loadOp: "clear",
				clearValue: { r: 0, g: 0, b: 0, a: 1 },
```

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: apps/desktop/src/utils/webgpu-renderer.ts
Line: 415

Comment:
**Same `loadOp: "load"` concern for Nv12 path**

This is the same `"clear"` → `"load"` change applied to `renderNv12FrameWebGPU`. If the composite pass does not guarantee full pixel coverage on the first frame, the Nv12 render path will have the same stale-pixel risk.

```suggestion
				loadOp: "clear",
				clearValue: { r: 0, g: 0, b: 0, a: 1 },
```

How can I resolve this? If you propose a fix, please make it concise.

_{Last reviewed commit: "fix(web): handle arr..."}

Greptile also left 3 inline comments on this PR.

Made-with: Cursor

…btraction Made-with: Cursor

Made-with: Cursor

… simulation Made-with: Cursor

Made-with: Cursor

…d op Made-with: Cursor

Made-with: Cursor

…pers Made-with: Cursor

…mentations Made-with: Cursor

…decar Made-with: Cursor

Made-with: Cursor

…ecovery Made-with: Cursor

…cycle Made-with: Cursor

…breakdown Made-with: Cursor

…zoom_segments Made-with: Cursor

Made-with: Cursor

apps/desktop/src/routes/editor/ConfigSidebar.tsx

crates/rendering/src/cursor_interpolation.rs

crates/recording/src/output_pipeline/win_fragmented_m4s.rs

apps/web/app/s/[videoId]/_components/AuthOverlay.tsx

crates/rendering/src/zoom_focus_interpolation.rs

crates/rendering/src/decoder/multi_position.rs

crates/enc-avfoundation/src/mp4.rs

apps/web/app/api/webhooks/stripe/route.ts

…ate encoder finish logic Made-with: Cursor

…buffer rebuild Made-with: Cursor

…olation Made-with: Cursor

Made-with: Cursor

…n range Made-with: Cursor

Made-with: Cursor

…thOverlay Made-with: Cursor

richiemcilroy · 2026-03-21T17:45:59Z

@greptileai please re-review the PR

greptile-apps · 2026-03-21T17:51:42Z

crates/rendering/src/shaders/cursor.wgsl

+    let kernel_size = 21;
+    let k = kernel_size - 1;
+    let offset_base = -vel_len / 2.0 / vel_len - 0.5;



offset_base always simplifies to -1.0

The expression:

let offset_base = -vel_len / 2.0 / vel_len - 0.5;

cancels to -(vel_len / 2.0) / vel_len - 0.5 = -0.5 - 0.5 = -1.0 regardless of vel_len. The loop then samples from input.uv - velocity_uv (i=0) to input.uv - 0.05 * velocity_uv (i=19), placing all 20 trailing samples strictly behind the cursor with no forward coverage.

If the intent was to center the kernel symmetrically (samples from -0.5*velocity_uv to +0.5*velocity_uv), offset_base should be -0.5. If the intent was to trail behind the cursor (samples from -velocity_uv to 0.0), offset_base should simply be -1.0.

Either way, using the current expression with vel_len terms that cancel is misleading and masks the actual behavior. Suggest replacing with the explicit constant:

Suggested change

let offset_base = -1.0;

Prompt To Fix With AI

This is a comment left during a code review. Path: crates/rendering/src/shaders/cursor.wgsl Line: 97 Comment: **`offset_base` always simplifies to `-1.0`** The expression: ```wgsl let offset_base = -vel_len / 2.0 / vel_len - 0.5; ``` cancels to `-(vel_len / 2.0) / vel_len - 0.5 = -0.5 - 0.5 = -1.0` regardless of `vel_len`. The loop then samples from `input.uv - velocity_uv` (i=0) to `input.uv - 0.05 * velocity_uv` (i=19), placing all 20 trailing samples strictly behind the cursor with no forward coverage. If the intent was to center the kernel symmetrically (samples from `-0.5*velocity_uv` to `+0.5*velocity_uv`), `offset_base` should be `-0.5`. If the intent was to trail behind the cursor (samples from `-velocity_uv` to `0.0`), `offset_base` should simply be `-1.0`. Either way, using the current expression with `vel_len` terms that cancel is misleading and masks the actual behavior. Suggest replacing with the explicit constant: ```suggestion let offset_base = -1.0; ``` How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-21T17:51:43Z

apps/desktop/src/utils/webgpu-renderer.ts

 				view: currentTexture.createView(),
-				clearValue: { r: 0, g: 0, b: 0, a: 1 },
-				loadOp: "clear",
+				loadOp: "load",


loadOp: "load" may expose stale pixels on first frame

Switching from "clear" to "load" skips resetting the attachment to clearValue before the render pass. In WebGPU, on the very first frame (or after a resize/GPU-device-lost recovery), the texture contents are undefined — "load" will then composite on top of garbage memory rather than a clean black background. This can manifest as single-frame visual glitches during playback start or after seeking.

The same change is applied to both renderFrameWebGPU (line 301) and renderNv12FrameWebGPU (line 415).

If the goal is to avoid the clear cost when the composite shader always writes every pixel, an alternative is to keep "clear" but only for the first frame render of a new session. Otherwise, at minimum add a comment confirming the shader guarantees full-pixel coverage so future readers understand the invariant.

Suggested change

loadOp: "load",

loadOp: "clear",

clearValue: { r: 0, g: 0, b: 0, a: 1 },

Prompt To Fix With AI

This is a comment left during a code review. Path: apps/desktop/src/utils/webgpu-renderer.ts Line: 302 Comment: **`loadOp: "load"` may expose stale pixels on first frame** Switching from `"clear"` to `"load"` skips resetting the attachment to `clearValue` before the render pass. In WebGPU, on the very first frame (or after a resize/GPU-device-lost recovery), the texture contents are **undefined** — `"load"` will then composite on top of garbage memory rather than a clean black background. This can manifest as single-frame visual glitches during playback start or after seeking. The same change is applied to both `renderFrameWebGPU` (line 301) and `renderNv12FrameWebGPU` (line 415). If the goal is to avoid the clear cost when the composite shader always writes every pixel, an alternative is to keep `"clear"` but only for the first frame render of a new session. Otherwise, at minimum add a comment confirming the shader guarantees full-pixel coverage so future readers understand the invariant. ```suggestion loadOp: "clear", clearValue: { r: 0, g: 0, b: 0, a: 1 }, ``` How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-21T17:51:44Z

apps/desktop/src/utils/webgpu-renderer.ts

 				view: context.getCurrentTexture().createView(),
-				clearValue: { r: 0, g: 0, b: 0, a: 1 },
-				loadOp: "clear",
+				loadOp: "load",


Same loadOp: "load" concern for Nv12 path

This is the same "clear" → "load" change applied to renderNv12FrameWebGPU. If the composite pass does not guarantee full pixel coverage on the first frame, the Nv12 render path will have the same stale-pixel risk.

Suggested change

loadOp: "load",

loadOp: "clear",

clearValue: { r: 0, g: 0, b: 0, a: 1 },

Prompt To Fix With AI

This is a comment left during a code review. Path: apps/desktop/src/utils/webgpu-renderer.ts Line: 415 Comment: **Same `loadOp: "load"` concern for Nv12 path** This is the same `"clear"` → `"load"` change applied to `renderNv12FrameWebGPU`. If the composite pass does not guarantee full pixel coverage on the first frame, the Nv12 render path will have the same stale-pixel risk. ```suggestion loadOp: "clear", clearValue: { r: 0, g: 0, b: 0, a: 1 }, ``` How can I resolve this? If you propose a fix, please make it concise.

richiemcilroy added 28 commits March 21, 2026 16:53

chore: add target-agent and tmp to gitignore

09844dd

Made-with: Cursor

fix(lint): rename unchecked_duration_subtraction to unchecked_time_su…

4642b5a

…btraction Made-with: Cursor

feat(web): add posthog-growth analytics script

8412033

Made-with: Cursor

feat(desktop): associate posthog events with authenticated user

2b8ac90

Made-with: Cursor

feat(web): add auth_surface tracking to login and signup forms

09eb901

Made-with: Cursor

feat(web): track auth events from share overlay

d92b0c1

Made-with: Cursor

fix(web): add 7-day window and safer result parsing for signup tracking

66369fd

Made-with: Cursor

fix(web): use proper platform enum in stripe webhook

8273138

Made-with: Cursor

refactor(rendering): rewrite cursor interpolation with fixed-timestep…

98094b8

… simulation Made-with: Cursor

feat(rendering): anticipatory click animation and idle fade lookahead

a2326c8

Made-with: Cursor

perf(rendering): simplify cursor and display motion blur shaders

dcd31ef

Made-with: Cursor

feat(project): tune cursor spring, motion blur, and rotation defaults

4718c18

Made-with: Cursor

feat(desktop): add cursor tilt slider to config sidebar

f680366

Made-with: Cursor

fix(desktop): remove unused variables and switch webgpu render to loa…

2cd5d57

…d op Made-with: Cursor

feat(rendering): add edge snapping and continuous zoom-out animation

0f964dd

Made-with: Cursor

refactor(rendering): lazy precomputation and cluster-based zoom focus

05e12de

Made-with: Cursor

perf(rendering): forward-only decoder fallback and pool bounds safety

48e0b5d

Made-with: Cursor

refactor(recording): extract blocking thread finish and mux error hel…

db7a216

…pers Made-with: Cursor

refactor(recording): use shared finish helpers across all muxer imple…

a759889

…mentations Made-with: Cursor

feat(recording): track optional pipeline failures with diagnostics si…

dc5f9e3

…decar Made-with: Cursor

fix(encoding): rebuild video sample buffer for UYVY camera frames

4132529

Made-with: Cursor

perf(editor): improve playback warmup, buffer drain, and frame skip r…

b9b1a1c

…ecovery Made-with: Cursor

perf(editor): increase renderer channel and add drain-flush-blocking …

bc6abd6

…cycle Made-with: Cursor

feat(rendering): integrate cursor tilt, zoom focus, and NV12 startup …

8d7a032

…breakdown Made-with: Cursor

refactor: update ZoomFocusInterpolator callers with click_spring and …

48f2edc

…zoom_segments Made-with: Cursor

feat(export): add first-frame benchmark with NV12 startup breakdown

b305a4f

Made-with: Cursor

feat(export): add export startup time benchmark example

6959c29

Made-with: Cursor

feat(recording): add camera-writer-repro diagnostic example

d3d7c74

Made-with: Cursor

greptile-apps bot reviewed Mar 21, 2026

View reviewed changes

apps/desktop/src/routes/editor/ConfigSidebar.tsx Outdated Show resolved Hide resolved

crates/rendering/src/cursor_interpolation.rs Show resolved Hide resolved

crates/recording/src/output_pipeline/win_fragmented_m4s.rs Show resolved Hide resolved

tembo bot reviewed Mar 21, 2026

View reviewed changes

richiemcilroy added 7 commits March 21, 2026 17:41

refactor(recording): extract FinishableEncoderState trait to deduplic…

bada32f

…ate encoder finish logic Made-with: Cursor

perf(enc-avfoundation): prefer copy_with_new_timing over full sample …

08e7e51

…buffer rebuild Made-with: Cursor

perf(rendering): use binary search for click lookups in cursor interp…

baf66fb

…olation Made-with: Cursor

perf(rendering): use binary search for cursor idle opacity move lookup

8932583

Made-with: Cursor

fix(rendering): include segment start time in zoom focus interpolatio…

3e89f9b

…n range Made-with: Cursor

fix(desktop): remove unnecessary type cast on cursor rotationAmount

f6fcbb2

Made-with: Cursor

fix(web): handle array videoId param and use safer array access in Au…

a440a32

…thOverlay Made-with: Cursor

greptile-apps bot reviewed Mar 21, 2026

View reviewed changes

richiemcilroy merged commit 8be2327 into main Mar 21, 2026
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved rendering engine, recording resilience, and analytics trackingRendering engine tweaks#1674

Improved rendering engine, recording resilience, and analytics trackingRendering engine tweaks#1674
richiemcilroy merged 35 commits intomainfrom
rendering-engine-tweaks

richiemcilroy commented Mar 21, 2026 •

edited by greptile-apps bot

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

richiemcilroy commented Mar 21, 2026

Uh oh!

greptile-apps bot Mar 21, 2026

Uh oh!

greptile-apps bot Mar 21, 2026

Uh oh!

greptile-apps bot Mar 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	loadOp: "load",
	loadOp: "clear",
	clearValue: { r: 0, g: 0, b: 0, a: 1 },

Conversation

richiemcilroy commented Mar 21, 2026 • edited by greptile-apps bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 3/5

Important Files Changed

Flowchart

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

richiemcilroy commented Mar 21, 2026

Uh oh!

greptile-apps bot Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

richiemcilroy commented Mar 21, 2026 •

edited by greptile-apps bot

Loading