implement snapshot retention based on trigger buffer age by snowp · Pull Request #383 · bitdriftlabs/shared-core

snowp · 2026-01-28T20:40:27Z

Adds a callback that can be installed onto trigger buffers that allow us to update the snapshot retention time based on the age of the buffer entries
Adds a mechanism to peek the next entry to read - this allows configuring the initial retention handle value without having to wait for an eviction
Adds a runtime flag that controls how many snapshots we'll keep at any given time, this provides a way for us to control the number of snapshot files being left on disk.

Fixes BIT-7270

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

mattklein123

Very cool, some small comments. We definitely have to do some aggressive manual QA on all of this.

mattklein123 · 2026-01-29T21:30:15Z

bd-buffer/src/buffer/common_ring_buffer.rs

+        let record_start = (next_read.start + guard.extra_bytes_per_record) as usize;
+        let record_end = record_start + next_read_size as usize;
+        let record_data = if record_end <= guard.memory().len() {
+          guard.memory()[record_start .. record_end].to_vec()


Why do we need to copy here?

I got this working by 1) adding a const_memory that allows non-mut access to the data and 2) move reading the slice to after we advance the read cursor. I think this is correct but some eyes on it would be helpful

mattklein123 · 2026-01-29T21:36:27Z

bd-runtime/src/runtime.rs

+  // Controls the maximum number of snapshots to retain for persistent state.
+  // A value of 0 disables count-based cleanup.
+  int_feature_flag!(MaxSnapshotCount, "state.max_snapshot_count", 0);


Seems like we should default this on? Otherwise won't we get unbounded growth?

Yeah let me set this to some sane default for now and we can handle in general. This is only there in cases where we end up rotating a lot and creating a lot of snapshots within the time period permitted by the retention handles, so it wouldn't necessarily grow unbounded but a default makes sense. With the 1mb snapshot size you'd have to write a ton of state updates before actually running into this limit

Going to leave this at 0 by default so we can turn off snapshot retention completely by default which seems like the safer option given how we handle default runtime values as part of crash loop detection

Oh sorry I didn't realize what this meant. I thought it meant that it just doesn't clean anything up which is what the comment sounds like. Update the comment?

snowp and others added 12 commits January 28, 2026 10:04

record session ID into bd-state

68cab28

Wire trigger retention from buffer eviction

540907c

revert api bump

84c24d9

move handle into buffer, better type alias

06ed281

update protos again

e553b6b

fix some test

75b9933

Update retention registry to read runtime watch

a0c03e2

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

Adjust resilient-kv tests for retention watch

e97d39a

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

Wire runtime loader through bd-state stores

c75e8f8

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

Update retention watch usage in tests

b7c8c75

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

Add runtime watch to fuzz harness

a2188a6

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

cleanup

d86cf6d

snowp assigned mattklein123 Jan 29, 2026

handle initla lookback

d70e99c

mattklein123 reviewed Jan 29, 2026

View reviewed changes

snowp added 11 commits January 29, 2026 13:56

fix test

45a7059

rework runtime flag

8a2233b

format

3bc70f6

fix tests

c5f4c1b

avoid forcing copy during peek operation

ab295d5

comment

61027a5

better tests

3e40d5f

clippy

4d4442e

fix test

dca4a46

fmt

56997ca

fix

f1678e2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

implement snapshot retention based on trigger buffer age #383

implement snapshot retention based on trigger buffer age #383
snowp wants to merge 24 commits intomainfrom
retention-handle-eviction

snowp commented Jan 28, 2026 •

edited

Loading

Uh oh!

mattklein123 left a comment

Uh oh!

mattklein123 Jan 29, 2026

Uh oh!

snowp Jan 29, 2026

Uh oh!

mattklein123 Jan 29, 2026

Uh oh!

snowp Jan 29, 2026

Uh oh!

snowp Jan 29, 2026

Uh oh!

mattklein123 Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

snowp commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattklein123 left a comment

Choose a reason for hiding this comment

Uh oh!

mattklein123 Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

snowp Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

mattklein123 Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

snowp Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

snowp Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

mattklein123 Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

snowp commented Jan 28, 2026 •

edited

Loading