Release v0.6.0#31
Merged
Merged
Conversation
Workspace `version` bumped from `0.5.0` to `0.6.0`. All internal
path-deps updated. CHANGELOG `[Unreleased]` renamed to
`[0.6.0] - 2026-05-07`; a fresh empty `[Unreleased]` block stays
on top for ongoing work.
Highlights since 0.5.0:
* **Word-level Media Overlay sync** (M6.5). whisper.cpp's
per-token timestamps drive karaoke-style
highlight-along-with-audio in Thorium / Readium /
Colibrio. Default-on with `--transcribe`; opt out via
`--no-word-sync`.
* **First-run UX overhaul.**
- `dpub doctor` — diagnostic for build state, runtime
tools, and Whisper model cache. `--json` for CI.
- `dpub setup --whisper-model <size>` — SHA-verified
download to `~/.cache/dpub/models/`.
- `scripts/build.sh` — host-aware release build,
auto-picks `--features metal` on Apple Silicon and
`--features cuda` on Linux+nvcc.
- `--transcribe` auto-discovers cached models; an
interactive TTY prompt offers to download
`ggml-medium.bin` when no model is cached.
- `dpub doctor --install` — opt-in installer for missing
runtime tools via `brew` / `apt-get` / `dnf` with
per-tool consent.
Pushing the `v0.6.0` tag after merge fires the release workflow,
which produces unsigned binaries for Linux x86_64, macOS arm64
(with Metal Whisper acceleration), and Windows x86_64 and uploads
each as a release asset.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Workspace version bumped from `0.5.0` to `0.6.0`. All internal path-deps updated. CHANGELOG `[Unreleased]` renamed to `[0.6.0] - 2026-05-07`; fresh empty `[Unreleased]` block stays on top.
After this PR merges, push the `v0.6.0` tag (`git tag v0.6.0 && git push origin v0.6.0`) and the release workflow builds unsigned binaries for Linux x86_64, macOS arm64 (with Metal Whisper acceleration), and Windows x86_64.
What ships in 0.6.0
Two big additions on top of v0.5.0's feature-complete v1 candidate:
Word-level Media Overlay sync (M6.5)
whisper.cpp's per-token timestamps now drive karaoke-style highlight-along-with-audio in compatible reading systems (Thorium, Readium, Colibrio). BPE tokens are coalesced back into whole words via a leading-space rule with punctuation attachment; each word becomes one SMIL `` wrapped in a per-paragraph ``. Default-on; `--no-word-sync` opts out.
Materially valuable for dyslexic readers, language learners, and low-vision users tracking with magnifiers — a class of accessibility experience no other open-source DAISY → EPUB toolchain ships.
First-run UX overhaul
Five-command path from fresh clone to working binary:
```sh
brew install cmake epubcheck ffmpeg
git clone https://github.com/11ways/dpub && cd dpub
./scripts/build.sh # auto-picks Metal on Apple Silicon
./target/release/dpub setup --whisper-model medium # SHA-verified download
./target/release/dpub doctor # confirms everything green
```
Test plan