fix: OGG/Opus audio truncation — final page lost in write_chunk finalize by will-assistant · Pull Request #448 · remsky/Kokoro-FastAPI

will-assistant · 2026-02-14T23:24:28Z

Summary

One-line fix: container.close() must be called before output_buffer.getvalue() in the write_chunk finalize block. The current order loses the final OGG page containing ~1-2 seconds of audio.

The Bug

When using response_format: "opus" on /v1/audio/speech, output audio is consistently truncated. The last 1-2 seconds are silently dropped. All other formats (MP3, WAV, FLAC, PCM) work correctly.

Related issue: #447

Root Cause

In api/src/services/streaming_audio_writer.py, the finalize block does:

# ❌ BEFORE (broken)
data = self.output_buffer.getvalue()  # reads buffer BEFORE final page is written
self.close()                           # closes container, writing final OGG page to buffer (too late)
return data                            # returns incomplete audio

For OGG/Opus, the container writes the final audio page to the output buffer during close(). By reading the buffer first, that last page is lost. MP3/WAV/FLAC aren't affected because their container close only writes metadata trailers, not audio frames.

Fix

# ✅ AFTER (fixed)
self.container.close()                 # writes final OGG page to buffer
data = self.output_buffer.getvalue()   # now includes all audio data
self.output_buffer.close()
return data

Test Results

Same text, same voice, same speed — only response_format differs:

Before fix

Text	MP3 duration	Opus duration	Lost
Short	3.408s	2.000s	1.4s
Medium	5.016s	3.000s	2.0s
Long	10.224s	9.000s	1.2s

Note the round-number opus durations — OGG pages emit at ~1s granule boundaries, and the final partial page was being dropped.

After fix

Text	MP3 duration	Opus duration	Delta
Short	3.408s	3.347s	0.06s ✅
Medium	5.016s	4.959s	0.06s ✅
Long	10.224s	10.163s	0.06s ✅

Durations now match within ~60ms (normal codec framing overhead).

Changed Files

api/src/services/streaming_audio_writer.py — 10 lines changed in write_chunk() finalize block

Testing

Tested on GPU Docker build (CUDA 12.9.1, PyTorch)
Verified with voice blending (am_puck(1)+am_liam(1)+am_onyx(0.5) at 1.2x speed)
Confirmed MP3/WAV/FLAC output unchanged
Sent fixed opus output as Discord voice messages — plays completely, no cutoff

The finalize block in write_chunk() called output_buffer.getvalue() before container.close(). For OGG/Opus, the final page of audio data is only written to the buffer during close(), causing ~1-2 seconds of audio to be lost. Swap the order: close container first, then read buffer. Fixes: remsky#447

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: OGG/Opus audio truncation — final page lost in write_chunk finalize#448

fix: OGG/Opus audio truncation — final page lost in write_chunk finalize#448
will-assistant wants to merge 1 commit intoremsky:masterfrom
will-assistant:fix/opus-truncation

will-assistant commented Feb 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

will-assistant commented Feb 14, 2026

Summary

The Bug

Root Cause

Fix

Test Results

Before fix

After fix

Changed Files

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant