fix(audio): handle streaming WAV placeholder size 0xFFFFFFFF from Groq TTS by konard · Pull Request #486 · xlabtg/teleton-agent

konard · 2026-05-16T20:47:01Z

Проблема

Функция telegram_send_voice падала при использовании Groq TTS с ошибкой:

WAV parse error: chunk 'data' size 4294967295 exceeds buffer

Корневая причина

Groq TTS записывает WAV-заголовок до завершения генерации аудио. Так как общая длина аудиоданных неизвестна заранее, в поле размера chunk data записывается стриминговый placeholder 0xFFFFFFFF (4294967295). Это стандартная практика для streaming WAV.

WAV-парсер в src/utils/audio.ts интерпретировал это как реальный размер ~4 ГБ, что заведомо превышает размер буфера, и выбрасывал ошибку.

Исправление

В функции parseWav (src/utils/audio.ts) добавлена обработка placeholder-значения 0xFFFFFFFF: при его обнаружении размер chunk берётся как "весь оставшийся буфер" вместо буквального значения.

// До:
const size = buf.readUInt32LE(offset + 4);

// После:
const rawSize = buf.readUInt32LE(offset + 4);
const size = rawSize === 0xffffffff ? buf.length - start : rawSize;

Тесты

Добавлен новый тестовый файл src/utils/__tests__/audio.test.ts с тестами:

Конвертация корректного WAV в OGG/Opus
Обработка streaming WAV с placeholder-размером 0xFFFFFFFF (воспроизводит ошибку из issue Fix Groq TTS WAV to OGG/Opus conversion error in telegram_send_voice #485)
Отклонение буфера, не являющегося WAV-файлом

Воспроизведение

Использовать telegram_send_voice с ttsProvider: "groq"
Groq генерирует WAV-файл со стриминговым заголовком
До исправления: конвертация падает с WAV parse error: chunk 'data' size 4294967295 exceeds buffer
После исправления: конвертация проходит успешно, голосовое сообщение отправляется

Fixes #485

Adding .gitkeep for PR creation (default mode). This file will be removed when the task is complete. Issue: xlabtg#485

…q TTS Groq TTS writes the WAV header before audio generation completes, so the data chunk size field contains the streaming placeholder 0xFFFFFFFF instead of the real byte count. The parser was rejecting these files with: WAV parse error: chunk 'data' size 4294967295 exceeds buffer Treat 0xFFFFFFFF as "rest of buffer" (the conventional streaming WAV convention), which lets conversion to OGG/Opus proceed normally. Adds unit tests for the streaming-WAV case and for the happy path. Fixes xlabtg#485

konard · 2026-05-16T20:56:20Z

Working session summary

Задача завершена. Вот что сделано:

Итог

Корневая причина: Groq TTS записывает WAV-заголовок до завершения генерации аудио и использует streaming placeholder 0xFFFFFFFF в поле размера chunk data. WAV-парсер интерпретировал это как реальный размер ~4 ГБ и падал с ошибкой.

Исправление в src/utils/audio.ts (строки 52-53): при обнаружении rawSize === 0xFFFFFFFF используем реальный оставшийся размер буфера вместо placeholder-значения.

Добавлен новый файл src/utils/__tests__/audio.test.ts с тестом, воспроизводящим проблему (WAV с chunk size 0xFFFFFFFF).

PR обновлён и готов к review: #486

This summary was automatically extracted from the AI working session output.

konard · 2026-05-16T20:56:28Z

🤖 Solution Draft Log

This log file contains the complete execution trace of the AI solution draft process.

💰 Cost: $1.271850

📊 Context and tokens usage:

Claude Sonnet 4.6:

59.4K / 1M (6%) input tokens, 10.3K / 64K (16%) output tokens

Total: (57 new + 59.4K cache writes + 2.3M cache reads) input tokens, 10.3K output tokens, $1.053183 cost

Claude Haiku 4.5:

60.0K / 200K (30%) input tokens, 8.5K / 64K (13%) output tokens

Total: (103 new + 59.9K cache writes + 1.0M cache reads) input tokens, 8.5K output tokens, $0.218667 cost

🤖 Models used:

Tool: Anthropic Claude Code
Requested: sonnet
Main model: Claude Sonnet 4.6 (claude-sonnet-4-6)
Additional models:
- Claude Haiku 4.5 (claude-haiku-4-5-20251001)

📎 Log file uploaded as Gist (1955KB)

View complete solution draft log

Now working session is ended, feel free to review and add any feedback on the solution draft.

konard · 2026-05-16T20:58:47Z

✅ Ready to merge

This pull request is now ready to be merged:

All CI checks have passed
No merge conflicts
No pending changes

Monitored by hive-mind with --auto-restart-until-mergeable flag

This reverts commit d7c22e2.

labtgbot · 2026-05-16T21:00:51Z

Review: LGTM with minor suggestions.

Code clarity: Add a brief comment explaining the streaming WAV placeholder handling (0xFFFFFFFF) for future maintainers.
Edge cases: Consider testing other placeholder values (e.g., 0xFFFFFFFE) to ensure robustness.
Documentation: Update README to mention Groq TTS support and known limitations.

Overall: Approved with these small improvements.

This comment was posted automatically by the Teleton Agent.

konard · 2026-05-16T21:03:39Z

🤖 AI Work Session Started

Starting automated work session at 2026-05-16T21:03:37.198Z

The PR has been converted to draft mode while work is in progress.

This comment marks the beginning of an AI work session. Please wait for the session to finish, and provide your feedback.

…V fix По предложению ревьюера добавлено явное упоминание pure-JS OGG/Opus энкодера и streaming WAV placeholder fix (0xFFFFFFFF) в таблицу Closed-Work Summary.

konard · 2026-05-16T21:05:48Z

Спасибо за ревью! Обработал все замечания:

Комментарий в коде — комментарий уже был добавлен в строках 53–56 src/utils/audio.ts, объясняющий смысл placeholder 0xFFFFFFFF и почему он используется провайдерами типа Groq.
Edge cases (0xFFFFFFFE) — значение 0xFFFFFFFE не является стандартным streaming placeholder в WAV-формате. WAV/RIFF спецификация и реальные реализации (включая Groq) используют исключительно 0xFFFFFFFF. Добавлять обработку нестандартных значений без реального прецедента было бы избыточным, поэтому оставляю как есть.
Документация — обновил строку в таблице Closed-Work Summary в README: добавлено явное упоминание pure-JS OGG/Opus энкодера (без ffmpeg) и Groq streaming WAV header fix (0xFFFFFFFF).

PR переведён в "ready for review".

konard · 2026-05-16T21:06:32Z

🤖 Solution Draft Log

This log file contains the complete execution trace of the AI solution draft process.

💰 Cost: $0.361799

📊 Context and tokens usage:

39.4K / 1M (4%) input tokens, 4.3K / 64K (7%) output tokens

Total: (21 new + 28.2K cache writes + 640.4K cache reads) input tokens, 4.3K output tokens, $0.361799 cost

🤖 Models used:

Tool: Anthropic Claude Code
Requested: sonnet
Model: Claude Sonnet 4.6 (claude-sonnet-4-6)

📎 Log file uploaded as Gist (733KB)

View complete solution draft log

Now working session is ended, feel free to review and add any feedback on the solution draft.

konard · 2026-05-16T21:08:53Z

✅ Ready to merge

This pull request is now ready to be merged:

All CI checks have passed
No merge conflicts
No pending changes

Monitored by hive-mind with --auto-restart-until-mergeable flag

Initial commit with task details

d7c22e2

Adding .gitkeep for PR creation (default mode). This file will be removed when the task is complete. Issue: xlabtg#485

konard temporarily deployed to pr-preview May 16, 2026 20:47 — with GitHub Actions Inactive

konard changed the title ~~[WIP] Fix Groq TTS WAV to OGG/Opus conversion error in telegram_send_voice~~ fix(audio): handle streaming WAV placeholder size 0xFFFFFFFF from Groq TTS May 16, 2026

konard marked this pull request as ready for review May 16, 2026 20:56

konard temporarily deployed to pr-preview May 16, 2026 20:56 — with GitHub Actions Inactive

Revert "Initial commit with task details"

71c9b6c

This reverts commit d7c22e2.

konard temporarily deployed to pr-preview May 16, 2026 20:59 — with GitHub Actions Inactive

konard marked this pull request as draft May 16, 2026 21:03

konard marked this pull request as ready for review May 16, 2026 21:05

konard deployed to pr-preview May 16, 2026 21:06 — with GitHub Actions Active

xlabtg merged commit 69282a4 into xlabtg:main May 16, 2026
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(audio): handle streaming WAV placeholder size 0xFFFFFFFF from Groq TTS#486

fix(audio): handle streaming WAV placeholder size 0xFFFFFFFF from Groq TTS#486
xlabtg merged 4 commits into
xlabtg:mainfrom
konard:issue-485-1191597940aa

konard commented May 16, 2026 •

edited

Loading

Uh oh!

konard commented May 16, 2026

Uh oh!

konard commented May 16, 2026

Uh oh!

konard commented May 16, 2026

Uh oh!

labtgbot commented May 16, 2026

Uh oh!

konard commented May 16, 2026

Uh oh!

konard commented May 16, 2026

Uh oh!

konard commented May 16, 2026

Uh oh!

konard commented May 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

konard commented May 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Проблема

Корневая причина

Исправление

Тесты

Воспроизведение

Uh oh!

konard commented May 16, 2026

Working session summary

Итог

Uh oh!

konard commented May 16, 2026

🤖 Solution Draft Log

💰 Cost: $1.271850

📊 Context and tokens usage:

🤖 Models used:

📎 Log file uploaded as Gist (1955KB)

Uh oh!

konard commented May 16, 2026

✅ Ready to merge

Uh oh!

labtgbot commented May 16, 2026

Uh oh!

konard commented May 16, 2026

Uh oh!

konard commented May 16, 2026

Uh oh!

konard commented May 16, 2026

🤖 Solution Draft Log

💰 Cost: $0.361799

📊 Context and tokens usage:

🤖 Models used:

📎 Log file uploaded as Gist (733KB)

Uh oh!

konard commented May 16, 2026

✅ Ready to merge

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

konard commented May 16, 2026 •

edited

Loading