Skip to content

feat(fishaudio): add speech-to-text support#5668

Open
Kilerd wants to merge 1 commit intolivekit:mainfrom
Kilerd:feat/fishaudio-stt
Open

feat(fishaudio): add speech-to-text support#5668
Kilerd wants to merge 1 commit intolivekit:mainfrom
Kilerd:feat/fishaudio-stt

Conversation

@Kilerd
Copy link
Copy Markdown

@Kilerd Kilerd commented May 7, 2026

Adds Speech-to-Text support to the Fish Audio plugin.

This implements a new fishaudio.STT provider using Fish Audio's /v1/asr endpoint, alongside the existing TTS support. The STT implementation uploads LiveKit audio buffers as WAV multipart form data, supports optional language hints, preserves returned timestamp segments in transcript metadata, and reports the Fish ASR model as transcribe-1 for metrics/logging.

Changes

  • Add livekit.plugins.fishaudio.STT
  • Add FishAudioSTT alias for explicit naming
  • Add STTModels = Literal["transcribe-1"]
  • Export STT symbols from the Fish Audio plugin package
  • Update Fish Audio README with separate STT and TTS examples
  • Update package description to mention STT support
  • Add unit tests for:
    • multipart WAV form upload
    • transcript mapping
    • language auto-detection behavior
    • HTTP error handling/logging

Copy link
Copy Markdown
Contributor

@devin-ai-integration devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 3 additional findings.

Open in Devin Review

@CLAassistant
Copy link
Copy Markdown

CLAassistant commented May 7, 2026

CLA assistant check
All committers have signed the CLA.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants