Skip to content

Conversation

@devin-ai-integration
Copy link
Contributor

Summary

Adds a new hidden documentation page for Whisper multilingual streaming under the Universal Streaming section. The page documents the whisper-streaming speech model which supports 99 languages with automatic language detection.

Key sections include:

  • Configuration for using speech_model: "whisper-streaming"
  • Language detection with language_code and language_confidence response fields
  • Non-speech tags section documenting [Silence], [Music] detection
  • Quickstart examples for Python SDK, Python, JavaScript SDK, and JavaScript

The page is placed directly under "Multilingual transcription" in the navigation and marked as hidden.

Review & Testing Checklist for Human

  • Verify page renders correctly: Run pnpm docs locally and navigate to /docs/universal-streaming/whisper-multilingual-streaming to confirm the page displays properly
  • Review code examples: The quickstart examples are extensive - verify the SDK parameters (speechModel: "whisper-streaming", languageDetection: true) match the actual SDK API
  • Confirm content accuracy: The documentation content was provided externally - verify the language detection output format and non-speech tag behavior matches actual API behavior

Recommended test plan: Run the local docs server and visually inspect the hidden page to ensure all MDX components (Accordion, Tabs, Notes, code blocks) render correctly.

Notes

  • The fern check passed for docs validation. Pre-existing errors in openapi.yml are unrelated to this change.
  • Content was generated from a Claude code session and adapted to match the existing multilingual.mdx page structure.

Link to Devin run: https://app.devin.ai/sessions/0c82ef0f73db4904b5d3b5cde11e1e31
Requested by: Ryan Seams (@ryanseams)

- Create new whisper-multilingual-streaming.mdx page with 99 language support
- Add language detection configuration and output format documentation
- Add non-speech tags section for silence and music detection
- Add quickstart examples for Python SDK, Python, JavaScript SDK, and JavaScript
- Add hidden page entry in docs.yml under multilingual transcription

Co-Authored-By: Ryan Seams <ryan.seams@gmail.com>
@devin-ai-integration
Copy link
Contributor Author

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

  • Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
  • Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

  • Disable automatic comment and CI monitoring

@github-actions
Copy link

- Add hyperlink to WhisperLiveKit package in intro
- Update supported languages to correct 99 language list with codes
- Replace language list with table showing code and language name
- Update formatting section to explain format_turns requirement
- Add latency tradeoff guidance (off for voice agents, on for notetaking)
- Add formatted vs unformatted example comparison

Co-Authored-By: Ryan Seams <ryan.seams@gmail.com>
@github-actions
Copy link

- Make supported languages table collapsible with Accordion
- Update sample rate to 48000 and FRAMES_PER_BUFFER to 3200 to match multilingual quickstart
- Reorder tabs to make Python the default shown option

Co-Authored-By: Ryan Seams <ryan.seams@gmail.com>
@github-actions
Copy link

@ryanseams ryanseams self-requested a review January 22, 2026 00:24
Copy link
Collaborator

@ryanseams ryanseams left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm - send it

@ryanseams ryanseams merged commit b9ef487 into main Jan 22, 2026
2 of 4 checks passed
@ryanseams ryanseams deleted the devin/1769039320-whisper-multilingual-streaming branch January 22, 2026 00:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants