Processing audio streaming from the backend (mastra only) #104

isabelle-cedar · 2025-08-23T21:11:41Z

No description provided.

greptile-apps

Greptile Summary

This PR implements voice streaming functionality for Cedar OS, specifically targeting Mastra backend providers. The implementation adds the capability to process audio responses in real-time as they arrive from the backend, rather than waiting for complete responses.

The core changes introduce a new voiceStreamLLM method to the agent connection architecture, following the established streaming pattern used for text responses. The implementation adds new types (VoiceStreamEvent, VoiceStreamHandler) to handle various voice-specific events including transcription updates, audio chunks, and structured objects. The voice slice is enhanced with a new stream boolean configuration setting that determines whether to use streaming or traditional voice processing.

Key architectural additions include:

Provider abstraction: The ProviderImplementation interface gains an optional voiceStreamLLM method, maintaining backward compatibility
Event handling: Comprehensive event processing for different voice stream event types (transcription, audio, chunk, object, done, error)
Dual-path implementation: The agent connection slice intelligently detects provider capabilities and falls back to non-streaming voice processing when streaming isn't supported
Mastra integration: Specific implementation for Mastra providers with proper URL construction, header management, and event transformation

The implementation maintains consistency with the existing Cedar OS streaming architecture while extending it to support voice use cases. Helper functions were extracted in the Mastra provider to promote code reuse between streaming and non-streaming voice methods. The voice slice introduces a handled flag mechanism to prevent duplicate processing when responses contain multiple data types.

Confidence score: 4/5

This PR introduces complex streaming logic but follows established patterns and includes comprehensive error handling
Score reflects well-structured implementation with proper fallback mechanisms and backward compatibility
Pay close attention to the event handling logic in voiceStreamLLM and the dual-path processing in the voice slice

_{4 files reviewed, 3 comments}

_{Edit Code Review Bot Settings | Greptile}

greptile-apps · 2025-08-23T21:12:24Z

packages/cedar-os/src/store/agentConnection/agentConnectionSlice.ts

+					if (response.audioData || response.audioUrl) {
+						wrappedHandler({
+							type: 'audio',
+							audioData: response.audioData || response.audioUrl || '',
+							audioFormat: response.audioFormat,
+						});


logic: Audio data fallback logic uses audioData || audioUrl || '' which could result in empty string for audio data if both are undefined

greptile-apps · 2025-08-23T21:12:27Z

packages/cedar-os/src/store/agentConnection/providers/mastra.ts

+			try {
+				const headers = createVoiceHeaders(config);
+				const baseUrl = resolveVoiceEndpoint(params.voiceSettings, config);
+				const streamUrl = `${baseUrl}/stream`;


logic: appending '/stream' to baseUrl could create malformed URLs if baseUrl already ends with '/stream' or has query parameters

greptile-apps · 2025-08-23T21:12:42Z

packages/cedar-os/src/store/voice/voiceSlice.ts

+			// Voice processing completed successfully (streaming or non-streaming)
+			get().setIsProcessing(false);


logic: Processing state is cleared after streaming completion, but error handling at line 296 also clears it. Consider moving the success case inside a try block to ensure consistent state management.

processing audio streaming from the backend (mastra only)

e0d0f03

isabelle-cedar marked this pull request as draft August 23, 2025 21:11

greptile-apps bot reviewed Aug 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Processing audio streaming from the backend (mastra only) #104

Processing audio streaming from the backend (mastra only) #104

Uh oh!

isabelle-cedar commented Aug 23, 2025

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot Aug 23, 2025

Uh oh!

greptile-apps bot Aug 23, 2025

Uh oh!

greptile-apps bot Aug 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		// Voice processing completed successfully (streaming or non-streaming)
		get().setIsProcessing(false);

Processing audio streaming from the backend (mastra only) #104

Are you sure you want to change the base?

Processing audio streaming from the backend (mastra only) #104

Uh oh!

Conversation

isabelle-cedar commented Aug 23, 2025

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Summary

Confidence score: 4/5

Uh oh!

greptile-apps bot Aug 23, 2025

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Aug 23, 2025

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Aug 23, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants