Skip to content

Marktechpost/Voice-AI-Projects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 

Repository files navigation

Voice-AI-Projects

List of Voice AI/ TTS/Audio Model Projects with Full Implementation codes

▶ A Coding Implementation on Deepgram Python SDK for Transcription, Text-to-Speech, Async Audio Processing, and Text Intelligence Codes Tutorial

▶ A Hands-On Coding Tutorial for Microsoft VibeVoice Covering Speaker-Aware ASR, Real-Time TTS, and Speech-to-Speech Pipelines Codes Tutorial

▶ How to Design a Fully Streaming Voice Agent with End-to-End Latency Budgets, Incremental ASR, LLM Streaming, and Real-Time TTS Codes Tutorial

▶ How to Build an Agentic Voice AI Assistant that Understands, Reasons, Plans, and Responds through Autonomous Multi-Step Intelligence Codes Tutorial

▶ How to Build an Advanced Voice AI Pipeline with WhisperX for Transcription, Alignment, Analysis, and Export? Codes Tutorial

▶ Building a Speech Enhancement and Automatic Speech Recognition (ASR) Pipeline in Python Using SpeechBrain Codes Tutorial

▶ How to Build an Advanced End-to-End Voice AI Agent Using Hugging Face Pipelines? Codes Tutorial

Releases

No releases published

Packages

 
 
 

Contributors