☕ ParlaBot: potrebbe ripeterlo?

(SpeakBot: Could you repeat that?)

🔗 Demo Link: https://parlabot.io 🔗

ParlaBot is a voice-enabled app that gives you real-time feedback on your Italian pronunciation. Speak into your mic, and ParlaBot will transcribe what you said, compare it to a target phrase, and return constructive feedback — powered by modern open-source AI and traditional DSP filtering techniques.

First, a Nod to the Past

My first real speech recognition project was my 2007 Master’s thesis — a vowel recognition frontend built using FFTs, Mel filters, and CMU Sphinx. It’s old-school compared to today’s AI toolkits, but this research (not necessarily mine, but those I studied) laid the foundation for the models that power ParlaBot.
More on that here →

Fast-Forward to Now

Nearly two decades later, I’ve been studying Italian seriously for three years and wanted to build something that merges:

Revisiting of my past studies in speech recognition
Hands-on exploration of modern STT and AI toolkits
My passion for learning Italian

Entrare il ParlaBot
(Enter ParlaBot)

Project Goals

Build a practical voice-powered Italian pronunciation coach
Showcase my ability to design, develop, and deploy AI-based microservices
Reinforce skills in Python, Go, C/C++, and container-based architecture

Architecture Overview

ParlaBot is composed of several Dockerized microservices:

Frontend UI in React
- Displays the target phrase from the PhraseService
- Records mic input and sends audio to the Orchestrator
- Displays multiple transcriptions and feedback
API Orchestrator in Go/Gin
- Fetches all target phrases from the Phrase Service
- Fetches all pipelines from the Audio Preprocessing Service
- Exposes a /transcribe endpoint
- Concurrently via goroutines:
  - Forwards the user’s audio to each selected preprocessing pipeline
  - Forwards the filtered audio to the STT Service for transcription and scoring
- (Planned) Routes results to the Feedback service
Audio Preprocessing Service in Python/FastAPI + Torch Transformers
- Accepts .wav audio
- Runs audio through specified preprocessing pipelines
- (Planned) Consume/integrate compoiled C++ shared objects filter chains for audio preprocessing from registry
STT Service in Python/FastAPI + HuggingFace Language Model Transcribers
- Accepts filtered .wav audio
- Transcribes speech using language model, currently only supports wav2vec2-large-xlsr-53-italian
- Scores the transcription against the target phrase
- Returns the model, preprocessing info, and transcript
- (Planned) Add support for multiple models
Phrase Service in Python/FastAPI + MongoDB + coqui (with mozilla and personal speaker training files) + Google TTS API
- Accepts text phrases and TTS speaker and generates audio using TTS
- (Planned) Tracks user progress

All services are containerized and connected via docker-compose.

Current System Architecture

... Where its going

How to Run

git clone https://github.com/richvigorito/parlabot.git
cd parlabot
docker-compose up --build
open http://localhost:3000

Roadmap

see milestones for project milestones/roadmaps/issues/etc

License

MIT License

Want to read this in Italian?

Rich Vigorito | Portland, OR | LinkedIn | GitHub

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.github/workflows		.github/workflows
assets		assets
docs		docs
infrastructure/ui		infrastructure/ui
scripts		scripts
sound_clips		sound_clips
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

☕ ParlaBot: potrebbe ripeterlo?

🔗 Demo Link: https://parlabot.io 🔗

First, a Nod to the Past

Fast-Forward to Now

Project Goals

Architecture Overview

Current System Architecture

How to Run

Roadmap

License

About

Uh oh!

Releases

Packages

Languages

License

richvigorito/parlabot

Folders and files

Latest commit

History

Repository files navigation

☕ ParlaBot: potrebbe ripeterlo?

🔗 Demo Link: https://parlabot.io 🔗

First, a Nod to the Past

Fast-Forward to Now

Project Goals

Architecture Overview

Current System Architecture

How to Run

Roadmap

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages