Did you clear cache before opening an issue?
Is there an existing issue for this?
Does the issue happen when logged in?
Yes
Does the issue happen when logged out?
Yes
Does the issue happen in incognito mode when logged in?
Yes
Does the issue happen in incognito mode when logged out?
Yes
Account name
No response
Account config
No response
Current Behavior
The spoken words are often unclear or difficult to distinguish, especially at normal playback speed. Some words blend together, making listening-based typing practice frustrating and less accurate.
Expected Behavior
The TTS voice should be clearer and easier to understand consistently during typing practice. Ideally, users should be able to accurately hear and identify spoken words without needing to repeatedly replay them.
Steps To Reproduce
- Open Monkeytype
- Enable the TTS / listen-based typing feature, from the settings
- Start a typing session
- Listen to the words
- Notice that some words are difficult to understand clearly
Environment
- OS: macOs
- Browser: Google Chrome
- Browser Version: Latest stable version
Anything else?
Possible improvements:
- Add clearer or more natural voice options
- Allow users to choose between multiple TTS providers/voices
- Improve pronunciation clarity
- Add controls for speech speed and voice style
- Prefer higher-quality system/browser TTS voices when available
Did you clear cache before opening an issue?
Is there an existing issue for this?
Does the issue happen when logged in?
Yes
Does the issue happen when logged out?
Yes
Does the issue happen in incognito mode when logged in?
Yes
Does the issue happen in incognito mode when logged out?
Yes
Account name
No response
Account config
No response
Current Behavior
The spoken words are often unclear or difficult to distinguish, especially at normal playback speed. Some words blend together, making listening-based typing practice frustrating and less accurate.
Expected Behavior
The TTS voice should be clearer and easier to understand consistently during typing practice. Ideally, users should be able to accurately hear and identify spoken words without needing to repeatedly replay them.
Steps To Reproduce
Environment
Anything else?
Possible improvements: