Skip to content

DerJanniku/VibeFlow

Repository files navigation

🌊 VibeFlow

Rust CI License: MIT Version

High-Performance Voice-to-Text Transcription Suite powered by Whisper AI.

VibeFlow is a cross-platform desktop application designed for seamless, near-instantaneous audio-to-text conversion. By leveraging OpenAI's Whisper models locally, it ensures maximum privacy and minimal latency without relying on external APIs.


🛠 Technical Stack

  • Backend: Rust (High-concurrency audio processing & system integration)
  • Frontend: Vue.js 3 + TypeScript (Reactive UI / Overlay)
  • Framework: Tauri (Memory-efficient bridge between Web & Native)
  • AI Core: whisper.cpp (Optimized C++ inference via FFI)
  • OS Integration: Low-level global hotkey hooks and simulated keyboard input.

🏗 System Architecture

VibeFlow utilizes a decoupled multi-threaded architecture to ensure the UI remains responsive even during heavy AI inference.

graph TD
    A[Global Hotkey Listener] -->|Event| B(Core Controller)
    B --> C[Audio Capture Service]
    C -->|PCM Buffer| D[Whisper Inference Engine]
    D -->|Token Stream| E[Post-Processor]
    E -->|String| F[OS Input Simulator]
    F -->|Virtual Keys| G[Active Application]
    B -->|State Sync| H[Vue.js UI Layer]
Loading

Key Components:

  • Audio Capture Service: Implemented using cpal for low-latency PCM data acquisition.
  • Inference Engine: Managed via a worker-pool to utilize multi-core CPU/GPU acceleration.
  • Auto-Paste Module: Utilizes system-level clipboard management and enigo for cross-platform keyboard emulation.

✨ Features

  • 🚀 Global Hotkey Integration: Instant start/stop transcription from any application.
  • 🔒 Local-First Privacy: No audio data ever leaves your machine. Processing is done entirely offline.
  • 🎛 Adaptive AI Models: Choose between four performance tiers (Tiny to Large) based on your hardware capabilities.
  • 🖥 Interactive Overlay: Real-time visual feedback of voice activity levels and transcription status.
  • ⚙️ Dynamic Configuration: Custom hotkeys, auto-start, and model management.

🚀 Getting Started

Prerequisites

  • Node.js 20.x or higher
  • Rust 1.77.x or higher (stable)
  • C++ Build Tools (for optimized inference libraries)

Installation (Development)

# Clone the repository
git clone https://github.com/DerJanniku/VibeFlow.git
cd VibeFlow

# Install frontend dependencies
cd ui && npm install && cd ..

# Run development server
npm run dev

Building for Production

# Cross-platform build via Tauri
cargo tauri build

🤝 Contributing

Contributions are welcome! We follow a strict coding standard to maintain high performance and readability.

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)
  3. Commit your Changes (git commit -m 'feat: add amazing feature')
  4. Push to the Branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

Developed with precision by DerJanniku
Focusing on performance, privacy, and user experience.

About

High-performance, local-first voice-to-text transcription suite powered by OpenAI's Whisper. Engineered with Rust, Vue.js, and Tauri for maximum privacy and speed.

Topics

Resources

License

Contributing

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors