FlexiGPT is a local-first BYOK AI workspace for power users and teams who need repeatable prompts, tools, skills, model choices, assistants/agents, and private local history across multiple LLM providers.
FlexiGPT is built for people who use LLMs as part of repeatable work, not just one-off chat.
- Power and local-first users who want provider choice, private local history, and control over configuration and orchestration.
- Developers and technical writers who develop features, review diffs, debug failures, write tests/docs, and reuse assistants/agents, prompts, attachments, tools, and model setups.
- Consultants and small teams who want consistent assistant workflows without sending chat history through another hosted app.
- Download the latest release from GitHub Releases.
- macOS:
.pkg - Linux:
.flatpak - Windows:
.exe
- macOS:
- Install the package. Detailed installation steps are in Installation.
- Launch FlexiGPT.
- Get an API key for a provider.
- Add the key in Settings -> Auth Keys.
- Open Chats.
- Start from a built-in assistant preset or choose a model preset directly.
- Attach files, folders, notes, PDFs, URLs, or code when the model needs source material.
- Send.
Good first workflows:
- Use a home screen workflow card such as Develop a Feature, Review Code, or Investigate a Bug.
- Attach only the relevant source material.
- For code changes, start with a repo path or changed files and let the Feature Developer workflow inspect, scope, implement, and verify the change.
- Send the prefilled prompt as-is or adjust it for your task.
- Reuse or customize the assistant preset once the workflow fits your style.
FlexiGPT does not bill you directly. Usage costs and limits come from the provider account behind the key you configure.
FlexiGPT does not proxy normal LLM calls through a FlexiGPT-hosted service. Requests go directly to the provider or compatible endpoint you configure.
- Built-in support for OpenAI, Anthropic, Google Gemini API, xAI, Mistral, Hugging Face, OpenRouter, and local
llama.cpp. - Compatible custom endpoints across OpenAI Chat Completions, OpenAI Responses, Anthropic Messages, and Google GenerateContent style APIs.
- Curated built-in providers and model presets so you can start quickly without manually defining endpoints or defaults first.
- API keys are stored securely through the OS keyring, not in plain-text exported settings.
- One interface for chats, tabs, reusable assistant presets, model presets, prompt templates, attachments, tools, skills, search, and exports.
- Build repeatable workflows by combining model choices, instructions, attachments, tools, and skills.
- Switch providers or models as you iterate.
- Multi-tab conversations with local history search and resume flows.
- Export the current conversation as JSON.
- Assistant presets bundle starting text, model choice, instructions, tools, and skills into reusable starting setups.
- Tools can be attached per conversation or per message and configured for manual review or auto-execution.
- When an eligible auto-execute tool is called, FlexiGPT can run it and submit the result back to the model.
- Keep tools manual when you want tighter control over execution.
- Markdown rendering with syntax-highlighted code blocks.
- Mermaid diagram rendering with zoom and source or image export workflows.
- KaTeX math rendering.
- Citations, token usage, and per-message request/response details for inspection and debugging.
- Message-level controls for copying, inspection, and follow-up iteration.
- Local conversation storage and full-text search.
- File, folder, image, PDF, and URL attachments.
- Bundled offline docs shipped inside the app.
- Conversations, workflow catalogs, and configuration are stored locally.
- Selected request context is sent to the provider or endpoint you choose when you send.
- Use your own provider accounts. FlexiGPT does not proxy or bill model usage.
- Develop bounded features and enhancements from local repo context with a spec, implementation steps, edits, and focused verification.
- Review code, diffs, and PRs for correctness, security, reliability, maintainability, and test gaps.
- Investigate bugs from logs, stack traces, failing outputs, source files, and config.
- Refactor code, design tests, implement tests, explore codebases, and review architecture with built-in software assistant presets.
- Use read-only presets for review/investigation and write/shell-capable presets for implementation, with manual review for write and shell tools.
- Built-in assistants cover PRD/MRD writing, decision records, user feedback analysis, roadmap prioritization, delivery risk review, and stakeholder status updates.
- Technical-writing assistants cover docs audits, docs authoring, API reference, release notes, and troubleshooting guides.
Start here:
Context and reusable setup:
Setup, safety, and help:
Recipes:
Architecture reference:
- Architecture Overview
- Backend Roles and Responsibilities
- Frontend Roles and Responsibilities
- Chats Workspace and Composer Design
- Data storage:
JSONandSQLitefiles in the local filesystem. - Go backend.
- Wails desktop application platform.
- Official Go SDKs by OpenAI, Anthropic, and Google GenAI.
- Vite and React Router v7 frontend in TypeScript.
- DaisyUI with Tailwind CSS for styling.
- Tooling: GolangCI-Lint, Knip, ESLint, Prettier, and GitHub Actions.
Developer setup is documented in devsetup.md.
Copyright (c) 2024 - Present - Pankaj Pipada
All source code in this repository, unless otherwise noted, is licensed under the Mozilla Public License, v. 2.0. See LICENSE for details.




