AI Browser

An AI-powered intelligent browser built with Next.js and Electron. Features multi-modal AI task execution, scheduled tasks, social media integration, and advanced file management capabilities with support for multiple AI providers.

Built with Next.js and Electron.

⚠️ Important Notes

This project is for learning purposes. There may be bugs and issues - your understanding is appreciated.

Windows users: Building on Windows may require multiple attempts to succeed. If the build fails, try running the build command again.

macOS recommended: The macOS version provides a better user experience. Limited resources have been allocated to Windows optimization.

Tech Stack

Frontend: Next.js 15 + React 19
Desktop: Electron 33
UI: Ant Design + Tailwind CSS
State Management: Zustand
Storage: IndexedDB (via electron-store)
AI Agent: @jarvis-agent (based on Eko - production-ready agent framework)
Build Tools: Vite + TypeScript

Development Environment Configuration

Node version: 20.19.3

Getting Started

1. Development Setup

First, run the development server:

# Install dependencies
pnpm install

# Build desktop application client for mac
pnpm run build:deps

# Build desktop application client for windows
pnpm run build:deps:win

# Start web development server
pnpm run next

# Start desktop application
pnpm run electron

2. Configure API Keys (After Launch)

After launching the application:

Click the Settings icon (⚙️) in the top-right corner
Navigate to Providers panel
Select your AI provider (DeepSeek, Qwen, Google Gemini, Claude, or OpenRouter)
Click Edit API Key and enter your API key
Click the checkmark to save

For detailed configuration instructions, see CONFIGURATION.md.

3. Building Desktop Application

To build the desktop application for distribution:

# Build the application for mac
pnpm run build

# Build the application for windows
pnpm run build:win

Note: End users will configure their API keys through the Settings UI after installation. No environment files needed.

Features

Multiple AI Providers: Support for DeepSeek, Qwen, Google Gemini, Anthropic Claude, and OpenRouter
Complete Settings System: Configure everything through the UI - no file editing required
- Providers: API keys, models, and provider-specific settings
- General: Language, startup behavior, window preferences
- Chat: Temperature, max tokens, streaming, and chat behavior
- Agent: Browser/File agent settings, custom prompts, MCP tools
- UI: Theme (Dark/Light/System), font size, density, editor preferences
- Network: Proxy, request timeout, stream timeout, retry attempts
Scheduled Tasks: Create and manage automated recurring tasks with custom intervals
AI-Powered Browser: Intelligent browser with automated task execution
Multi-Modal AI: Vision and text processing capabilities
Speech & TTS: Voice recognition and text-to-speech integration
File Management: Advanced file operations and management
Internationalization: Full English and Chinese language support

RoadMap

✅ Completed Features

v0.0.1 - v0.0.4: Core Functionality

AI-powered browser with automated task execution
Multiple AI provider support (DeepSeek, Qwen, Google Gemini, Claude, OpenRouter)
Multi-modal AI capabilities (vision and text processing)
Scheduled tasks system with custom intervals
File management capabilities
UI configuration for API keys and models

v0.0.5 - v0.0.7: UI/UX Enhancements

Purple theme redesign with improved UI/UX
Agent Configuration system (custom prompts, MCP tools management)
Toolbox page (centralized feature hub)
Internationalization support (English/Chinese)
WebGL animated background with gradient fallback
Improved modal sizes and layout optimization

v0.0.8 - v0.0.10: Advanced Features

Human interaction support (AI can ask questions during execution)
Task continuation with file attachment management
Atomic fragment-based history playback with typewriter effects
Advanced playback controls (play/pause/restart/speed adjustment)
Context restoration and session management
Optimized auto-scroll behavior for messages
Enhanced message display and rendering

v0.0.11+: Unified Settings System

Complete Settings redesign with 6 panels (Providers, General, Chat, Agent, UI, Network)
Unified configuration management with electron-store
All settings configurable through UI - no manual file editing required
Real-time settings sync across all windows
Import/Export/Reset functionality for all settings
Dark/Light/System theme support with smooth transitions
Network configuration (Proxy, Timeout, Retry)
Scheduled Tasks system with IndexedDB storage

🚀 Future Plans

Phase 1: Enhanced User Experience

Voice input support (speech-to-text integration)
Theme customization system (multiple color schemes)
Dark/Light mode toggle
Enhanced accessibility features

Phase 2: Workflow Enhancement

Workflow configuration export/import functionality
Refactored scheduled task steps based on workflow configuration
Visual workflow editor with drag-and-drop interface
Step management (reorder, add, remove, edit workflow steps)
Workflow templates and presets

Phase 3: Plugin Ecosystem

MCP plugin marketplace
Community plugin sharing platform
Plugin version management system
One-click plugin installation and updates
Plugin development toolkit and documentation

Phase 4: Advanced Capabilities

Multi-tab browser support
Collaborative task execution
Cloud sync for tasks and configurations
Mobile companion app
Performance optimization and caching improvements

Screenshots

Start Loading

Animated splash screen on application launch.

Home

Simple and clean interface - input your task and let AI execute automatically.

Task Execution

Left: AI thinking process and execution steps. Right: Real-time browser operation preview.

History

View past tasks with search functionality and playback capabilities.

Settings

Comprehensive settings interface with 8 panels:

General: Language, startup behavior, window preferences
Providers: AI provider selection and API key management
Chat: Temperature, max tokens, and chat behavior
Agent: Browser/File agent settings and MCP tools
Scheduled Tasks: Create and manage automated tasks
User Interface: Theme, font size, density preferences
Network: Proxy, timeout, and retry configuration
Memory: Context management settings (coming soon)

Supported AI Providers

DeepSeek: deepseek-chat, deepseek-reasoner
Qwen (Alibaba Cloud): qwen-max, qwen-plus, qwen-vl-max
Google Gemini: gemini-1.5-flash, gemini-2.0-flash, gemini-1.5-pro, and more
Anthropic Claude: claude-3.7-sonnet, claude-3.5-sonnet, claude-3-opus, and more
OpenRouter: Multiple providers (Claude, GPT, Gemini, Mistral, Cohere, etc.)

Documentation

Configuration Guide - Detailed API key setup instructions

Acknowledgements

Special thanks to Eko - A production-ready agent framework that powers the AI capabilities of this project.

Community and Support

⭐ If you find this project helpful, please consider giving it a star! Your support helps us grow and improve.

Report issues on GitHub Issues
Join discussions and share feedback
Contribute to make AI Browser better

Contributing

Contributions are welcome! Please read our contributing guidelines before submitting pull requests.

Important: Never commit actual API keys to the repository. Use the Settings UI to configure your development environment.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 140 Commits
.vscode		.vscode
assets		assets
docs		docs
electron		electron
public		public
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
electron-builder.yml		electron-builder.yml
electron-update.yml		electron-update.yml
eslint.config.mjs		eslint.config.mjs
middleware.ts		middleware.ts
next.config.js		next.config.js
nodemon.json		nodemon.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
postcss.config.mjs		postcss.config.mjs
server.js		server.js
server.ts		server.ts
tsconfig.json		tsconfig.json
tsconfig.server.json		tsconfig.server.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Browser

Tech Stack

Development Environment Configuration

Getting Started

1. Development Setup

2. Configure API Keys (After Launch)

3. Building Desktop Application

Features

RoadMap

✅ Completed Features

🚀 Future Plans

Screenshots

Start Loading

Home

Task Execution

History

Settings

Supported AI Providers

Documentation

Acknowledgements

Community and Support

Contributing

License

About

Uh oh!

Releases 14

Packages

Languages

License

DeepFundAI/ai-browser

Folders and files

Latest commit

History

Repository files navigation

AI Browser

Tech Stack

Development Environment Configuration

Getting Started

1. Development Setup

2. Configure API Keys (After Launch)

3. Building Desktop Application

Features

RoadMap

✅ Completed Features

🚀 Future Plans

Screenshots

Start Loading

Home

Task Execution

History

Settings

Supported AI Providers

Documentation

Acknowledgements

Community and Support

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 14

Packages 0

Languages

Packages