🚀 KoboldCPP Vector Database
- 🔥 Supported on
Windowssystems with anNVIDIA GPU.
-
📁 Git
-
🌐 Pandoc
-
CUDA does NOT need to be installed system-wide — the installer pulls all the CUDA 12.8 runtime wheels it needs.
-
Build Tools.
Some dependencies don't ship pre-compiled "wheels" so you must be able to build them with Microsoft Build Tools and/or Visual Studio. Visual Studio is recommended — make sure to select the "Desktop development with C++" workload and check the four boxes on the right that contain "SDK."
Download the latest "release," extract its contents, and navigate to the src folder to run the commands below.
NOTE: If you clone this repository directly you will get the in-development version of this program, which may or may not be stable.
Open a command prompt inside the src folder and create a virtual environment:
python -m venv .
Activate the virtual environment:
.\Scripts\activate
Run the setup script. It will:
- Walk you through pre-flight checks (Python version, NVIDIA GPU, Git/Git LFS/Pandoc/Build Tools).
- Install
uvas the package manager. - Install torch 2.9 with CUDA 12.8, flash-attention, and the matching NVIDIA wheels.
- Install the rest of the project's pinned dependencies.
- Overlay the project's patched versions of
pdf.py,instructor.py, andSentenceTransformer.pyinto the freshly installed packages.
python setup_windows.py
If something goes wrong and you want to reinstall from scratch, the easiest path is to deactivate, delete the venv folder, and start again from Step 2. If you'd rather wipe just the installed packages:
pip freeze > requirements_uninstall.txt && pip uninstall -r requirements_uninstall.txt -y && del requirements_uninstall.txt
Launch the GUI:
python gui.py
- Run
python download_kobold.pyfrom your activated virtual environment to grab the latest KoboldCpp binary, or download the appropriate executable directly from KoboldCpp's releases page. The installer presents three options:koboldcpp.exe— default, includes CUDA. Pick this if you have an NVIDIA GPU.koboldcpp-nocuda.exe— Vulkan/CPU only. Pick this if you don't have an NVIDIA GPU.koboldcpp-oldpc.exe— for older CPUs without AVX2.
- Configure your launch options as desired, but uncheck "open browser". The "quiet" option is recommended to keep KoboldCpp's terminal output from duplicating.

