Feature(backend): Add user toggle to run encoder models on CPU #8777

lstein · 2026-01-21T22:05:35Z

Summary

This PR adds the ability to configure standalone text encoder models to run on CPU exclusively, thereby freeing up VRAM that might otherwise compete with the denoiser and other large models. Users can set a text encoder to run in CPU from the Model Manager, by clicking on a new toggle in the details area shown below:

All the text encoders are supported, including CLIPEmbed, T5Encoder, Qwen3Encoder, CLIPVision, SigLIP, and LlavaOnevision. However, Invoke only offers the option of changing the text encoder for some of the more recent main models, chiefly Flux.1, Flux.2 and Z-Image.

In most cases it does not make sense to run the text encoder on CPU, as execution speed suffers greatly (up to 5x slower for Qwen3 encoders). However, for users who have very low VRAM (e.g. 8 GB), this may allow them to run encoder models that would otherwise be inaccessible.

Related Issues / Discussions

Brief discussion on Discord regarding Comfy's use of a similar strategy: https://discord.com/channels/1020123559063990373/1020123559831539744/1462795385469796591

QA Instructions

CPU mode on

Go to the model manager and select one of the standalone encoders, e.g. Z-Image Qwen3 Text Encoder (for Z-Image).
The details panel will show a new setting, "Run text encoder model on CPU only". Turn it on and click "Save".
Go to a generation pane (linear, canvas or workflow) and select a main model that uses this encoder, and then under Advanced select the text encoder you modified.
Run a generation and look at the log messages. You should not see any messages about the text encoder being loaded into the cuda device.
When the generation is finished, examine the performance statistics. The text encoder should have taken an unusually long time to run.

CPU mode off

Repeat the instructions above, but this time turn the CPU toggle off.
You should see log messages about the text encoder loading into cuda.
Execution speed should be fast.

Repeat this with other text encoders and main models.

Merge Plan

Simple merge.

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
❗Changes to a redux slice have a corresponding migration
Documentation added / updated (if applicable)
Updated What's New copy (if doing a release after this PR)

Co-authored-by: lstein <111189+lstein@users.noreply.github.com> Add frontend UI for CPU-only model execution toggle Co-authored-by: lstein <111189+lstein@users.noreply.github.com>

JPPhoto · 2026-01-21T23:04:03Z

@lstein This is failing a frontend check. Once you resolve that, I'll do a deeper dive.

I think you need import type { FormField } from 'features/modelManagerV2/subpanels/ModelPanel/MainModelDefaultSettings/MainModelDefaultSettings'; at the top of invokeai/frontend/web/src/features/modelManagerV2/subpanels/ModelPanel/EncoderModelSettings/EncoderModelSettings.tsx. Put that right before import { toast } from 'features/toast/toast'; to not get an ordering error.

JPPhoto

After making the changes above, I was able to build and run and it worked as advertised. Approval is pending that fix and successful tests.

feature(backend) Add user toggle to run encoder models on CPU

4ee2216

Co-authored-by: lstein <111189+lstein@users.noreply.github.com> Add frontend UI for CPU-only model execution toggle Co-authored-by: lstein <111189+lstein@users.noreply.github.com>

lstein requested review from JPPhoto, Pfannkuchensack, blessedcoolant and dunkeroni as code owners January 21, 2026 22:05

github-actions bot added python PRs that change python files invocations PRs that change invocations backend PRs that change backend files services PRs that change app services frontend PRs that change frontend files labels Jan 21, 2026

JPPhoto requested changes Jan 22, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature(backend): Add user toggle to run encoder models on CPU #8777

Feature(backend): Add user toggle to run encoder models on CPU #8777

Uh oh!

lstein commented Jan 21, 2026

Uh oh!

JPPhoto commented Jan 21, 2026 •

edited

Loading

Uh oh!

JPPhoto left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feature(backend): Add user toggle to run encoder models on CPU #8777

Are you sure you want to change the base?

Feature(backend): Add user toggle to run encoder models on CPU #8777

Uh oh!

Conversation

lstein commented Jan 21, 2026

Summary

Related Issues / Discussions

QA Instructions

CPU mode on

CPU mode off

Merge Plan

Checklist

Uh oh!

JPPhoto commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JPPhoto left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JPPhoto commented Jan 21, 2026 •

edited

Loading