Offline language runtime on ESP32-C3: where does a flash-resident execution path fit relative to ExecuTorch? #18221

Alpha-Guardian · 2026-03-17T06:24:58Z

Alpha-Guardian
Mar 17, 2026

Hi ExecuTorch folks,

I wanted to share a small on-device language-runtime experiment and ask how people here would think about it relative to the usual graph/runtime view of on-device AI.

We built a public demo line called Engram and deployed it on a commodity ESP32-C3.

Current public numbers:

Host-side benchmark capability
- LogiQA = 0.392523
- IFEval = 0.780037
Published board proof
- LogiQA 642 = 249 / 642 = 0.3878504672897196
- host_full_match = 642 / 642
- runtime artifact size = 1,380,771 bytes

Important scope note:

This is not presented as unrestricted open-input native LLM generation on MCU.

The board-side path is closer to a flash-resident, table-driven runtime with:

packed token weights
hashed lookup structures
fixed compiled probe batches
streaming fold / checksum style execution over precompiled structures

What makes this interesting to us is that it seems to sit somewhere between:

tiny-model distillation
runtime compilation
deployment-time execution shaping
edge-specific specialization of language behavior

So I’m curious how people here would think about it in relation to ExecuTorch’s world.

If a language-task system is no longer best expressed as a standard dense graph executed by a familiar operator runtime, but instead as a highly specialized flash-resident execution path, does that still feel like “model deployment” in the usual sense? Or is it a different category entirely?

Repo:
https://github.com/Alpha-Guardian/Engram

Would love to hear any thoughts.

cccclai · 2026-03-17T16:57:49Z

cccclai
Mar 17, 2026
Maintainer

Maybe @psiddh and @rascani will know better

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Offline language runtime on ESP32-C3: where does a flash-resident execution path fit relative to ExecuTorch? #18221

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Offline language runtime on ESP32-C3: where does a flash-resident execution path fit relative to ExecuTorch? #18221

Uh oh!

Uh oh!

Alpha-Guardian Mar 17, 2026

Replies: 1 comment

Uh oh!

cccclai Mar 17, 2026 Maintainer

Alpha-Guardian
Mar 17, 2026

cccclai
Mar 17, 2026
Maintainer