Ember

An open-source, full-stack AI inference platform.

Ember covers the entire inference stack from GPU kernels to serving, built on Triton, IREE, and SGLang.

Architecture

  AI Evolution Layer   Auto-Evolve (AI-driven continuous optimization)
  ─────────────────────────────────────────────────────────────────
  Layer 8  Serve       SGLang-based, 3-process, continuous batching
  Layer 7  Pipeline    Text gen / speculative / constrained decoding
  Layer 6  KV Cache    Paged + multi-tier (GPU, CPU, SSD)
  Layer 5  NN Module   Transformer, Attention, RoPE, weight loading
  Layer 4  Graph       Static graph API + IREE compiler (custom passes)
  Layer 3  Runtime     IREE Runtime + LLM scheduling extensions
  Layer 2  Kernels     Triton kernels + FlashAttention + FlagGems
  Layer 1  Compiler    MLIR/LLVM + Triton compiler + CUTLASS
  ─────────────────────────────────────────────────────────────────
  Hardware             NVIDIA (PTX) | AMD (ROCm) | Apple (Metal)

Status

Early development. Not yet functional.

Build

bazel build //...

License

Apache 2.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
.ai		.ai
benchmarks		benchmarks
compiler/passes		compiler/passes
docs/tasks		docs/tasks
kernels		kernels
models		models
python		python
runtime/hal_extensions		runtime/hal_extensions
tests		tests
third_party		third_party
.bazelrc		.bazelrc
.gitignore		.gitignore
BUILD.bazel		BUILD.bazel
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
MODULE.bazel		MODULE.bazel
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ember

Architecture

Status

Build

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Ember

Architecture

Status

Build

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages