Skip to content

Issue/253: (1) Refactor attention KV cache quantization to layers/kv_…

4aa8c3e
Select commit
Loading
Failed to load commit list.
Open

Issue/253: feat: support offline int8 kv cache quantization #254

Issue/253: (1) Refactor attention KV cache quantization to layers/kv_…
4aa8c3e
Select commit
Loading
Failed to load commit list.

There are no checks for this commit