ponytaill

ponytaill

Achievements

HeteroCache HeteroCache Public

[ACL 2026] HeteroCache: A Dynamic Retrieval Approach to Heterogeneous KV Cache Compression for Long-Context LLM Inference

Python 3
SCM SCM Public
AutoSmoothQuant AutoSmoothQuant Public

Forked from AniZpZ/AutoSmoothQuant

An easy-to-use package for implementing SmoothQuant for LLMs

Python