token-reduction

Here are 72 public repositories matching this topic...

ModelTC / LightCompress

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.

benchmark deployment tool evaluation pruning quantization wan awq large-language-models llm token-pruning vllm smoothquant token-reduction mixtral internlm2 token-merging deepseek-v3

Updated Apr 1, 2026
Python

CircleRadon / TokenPacker

Star

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025

connector lmm mllm token-reduction visual-projector tokenpacker

Updated May 26, 2025
Python

edouard-claude / snip

Star

CLI proxy that reduces LLM token usage by 60-90%. Declarative YAML filters for Claude Code, Cursor, Copilot, Gemini. rtk alternative in Go.

Updated May 5, 2026
Go

fajarhide / omni

Sponsor

Star

A smart context filter that removes noise, refines and enhances responses, also slashes token usage by up to 90%.

rust cli homebrew hooks mcp ai-agents cost-reduction token-reduction efficiency-tools antigravity context-distillation claude-code token-optimization token-efficiency

Updated May 5, 2026
Rust

jfrog / boost

Star

Make your agents leaner and faster. It’s not just about saving time; it’s about the feeling of not wasting it.

acceleration cursor token-reduction claude-code

Updated May 5, 2026
Shell

A discovery and compression tool for your Python codebase. Creates a knowledge graph for a LLM context window, efficiently outlining your project | Code structure visualization | LLM Context Window Efficiency | Static analysis for AI | Large Language Model tooling #LLM #AI #Python #CodeAnalysis #ContextWindow #DeveloperTools

Updated Dec 5, 2024
Python

manojmallick / sigmap

Star

97% token reduction for AI coding sessions — zero deps, 21 languages, MCP server

Updated May 5, 2026
JavaScript

Huzaifa785 / context-compressor

Star

AI-powered text compression library for RAG systems and API calls. Reduce token usage by up to 50-60% while preserving semantic meaning with advanced compression strategies.

Updated Aug 16, 2025
Python

brandondocusen / CntxtJS

Star

A lightweight tool to optimize your Javascript / Typescript project for LLM context windows by using a knowledge graph | AI code understanding | LLM context enhancement | Code structure visualization | Static analysis for AI | Large Language Model tooling #LLM #AI #JavaScript #TypeScript #CodeAnalysis #ContextWindow #DeveloperTools

Updated Dec 2, 2024
Python

orailix / PACT

Star

[CVPR 2025] PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language Models

token-pruning token-reduction vision-language-models token-merging visual-token-reduction token-clustering positional-bias-migitation-in-pruning

Updated Jan 30, 2026
Python

ZON-Format / zon-TS

Star

ZON → 35-70% cheaper LLM prompts than JSON/TOON. Zero overhead.

json data tokenizer toon claude llm chatgpt token-reduction zon gemini-pro

Updated Apr 27, 2026
TypeScript

xuyang-liu16 / GlobalCom2

Star

[AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models

multi-modal model-compression large-language-models llm token-reduction mllms

Updated Jan 27, 2026
Python

brandondocusen / CntxtCS

Star

A lightweight tool to optimize your C# project for LLM context windows by using a knowledge graph | Code structure visualization | Static analysis for AI | Large Language Model tooling | .NET ecosystem support #LLM #AI #CSharp #DotNet #CodeAnalysis #ContextWindow #DeveloperTools

Updated Dec 3, 2024
Python

oanhduong / token-ninja

Star

token-ninja routes deterministic shell commands locally — zero LLM calls, ~19µs latency. Works silently inside AI tools via MCP.

mcp developer-tools cursor copilot codex ai-tools token-reduction antigravity claude-code token-optimization token-optimizer

Updated Apr 22, 2026
TypeScript

brandondocusen / CntxtJV

Star

A discovery and compression tool for your Java codebase. Creates a knowledge graph for a LLM context window, efficiently outlining your project #LLM #AI #Java #CodeAnalysis #ContextWindow #DeveloperTools #StaticAnalysis #CodeVisualization

Updated Dec 4, 2024
Python

SuppieRK / ccp

Star

CLI proxy for coding agents that cuts noisy terminal output while preserving command behavior

go cli productivity open-source terminal opencode developer-tools command-line-tool codex llm cost-reduction token-reduction ai-coding claude-code agentic-coding

Updated Apr 30, 2026
Go

devlensio / devlensOSS

Star

Devlens is an open source project for intelligent visualization of react and Nextjs Codebases

visualization architecture code-analysis dependency-graph developer-tools dev-tool code-visualization graph-visualizer pr-review token-reduction codebase-visualization code-visualization-tool ai-summaries codebase-visualizer

Updated May 2, 2026
TypeScript

Madhan230205 / token-reducer

Star

⚡ Cut Claude token usage by 90%+ — free, open-source, local-first context compression for Claude Code. Hybrid RAG (BM25 + ONNX vectors), AST chunking, reranking. No API needed.

Updated May 2, 2026
Python

GabrielBarberini / laconic

Star

Token-compression skill. An adaptation of caveman — short common words, trust context, say just enough, be laconic.

productivity developer-tools spartan brevity claude laconic ai-assistant llm prompt-engineering anthropic token-reduction claude-code token-optimization claude-code-plugin claude-code-skill

Updated Apr 8, 2026
Python

sangminwoo / awesome-token-redundancy-reduction

Star

😎 Awesome papers on token redundancy reduction

token-pruning token-reduction token-merging token-compression token-sparsification token-redundancy-reduction

Updated Mar 12, 2025

Improve this page

Add a description, image, and links to the token-reduction topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the token-reduction topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

token-reduction

Here are 72 public repositories matching this topic...

ModelTC / LightCompress

CircleRadon / TokenPacker

edouard-claude / snip

fajarhide / omni

jfrog / boost

brandondocusen / CntxtPY

manojmallick / sigmap

Huzaifa785 / context-compressor

brandondocusen / CntxtJS

orailix / PACT

ZON-Format / zon-TS

xuyang-liu16 / GlobalCom2

brandondocusen / CntxtCS

oanhduong / token-ninja

brandondocusen / CntxtJV

SuppieRK / ccp

devlensio / devlensOSS

Madhan230205 / token-reducer

GabrielBarberini / laconic

sangminwoo / awesome-token-redundancy-reduction

Improve this page

Add this topic to your repo