-
Notifications
You must be signed in to change notification settings - Fork 8
Pull requests: tinybirdco/llm-benchmark
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add benchmark results for bytedance-seed/seed-1.6-flash
#265
opened Mar 4, 2026 by
github-actions
bot
Loading…
Add benchmark results for liquid/lfm-2.5-1.2b-thinking:free
#264
opened Mar 4, 2026 by
github-actions
bot
Loading…
Add benchmark results for openai/gpt-5.3-chat
#263
opened Mar 4, 2026 by
github-actions
bot
Loading…
Add benchmark results for google/gemini-3.1-flash-lite-preview
#262
opened Mar 3, 2026 by
github-actions
bot
Loading…
Add benchmark results for bytedance-seed/seed-2.0-mini
#261
opened Feb 27, 2026 by
github-actions
bot
Loading…
Add benchmark results for google/gemini-3.1-flash-image-preview
#260
opened Feb 26, 2026 by
github-actions
bot
Loading…
Add benchmark results for qwen/qwen3.5-122b-a10b
#258
opened Feb 26, 2026 by
github-actions
bot
Loading…
Add benchmark results for google/gemini-3.1-pro-preview-customtools
#257
opened Feb 26, 2026 by
github-actions
bot
Loading…
Add benchmark results for qwen/qwen3.5-flash-02-23
#256
opened Feb 26, 2026 by
github-actions
bot
Loading…
Add benchmark results for liquid/lfm-2-24b-a2b
#255
opened Feb 26, 2026 by
github-actions
bot
Loading…
Add benchmark results for qwen/qwen3.5-35b-a3b
#254
opened Feb 26, 2026 by
github-actions
bot
Loading…
Add benchmark results for aion-labs/aion-2.0
#253
opened Feb 25, 2026 by
github-actions
bot
Loading…
Add benchmark results for openai/gpt-5.3-codex
#252
opened Feb 25, 2026 by
github-actions
bot
Loading…
Add benchmark results for google/gemini-3.1-pro-preview
#251
opened Feb 19, 2026 by
github-actions
bot
Loading…
Add benchmark results for anthropic/claude-sonnet-4.6
#250
opened Feb 17, 2026 by
github-actions
bot
Loading…
Add benchmark results for qwen/qwen3.5-397b-a17b
#249
opened Feb 16, 2026 by
github-actions
bot
Loading…
Add benchmark results for qwen/qwen3.5-plus-02-15
#248
opened Feb 16, 2026 by
github-actions
bot
Loading…
Add benchmark results for minimax/minimax-m2.5
#246
opened Feb 12, 2026 by
github-actions
bot
Loading…
Add benchmark results for qwen/qwen3-max-thinking
#244
opened Feb 10, 2026 by
github-actions
bot
Loading…
Add benchmark results for openrouter/aurora-alpha
#243
opened Feb 9, 2026 by
github-actions
bot
Loading…
Add benchmark results for openrouter/pony-alpha
#242
opened Feb 6, 2026 by
github-actions
bot
Loading…
Add benchmark results for anthropic/claude-opus-4.6
#241
opened Feb 5, 2026 by
github-actions
bot
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.