Skip to content

feat: evalmonkey web ui and benchmark stability fixes#3

Merged
himmi-01 merged 1 commit intoCorbell-AI:mainfrom
himmi-01:feature/evalmonkey-web-ui
May 6, 2026
Merged

feat: evalmonkey web ui and benchmark stability fixes#3
himmi-01 merged 1 commit intoCorbell-AI:mainfrom
himmi-01:feature/evalmonkey-web-ui

Conversation

@himmi-01
Copy link
Copy Markdown
Contributor

@himmi-01 himmi-01 commented May 6, 2026

  • Added Next.js & FastAPI Web UI for live benchmarking
  • Fixed macOS torch shared memory permission crash
  • Improved HuggingFace datasets loading logic (trust_remote_code=True)
  • Fixed Hellaswag and MMLU strict LLM judge options mapping
  • Updated UI to auto-detect LLM judge from environment

Issue : #2

Screenshot 2026-05-05 at 10 48 33 PM Screenshot 2026-05-05 at 10 48 46 PM Screenshot 2026-05-05 at 10 49 01 PM

- Added Next.js & FastAPI Web UI for live benchmarking
- Fixed macOS torch shared memory permission crash
- Improved HuggingFace datasets loading logic (trust_remote_code=True)
- Fixed Hellaswag and MMLU strict LLM judge options mapping
- Updated UI to auto-detect LLM judge from environment
@himmi-01 himmi-01 merged commit 1ad2b0e into Corbell-AI:main May 6, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant