feat: add benchmark automation bot [WIP] by andygrove · Pull Request #3557 · apache/datafusion-comet

andygrove · 2026-02-20T17:35:20Z

Summary

I have been testing a version of this code for several weeks and it seems to work fairly well now, so I would like to get the code into OSS for transparency, and allow others to help make improvements.

It is just a first step. The benchmarks do run in k8s in a constrained environment, which is good, but the tests run in Spark local mode. It would be better to deploy as a real cluster in k8s later on.

There is currently an assumption that TPC-H 100GB data already exists on the k8s nodes. It would be better to generate the data using tpchgen-cli directly in the containers. It would also be nice to support different scale factors.

There are likely many other improvements that can be made in the future.

Changes

Adds a GitHub bot (cometbot) that monitors PR comments for slash commands (/run tpch, /run micro, /help) and automatically runs benchmarks in Kubernetes, posting results back as PR comments
Includes a Click CLI for manual benchmark runs, Docker image build/push, K8s job management, and deployment tooling
Adds contributor guide documentation explaining how to trigger benchmarks and how the bot works

Details

The bot lives in dev/benchmarking-bot/ and includes:

Bot (src/cometbot/bot.py): Polls GitHub for slash commands on open Comet PRs
K8s (src/cometbot/k8s.py): Builds Docker images, creates/manages Kubernetes jobs
CLI (src/cometbot/cli.py): Click-based CLI for manual benchmark runs and bot management
Dockerfile: Container with JDK 17, Rust, Maven, and Spark 3.5 for running benchmarks
K8s templates (k8s/): Job and deployment manifests
Deploy script (deploy/deploy.sh): Automated deployment to a remote host via SSH

All configuration is via COMETBOT_* environment variables (registry, GitHub token, deploy host, etc.).

…rks on PRs Adds a GitHub bot that monitors PR comments for slash commands (/run tpch, /run micro, /help) and automatically runs benchmarks in Kubernetes, posting results back as PR comments. Includes CLI for manual benchmark runs, Docker image build/push, K8s job management, and deployment tooling. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

andygrove · 2026-02-20T17:55:56Z

@Shekharrajak fyi

milenkovicm · 2026-02-21T16:57:21Z

will try to review it as soon as possible, it might be a bit too big for review.
if it works similarly to

[at]sqlbenchmark tpch --iterations 5 --scale-factor 10

I would suggest one minor change, when bot comments it would be good if it mentions person who's triggered the bench in the comment. In my case i have github app notifications disabled for comments but still have notifications for mentions. would it make sense ?

andygrove · 2026-02-24T19:09:13Z

will try to review it as soon as possible, it might be a bit too big for review. if it works similarly to
[at]sqlbenchmark tpch --iterations 5 --scale-factor 10
I would suggest one minor change, when bot comments it would be good if it mentions person who's triggered the bench in the comment. In my case i have github app notifications disabled for comments but still have notifications for mentions. would it make sense ?

Thanks. I was thinking that we could create a similar PR in Ballista once we get this through review.

milenkovicm

I think that makes sense @andygrove

I guess main comment I have is how to update authorised users, can't really tell from PR and no documentation section about it

milenkovicm · 2026-02-24T21:14:24Z

docs/source/contributor-guide/benchmark-bot.md

+This path is mounted into the container as `/data/tpch` via a `hostPath` volume (see `k8s/comet-job-template.yaml`).
+
+You must generate and place this data before running TPC-H benchmarks. Data generation scripts are
+available in the [DataFusion Benchmarks](https://github.com/apache/datafusion-benchmarks) repository.


Was this repo retired or is going to be?

milenkovicm · 2026-02-24T21:17:09Z

docs/source/contributor-guide/benchmark-bot.md

+
+### TPC-H Data Prerequisite
+
+The benchmark jobs expect TPC-H SF100 data to already exist on the Kubernetes nodes at `/mnt/bigdata/tpch`.


If the data is pre generated would it make sense to be able to retrieve that info as part of help or some other way?

milenkovicm · 2026-02-24T21:18:41Z

docs/source/contributor-guide/benchmark-bot.md

+   clones the Comet repo, checks out the PR, builds Comet (`make release`), and runs the requested benchmark.
+6. **Completion Tracking**: While jobs are running, the bot checks Kubernetes job status every 10 seconds.
+   On completion, a :rocket: reaction is added to the original comment.
+7. **Results Posting**: The benchmark container itself posts results as a PR comment using the GitHub API


Mentioning user who trigger benchmark would make sense in my opinion

milenkovicm · 2026-02-24T21:23:33Z

docs/source/contributor-guide/benchmark-bot.md

+## Authorization
+
+Only DataFusion committers are authorized to trigger benchmarks. The list of authorized GitHub usernames
+is maintained in `dev/benchmarking-bot/authorized_users.txt`. Unauthorized users receive a reply


If the user list is updated, how does the change get propagated to cluster ?

Its a bit big pr to trace it, but I hope pr author can't abuse file and add itself to it and run benchmarks

milenkovicm · 2026-02-24T21:40:29Z

dev/benchmarking-bot/README.md

+export COMETBOT_DEPLOY_HOST=myhost
+export COMETBOT_DEPLOY_USER=myuser
+export COMETBOT_DEPLOY_DIR=/home/myuser/cometbot
+./deploy/deploy.sh


Should this script be run when list of authorised users is updated?

milenkovicm · 2026-02-24T21:43:01Z

docs/source/contributor-guide/benchmark-bot.md

+# Start the bot
+cometbot bot start --github-token <token>
+```
+


Maybe cli could be used to propagate authorised users update?

andygrove changed the title ~~feat: add benchmark automation bot~~ feat: add benchmark automation bot [WIP] Feb 20, 2026

andygrove force-pushed the benchmark-bot branch 2 times, most recently from 821b893 to 360a448 Compare February 20, 2026 17:37

andygrove force-pushed the benchmark-bot branch 2 times, most recently from 9c00b57 to d43d71c Compare February 20, 2026 17:43

reduce authorized users

51b5008

andygrove force-pushed the benchmark-bot branch from d43d71c to 51b5008 Compare February 20, 2026 17:52

andygrove requested a review from milenkovicm February 21, 2026 00:13

milenkovicm reviewed Feb 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat: add benchmark automation bot [WIP]#3557

feat: add benchmark automation bot [WIP]#3557
andygrove wants to merge 2 commits intoapache:mainfrom
andygrove:benchmark-bot

andygrove commented Feb 20, 2026 •

edited

Loading

Uh oh!

andygrove commented Feb 20, 2026

Uh oh!

milenkovicm commented Feb 21, 2026

Uh oh!

andygrove commented Feb 24, 2026

Uh oh!

milenkovicm left a comment

Uh oh!

milenkovicm Feb 24, 2026

Uh oh!

milenkovicm Feb 24, 2026

Uh oh!

milenkovicm Feb 24, 2026

Uh oh!

milenkovicm Feb 24, 2026

Uh oh!

milenkovicm Feb 24, 2026

Uh oh!

milenkovicm Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		### TPC-H Data Prerequisite

		The benchmark jobs expect TPC-H SF100 data to already exist on the Kubernetes nodes at `/mnt/bigdata/tpch`.

Comments

Conversation

andygrove commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Details

Uh oh!

andygrove commented Feb 20, 2026

Uh oh!

milenkovicm commented Feb 21, 2026

Uh oh!

andygrove commented Feb 24, 2026

Uh oh!

milenkovicm left a comment

Choose a reason for hiding this comment

Uh oh!

milenkovicm Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

milenkovicm Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

milenkovicm Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

milenkovicm Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

milenkovicm Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

milenkovicm Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

andygrove commented Feb 20, 2026 •

edited

Loading