Fix KernelAgent backend integration issues #205

jiannanWang · 2026-01-14T23:52:32Z

This PR mainly solves KernelAgent integration issues with BackendBench.

Key changes:

Centralized Kernel Generation: Adds a method to generate kernels for all specified operators, with per-op summaries and an overall report.
Consistent Operator Name: Extracts the correct folder names using a util function.
Organized Kernel Files: The generated kernel files are now grouped by operator and aligned with the directory backend.
Operator Filtering (torchbench suite): Switches to exact name matching to avoid false positives. (e.g., --ops relu.default now test only aten.relu.default, not aten.leaky_relu.default)

Example Usage:
Command:
uv run python BackendBench/scripts/main.py --suite torchbench --backend kernel_agent --ops relu.default,leaky_relu.default --topn 5

Result

# KernelAgent logs are skipped

correctness score (mean pass rate over all operators): 1.00
performance score (geomean speedup over all operators): 1.00
perf@p score (rate of correct samples with a speedup greater than p, p=1.0): 0.50

Folder Structure Example: (some files are skipped for simplicity.)

kernel_agent_run_20260114_152026
├── agent_logs
│   └── ...
├── leaky_relu__default
│   ├── leaky_relu__default_implementation_v1.py
│   └── leaky_relu__default_summary.txt
├── leaky_relu__default_session_session_20260114_152046_533793
│   └── ...
├── relu__default
│   ├── relu__default_implementation_v1.py
│   └── relu__default_summary.txt
├── relu__default_session_session_20260114_152250_994449
│   └── ...
├── OVERALL_SUMMARY.txt
└── README.md

msaroufim · 2026-01-15T04:51:24Z

BackendBench/backends/kernel_agent.py

 '''
            return kernel_code + wrapper_code

+    def _generate_kernel_file_path(self, folder_name: str, attempt: int) -> str:


so we had some existing setup directory utils IIRC, wouldn't this just duplicate stuff?

We do have a similar function in the llm backend and this function is a duplicate for that.

This kernel_file_path is only used in the llm and kernel_agent backends because those backends requires generating kernels.

Since kernel_file_path is only used in the llm and kernel_agent backends, I could create a new class, GenerativeBackend, which inherits from Backend. I can move the shared functions (e.g., _generate_kernel_file_path) from both llm and kernel_agent into GenerativeBackend. Then, both the LLM and kernel_agent classes can inherit from GenerativeBackend instead of directly from Backend, which will help reduce duplication.

would it not be possible to have the same function?

msaroufim · 2026-01-15T04:53:20Z

BackendBench/backends/kernel_agent.py

-            f.write(kernel_code)
+        try:
+            compiled_kernel = self.compile_kernel_from_string(kernel_code, op_name, attempt=attempt)
+            self.compiled_kernels[op] = compiled_kernel


this function means something totally different now, we went from something like write_kernel_to_file to bind the compiled op? so maybe the function just needs a better name

msaroufim · 2026-01-15T04:54:12Z

BackendBench/backends/kernel_agent.py

+
+    def generate_kernels(self, suite, daemon=True):
+        """Generate kernels for all operators in the suite with comprehensive feedback."""
+        self.daemon = daemon


what is this doing? I'd have expected a daemon to be a service and not a boolean. Also generally boolean args in a function typically hint that you might need 2 seperate functions

msaroufim · 2026-01-15T04:55:35Z

BackendBench/backends/kernel_agent.py

+        # Generate and save overall summary
+        self._write_overall_summary(successful_ops, total_ops)
+
+    def _write_summary(


I feel like a few stuff is getting mixed here

What the eval is expected to output

What kernelagent is expected to output

What the specific integration of kernelagent and the eval is supposed to follow

For instance Sahan also had some extensive reporting code in the past, so I'd like to hear some ideas for how to clean all this up

fix kernel agent

a6791e8

jiannanWang requested a review from msaroufim January 14, 2026 23:52

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 14, 2026

msaroufim reviewed Jan 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix KernelAgent backend integration issues #205

Fix KernelAgent backend integration issues #205

jiannanWang commented Jan 14, 2026

Uh oh!

msaroufim Jan 15, 2026

Uh oh!

jiannanWang Jan 15, 2026 •

edited

Loading

Uh oh!

msaroufim Jan 18, 2026

Uh oh!

msaroufim Jan 15, 2026

Uh oh!

msaroufim Jan 15, 2026

Uh oh!

msaroufim Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix KernelAgent backend integration issues #205

Are you sure you want to change the base?

Fix KernelAgent backend integration issues #205

Conversation

jiannanWang commented Jan 14, 2026

Uh oh!

msaroufim Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

jiannanWang Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

msaroufim Jan 18, 2026

Choose a reason for hiding this comment

Uh oh!

msaroufim Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

msaroufim Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

msaroufim Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jiannanWang Jan 15, 2026 •

edited

Loading