Add python script to launch vtk benchmark by Lambourl · Pull Request #1 · tipi-build/kitware-benchmark

Lambourl · 2026-04-28T08:02:14Z

No description provided.

pysco68

Good start, but I don't this we're tracking all relevant data in the most useful way yet;

We need to time:

full rebuild (clean)
Initial full build
Configure

For "baselined cmake+ninja and for cmake-re" AFAICT

pysco68 · 2026-04-28T12:55:54Z

+        tc_name = Path(toolchain).stem
+        output_file = f"cmake-run_{tc_name}.txt"


You shouldn't be dealing with manually collecting that data and writing it to files especially since you're not using structured data anyway;

import logging from datetime import datetime log_filename = f"kitwarebench_{datetime.now().strftime('%Y-%m-%d_%H-%M-%S')}.log" # Configure the logging framework logging.basicConfig( level=logging.INFO, format='%(asctime)s - %(levelname)s - %(message)s', handlers=[ logging.FileHandler(log_filename), # Writes to the file logging.StreamHandler() # Also prints to your console ] ) # Use it! logging.info("Application started successfully.") logging.warning("This is a warning message.") logging.error("An error occurred.")

That being said, you might want to go structured data and serialize to CSV or JSON both of which can be done using python built-ins

It wasn't a completely finished script yet. But I always wanted to use JSON to store that information in the end.

so should be done with json

pysco68 · 2026-04-28T13:07:29Z

+def start_docker(image, source_dir, container_name=None):
+    """Start a detached docker container with source_dir mounted. Returns the container name."""
+    if container_name is None:
+        container_name = str(uuid.uuid4())
+    uid = os.getuid()
+    gid = os.getgid()
+    username = getpass.getuser()
+    home = os.environ["HOME"]
+
+    print(f"Starting container {container_name}...")
+    subprocess.run([
+        "docker", "run",
+        "--platform", "linux/amd64",
+        "--rm", "--init",
+        "--name", container_name,
+        f"-u{uid}:{gid}", "--group-add", "tipi",
+        "--ulimit", "nofile=65535:65535",   #
+        "-e", "TIPI_DISABLE_AR_RANLIB_DRIVER=ON",
+        "-e", "TIPI_CACHE_CONSUME_ONLY=ON",
+        "-e", "TIPI_CACHE_FORCE_ENABLE=OFF",
+        "-e", "HOME",
+        "-e", "RBE_service=kernite.cluster.engflow.com:443",
+        "-e", f"RBE_tls_client_auth_key={home}/engflow-mTLS/engflow.key",
+        "-e", f"RBE_tls_client_auth_cert={home}/engflow-mTLS/engflow.crt",
+        "-v", f"{home}:{home}:rw",
+        "-v", f"{source_dir}:{source_dir}:rw",
+        "-w", str(source_dir),
+        "-d",
+        image,
+        "sleep", "infinity",
+    ], check=True)
+
+    # Create the user inside the container
+    subprocess.run([
+        "docker", "exec", "-u", "0", container_name,
+        "useradd", "-d", home, "-u", str(uid), username,
+    ], check=False)
+
+    print(f"Container running: {container_name}")
+    return container_name
+
+
+def stop_docker(container_name):
+    """Stop a running docker container."""
+    print(f"Stopping container {container_name[:8]}...")
+    subprocess.run(["docker", "stop", "-t0", container_name], check=True)
+    print("Container stopped.")
+
+
+def docker_exec(container_name, cmd):
+    """Run a command inside a running docker container and return elapsed time."""
+    print(f"  [{container_name[:8]}] Running: {cmd}")
+    start = time.perf_counter()
+    subprocess.run(["docker", "exec", container_name, "bash", "-c", cmd], check=True)
+    elapsed = time.perf_counter() - start
+    print(f"  Done in {elapsed:.2f}s")
+    return elapsed


Nitpick: This should be in a Python RAII wrapper that allows managing the lifecycle of the container IMHO (so a class with __exit__(self, exc_type, exc_val, exc_tb) and __enter__(self) defined which then allows to do with ... as x: ... blocks)

pysco68 · 2026-04-28T13:08:47Z

+    """Run a command inside a running docker container and return elapsed time."""
+    print(f"  [{container_name[:8]}] Running: {cmd}")
+    start = time.perf_counter()
+    subprocess.run(["docker", "exec", container_name, "bash", "-c", cmd], check=True)


You should capture the output and log it somewhere because we will certainly want to know about the error messages / details / RBE invocation ID for profiles ...

pysco68 · 2026-04-28T13:09:55Z

+
+                # Second run: with warm RBE cache
+                print("  [with-cache] cmake-re configure + build...")
+                docker_exec(container, f"cmake-re -GNinja -S . -B ./build -DCMAKE_TOOLCHAIN_FILE={toolchain} --host --distributed")


track the configure time too as this was part of the tracked data in Bill's initial benchmark

pysco68 · 2026-04-28T13:12:56Z

+    benchmark_vtk_project_cmake(source_dir, image, args.iterations, toolchains)
+    benchmark_vtk_project_cmake_re(source_dir, image, args.iterations, toolchains)


nitpick; feels like the loop doing the toolchain and iterations could be defined here to not repeat yourself so often.

…ents

Orphis · 2026-04-29T13:14:43Z

Just a general comment about the Python code: you should now use type annotations on your functions whenever possible. It's been supported by Python 3 for a while now.

On the benchmark side, it might be interesting to check the cache hit rate on rebuild by extracting it from the reproxy log files. I don't think we have much cache invalidation in this project, but if we were to reuse some of this, we'd definitely need it.

Lambourl · 2026-04-29T13:21:24Z

On the benchmark side, it might be interesting to check the cache hit rate on rebuild by extracting it from the reproxy log files. I don't think we have much cache invalidation in this project, but if we were to reuse some of this, we'd definitely need it.

I'm currently working on a way to retrieve all the logs related to the Docker container at the end of the process, which will allow us to review them.

…ainer on every run

… the cluster less opportunity to scale back

…/Kitware-benchmark into feature/add-benchmark

Lambourl added 3 commits April 28, 2026 10:01

🆕 add python script to launch vtk benchmark

88670ca

🔧 increase pipe limit

5db6bc8

🔧 add args to change -j

c24adc3

pysco68 requested changes Apr 28, 2026

View reviewed changes

Lambourl added 18 commits April 28, 2026 16:02

🔧 add cleaning and chaneg of version

2de2089

🔧 relaunch cmake too

a0b9385

🔧 add function to clean repo after change

a149b88

🔧 use json

0435dd1

🔧 manage docker lifetime with class

be9c2c4

🔧 log are strore in log file

5ebc4fb

🔧 use a common loop

25e964e

🔧 improve configuration

7907232

🔧 improve help data

08b1d4c

🔧 the touch file is configurable

0832683

🔧 use a struct to call run_benchmarks to avoid to have a lot of argum…

b1d0d9a

…ents

🔧 rbe service is now configurable

25ac422

💄 jobs go in config

8beea09

🔧 use the full cfg

349e627

🔧 avoid issue with space in toolchain or touch_file

ebd8398

🔧 add rbe mode to change beetween racing and remote

f6138ce

🔧 update config

a5ee89e

🔧 update benchamrk to handle url adress with .git

6b26041

Lambourl added 6 commits April 29, 2026 15:22

🔧 add reclient log zip at the end of the docker

d848670

🔧 respect cmake-re name

4a72911

🔧 produce cvs at the end

6e8fdb6

🔧 remove some bad environment variable

3ade827

🔧 install zip if not present in the docker

b7a9d98

🔧 better cvs at the end

84c6d58

Lambourl added 4 commits April 30, 2026 11:12

🔧 add profile download and search pattern in the benchmark

1037910

🔧 add explain of the sleep

e0bfcaf

🔧 do all download at the end to let finalize all file

2e0868a

🔧 be able to chosse credential folder name

781b68b

Lambourl force-pushed the feature/add-benchmark branch from ce105fc to 781b68b Compare April 30, 2026 09:43

pysco68 and others added 9 commits April 30, 2026 18:13

🔧 use tar instead of zip to not have to install something in the cont…

c7072be

…ainer on every run

🔧 changed the order of the "modified_file_rebuild vs rebuild" to give…

73a1236

… the cluster less opportunity to scale back

Implemented cluster preheating for the cmake-re bench

6727fdb

🔧 adapt comment

d6c3cb5

Optimizing order of preheating run to minimize long tail

db82520

Fixing typo

4277119

🔧 introduce some fisrt readme

423af85

Merge branch 'feature/add-benchmark' of https://github.com/tipi-build…

7fa5efd

…/Kitware-benchmark into feature/add-benchmark

🔧 rename and add sheet compare

4249615

Lambourl force-pushed the feature/add-benchmark branch from be2552f to 4249615 Compare May 4, 2026 13:11

🔧 change in asset

212b45f

		tc_name = Path(toolchain).stem
		output_file = f"cmake-run_{tc_name}.txt"

		benchmark_vtk_project_cmake(source_dir, image, args.iterations, toolchains)
		benchmark_vtk_project_cmake_re(source_dir, image, args.iterations, toolchains)

Conversation

Lambourl commented Apr 28, 2026

Uh oh!

pysco68 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Orphis commented Apr 29, 2026

Uh oh!

Lambourl commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants