feat: Add memory profiling to TPC benchmarks [WIP]#3539
Draft
andygrove wants to merge 3 commits intoapache:mainfrom
Draft
feat: Add memory profiling to TPC benchmarks [WIP]#3539andygrove wants to merge 3 commits intoapache:mainfrom
andygrove wants to merge 3 commits intoapache:mainfrom
Conversation
7 tasks
07ec5c4 to
1cad3be
Compare
859ee45 to
43a7612
Compare
Single script that orchestrates the full benchmark lifecycle: starts the Spark cluster, runs the benchmark, and tears down on exit (including Ctrl-C). Supports --laptop flag for single-worker mode. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
--profile/--profile-intervalflags) that polls executor memory stats during benchmark runs and writes time-series CSV outputcollect-metrics.shscript, with automatic per-engine snapshots when profiling is enabledvisualize-metrics.pyto generate memory charts from JVM executor and cgroup metrics CSVsrequirements.txtwithmatplotlibdependency for the visualization scriptprofiling.py,visualize-metrics.py, andcollect-metrics.shinto the benchmark imagetpcbench.pyandrun.pyto wire through profiling options