Skip to content

Commit 8c4c755

Browse files
add table details
1 parent 33b7ae6 commit 8c4c755

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

index.html

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -332,6 +332,8 @@ <h2 id="results">Results</h2>
332332
<figcaption style="text-align: center;">Table 3: Refactoring results for LIBRARIAN (w/ K = 8) averaged over 10 Code Contests collections</figcaption>
333333
</figure>
334334

335+
We also present the results on the small repo split, which consists of repositories generated by o4-mini.
336+
We experiment with Claude Sonnet 3.7 as a planner, and either Sonnet 3.7 as the implementer or o4-mini.
335337
<figure class="table-figure">
336338
<table class="table-styled">
337339
<thead>
@@ -362,6 +364,7 @@ <h2 id="results">Results</h2>
362364
<figcaption style="text-align: center;">Table 4: Average results on MiniCode-repositories small, using Codex with o4-mini and Claude Code with Claude Sonnet 3.7</figcaption>
363365
</figure>
364366

367+
Finally, we present resulst on the large repo split. Due to the stronger performance of Sonnet models, we evaluate only Sonnet models to minimize cost.
365368
<figure class="table-figure">
366369
<table class="table-styled">
367370
<thead>

0 commit comments

Comments
 (0)