You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<p><strong>LIBRARIAN</strong> is a novel sample-and-rerank method that refactors codebases into reusable libraries. It clusters code to find shared structures, samples refactorings, and ranks them by simplicity and correctness. It achieves 1.6-2x better compression than top code agents while boosting accuracy.</p>
76
+
<p><strong>Librarian</strong> is a novel sample-and-rerank method that refactors codebases into reusable libraries. It clusters code to find shared structures, samples refactorings, and ranks them by simplicity and correctness. It achieves 1.6-2x better compression than top code agents while boosting accuracy.</p>
<p><strong>MINiCODE</strong> is a new benchmark for testing code agents' ability to create unified libraries from multiple code sources. It emphasizes open-ended design, verifiable correctness via unit tests, and large-context synthesis. It spans competition coding and synthesized repositories, filling gaps in existing benchmarks.</p>
79
+
<p><strong>Minicode</strong> is a benchmark for testing code agents' ability to create unified libraries from multiple code sources, such as competition coding programs and Python repositories. It requires open-ended design and large-context understanding in order to craft simple libraries.</p>
0 commit comments