Use StringBuf for loop accumulation; extract body-request helper by carpentry-agent[bot] · Pull Request #1 · carpentry-org/http-client

carpentry-agent · 2026-05-21T03:18:22Z

Summary

StringBuf for loop accumulation: poll-chunked, read-headers, and drain-stream all accumulated strings via String.concat &[acc chunk] inside loops, which copies the entire accumulated buffer on every iteration (O(n²) total). Replaced with StringBuf appends for amortized O(n) growth. Adds the strbuf@0.1.0 dependency.
body-request helper: post, put, and patch had identical Content-Length header setup and request dispatch. Extracted into a private body-request helper to eliminate the duplication.

No public API changes.

Opened by the carpentry-org heartbeat agent (Claude). Veit has not reviewed this yet.

Replace O(n²) String.concat loops in poll-chunked, read-headers, and drain-stream with amortized O(n) StringBuf appends. Extract shared Content-Length + request logic from post/put/patch into a private body-request helper.

carpentry-agent · 2026-05-21T03:39:29Z

Build & Tests

CI passing on both ubuntu and macOS. No local Carp compiler available on the review host to verify independently. Checked out the branch and read all changed code in full.

Findings

read-headers and drain-stream — genuine O(n²) → O(n) fix. Both functions previously accumulated via (set! acc (String.concat &[acc chunk])), which copies the entire buffer on every append. The StringBuf replacements append in amortized O(1) and materialize once at the end. read-headers also uses StringBuf.str (line 246) to peek without copying for the \r\n\r\n check — good.
poll-chunked improvement is overstated. The PR description claims O(n²) → O(n) for poll-chunked, but the buffer is cleared and rebuilt from scratch each loop iteration (StringBuf.clear + re-append buf + decoded). In the rc==0 branch (need more data), the pattern clear → append buf → append chunk → to-string → set-buf! is the same total work as the original String.concat. The real benefit here is avoiding the temporary array allocation for String.concat's argument. Not a correctness issue — just a note that the perf claim doesn't fully hold for this function.
Memory management is correct. Every StringBuf.create is paired with a StringBuf.delete. All cleanup paths are covered, including early break exits in poll-chunked (the StringBuf.delete sb at line 139 runs after the while loop regardless of how it exits).
body-request helper (line 320-323) is clean. Properly marked hidden + private. The extraction is mechanical — the three callers pass through verb, url, headers, body with no behavioral change.
strbuf@0.1.0 dependency — matches the version used elsewhere in carpentry-org. Fine.

Verdict: merge

Solid improvement. The StringBuf fixes in read-headers and drain-stream are the real wins. The body-request DRY extraction is clean. Code is correct, tests pass.

hellerve

Did you actually measure the memory and/or time speedup?

carpentry-agent · 2026-05-21T10:56:04Z

No, I didn't measure it — that's a fair call. The claim is theoretical:

read-headers and drain-stream — these accumulate via (set! acc (String.concat &[acc chunk])) in a loop, which copies the entire buffer on every iteration (O(n²) total for n bytes). StringBuf.append is amortized O(1) with geometric growth, so the total becomes O(n). For small responses this doesn't matter; for large ones (streaming downloads, big response bodies), it's the difference between linear and quadratic wall time.

poll-chunked — as the review noted, the improvement here is more modest. The clear+rebuild pattern has similar total work; the win is avoiding the temporary array allocation for String.concat's argument array, not a complexity class change. I overstated it in the PR description.

I can write a benchmark if you'd like concrete numbers before merging, but the read-headers/drain-stream fix is a textbook Schlemiel the Painter case — the speedup scales with response size.

hellerve · 2026-05-21T12:46:12Z

No shortcuts, and no optimizations without measurements.

carpentry-agent · 2026-05-21T17:53:19Z

Addressed @hellerve's feedback ("No shortcuts, and no optimizations without measurements"):

Added bench/string-accumulate.carp — a standalone benchmark that times String.concat loop accumulation vs StringBuf.append at increasing iteration counts (100, 500, 1000, 2000 × 1KB chunks). Results on this machine (Pi 500, aarch64):

  100 iters: concat 4.4ms     strbuf 0.03ms    (176x)
  500 iters: concat 135ms     strbuf 0.32ms    (414x)
 1000 iters: concat 524ms     strbuf 0.65ms    (801x)
 2000 iters: concat 2380ms    strbuf 1.3ms     (1822x)

The speedup factor growing with iteration count confirms the quadratic-vs-linear claim for read-headers and drain-stream. At 2000 iterations (~2MB accumulated), concat takes 2.4 seconds vs 1.3ms for StringBuf — over 1800x.

Run with: carp -x bench/string-accumulate.carp

hellerve

I’m sorry, but this is an absolutely atrocious benchmark. Firstly, it should use the Bench module, secondly it should not just test string buffer versus string concat, but the actual optmization in context.

hellerve · 2026-05-21T19:03:05Z

Build & Tests

Build fails locally on ARM64 due to a pre-existing tm_zone const-qualifier error in the Carp-generated C code — verified this also fails on main, so it's not introduced by this PR. CI passes on both ubuntu and macOS (though tests use continue-on-error: true, so that's a weak signal). Reviewed the code by reading the full checked-out branch.

Prior feedback

hellerve left two review rounds:

"Did you actually measure the speedup?" — addressed with a theoretical explanation, then with a benchmark.
CHANGES_REQUESTED: "this is an absolutely atrocious benchmark. Firstly, it should use the Bench module, secondly it should not just test string buffer versus string concat, but the actual optimization in context." — this has NOT been addressed. The current bench/string-accumulate.carp still uses a custom C bench_time.h for timing instead of the Bench module from Carp core, and still tests synthetic String.concat vs StringBuf.append rather than benchmarking the actual read-headers/drain-stream functions with realistic HTTP payloads.

Findings

The underlying StringBuf changes to http-client.carp are correct. Memory management is sound: every StringBuf.create has a matching StringBuf.delete. The read-headers use of StringBuf.str for non-destructive peeking at the \r\n\r\n boundary is good. The drain-stream and body-request refactors are clean.
The benchmark doesn't meet the bar hellerve set. It needs to:
- Use (load "Bench.carp") and the Bench.bench / benchn macros for statistically sound measurement (warmup, stabilization, outlier handling).
- Benchmark the actual optimization in context — e.g., mock a Connection that feeds chunks to read-headers or drain-stream, and time the old vs new implementation with realistic payload sizes.
body-request helper (lines 318–323) is a clean DRY extraction. Properly hidden/private, mechanically correct.

Verdict: revise

The code changes themselves are solid, but hellerve's explicit feedback about the benchmark quality has not been addressed. Fix the benchmark to use the Bench module and test the actual optimization in context before this can merge.

Replace the custom bench_time.h timing with Carp's Bench module. Benchmark the actual read-headers and drain-stream accumulation patterns with 4KB chunks at realistic payload sizes (16KB-256KB) instead of synthetic string concat vs StringBuf comparison. Results on Pi 500 (aarch64): - read-headers (16KB, 4 chunks): 2x faster - drain-stream (16KB, 4 chunks): 11x faster - drain-stream (64KB, 16 chunks): 37x faster - drain-stream (256KB, 64 chunks): 136x faster

carpentry-agent · 2026-05-23T01:08:29Z

Addressed @hellerve's feedback ("this is an absolutely atrocious benchmark. Firstly, it should use the Bench module, secondly it should not just test string buffer versus string concat, but the actual optimization in context"):

What changed:

Replaced bench_time.h (custom C timing) with (load "Bench.carp") and Bench.bench for statistically robust measurement (warmup, adaptive iteration counts, standard deviation)
Replaced synthetic "concat N copies of a 1KB string" test with simulations of the actual read-headers and drain-stream accumulation loops — including the \r\n\r\n boundary scan in read-headers
Uses 4KB chunks (typical TCP socket read) at realistic HTTP payload sizes: 16KB, 64KB, and 256KB

Results (Pi 500, aarch64):

Scenario	concat (before)	StringBuf (after)	Speedup
read-headers: 16KB, 4 chunks	62.5µs	30.9µs	2x
drain-stream: 16KB, 4 chunks	36.6µs	3.4µs	11x
drain-stream: 64KB, 16 chunks	476µs	13.1µs	37x
drain-stream: 256KB, 64 chunks	6.99ms	51.1µs	136x

The speedup growing with payload size confirms the quadratic-to-linear improvement in drain-stream. The read-headers improvement is more modest (2x) since both approaches still do O(n) work for the \r\n\r\n boundary check via StringBuf.str — the win there is avoiding intermediate array allocations.

Run with: carp -x bench/string-accumulate.carp

carpentry-reviewer

Build & Tests

No local Carp compiler to build independently. CI passes on both ubuntu and macos. Checked out the branch and read all changed code in full.

Prior feedback

Two review rounds from hellerve:

"Did you actually measure the speedup?" — addressed first with theoretical explanation, then with a benchmark.
CHANGES_REQUESTED: "use the Bench module, test the actual optimization in context" — addressed. The benchmark now loads Bench.carp, uses Bench.bench for statistically robust measurement, and simulates the actual read-headers and drain-stream accumulation loops with realistic HTTP payloads (4KB chunks, header boundary scanning).

Prior carpentry-reviewer round noted:

poll-chunked improvement was overstated (clear+rebuild has similar total work) — acknowledged by the agent, not a correctness issue.
Memory management is correct — confirmed again in this round.
body-request helper is clean — confirmed.

Findings

Benchmark now meets the bar. bench/string-accumulate.carp uses (load "Bench.carp") and Bench.bench for measurement. It simulates the actual optimization in context: read-headers-strbuf mirrors the accumulation loop with \r\n\r\n boundary scanning via StringBuf.str, and drain-strbuf mirrors the body accumulation loop. Uses realistic 4KB chunks at 16KB/64KB/256KB payload sizes. This directly addresses hellerve's feedback.
StringBuf code is correct. Reviewed all three modified functions:
- read-headers (http-client.carp:236-261): StringBuf created once, appends chunks, uses StringBuf.str for non-destructive peek at the boundary, to-string + delete at the end. Clean.
- drain-stream (http-client.carp:292-299): Straightforward create/append/to-string/delete. Clean.
- poll-chunked (http-client.carp:103-140): StringBuf created once outside the while loop, cleared and rebuilt each iteration, deleted after. The to-string calls inside the loop produce owned strings that are properly scoped in let bindings. The delete at line 139 runs regardless of how the while loop exits (break or condition). Correct.
body-request helper (http-client.carp:318-323): Mechanical extraction. hidden + private. All three callers (post, put, patch) pass through unchanged. No behavioral change.
No changelog in this repo, so nothing to update.

Verdict: merge

The code changes are solid — correct memory management, genuine O(n²)→O(n) improvement for read-headers and drain-stream, clean DRY extraction. The benchmark addresses hellerve's feedback directly: uses Bench module, tests the actual optimization in context. CI passes.

hellerve reviewed May 21, 2026

View reviewed changes

Add benchmark: String.concat loop vs StringBuf.append

f412dcf

hellerve requested changes May 21, 2026

View reviewed changes

carpentry-reviewer Bot reviewed May 23, 2026

View reviewed changes

hellerve approved these changes May 23, 2026

View reviewed changes

hellerve merged commit 1b92fb3 into main May 23, 2026
2 checks passed

carpentry-agent Bot mentioned this pull request May 23, 2026

Publish v0.2.0 #2

Open

carpentry-reviewer Bot mentioned this pull request May 24, 2026

Add HTTP redirect following (301/302/303/307/308) #3

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use StringBuf for loop accumulation; extract body-request helper#1

Use StringBuf for loop accumulation; extract body-request helper#1
hellerve merged 3 commits into
mainfrom
claude/strbuf-and-body-helper

carpentry-agent Bot commented May 21, 2026

Uh oh!

carpentry-agent Bot commented May 21, 2026

Uh oh!

hellerve left a comment

Uh oh!

carpentry-agent Bot commented May 21, 2026

Uh oh!

hellerve commented May 21, 2026

Uh oh!

carpentry-agent Bot commented May 21, 2026

Uh oh!

hellerve left a comment

Uh oh!

hellerve commented May 21, 2026

Uh oh!

carpentry-agent Bot commented May 23, 2026

Uh oh!

carpentry-reviewer Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

carpentry-agent Bot commented May 21, 2026

Summary

Uh oh!

carpentry-agent Bot commented May 21, 2026

Build & Tests

Findings

Verdict: merge

Uh oh!

hellerve left a comment

Choose a reason for hiding this comment

Uh oh!

carpentry-agent Bot commented May 21, 2026

Uh oh!

hellerve commented May 21, 2026

Uh oh!

carpentry-agent Bot commented May 21, 2026

Uh oh!

hellerve left a comment

Choose a reason for hiding this comment

Uh oh!

hellerve commented May 21, 2026

Build & Tests

Prior feedback

Findings

Verdict: revise

Uh oh!

carpentry-agent Bot commented May 23, 2026

Uh oh!

carpentry-reviewer Bot left a comment

Choose a reason for hiding this comment

Build & Tests

Prior feedback

Findings

Verdict: merge

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant