initial OCI support #2

bennyz · 2026-01-04T17:46:20Z

./fls-oci from-url oci://quay.io/bzlotnik/ti-disk:latest /dev/mmcblk1 --debug

Summary by CodeRabbit

New Features
- OCI support (oci://): registry auth, image reference parsing, file-pattern matching, streaming pull→decompress→write, and OCI-specific progress.
- Added streaming adapter, centralized compression/type detection, and improved final progress reporting.
Bug Fixes
- More graceful macOS device detection when metadata is missing.
Refactor
- Options reorganized into common, block, OCI, and HTTP-client types; public API surface expanded.
Chores
- Expanded .gitignore and updated dependencies.
Tests
- Consolidated and extended URL/OCI-related test helpers.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com>

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com> Assisted-by: claude-opus-4.5

coderabbitai · 2026-01-04T17:46:26Z

📝 Walkthrough

Walkthrough

Adds full OCI image flashing support and streaming pipelines: new OCI modules (reference, manifest, auth, registry, from_oci), content-detection and compression utilities, stream bridging, refactored options (FlashOptions/BlockFlashOptions/OciOptions/HttpClientOptions), progress final-stat APIs, Cargo dependency updates, and .gitignore additions.

Changes

Cohort / File(s)	Summary
Build & manifest `\.gitignore`, `Cargo.toml`	`.gitignore` extended for Rust artifacts; `Cargo.toml` updated with new deps (`serde`, `serde_json`, `base64`, `tar`, `flate2`, `xz2`), adjusted `clap`/`reqwest` features, and added/moved dev-deps (`http-body-util`, `tokio-rustls`, `rustls`, `rustls-pemfile`, `rcgen`).
Public surface & exports `src/lib.rs`, `src/fls/mod.rs`	Public re-exports expanded (`flash_from_oci`, `flash_from_url`, `BlockFlashOptions`, `FlashOptions`, `OciOptions`); module visibilities changed and new modules `magic_bytes`, `stream_utils`, and public `oci` added.
Options & CLI `src/fls/options.rs`, `src/main.rs`	Introduces `FlashOptions`, `BlockFlashOptions`, `OciOptions`, `HttpClientOptions` with From impls and defaults; CLI `FromUrl` gains `username`, `password`, `file_pattern` and routes `oci://` URIs to the new OCI flow.
OCI subsystem `src/fls/oci/*`	New OCI modules: `reference.rs` (image reference parsing), `manifest.rs` (manifest/index parsing & layer helpers), `auth.rs` (WWW‑Authenticate parsing, credentials, token flow), `registry.rs` (RegistryClient: auth, manifest/blob fetch), `from_oci.rs` (end‑to‑end streaming flash orchestration).
Content detection & compression `src/fls/compression.rs`, `src/fls/magic_bytes.rs`	Adds `Compression` enum and magic‑byte utilities: detect compression (gzip/xz/zstd/none), tar detection, content detection with sample decompression, and tests.
Streaming & IO helpers `src/fls/stream_utils.rs`, `src/fls/block_writer.rs`	Adds `ChannelReader` bridging tokio mpsc -> blocking `Read`; `block_writer` modified to handle missing metadata gracefully and small I/O/opening formatting changes.
Tar/extraction & pipeline `src/fls/oci/from_oci.rs`	Implements streaming pipeline: manifest selection, pre-buffer magic detection, branching for raw image vs tar, decompressor selection (in‑process or external), tar extraction, progress tracking, and block-device writes (large multi-component flow).
Registry client & auth `src/fls/oci/registry.rs`, `src/fls/oci/auth.rs`	New `RegistryClient` with authenticate/fetch_manifest/get_blob_stream and auth helpers: `Credentials`, `WwwAuthenticate`, token request flow, and unit tests.
Progress reporting `src/fls/progress.rs`	Adds `print_final_stats` and `print_final_stats_with_ratio`, centralizes final‑stat printing and rate calculations.
Tests & refactors `tests/from_url.rs`	Tests refactored to use shared helpers (`test_options`, `test_options_with`) and adjusted for new option shapes and TLS variations.

Sequence Diagram(s)

sequenceDiagram
    autonumber
    participant CLI as CLI
    participant Reg as RegistryClient
    participant Auth as Auth module
    participant HTTP as Registry HTTP
    participant Manifest as Manifest parser
    participant Stream as Stream/Detector
    participant Decomp as Decompressor
    participant Block as BlockWriter

    CLI->>Reg: RegistryClient::new(image_ref, OciOptions)
    CLI->>Reg: authenticate()
    Reg->>HTTP: GET /v2/<repo>/manifests (anonymous)
    HTTP-->>Reg: 401 + WWW-Authenticate
    Reg->>Auth: parse WWW-Authenticate
    Auth->>HTTP: request token (optional Basic)
    HTTP-->>Auth: token response
    Auth-->>Reg: bearer token
    Reg->>HTTP: GET manifest (with Authorization)
    HTTP-->>Reg: manifest JSON
    Reg->>Manifest: parse & select manifest/layer
    Reg->>HTTP: GET /v2/<repo>/blobs/<digest>
    HTTP-->>Stream: response stream
    Stream->>Stream: pre-buffer for magic detection
    alt Raw disk or passthrough compression
        Stream->>Block: stream -> write directly
    else Tar archive
        Stream->>Decomp: decompress (if needed)
        Decomp-->>Stream: decompressed tar stream
        Stream->>Block: extract matching entry -> write
    end
    Block-->>CLI: progress updates / final stats

Estimated code review effort

🎯 5 (Critical) | ⏱️ ~120 minutes

Poem

🐰 I sniffed the header, hopped through every byte,
Tokens fetched at midnight, streams unspooled by light,
I matched the tar, I chased the gzip seam,
Piped layers to the disk and dreamed a flashing dream,
Small paws, big bytes — a rabbit's streaming delight.

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'initial OCI support' directly and clearly summarizes the main change in this PR: adding OCI (Open Container Initiative) image support to the project.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com>

bennyz · 2026-01-06T09:16:05Z

@coderabbitai review

coderabbitai · 2026-01-06T09:16:10Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 8

🤖 Fix all issues with AI Agents

In @Cargo.toml:
- Around line 12-16: The tar dependency in Cargo.toml permits vulnerable 0.4.x
versions; update the tar version constraint to require at least 0.4.36 (e.g.,
change tar = "0.4" to tar = "0.4.36" or use a semver constraint like ">=
0.4.36") to mitigate CVE-2021-38511, and audit/update any xz2 dependency (and
its linked/liblzma vendoring) to a release that uses liblzma >= 5.8.1 or
otherwise confirm the linked liblzma is patched to mitigate CVE-2025-31115.
Ensure Cargo.toml entries for tar and xz2 are updated accordingly so cargo.lock
will pick the fixed versions.

In @src/fls/oci/from_oci.rs:
- Around line 975-980: There is an extra blank line between the end of the impl
Read for DebugGzReader and the start of the impl<R: Read> DebugReader<R>; remove
the stray blank line so the closing brace of DebugGzReader's Read impl is
immediately followed by the impl<R: Read> DebugReader<R> declaration, then run
rustfmt (cargo fmt) to ensure formatting consistency.
- Around line 760-799: The MagicDetectingReader::peek_bytes method currently
emits unconditional debug output via eprintln!, which is noisy; remove those
eprintln! calls or gate them behind a proper logging mechanism or feature flag.
Replace the raw eprintln! calls in peek_bytes with either calls to the crate
logging macros (e.g., log::debug!) or conditionally compile them behind
cfg!(debug_assertions) or a debug field on MagicDetectingReader, keeping the
logic intact (peek_bytes, self.peeked, self.peek_pos, self.inner remain
unchanged) and ensure no stray eprintln! calls remain.
- Around line 924-927: There is an extra blank line after the end of the
DebugGzReader::new function causing CI formatting failures; remove the stray
blank line so the closing brace of DebugGzReader::new is immediately followed by
the next token/file end as per project formatting rules (ensure no extra newline
between the function's closing brace and the next line).
- Around line 733-736: There is an extra blank line after the matches_pattern
function causing a formatting failure; remove the stray empty line immediately
following the end of the matches_pattern function (the trailing blank between
the function closing brace and the next token/EOF) so the file matches the
project's formatter/CI expectations.

In @src/fls/oci/reference.rs:
- Around line 109-139: split_name_and_tag currently treats any numeric-only
suffix ≤5 digits as a port, causing valid numeric tags like "myrepo:12345" to be
misparsed; update the heuristic in split_name_and_tag: only consider an
all-digit short suffix a registry port when the authority/hostname portion (the
segment left of the colon up to the previous '/') looks like a host (contains a
dot '.' or is "localhost" or is an IP literal), otherwise treat the suffix as a
tag; add a unit test that asserts "myrepo:12345" yields tag "12345" and another
test confirming "registry:5000/repo" yields no tag.

🧹 Nitpick comments (4)

src/main.rs (1)
146-156: Consider extracting duplicate exit handling logic.

Both the OCI and HTTP paths have identical success/error handling with FLASH_COMPLETED/FLASH_FAILED output and exit codes. While functional, this could be extracted into a helper function.
🔎 Suggested refactor
fn handle_flash_result(result: Result<(), Box<dyn std::error::Error>>) {
    match result {
        Ok(_) => {
            println!("Result: FLASH_COMPLETED");
            std::process::exit(0);
        }
        Err(e) => {
            eprintln!("Error: {}", e);
            println!("Result: FLASH_FAILED");
            std::process::exit(1);
        }
    }
}
src/fls/oci/auth.rs (1)
163-182: Consider using urlencoding crate instead of custom implementation.

The custom urlencoding_encode function only handles a subset of characters. While it covers common cases for OCI scope strings, it may miss edge cases. The urlencoding crate provides complete RFC 3986 compliant encoding.
#!/bin/bash
# Check if urlencoding crate is already a dependency
rg -n 'urlencoding' Cargo.toml
src/fls/oci/registry.rs (1)
80-86: Header case lookup is redundant.

HTTP headers are case-insensitive per RFC 7230, and reqwest normalizes header names. The dual lookup for www-authenticate and WWW-Authenticate is unnecessary.
🔎 Simplified header lookup
-                let www_auth_header = response
-                    .headers()
-                    .get("www-authenticate")
-                    .or_else(|| response.headers().get("WWW-Authenticate"))
-                    .ok_or("No WWW-Authenticate header in 401 response")?
+                let www_auth_header = response
+                    .headers()
+                    .get("www-authenticate")
+                    .ok_or("No WWW-Authenticate header in 401 response")?
src/fls/oci/manifest.rs (1)

132-151: Multi-layer handling returns last layer - document assumption.

For multi-layer images, get_single_layer returns the last layer with the comment "which typically contains the actual content". This is a reasonable heuristic for disk images but may not hold for all OCI artifacts.

Consider adding a warning log when multiple layers exist, or documenting this behavior in the function's docstring.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 0df405c and c1cab54.

⛔ Files ignored due to path filters (1)

Cargo.lock is excluded by !**/*.lock

📒 Files selected for processing (13)

.gitignore
Cargo.toml
src/fls/block_writer.rs
src/fls/from_url.rs
src/fls/http.rs
src/fls/mod.rs
src/fls/oci/auth.rs
src/fls/oci/from_oci.rs
src/fls/oci/manifest.rs
src/fls/oci/mod.rs
src/fls/oci/reference.rs
src/fls/oci/registry.rs
src/main.rs

🧰 Additional context used

🧬 Code graph analysis (7)

src/fls/mod.rs (1)

src/fls/oci/from_oci.rs (1)

flash_from_oci (37-491)

src/fls/oci/mod.rs (1)

src/fls/oci/from_oci.rs (1)

flash_from_oci (37-491)

src/fls/oci/from_oci.rs (6)

src/fls/from_url.rs (5)

mpsc (27-27)

mpsc (28-28)

mpsc (29-29)

mpsc (112-112)

mpsc (116-116)

src/fls/decompress.rs (2)

spawn_stderr_reader (52-69)

start_decompressor_process (33-50)

src/fls/error_handling.rs (1)

process_error_messages (3-7)

src/fls/oci/manifest.rs (2)

parse (103-130)

compression (196-204)

src/fls/oci/reference.rs (1)

parse (55-83)

src/fls/progress.rs (4)

download_time_formatted (38-40)

decompress_time_formatted (43-45)

write_time_formatted (48-50)

total_time_formatted (53-55)

src/fls/oci/auth.rs (1)

src/fls/oci/registry.rs (1)

new (25-56)

src/main.rs (3)

src/fls/oci/registry.rs (1)

image_ref (254-256)

src/fls/oci/from_oci.rs (1)

flash_from_oci (37-491)

src/fls/from_url.rs (1)

flash_from_url (13-621)

src/fls/from_url.rs (1)

src/fls/http.rs (1)

start_download (77-168)

src/fls/oci/registry.rs (4)

src/fls/oci/auth.rs (3)

request_token (201-245)

new (20-28)

parse (55-74)

src/fls/http.rs (1)

setup_http_client (6-75)

src/fls/oci/manifest.rs (1)

parse (103-130)

src/fls/oci/reference.rs (4)

parse (55-83)

registry_url (99-106)

api_repository (86-88)

reference_string (91-96)

🪛 GitHub Actions: CI

src/fls/oci/from_oci.rs

[error] 733-733: cargo fmt -- --check found formatting differences in from_oci.rs; run 'cargo fmt' to format.

[error] 924-924: cargo fmt -- --check found formatting differences in from_oci.rs; run 'cargo fmt' to format.

[error] 975-975: cargo fmt -- --check found formatting differences in from_oci.rs; run 'cargo fmt' to format.

🔇 Additional comments (31)

.gitignore (1)

1-21: LGTM!

Standard Rust .gitignore with appropriate entries for cargo build outputs, rustfmt backups, MSVC debug files, mutation testing artifacts, and IDE configuration. The commented .idea/ with guidance to use a global gitignore is a good practice.

Cargo.toml (3)

7-8: Dependencies updated appropriately for OCI support.

The env feature for clap enables environment variable support, and the json feature for reqwest is necessary for OCI registry API interactions.

23-23: xz2 correctly promoted to production dependency.

This aligns with the XZ decompression support needed for OCI layers.

32-36: TLS testing dependencies are appropriate.

The tokio-rustls, rustls, rustls-pemfile, and rcgen additions enable proper HTTPS/TLS testing for registry interactions.

src/fls/http.rs (1)

47-50: LGTM!

Formatting changes only - the debug output statements are reformatted for better readability with no functional changes.

Also applies to: 128-132

src/fls/from_url.rs (1)

162-170: LGTM!

Formatting change only - the start_download call is reformatted across multiple lines for better readability with no functional changes.

src/fls/mod.rs (2)

7-7: Appropriate visibility for internal HTTP module.

Making http module pub(crate) allows other modules within the crate (like the new OCI module) to reuse HTTP client functionality while keeping it internal to the crate.

9-9: Clean public API surface for OCI support.

The new oci module is properly exposed as public, and the re-exports of flash_from_oci and OciOptions follow the existing pattern established for flash_from_url and BlockFlashOptions.

Also applies to: 15-15

src/fls/oci/mod.rs (1)

1-14: Well-structured OCI module with clear separation of concerns.

The module organization is clean:

auth: Authentication handling

from_oci: Main flash functionality

manifest: OCI manifest parsing

reference: Image reference parsing

registry: Registry client

The public API surface (flash_from_oci, ImageReference, OciOptions) is appropriately minimal.

src/fls/block_writer.rs (3)

205-211: Good defensive improvement for macOS device detection.

Gracefully handling missing files by defaulting to (false, false) is the correct approach - the file may not exist yet when flashing to a regular file path. This prevents potential panics when metadata retrieval fails.

218-218: Corrected debug output for device type.

Using !is_device to display "Regular file" status is semantically clearer and aligns with the updated device classification logic.

187-187: LGTM!

Formatting changes for improved readability - OpenOptions chains and multi-line error messages are reformatted with no functional changes.

Also applies to: 223-224, 227-229, 285-285, 476-481, 484-487

src/main.rs (2)

59-67: OCI authentication options look good.

The new CLI arguments for OCI authentication follow secure patterns:

Password can be provided via environment variable (FLS_REGISTRY_PASSWORD) which is preferred over CLI args

Optional file pattern for selecting disk images from tar archives

102-128: OCI URL handling and options setup is well-structured.

The scheme detection and prefix stripping are correct. The unwrap() on line 107 is safe since is_oci guarantees one of the prefixes matched. The informational output provides good visibility into the operation configuration.

src/fls/oci/from_oci.rs (3)

36-93: Well-structured OCI flash entry point.

The function properly:

Parses image references

Creates authenticated registry client

Handles multi-platform manifest indexes

Validates compression support

Routes to appropriate handler based on content type

600-656: Content and compression detection logic is sound.

The detection function properly:

Handles gzip magic bytes (0x1f 0x8b)

Handles XZ magic bytes (0xfd 0x37 0x7a 0x58)

Falls back gracefully when decompression sampling fails

Correctly identifies tar archives via ustar/posix magic at offset 257

989-1010: This concern is not applicable.

The DebugReader struct is already private (no pub keyword at line 846) and is only instantiated when debug is true (line 546-550). The unconditional eprintln! calls in the Read implementation are appropriate since the struct only exists in debug mode.

src/fls/oci/auth.rs (3)

107-161: Auth parameter parsing handles quoted values correctly.

The state machine parser properly handles:

Quoted values with embedded commas

Unquoted values

Whitespace between parameters

Standard Docker/OCI WWW-Authenticate header formats

200-245: Token request implementation is correct.

Properly:

Builds token URL with scope

Applies basic auth when credentials provided

Handles both token and access_token response fields (registry compatibility)

Returns clear error messages

247-289: Good test coverage for authentication module.

Tests cover the key parsing scenarios including full headers, minimal headers, token URL construction, and basic auth encoding.

src/fls/oci/reference.rs (3)

141-167: Registry detection heuristics are reasonable.

The logic correctly identifies registries by:

Presence of dots (e.g., ghcr.io)

Presence of colons (ports)

Literal localhost

This covers standard OCI registry patterns and defaults to Docker Hub appropriately.

98-107: Docker Hub registry URL translation is correct.

The registry_url method properly translates docker.io to registry-1.docker.io which is required for Docker Hub API access.

169-217: Comprehensive unit tests for reference parsing.

Tests cover all major formats: full references, Docker Hub library images, namespaced images, ported registries, digests, and default tags.

src/fls/oci/registry.rs (4)

29-45: HTTP client initialization reuses BlockFlashOptions correctly.

The client setup properly maps OciOptions to BlockFlashOptions, reusing the existing TLS and connection pool configuration. The hardcoded defaults for retry and other HTTP-specific options are appropriate since OCI layer downloads may have different retry semantics.

58-112: Authentication flow correctly implements OCI distribution spec.

The authenticate method properly:

Checks for anonymous access first

Parses WWW-Authenticate on 401

Requests bearer token via auth service

Stores token for subsequent requests

114-123: Auth header priority is correct.

Bearer token takes priority over basic auth, which aligns with the OCI distribution spec where bearer tokens are the primary authentication mechanism after initial token exchange.

259-295: OciOptions struct is well-designed with sensible defaults.

The struct captures all necessary configuration for OCI operations. The Default implementation provides reasonable values that match common use cases.

src/fls/oci/manifest.rs (4)

108-130: Manifest parsing order may misidentify edge cases.

The parsing logic first tries to parse as ImageIndex, checking if manifests is non-empty. An ImageManifest might accidentally parse as an empty ImageIndex if the JSON structure allows it, though serde's strictness likely prevents this.

The current approach is pragmatic and handles common cases. Consider whether strict media-type based parsing should be prioritized when content-type is available.

175-181: Platform preference documented but may need configuration.

find_linux_manifest prefers arm64 over amd64. While documented as "common for embedded", some users may need the opposite preference. Consider making this configurable via OciOptions in the future.

183-205: Layer compression detection is correct.

Properly identifies gzip (OCI and Docker formats) and zstd layers based on media types.

215-277: Good test coverage for manifest parsing.

Tests validate parsing of both image manifests and indexes, including platform lookup functionality.

coderabbitai · 2026-01-06T09:20:15Z

Cargo.toml

+serde = { version = "1.0", features = ["derive"] }
+serde_json = "1.0"
+base64 = "0.22"
+tar = "0.4"
+flate2 = "1.0"


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # First, let's examine the Cargo.toml file to verify the dependencies if [ -f "Cargo.toml" ]; then echo "=== Full Cargo.toml ===" cat -n Cargo.toml else echo "Cargo.toml not found" find . -name "Cargo.toml" -type f | head -5 fi

Repository: jumpstarter-dev/fls

Length of output: 1693

🌐 Web query:

security vulnerabilities serde 1.0 rust crate CVE

💡 Result:

Here are the publicly disclosed security issues (CVEs/advisories) related to Serde and Serde-related crates that have affected the 1.0-era ecosystem:

CVE-2019-25001 — serde_cbor: CBOR deserializer allowed deeply nested semantic tags causing stack consumption / possible DoS; fixed in serde_cbor 0.10.2. [1][2]

CVE-2021-45687 (RUSTSEC-2021-0089) — raw-cpuid: when the non-default serialize feature is enabled, generated serde::Deserialize impls lacked validation (memory-corruption / panics); fixed in raw-cpuid >= 9.1.1. [3][4]

Cargo name confusion (CVE-2019-16760) — not a serde code bug but a Cargo behavior issue around the package renaming feature that could cause downloading the wrong crate (attacker-squatting names like serde1); affects builds using older Cargo/Rust. Upgrade Cargo/Rust to eliminate the issue. [5]

CVE-2024-58264 — serde-json-wasm: stack-consumption via deeply nested JSON in versions before 1.0.1 (upgrade to >= 1.0.1). [6][7]

Recommended action: audit your dependency tree for crates that depend on serde (and serde-* formats), and upgrade affected crates to the fixed versions above (or to the latest releases). For crates you maintain, avoid enabling non-default features that introduce unchecked Deserialize implementations until you’ve reviewed input validation. Use cargo audit / cargo-deny and monitor RustSec/NVD advisories for ongoing updates.

Sources:

NVD — CVE-2019-25001 (serde_cbor).

OSV / RustSec — RUSTSEC-2019-0025 (serde_cbor).

NVD — CVE-2021-45687.

RustSec advisory RUSTSEC-2021-0089 (raw-cpuid).

RustSec / CVE-2019-16760 (Cargo package rename issue).

Wiz / vulnerability data for CVE-2024-58264 (serde-json-wasm).

RustSec / GitHub advisory entries for serde-json-wasm.

🌐 Web query:

security vulnerabilities base64 0.22 rust crate

💡 Result:

Short answer: there are no public RustSec/CVE advisories specific to base64 crate v0.22.x. The notable security issue for the crate is an older integer-overflow / heap-buffer-overflow advisory (RUSTSEC-2017-0004 / CVE-2017-1000430) that affected much earlier versions and was fixed long ago; v0.22.x is not listed as affected. See sources below.

Sources:

RUSTSEC/CVE advisory for the historical base64 bug (RUSTSEC-2017-0004 / CVE-2017-1000430). [1]

crate metadata / release docs for base64 0.22.x (no listed advisory for 0.22). [2]

Related but separate: a 2024 advisory about non-constant-time base64 use in the vodozemac crate (not the base64 crate itself). [3]

References:
[1] rustsec / OSV RUSTSEC-2017-0004 (CVE-2017-1000430).
[2] base64 0.22.0 crate info (deps.rs / docs.rs).
[3] RUSTSEC-2024-0354 (vodozemac: non-constant-time base64 usage).

🌐 Web query:

security vulnerabilities tar 0.4 flate2 1.0 rust crate CVE

💡 Result:

Short answer:

The tar crate has a known CVE: CVE-2021-38511 — directory traversal via symlink extraction affecting tar versions before 0.4.36 (update to >= 0.4.36). [1][2]

I found no published CVE for the flate2 1.0 series. Check RustSec / cargo-audit for any advisory affecting your exact flate2 version and run cargo-audit on your Cargo.lock. [2][3]

Sources:

CVE-2021-38511 details (tar before 0.4.36). [CVE/NVD summary]. [1]

RustSec / advisory database & guidance (use cargo-audit). [2]

flate2 crate / repo (no CVE listed in RustSec search results). [3]

🌐 Web query:

security vulnerabilities xz2 0.1 rust crate

💡 Result:

Short answer: yes — xz2 (0.1.x) is a wrapper around liblzma, so security issues in liblzma/xz-utils affect it. Two high‑impact incidents to be aware of:

March–April 2024 supply‑chain/backdoor incident (malicious code in xz/liblzma distribution). Affected downstream packages and bindings; advisories recommend using fixed/clean upstream sources. [1][2]

CVE‑2025‑31115 (disclosed Apr 3, 2025): a use‑after‑free / NULL‑write bug in the multithreaded liblzma .xz decoder (lzma_stream_decoder_mt) affecting XZ Utils 5.3.3alpha–5.8.0; fixed in 5.8.1. Any crate that links the affected liblzma (including xz2) is potentially impacted if it uses the multithreaded decoder. [3][4]

What to do:

If you use xz2, ensure the underlying liblzma is a patched/clean version (liblzma / xz-utils ≥ fixed releases) and/or upgrade xz2 to a release built against a fixed liblzma. Verify your distro/package vendor’s liblzma version or rebuild linking a safe liblzma. [3][4][1]

Sources:
[1] Snyk advisory on embedded malicious code in liblzma / xz (disclosed Mar 29, 2024).
[2] Tukaani (xz) backdoor / incident pages and timeline.
[3] Tukaani advisory for CVE‑2025‑31115 (Apr 3, 2025).
[4] NVD entry for CVE‑2025‑31115.

Fix tar dependency version constraint to address CVE-2021-38511.

The tar 0.4 dependency allows versions below 0.4.36, which are vulnerable to directory traversal via symlink extraction (CVE-2021-38511)—directly relevant to the stated layer extraction functionality. Update to tar = "0.4.36" or use tar = { version = "0.4", features = [...], ">= 0.4.36" }.

Additionally, xz2 0.1 wraps liblzma which has CVE-2025-31115 (use-after-free in multithreaded decoder affecting liblzma versions before 5.8.1). Verify the vendored/linked liblzma version is patched, or upgrade xz2 if a fixed release is available.

The other dependencies (serde, serde_json, base64, flate2) have no known public vulnerabilities in the specified versions.

🤖 Prompt for AI Agents

In @Cargo.toml around lines 12 - 16, The tar dependency in Cargo.toml permits vulnerable 0.4.x versions; update the tar version constraint to require at least 0.4.36 (e.g., change tar = "0.4" to tar = "0.4.36" or use a semver constraint like ">= 0.4.36") to mitigate CVE-2021-38511, and audit/update any xz2 dependency (and its linked/liblzma vendoring) to a release that uses liblzma >= 5.8.1 or otherwise confirm the linked liblzma is patched to mitigate CVE-2025-31115. Ensure Cargo.toml entries for tar and xz2 are updated accordingly so cargo.lock will pick the fixed versions.

src/fls/oci/from_oci.rs

src/fls/oci/reference.rs

coderabbitai

Actionable comments posted: 2

🤖 Fix all issues with AI Agents

In @src/fls/oci/from_oci.rs:
- Around line 1147-1159: The unsafe duplication of the file descriptor must
validate libc::dup and the non-unix branch must not abort compilation: after
calling libc::dup(raw_fd) check if dup_fd == -1 and convert errno into a proper
error (e.g., via std::io::Error::last_os_error() or your crate's error type) and
return that error instead of proceeding to FromRawFd; only call
std::fs::File::from_raw_fd(dup_fd) when dup succeeded. Replace the
#[cfg(not(unix))] compile_error! branch with a runtime path that either returns
a clear UnsupportedPlatform/NotImplemented error from the current function or
implements the synchronous-channel fallback (using the same public symbols:
decompressor_stdin and stdin_fd) so non-unix platforms fail gracefully at
runtime rather than during compile.
- Around line 1044-1051: The unconditional debug eprintln inside ChannelReader
(the static FIRST_READ.once block) produces noise; remove that eprintln or gate
it behind a proper debug/log mechanism—replace the eprintln! call in the
FIRST_READ closure with a log::debug! (or cfg!(debug_assertions) conditional) or
pass a debug flag into ChannelReader so the message only prints when enabled;
update the FIRST_READ closure referenced in ChannelReader to use log::debug! (or
wrap the eprintln in if cfg!(debug_assertions) { ... }) and ensure the log crate
is initialized or the debug flag is propagated to avoid unconditional output.

🧹 Nitpick comments (3)

src/fls/oci/from_oci.rs (2)

722-734: Limited glob pattern support.

The matches_pattern function only supports patterns with * at the start or end (e.g., *.img or disk*). Patterns like disk*.img or *disk* won't work as expected. Consider documenting this limitation or extending support.

872-891: Hardcoded debug file path may cause conflicts.

TeeReader writes to /tmp/debug_gzip_data.gz which could conflict with concurrent runs or leave debug artifacts. Consider using a unique filename (e.g., with timestamp or PID) or making the path configurable.

src/fls/oci/reference.rs (1)

64-67: Digest format validation is minimal.

The digest validation only checks for sha256: or sha512: prefix but doesn't validate the hash format (e.g., sha256:abc123 passes even though it's not a valid 64-character hex string). Consider whether stricter validation is needed, or document that callers should expect potential registry errors for malformed digests.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c1cab54 and 24f0c19.

📒 Files selected for processing (2)

src/fls/oci/from_oci.rs
src/fls/oci/reference.rs

🧰 Additional context used

🧬 Code graph analysis (1)

src/fls/oci/from_oci.rs (9)

src/fls/block_writer.rs (5)

mpsc (458-458)

new (40-52)

new (93-312)

new (437-504)

bytes_written (424-426)

src/fls/from_url.rs (5)

mpsc (27-27)

mpsc (28-28)

mpsc (29-29)

mpsc (112-112)

mpsc (116-116)

src/fls/decompress.rs (2)

spawn_stderr_reader (52-69)

start_decompressor_process (33-50)

src/fls/error_handling.rs (1)

process_error_messages (3-7)

src/fls/oci/registry.rs (2)

image_ref (254-256)

new (25-56)

src/fls/oci/reference.rs (1)

parse (55-83)

src/fls/oci/manifest.rs (2)

parse (103-130)

compression (196-204)

src/fls/oci/auth.rs (2)

parse (55-74)

new (20-28)

src/fls/progress.rs (4)

download_time_formatted (38-40)

decompress_time_formatted (43-45)

write_time_formatted (48-50)

total_time_formatted (53-55)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Build (aarch64-unknown-linux-gnu)
GitHub Check: Build (aarch64-unknown-linux-musl)

🔇 Additional comments (6)

src/fls/oci/from_oci.rs (3)

147-160: Verify XZ-compressed tar handling logic.

XZ-compressed layers are bypassed for tar extraction and sent directly to flash_raw_disk_image_directly. This assumes XZ-compressed layers are always raw disk images, not tar archives containing disk images. If an XZ-compressed tar archive contains a disk image file that needs extraction, this path would fail silently by writing the wrong content to the device.

Consider whether the detection logic should decompress XZ samples to verify content type, similar to gzip handling at line 631-640.

37-491: LGTM - Well-structured OCI flash pipeline.

The flash_from_oci function implements a comprehensive streaming pipeline with proper:

Authentication flow and manifest handling

Content and compression detection

Parallel processing with channels for download, extraction, decompression, and writing

Progress tracking and error handling

Clean shutdown sequence

The previous concern about unconditional debug logging in MagicDetectingReader has been addressed - debug output is now properly gated behind self.debug checks.

61-63: No action required. The error message accurately describes the platform selection logic in find_linux_manifest(). The function searches for "linux/arm64" first, then "linux/amd64", which matches the error message "No linux/arm64 or linux/amd64 manifest found".

src/fls/oci/reference.rs (3)

109-166: Good fix for numeric tag handling.

The looks_like_hostname helper correctly addresses the previously flagged issue where numeric-only tags were misclassified as port numbers. The heuristic now properly distinguishes:

myrepo:12345 → tag "12345" (myrepo doesn't look like a hostname)

registry.local:5000/repo → port 5000 (registry.local contains a dot)

localhost:5000/repo → port 5000 (localhost is explicitly handled)

1-194: LGTM - Clean image reference parsing implementation.

The module provides well-structured OCI image reference parsing with:

Clear type definitions for ImageReference and Reference

Proper handling of Docker Hub defaults

Good edge case handling for registry ports vs. tags (the previous issue has been addressed)

Comprehensive test coverage

254-261: No issue found. The test test_parse_registry_port_no_tag correctly verifies that "registry:5000/repo" is parsed as registry="registry:5000" and repository="repo" with no tag (defaulting to "latest").

While the colon-slash rule at line 141-143 prevents "5000/repo" from being treated as a tag, this is the correct and intended behavior. Additionally, when the name is parsed by parse_registry_and_repo, "registry:5000" is correctly identified as a registry due to containing a colon (line 182), which further validates the parsing. The test name accurately reflects what it tests: that a registry with a port number is correctly parsed without extracting a tag.

src/fls/oci/from_oci.rs

coderabbitai

Actionable comments posted: 2

🤖 Fix all issues with AI Agents

In @src/fls/oci/from_oci.rs:
- Line 524: TeeReader currently writes debug output to a hardcoded predictable
path "/tmp/debug_gzip_data.gz" which is unsafe; replace that with a securely
created temporary file using the tempfile crate (e.g., create a
tempfile::NamedTempFile and use its path) and pass that temporary path to
TeeReader (or adapt TeeReader to accept a File/Write handle) to avoid
race/symlink attacks and ensure proper cleanup and permissions; update the code
around the magic_reader/TeeReader instantiation to use the NamedTempFile and
handle any ownership/return of the temp file so it isn't left insecurely
accessible.
- Around line 991-1021: DebugReader currently always prints to stderr; add a
boolean field debug to the DebugReader struct, update its constructor
(DebugReader::new) to accept a debug: bool (or provide a new_with_debug) and set
that field, then wrap every eprintln! in the Read impl in an if self.debug { ...
} so logging only happens when enabled; finally update the places that construct
DebugReader (where DebugReader::new(...) is called) to pass the appropriate
debug flag from the surrounding context (e.g., a config or existing debug
variable) so behavior matches MagicDetectingReader.

🧹 Nitpick comments (2)

src/fls/oci/from_oci.rs (2)

200-224: Simplify decompressor selection logic.

The nested match/if statements for selecting the decompressor hint are complex and difficult to follow. The logic essentially maps (content_type, layer_compression, content_compression) to a decompressor hint string.

Consider extracting this to a separate helper function that returns the appropriate file pattern based on the detection results, improving readability and testability.

🔎 Example refactor

fn determine_decompressor_hint(
    content_type: ContentType,
    layer_compression: LayerCompression,
    content_compression: CompressionType,
    file_pattern: Option<&str>,
    debug: bool,
) -> &str {
    match (content_type, layer_compression, content_compression) {
        (ContentType::TarArchive, LayerCompression::Gzip | LayerCompression::Zstd, _) => {
            if debug {
                eprintln!("[DEBUG] Layer is compressed (manifest), using 'cat' for extracted files");
            }
            "disk.img"
        }
        (ContentType::TarArchive, LayerCompression::None, CompressionType::Gzip) => {
            if debug {
                eprintln!("[DEBUG] Layer is gzip compressed (content detection), using 'cat' for extracted files");
            }
            "disk.img"
        }
        (ContentType::TarArchive, LayerCompression::None, CompressionType::Xz) => {
            if debug {
                eprintln!("[DEBUG] Layer is XZ compressed (content detection), using external 'xzcat' for the main pipeline");
            }
            "disk.img.xz"
        }
        _ => file_pattern.unwrap_or("disk.img.xz"),
    }
}

Then replace lines 200-224 with:

let initial_decompressor_hint = determine_decompressor_hint(
    content_type,
    compression,
    compression_type,
    options.file_pattern.as_deref(),
    options.debug,
);

1233-1233: Using Handle::current() in spawn_blocking may not work as expected.

At line 1233, the code calls tokio::runtime::Handle::current() inside a spawn_blocking task. While this works if there's an active runtime, it's more idiomatic to pass the handle from outside the blocking task or use the handle's block_on method.

However, since the code is already inside a tokio async context (called from flash_raw_disk_image_directly which is async), this should work. The pattern is acceptable but could be clearer.

🔎 Alternative approach

Capture the handle before entering the blocking task:

+        let rt_handle = tokio::runtime::Handle::current();
         let reader_handle = tokio::task::spawn_blocking(move || {
-            // Create a runtime handle to send to the async channel
-            let rt = tokio::runtime::Handle::current();
-
             let reader = ChannelReader::new(http_rx);

             // Apply in-process gzip decompression if needed
             let processed_reader: Box<dyn std::io::Read + Send> = match compression_type {
                 CompressionType::Gzip => {
                     if debug {
                         eprintln!("[DEBUG] Applying in-process gzip decompression");
                     }
                     Box::new(flate2::read::GzDecoder::new(reader))
                 }
                 _ => Box::new(reader),
             };

             let mut reader = processed_reader;
             let mut buffer = vec![0u8; 1024 * 1024]; // 1MB chunks

             loop {
                 match reader.read(&mut buffer) {
                     Ok(0) => break, // EOF
                     Ok(n) => {
                         // Send to async channel using block_on
-                        if rt.block_on(data_tx.send(buffer[..n].to_vec())).is_err() {
+                        if rt_handle.block_on(data_tx.send(buffer[..n].to_vec())).is_err() {
                             return Err("Data channel closed".to_string());
                         }
                     }
                     Err(e) => return Err(format!("Error reading/decompressing: {}", e)),
                 }
             }

             Ok::<(), String>(())
         });

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 24f0c19 and c580350.

📒 Files selected for processing (2)

src/fls/oci/from_oci.rs
src/fls/oci/reference.rs

🧰 Additional context used

🧬 Code graph analysis (1)

src/fls/oci/from_oci.rs (7)

src/fls/from_url.rs (5)

mpsc (27-27)

mpsc (28-28)

mpsc (29-29)

mpsc (112-112)

mpsc (116-116)

src/fls/block_writer.rs (6)

mpsc (458-458)

new (40-52)

new (93-312)

new (437-504)

drop (64-68)

bytes_written (424-426)

src/fls/decompress.rs (2)

spawn_stderr_reader (52-69)

start_decompressor_process (33-50)

src/fls/error_handling.rs (1)

process_error_messages (3-7)

src/fls/oci/reference.rs (1)

parse (55-81)

src/fls/oci/manifest.rs (2)

parse (103-130)

compression (196-204)

src/fls/progress.rs (4)

download_time_formatted (38-40)

decompress_time_formatted (43-45)

write_time_formatted (48-50)

total_time_formatted (53-55)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: Build (aarch64-unknown-linux-gnu)
GitHub Check: Build (aarch64-apple-darwin)
GitHub Check: Build (aarch64-unknown-linux-musl)

🔇 Additional comments (2)

src/fls/oci/reference.rs (1)

1-340: LGTM! Well-structured OCI reference parsing implementation.

The implementation correctly handles the various OCI image reference formats, including the previously-flagged numeric tag issue which has been properly addressed. The code includes:

Proper validation for digests per OCI spec

Smart heuristics for disambiguating registry ports from numeric tags

Comprehensive test coverage including edge cases

Clear separation of concerns with helper functions

src/fls/oci/from_oci.rs (1)

61-63: No action needed. The error message at line 63 accurately reflects the implementation of find_linux_manifest(), which searches for exactly two platforms: linux/arm64 (tried first) and linux/amd64 (fallback).

Likely an incorrect or invalid review comment.

src/fls/oci/from_oci.rs

coderabbitai

Actionable comments posted: 2

🤖 Fix all issues with AI Agents

In @src/fls/oci/from_oci.rs:
- Around line 506-516: The debug branch currently writes to a hardcoded
"/tmp/debug_gzip_data.gz" via TeeReader, creating a predictable-path
symlink/TOCTOU risk; replace that with a secure temporary file created by the
tempfile crate (e.g. tempfile::NamedTempFile) and pass its path to TeeReader,
ensuring the NamedTempFile is kept alive for the lifetime of the reader (so file
isn't deleted prematurely) and propagate any temp-file creation errors (from the
NamedTempFile::new mapping to the function's error type); update the
DebugGzReader creation to wrap GzDecoder around the TeeReader backed by the
NamedTempFile path, and remove the hardcoded "/tmp/…" string while keeping
references to magic_reader, TeeReader, and DebugGzReader.
- Around line 937-1004: The DebugGzReader currently emits unconditional
eprintln! debug logs; add a boolean field debug to the DebugGzReader struct,
accept it in DebugGzReader::new (or add a new constructor) and store it, then
wrap every eprintln! in DebugGzReader::new and DebugGzReader::read with if
self.debug { ... } (or if debug in new), and update the call site that
constructs DebugGzReader to pass the appropriate debug flag (matching how
MagicDetectingReader/DebugReader do it).

🧹 Nitpick comments (1)

src/fls/oci/from_oci.rs (1)
1169-1184: Consider runtime error for non-unix platforms.

The Unix-specific fd duplication now includes proper error checking (lines 1175-1177) which addresses the previous concern. However, the #[cfg(not(unix))] branch at line 1181-1184 still uses a compile-time error, preventing compilation on non-unix platforms entirely.

Consider returning a runtime error instead:
🔎 Proposed refactor
 #[cfg(not(unix))]
-let stdin_fd: std::fs::File = {
-    return Err("XZ streaming decompression is not supported on non-unix platforms".into());
-};
+let stdin_fd = {
+    return Err("XZ streaming decompression is not supported on non-unix platforms".into());
+};
This allows the code to compile on all platforms while still failing gracefully at runtime when XZ decompression is attempted on non-unix systems.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c580350 and bd612d2.

📒 Files selected for processing (2)

src/fls/oci/from_oci.rs
src/fls/oci/reference.rs

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Build (aarch64-unknown-linux-gnu)
GitHub Check: Build (aarch64-unknown-linux-musl)

🔇 Additional comments (6)

src/fls/oci/from_oci.rs (2)

614-671: Well-designed content detection strategy.

The two-phase detection approach (lines 614-671) is well-implemented:

Pre-buffers 2MB for reliable detection

Decompresses a sample when needed to inspect actual content

Falls back gracefully when decompression fails

Clear separation between compression type and content type

The decision to return early for XZ (line 656-662) rather than attempting partial decompression is pragmatic.

37-145: Solid main pipeline orchestration.

The flash_from_oci function handles a complex multi-stage pipeline effectively:

Clear progression: authenticate → fetch manifest → platform selection → layer download

Proper validation of compression support (lines 91-93)

Smart early detection of content type to choose the right pipeline (lines 136-160)

Good separation between tar archive and raw disk image paths

The 2MB pre-buffer approach (lines 105-122) is a practical solution to the chicken-and-egg problem of needing to detect content type before setting up the full pipeline.

src/fls/oci/reference.rs (4)

10-105: Clean API design for image reference handling.

The ImageReference struct and its methods provide a well-designed public API:

Clear separation between tag and digest references (Reference enum)

Proper Display implementations for both types

Good documentation with multiple examples (lines 48-54)

Correct handling of Docker Hub's special registry URL (lines 98-104)

The delegation to helper functions keeps the main parse method readable.

107-148: Comprehensive digest validation.

The digest validation is thorough and spec-compliant:

Validates algorithm and hash length per OCI requirements

Enforces lowercase hex digits (lines 137-144) as required by the OCI spec

Provides clear, actionable error messages

Supports both sha256 and sha512

This prevents malformed digests from propagating into the registry client.

173-207: Numeric tag disambiguation is now correct.

The previous issue where numeric-only tags like myrepo:12345 were misclassified as port numbers has been properly resolved:

Lines 194-200: Now uses looks_like_hostname() to distinguish between registry:port and image:numerictag formats

Only treats all-digit suffixes as ports when the authority contains a dot, is "localhost", or is an IP literal

Lines 290-305: Tests confirm both numeric tags and registry ports parse correctly

This correctly implements Docker/OCI tag specifications that allow numeric-only tags.

237-340: Excellent test coverage.

The test suite comprehensively covers:

All major reference formats (full, Docker Hub library, namespaces, ports, digests)

Edge cases that were previously problematic (numeric tags, registry ports)

Negative cases for digest validation (wrong length, uppercase, unsupported algorithm)

The tests at lines 290-305 specifically validate the numeric tag disambiguation fix, providing regression protection.

src/fls/oci/from_oci.rs

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com>

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

src/fls/oci/reference.rs (1)

46-105: Consider adding tag validation.

While digest validation is comprehensive (lines 111-148), there's no validation for tag format. Per OCI spec, tags should match [a-zA-Z0-9_][a-zA-Z0-9._-]{0,127} (max 128 chars, cannot start with . or -). Adding validation would catch invalid tags earlier rather than failing at registry interaction time.

💡 Example tag validation function

Add this helper function after validate_digest:

/// Validate tag format according to OCI spec
fn validate_tag(tag: &str) -> Result<(), String> {
    if tag.is_empty() {
        return Err("Tag cannot be empty".to_string());
    }
    
    if tag.len() > 128 {
        return Err(format!("Tag exceeds maximum length of 128 characters: {}", tag.len()));
    }
    
    let first_char = tag.chars().next().unwrap();
    if first_char == '.' || first_char == '-' {
        return Err(format!("Tag cannot start with '{}' (must be alphanumeric or underscore)", first_char));
    }
    
    if !tag.chars().all(|c| c.is_ascii_alphanumeric() || c == '_' || c == '.' || c == '-') {
        return Err("Tag must contain only ASCII alphanumeric, underscore, period, or hyphen".to_string());
    }
    
    Ok(())
}

Then call it in the parse method when creating a Reference::Tag:

         } else {
             // Split by : for tag, but be careful about registry port
             let (name, tag) = split_name_and_tag(input)?;
-            (name, Reference::Tag(tag.unwrap_or("latest").to_string()))
+            let tag_str = tag.unwrap_or("latest");
+            validate_tag(tag_str)?;
+            (name, Reference::Tag(tag_str.to_string()))
         };

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between bd612d2 and 0e3220e.

📒 Files selected for processing (2)

src/fls/oci/from_oci.rs
src/fls/oci/reference.rs

🚧 Files skipped from review as they are similar to previous changes (1)

src/fls/oci/from_oci.rs

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Build (aarch64-unknown-linux-musl)
GitHub Check: Build (aarch64-unknown-linux-gnu)

🔇 Additional comments (8)

src/fls/oci/reference.rs (8)

1-9: LGTM! Clear documentation and minimal imports.

The module documentation clearly describes the supported reference formats, making the API easy to understand.

10-44: LGTM! Clean data structures with correct formatting.

The ImageReference and Reference types are well-designed, and the Display implementations correctly format tags with : and digests with @ per OCI specification.

46-105: LGTM! Well-structured parsing with correct Docker Hub handling.

The parsing logic correctly handles digest and tag references, defaults to "latest" appropriately, and includes the necessary Docker Hub API endpoint mapping.

107-148: LGTM! Thorough digest validation per OCI specification.

The validation correctly enforces sha256/sha512 algorithms with proper length requirements and lowercase hex characters. Error messages are clear and helpful.

150-171: LGTM! Effective heuristic for hostname detection.

The function correctly identifies likely hostnames through dot presence, "localhost", or IPv6 literals, enabling proper disambiguation of registry ports from numeric tags.

173-207: LGTM! Numeric tag vs. port disambiguation correctly implemented.

The logic properly handles the edge case flagged in previous review where numeric-only tags (e.g., myrepo:12345) were misclassified as port numbers. The looks_like_hostname heuristic at lines 194-200 ensures that numeric suffixes are only treated as ports when preceded by a recognizable hostname.

209-235: LGTM! Correct registry and namespace defaulting.

The function properly implements Docker Hub conventions, defaulting single-part names to library/ namespace and correctly identifying when the first component is a registry vs. a namespace.

237-340: LGTM! Comprehensive test coverage addressing all edge cases.

The test suite thoroughly covers the parsing logic, including the specific edge cases from the previous review:

Line 290-296: Confirms myrepo:12345 is correctly parsed as a numeric tag

Line 299-305: Confirms registry:5000/repo is correctly handled as registry:port

The digest validation tests are also thorough, covering valid digests, invalid lengths, uppercase rejection, and unsupported algorithms.

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/fls/from_url.rs (1)
612-615: Potential division by zero when calculating compression ratio.

If stats.mb_received is zero (e.g., download fails very early or uncompressed file with no data), this will result in a division by zero producing inf or NaN.
🐛 Proposed fix
-    println!(
-        "Compression ratio: {:.2}x",
-        stats.mb_decompressed / stats.mb_received
-    );
+    if stats.mb_received > 0.0 {
+        println!(
+            "Compression ratio: {:.2}x",
+            stats.mb_decompressed / stats.mb_received
+        );
+    }
Alternatively, use the print_final_stats_with_ratio() method from ProgressTracker which already includes this guard.

🧹 Nitpick comments (7)

src/fls/progress.rs (1)
274-332: Consider using these new helper methods to reduce duplication.

These methods consolidate the final stats printing pattern, but they're currently marked #[allow(dead_code)] and not utilized. Both flash_from_url (lines 593-616) and flash_from_oci duplicate this printing logic inline.
♻️ Suggested usage in flash_from_url
-    let stats = progress.final_stats();
-    println!(
-        "\nDownload complete: {:.2} MB in {} ({:.2} MB/s)",
-        stats.mb_received,
-        stats.download_time_formatted(),
-        stats.download_rate
-    );
-    println!(
-        "Decompression complete: {:.2} MB in {} ({:.2} MB/s)",
-        stats.mb_decompressed,
-        stats.decompress_time_formatted(),
-        stats.decompress_rate
-    );
-    println!(
-        "Write complete: {:.2} MB in {} ({:.2} MB/s)",
-        stats.mb_written,
-        stats.write_time_formatted(),
-        stats.write_rate
-    );
-    println!(
-        "Compression ratio: {:.2}x",
-        stats.mb_decompressed / stats.mb_received
-    );
-    println!("Total flash runtime: {}", stats.total_time_formatted());
+    progress.print_final_stats_with_ratio();
src/fls/compression.rs (1)

1-56: Clean compression type abstraction.

The Compression enum provides a unified representation for compression handling. The methods are well-designed with consistent behavior.

One observation: the OCI module defines its own LayerCompression enum (in src/fls/oci/manifest.rs) with similar variants (None, Gzip, Zstd). Consider whether these could be consolidated or if a conversion trait between them would be beneficial to maintain consistency.

src/main.rs (1)

172-184: Consider warning on empty header value after split.

The header parsing correctly handles malformed headers by checking parts.len() == 2, but a header like "Authorization:" (with colon but empty value) would result in an empty string for the value. This may be intentional, but worth noting.
src/fls/oci/from_oci.rs (1)
634-646: Glob pattern matching is limited but functional.

The matches_pattern function only supports * at the start or end of the pattern. Patterns like disk*.img or *.img.* won't work as expected. This is acceptable for the current use case but worth documenting.
📝 Consider documenting the limitation
 /// Simple glob pattern matching
+///
+/// Supports only simple patterns:
+/// - `*.ext` - matches files ending with `.ext`
+/// - `prefix*` - matches files starting with `prefix`
+/// - `exact` - exact match
 fn matches_pattern(name: &str, pattern: &str) -> bool {
src/fls/options.rs (1)

55-63: OciOptions Default may be misleading.

Using #[derive(Default)] for OciOptions means common.device will be an empty string, which is invalid for actual use. While this is technically fine since callers should always provide a device, it could lead to confusing runtime errors if someone accidentally uses the default.

This is a minor concern since the struct is primarily constructed from CLI args.

src/fls/stream_utils.rs (1)

63-70: StreamResult and functions are marked dead_code.

The #[allow(dead_code)] annotations suggest these utilities are not yet used. Consider removing the annotations once integrated, or document if they're intended for future use.
src/fls/oci/registry.rs (1)
66-69: Remove redundant header case fallback — reqwest lookups are case-insensitive.

reqwest normalizes header names internally, so .or_else(|| response.headers().get("WWW-Authenticate")) is unnecessary. A single .get("www-authenticate") call handles all case variants.
♻️ Simplified header lookup
                 let www_auth_header = response
                     .headers()
                     .get("www-authenticate")
-                    .or_else(|| response.headers().get("WWW-Authenticate"))
                     .ok_or("No WWW-Authenticate header in 401 response")?

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 0e3220e and 043af9a.

📒 Files selected for processing (14)

src/fls/compression.rs
src/fls/from_url.rs
src/fls/http.rs
src/fls/magic_bytes.rs
src/fls/mod.rs
src/fls/oci/from_oci.rs
src/fls/oci/mod.rs
src/fls/oci/registry.rs
src/fls/options.rs
src/fls/progress.rs
src/fls/stream_utils.rs
src/lib.rs
src/main.rs
tests/from_url.rs

🚧 Files skipped from review as they are similar to previous changes (1)

src/fls/oci/mod.rs

🧰 Additional context used

🧬 Code graph analysis (7)

src/lib.rs (2)

src/fls/oci/from_oci.rs (1)

flash_from_oci (27-471)

src/fls/from_url.rs (1)

flash_from_url (13-619)

tests/from_url.rs (1)

src/fls/options.rs (2)

default (19-32)

default (45-52)

src/main.rs (2)

src/fls/oci/registry.rs (1)

image_ref (240-242)

src/fls/oci/from_oci.rs (1)

flash_from_oci (27-471)

src/fls/stream_utils.rs (3)

src/fls/from_url.rs (5)

mpsc (28-28)

mpsc (29-29)

mpsc (30-30)

mpsc (117-117)

mpsc (121-121)

src/fls/oci/from_oci.rs (18)

mpsc (178-178)

mpsc (181-181)

mpsc (184-184)

mpsc (185-185)

mpsc (186-186)

mpsc (965-965)

mpsc (980-980)

mpsc (981-981)

mpsc (1004-1004)

mpsc (1097-1097)

new (657-664)

new (779-797)

new (837-847)

new (908-914)

read (717-760)

read (801-825)

read (851-904)

read (918-941)

src/fls/progress.rs (1)

new (84-104)

src/fls/options.rs (1)

src/fls/oci/registry.rs (1)

new (25-42)

src/fls/mod.rs (3)

src/fls/oci/manifest.rs (1)

compression (196-204)

src/fls/from_url.rs (1)

flash_from_url (13-619)

src/fls/oci/from_oci.rs (1)

flash_from_oci (27-471)

src/fls/oci/registry.rs (4)

src/fls/oci/auth.rs (3)

request_token (201-245)

new (20-28)

parse (55-74)

src/fls/http.rs (1)

setup_http_client (6-75)

src/fls/oci/manifest.rs (1)

parse (103-130)

src/fls/oci/reference.rs (4)

parse (55-81)

registry_url (97-104)

api_repository (84-86)

reference_string (89-94)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Build (aarch64-unknown-linux-gnu)
GitHub Check: Build (aarch64-unknown-linux-musl)

🔇 Additional comments (30)

src/lib.rs (1)

7-7: LGTM! Public API expanded for OCI support.

The re-exports correctly expose the new OCI flash functionality and associated option types. Note that HttpClientOptions is exported from src/fls/mod.rs but not re-exported here, which appears intentional if it's meant for internal use only.

src/fls/http.rs (3)

2-8: Clean refactoring to use HttpClientOptions.

The signature change properly decouples the HTTP client setup from the specific flash options type, allowing it to be used by both URL and OCI flash flows via the From trait conversions.

47-51: Formatting improvements for debug output.

Minor formatting change for better readability of CA certificate path in debug logs.

128-132: Formatting improvements for response status logging.

Multi-line formatting improves readability of the debug output.

src/fls/mod.rs (1)

3-19: Well-organized module structure for OCI support.

The visibility modifiers are appropriate:

pub(crate) for internal utilities (compression, http)

pub for the new oci module exposing the public API

The expanded re-exports correctly expose the new option types alongside the existing ones.

tests/from_url.rs (2)

10-42: Good test helper abstraction.

The test_options and test_options_with helpers effectively reduce boilerplate across tests while maintaining flexibility for customizing TLS and debug settings. The use of ..Default::default() ensures new fields added to the options structs won't require updates to every test.

64-64: Consistent adoption of test helper.

Tests now use the centralized helper, improving maintainability.

src/fls/from_url.rs (2)

10-18: Clean separation of HTTP client options.

The conversion from BlockFlashOptions to HttpClientOptions via the From trait properly decouples the HTTP client setup from the flash-specific options structure.

32-44: Consistent options access pattern.

Accessing common options through options.common.* is consistent with the new nested options structure.

src/main.rs (3)

59-67: Credentials handling looks good.

The password field correctly uses the env attribute to read from FLS_REGISTRY_PASSWORD environment variable, which is a good security practice to avoid passing secrets on the command line where they could be visible in process lists.

95-100: URL scheme detection is clear and correct.

The starts_with("oci://") check followed by strip_prefix (with safe unwrap() due to the prior check) is a clean approach for routing between OCI and HTTP paths.

127-155: OCI options construction and flash invocation look correct.

The OciOptions struct is properly populated with all relevant fields, and error handling correctly distinguishes between success (FLASH_COMPLETED) and failure (FLASH_FAILED) with appropriate exit codes.

src/fls/magic_bytes.rs (4)

11-32: Compression detection logic is correct.

The magic byte signatures are accurate:

Gzip: 1f 8b

XZ: fd 37 7a 58 5a 00

Zstd: 28 b5 2f fd

The early return for insufficient data (len < 2) is appropriate.

92-105: XZ/Zstd assumption of tar archive may cause issues with raw compressed disk images.

When XZ or Zstd compression is detected, the code assumes the content is a tar archive without attempting to decompress and inspect. If a raw disk image is XZ-compressed (not inside a tar), this would misclassify it.

However, based on the calling code in from_oci.rs which handles XZ-compressed layers specially (bypassing tar extraction), this seems intentional.

125-140: Gzip sample decompression is well-sized for tar detection.

The 8KB buffer is more than sufficient for tar detection, which requires only 262 bytes to check the magic at offset 257. Good implementation.

142-176: Test coverage is adequate for the detection utilities.

Tests cover the main compression formats and tar signature detection. Consider adding a test for detect_content_and_compression with actual gzip-compressed data in a follow-up.

src/fls/oci/from_oci.rs (5)

80-83: Zstd rejection is clear with a helpful error message.

Good practice to explicitly reject unsupported compression formats early with a clear error rather than failing later in the pipeline.

1018-1033: Good error handling for fd duplication.

The code now properly checks for dup failure (dup_fd == -1) and returns an appropriate error, addressing the previously flagged issue.

1030-1033: Non-unix platform handling is acceptable.

Returning a runtime error instead of compile_error! is the correct fix. Users on non-unix platforms will get a clear error message rather than a compile failure.

222-252: Writer task correctly handles EOF and errors.

The decompressor stdout to block writer task properly handles EOF (Ok(0)), propagates progress, and reports errors through the error channel. The 8MB buffer is appropriate for throughput.

392-419: Decompressor wait loop properly handles process completion.

The polling loop with 100ms sleep is reasonable for waiting on the decompressor process while keeping progress updated. The exit status check ensures decompressor failures are caught.

src/fls/options.rs (2)

3-16: Clean option struct design with good field organization.

The FlashOptions struct appropriately groups common configuration shared between URL and OCI flash operations. The field names are clear and self-documenting.

74-93: From trait implementations provide clean conversion ergonomics.

The From<&FlashOptions>, From<&BlockFlashOptions>, and From<&OciOptions> implementations for HttpClientOptions avoid code duplication and enable clean conversion patterns like let http_options: HttpClientOptions = options.into();.

src/fls/stream_utils.rs (3)

34-61: ChannelReader correctly implements Read trait.

The implementation properly handles:

Partial reads from buffered data

Blocking receive for new chunks

EOF signaling when channel closes

Using blocking_recv() is the correct choice since this reader is intended for sync consumers like tar::Archive.

91-132: Stream-to-channel function has good timeout and error handling.

The 30-second per-chunk timeout is appropriate for preventing indefinite hangs. The early break on channel closure correctly handles backpressure from the consumer side.

145-168: wait_with_progress correctly drains progress channels.

The function properly polls both decompressed and written progress channels while waiting for task completion. The 100ms sleep interval balances responsiveness with CPU usage.

src/fls/oci/registry.rs (4)

44-98: Authentication flow correctly implements OCI distribution spec.

The authenticate method properly:

Attempts anonymous access first

Handles 401 by parsing WWW-Authenticate header

Requests bearer token when needed

Stores token for subsequent requests

This follows the OCI Distribution Specification authentication flow.

100-109: add_auth correctly prioritizes Bearer over Basic auth.

The method properly checks for a token first (from successful bearer auth) before falling back to basic credentials. This ensures the most recent successful auth method is used.

124-133: Accept header correctly requests multiple manifest types.

The Accept header includes both OCI and Docker manifest types, ensuring compatibility with different registries. Good practice for interoperability.

204-236: Blob streaming correctly returns Response for caller to consume.

Returning the Response directly allows the caller to stream the blob without buffering the entire content in memory. The debug logging for blob size is helpful for troubleshooting.

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/fls/from_url.rs (1)

593-616: Guard against potential division by zero in compression ratio.

Line 614 calculates the compression ratio without checking if mb_received is zero. While unlikely in practice (download would have failed earlier), this could panic on edge cases.

🛡️ Proposed fix

     println!(
         "Write complete: {:.2} MB in {} ({:.2} MB/s)",
         stats.mb_written,
         stats.write_time_formatted(),
         stats.write_rate
     );
-    println!(
-        "Compression ratio: {:.2}x",
-        stats.mb_decompressed / stats.mb_received
-    );
+    if stats.mb_received > 0.0 {
+        println!(
+            "Compression ratio: {:.2}x",
+            stats.mb_decompressed / stats.mb_received
+        );
+    }
     println!("Total flash runtime: {}", stats.total_time_formatted());

🧹 Nitpick comments (6)

src/fls/progress.rs (1)

279-332: Consider consolidating these near-identical methods.

Both print_final_stats and print_final_stats_with_ratio share significant duplication. Consider refactoring to a single method with an optional parameter to control whether the compression ratio is displayed:

♻️ Proposed refactor

-    #[allow(dead_code)]
-    pub(crate) fn print_final_stats(&self) {
+    pub(crate) fn print_final_stats(&self, show_compression_ratio: bool) {
         let stats = self.final_stats();
         println!(
             "\nDownload complete: {:.2} MB in {} ({:.2} MB/s)",
@@ -293,20 +292,11 @@
             stats.write_time_formatted(),
             stats.write_rate
         );
+        if show_compression_ratio && stats.mb_received > 0.0 {
+            println!(
+                "Compression ratio: {:.2}x",
+                stats.mb_decompressed / stats.mb_received
+            );
+        }
         println!("Total flash runtime: {}", stats.total_time_formatted());
     }
-
-    /// Print final statistics including compression ratio (for URL flash)
-    #[allow(dead_code)]
-    pub(crate) fn print_final_stats_with_ratio(&self) {
-        let stats = self.final_stats();
-        println!(
-            "\nDownload complete: {:.2} MB in {} ({:.2} MB/s)",
-            stats.mb_received,
-            stats.download_time_formatted(),
-            stats.download_rate
-        );
-        println!(
-            "Decompression complete: {:.2} MB in {} ({:.2} MB/s)",
-            stats.mb_decompressed,
-            stats.decompress_time_formatted(),
-            stats.decompress_rate
-        );
-        println!(
-            "Write complete: {:.2} MB in {} ({:.2} MB/s)",
-            stats.mb_written,
-            stats.write_time_formatted(),
-            stats.write_rate
-        );
-        if stats.mb_received > 0.0 {
-            println!(
-                "Compression ratio: {:.2}x",
-                stats.mb_decompressed / stats.mb_received
-            );
-        }
-        println!("Total flash runtime: {}", stats.total_time_formatted());
-    }

Then update callers:

For URL flash: tracker.print_final_stats(true)
For OCI flash: tracker.print_final_stats(false)

src/fls/oci/from_oci.rs (2)

1083-1086: Consider checking stdin writer result.

Lines 1083 and 1086 ignore the results of stdin_writer_handle and decompressor.wait(). If the stdin writer fails, this could be masked by downstream errors. While the error would likely manifest elsewhere, explicitly checking improves error reporting.

💡 Optional improvement

             // Wait for stdin writer to finish
-            let _ = stdin_writer_handle.await;
+            if let Err(e) = stdin_writer_handle.await {
+                eprintln!("Warning: stdin writer task failed: {}", e);
+            }

             // Wait for decompressor to finish
-            let _ = decompressor.wait().await;
+            if let Err(e) = decompressor.wait().await {
+                eprintln!("Warning: decompressor wait failed: {}", e);
+            }

             writer.close().await

1-1332: Consider adding high-level documentation.

This file implements a complex streaming pipeline with multiple code paths (tar archives, raw disk images, XZ bypass). While the implementation is solid, adding module-level documentation explaining the overall architecture and the different paths would significantly improve maintainability.

📝 Suggested documentation structure

//! Flash from OCI image
//!
//! # Architecture
//!
//! This module implements three streaming pipelines depending on content detection:
//!
//! 1. **Tar Archive Path** (default):
//!    - HTTP download → Gzip decompress → Tar extract → XZ decompress → Block device
//!    - Used for standard OCI layers containing compressed disk images in tar
//!
//! 2. **Raw Disk Image Path**:
//!    - HTTP download → Optional decompress → Block device
//!    - Used when content detection identifies a raw disk image (not a tar)
//!
//! 3. **XZ Bypass Path**:
//!    - HTTP download → Tar extract (no decompress) → XZ decompress → Block device
//!    - Used for XZ-compressed layers to avoid double decompression
//!
//! All paths use channel-based streaming for memory efficiency and include
//! comprehensive progress tracking and error handling.

src/fls/oci/registry.rs (2)

66-72: Redundant header case check.

HTTP headers are case-insensitive, and reqwest normalizes header names to lowercase internally. The or_else fallback for "WWW-Authenticate" is unnecessary since "www-authenticate" will always match.
♻️ Suggested simplification
                 let www_auth_header = response
                     .headers()
                     .get("www-authenticate")
-                    .or_else(|| response.headers().get("WWW-Authenticate"))
                     .ok_or("No WWW-Authenticate header in 401 response")?
                     .to_str()
                     .map_err(|e| format!("Invalid WWW-Authenticate header: {}", e))?;
160-202: Consider extracting common manifest fetch logic.

fetch_manifest and fetch_manifest_by_digest share significant code structure. Consider extracting a private helper method that accepts the reference/digest and accept header to reduce duplication.

src/fls/stream_utils.rs (1)

145-168: Consider documenting that caller must await the handle separately.

The function takes &JoinHandle<T> and polls is_finished() but doesn't await the handle. The caller is responsible for subsequently awaiting the handle to retrieve the result and handle any panics. This is a valid design but worth documenting.

Also, update_progress errors are silently ignored on line 161. If progress updates can fail in meaningful ways, consider logging or propagating.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 043af9a and 7f1bb1e.

📒 Files selected for processing (14)

src/fls/compression.rs
src/fls/from_url.rs
src/fls/http.rs
src/fls/magic_bytes.rs
src/fls/mod.rs
src/fls/oci/from_oci.rs
src/fls/oci/mod.rs
src/fls/oci/registry.rs
src/fls/options.rs
src/fls/progress.rs
src/fls/stream_utils.rs
src/lib.rs
src/main.rs
tests/from_url.rs

🚧 Files skipped from review as they are similar to previous changes (1)

src/fls/oci/mod.rs

🧰 Additional context used

🧬 Code graph analysis (6)

tests/from_url.rs (2)

src/fls/from_url.rs (1)

flash_from_url (13-619)

src/fls/options.rs (2)

default (19-32)

default (45-52)

src/fls/from_url.rs (3)

src/fls/http.rs (1)

setup_http_client (6-75)

src/fls/oci/registry.rs (1)

new (25-42)

src/fls/progress.rs (1)

new (84-104)

src/fls/oci/from_oci.rs (9)

src/fls/stream_utils.rs (2)

read (35-60)

new (25-31)

src/fls/block_writer.rs (5)

mpsc (458-458)

new (40-52)

new (93-312)

new (437-504)

bytes_written (424-426)

src/fls/oci/manifest.rs (2)

compression (196-204)

parse (103-130)

src/fls/decompress.rs (2)

spawn_stderr_reader (52-69)

start_decompressor_process (33-50)

src/fls/magic_bytes.rs (1)

detect_content_and_compression (59-123)

src/fls/oci/registry.rs (2)

image_ref (240-242)

new (25-42)

src/fls/oci/reference.rs (1)

parse (55-81)

src/fls/progress.rs (5)

new (84-104)

download_time_formatted (38-40)

decompress_time_formatted (43-45)

write_time_formatted (48-50)

total_time_formatted (53-55)

src/fls/options.rs (3)

from (73-79)

from (83-85)

from (89-91)

src/lib.rs (2)

src/fls/oci/from_oci.rs (1)

flash_from_oci (27-471)

src/fls/from_url.rs (1)

flash_from_url (13-619)

src/main.rs (3)

src/fls/oci/registry.rs (1)

image_ref (240-242)

src/fls/oci/from_oci.rs (1)

flash_from_oci (27-471)

src/fls/from_url.rs (1)

flash_from_url (13-619)

src/fls/options.rs (1)

src/fls/oci/registry.rs (1)

new (25-42)

🔇 Additional comments (30)

src/lib.rs (1)

7-7: LGTM!

The public API expansion appropriately exposes the new OCI functionality (flash_from_oci, OciOptions) alongside the existing URL-based flash operations and shared option types.

src/fls/compression.rs (1)

1-56: LGTM! Clean compression type abstraction.

The Compression enum provides a well-structured, single source of truth for compression handling across OCI and URL-based operations. The implementation is clean with appropriate helper methods.

src/fls/http.rs (2)

2-2: LGTM! Type refactoring aligns with option structure updates.

The function signature update to accept HttpClientOptions instead of BlockFlashOptions is consistent with the broader option type refactoring described in the PR, enabling unified HTTP client setup across OCI and URL-based flash operations.

Also applies to: 7-7

47-50: Debug formatting changes are cosmetic.

The multi-line formatting of debug output for CA certificates and HTTP response status doesn't affect functionality. These changes maintain the same information while adjusting the presentation.

Also applies to: 128-132

tests/from_url.rs (3)

10-42: Excellent refactoring to centralize test setup.

The new test helper functions (test_options and test_options_with) effectively reduce duplication across test cases and provide a consistent way to configure BlockFlashOptions for different test scenarios. This improves maintainability and makes test intent clearer.

64-64: LGTM! Test updates consistently use helpers.

The migration from inline BlockFlashOptions construction to the test_options helper is applied consistently across tests. The pattern of creating options with the helper and then mutating specific fields (e.g., retry_delay_secs, max_retries) is clear and maintains test readability.

Also applies to: 123-123, 183-183, 247-249, 318-320, 394-396, 593-595

765-772: TLS test configuration properly uses parameterized helper.

The TLS-related tests correctly use test_options_with to configure custom CA certificates and insecure TLS settings, demonstrating proper test coverage for these security-related features.

Also applies to: 936-937, 1033-1034

src/fls/mod.rs (1)

1-19: Module organization looks good.

The visibility changes and new module additions are well-structured. Making compression and http pub(crate) appropriately restricts internal access, while the new oci module and expanded re-exports cleanly extend the public API surface.

src/fls/from_url.rs (1)

17-18: LGTM: Clean option conversion pattern.

The conversion from BlockFlashOptions to HttpClientOptions via the From trait cleanly separates HTTP client configuration from flash-specific options. This aligns well with the new option structure shown in the registry code.

src/main.rs (2)

96-156: LGTM: OCI path implementation is clean and secure.

The OCI routing logic properly:

Detects the oci:// scheme

Strips the prefix before passing to the image parser

Avoids leaking credentials in logs (only prints "Using provided credentials")

Constructs options with the new nested structure

157-222: LGTM: HTTP path handling is robust.

The HTTP/HTTPS branch properly:

Parses headers with defensive error handling (warns on invalid format)

Constructs BlockFlashOptions with nested FlashOptions

Maintains consistency with the OCI path structure

src/fls/magic_bytes.rs (3)

11-32: LGTM: Magic byte detection is correct.

The compression detection correctly identifies gzip, XZ, and Zstd formats using their standard magic byte signatures. The minimum length check prevents out-of-bounds access.

37-44: LGTM: Tar archive detection is correct.

The tar magic signature check at offset 257 is correct per the POSIX tar specification. Supporting both "ustar" and "posix" variants is appropriate.

59-123: LGTM: Content detection logic is well-designed.

The approach of decompressing a gzip sample to inspect the underlying content is clever and handles the case where registries don't set media types correctly. The fallback to TarArchive on decompression failure is a safe default.

src/fls/oci/from_oci.rs (6)

27-84: LGTM: OCI setup phase is well-structured.

The initialization flow is clean and logical:

Parse image reference

Authenticate with registry

Handle multi-platform manifests (selecting Linux)

Validate layer compression

The explicit rejection of Zstd with a clear error message is appropriate given current limitations.

93-150: LGTM: Content detection strategy is sound.

Downloading 2MB upfront for content detection is a reasonable trade-off between accuracy and efficiency. The special handling for XZ-compressed content (bypassing tar extraction) is a good optimization.

155-220: LGTM: Pipeline setup is well-designed.

The buffer capacity calculation with enhanced buffering for compressed layers (minimum 10,000 chunks) shows good understanding of streaming decompression requirements. The channel setup cleanly separates different data flows.

222-274: LGTM: Task spawning and orchestration is well-structured.

The pipeline properly separates concerns with dedicated tasks for:

Writing decompressed data to the block device

Reading decompressor stderr for errors

Processing error messages

Feeding tar-extracted data to the decompressor

Proper resource cleanup (dropping stdin at line 272) ensures EOF signaling.

1018-1033: LGTM: File descriptor duplication is now safe.

The unsafe fd duplication correctly:

Checks for dup failure (line 1024-1025)

Converts OS error on failure

Only proceeds with from_raw_fd after validation

Handles non-Unix platforms gracefully with a runtime error

608-632: LGTM: Disk image detection is comprehensive.

The function appropriately:

Skips hidden files (common convention)

Supports custom glob patterns

Falls back to a comprehensive list of common disk image extensions

src/fls/oci/registry.rs (4)

100-109: LGTM!

The authorization logic correctly prioritizes bearer token over basic auth, with appropriate fallback behavior.

111-158: LGTM!

The manifest fetch implementation correctly handles multiple manifest media types via the Accept header and provides good error context including the response body.

204-236: LGTM!

Returning the raw Response allows for efficient streaming of potentially large blobs without buffering the entire content in memory.

238-243: LGTM!

Simple accessor method. The #[allow(dead_code)] annotation is appropriate for a public API method that may be used by consumers.

src/fls/stream_utils.rs (2)

72-138: LGTM!

Well-structured streaming function with appropriate timeout handling, progress tracking, and graceful handling of channel closure. The 30-second per-chunk timeout provides reasonable protection against stalled downloads.

34-61: ChannelReader is correctly used only from blocking contexts.

All three instantiations of ChannelReader::new are properly placed within tokio::task::spawn_blocking blocks, which is the correct pattern for bridging async HTTP streaming with synchronous readers like tar::Archive or flate2::GzDecoder. The use of blocking_recv() is safe in this context.

src/fls/options.rs (4)

3-33: LGTM!

Clean separation of common flash options. The default buffer sizes (128MB) and progress interval (0.5s) are reasonable choices for streaming flash operations.

35-53: LGTM!

The composition pattern with FlashOptions as the common field is a clean approach. Default retry settings (10 retries, 2s delay) are reasonable for resilient HTTP operations.

55-62: LGTM!

The OciOptions struct properly composes with FlashOptions and uses #[derive(Default)] which correctly leverages FlashOptions::default(). Optional credentials support both anonymous and authenticated registry access.

64-92: LGTM!

The HttpClientOptions struct cleanly extracts the HTTP-client-relevant subset of options. The From implementations use proper delegation to avoid code duplication.

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com>

coderabbitai

Actionable comments posted: 2

🤖 Fix all issues with AI agents

In @src/main.rs:
- Around line 95-104: After detecting is_oci and obtaining image_ref via
url.strip_prefix("oci://").unwrap(), validate that image_ref is not empty and
return or error early if it is; update the OCI branch (where image_ref is used
and passed to flash_from_oci) to check image_ref.is_empty() and produce a clear
error/exit instead of calling flash_from_oci with an empty string so downstream
code cannot receive an invalid image reference.

🧹 Nitpick comments (3)

src/main.rs (1)

173-195: Consider rejecting headers with empty values.

The current implementation warns about empty header values but still includes them (line 186). While some HTTP headers can technically have empty values, this might not be the intended behavior and could cause confusion or issues with certain servers.

♻️ Alternative: reject empty values

                             if header_name.is_empty() {
                                 eprintln!("Warning: Ignoring header with empty name: {}", h);
                                 None
                             } else if header_value.is_empty() {
-                                eprintln!("Warning: Header '{}' has empty value", header_name);
-                                Some((header_name, header_value))
+                                eprintln!("Warning: Ignoring header '{}' with empty value", header_name);
+                                None
                             } else {
                                 Some((header_name, header_value))
                             }

src/fls/oci/from_oci.rs (1)

181-181: Consider making tar channel buffer size configurable.

The tar channel buffer capacity is hardcoded to 16 chunks (128MB total at 8MB per chunk). This could be configurable via options, similar to how HTTP download buffer is configurable via buffer_size_mb.
💡 Optional improvement

Consider adding a field to OciOptions or FlashOptions:
tar_buffer_chunks: usize,  // default: 16
Then use it here:
-    let (tar_tx, mut tar_rx) = mpsc::channel::<Vec<u8>>(16); // 16 * 8MB = 128MB buffer
+    let tar_buffer = options.common.tar_buffer_chunks.unwrap_or(16);
+    let (tar_tx, mut tar_rx) = mpsc::channel::<Vec<u8>>(tar_buffer);

src/fls/oci/manifest.rs (1)

134-151: Multi-layer image handling silently picks last layer.

When an image has multiple layers, get_single_layer() silently returns the last layer based on the assumption that it "typically contains the actual content." This assumption may not hold for all OCI images, and the silent selection could lead to flashing the wrong layer.

📋 Suggested improvement

Consider one of these approaches:

Option 1: Log a warning (minimal change)

                 } else if m.layers.len() > 1 {
-                    // For multi-layer images, we take the last layer
-                    // which typically contains the actual content
+                    eprintln!(
+                        "Warning: Image has {} layers. Selecting last layer: {}",
+                        m.layers.len(),
+                        m.layers.last().unwrap().digest
+                    );
                     Ok(m.layers.last().unwrap())

Option 2: Return error and require explicit layer selection (safer)

                 } else if m.layers.len() > 1 {
-                    // For multi-layer images, we take the last layer
-                    // which typically contains the actual content
-                    Ok(m.layers.last().unwrap())
+                    Err(format!(
+                        "Image has {} layers. Please specify which layer to use with --layer-index or --layer-pattern",
+                        m.layers.len()
+                    ))

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 7f1bb1e and 499b082.

📒 Files selected for processing (15)

src/fls/compression.rs
src/fls/from_url.rs
src/fls/http.rs
src/fls/magic_bytes.rs
src/fls/mod.rs
src/fls/oci/from_oci.rs
src/fls/oci/manifest.rs
src/fls/oci/mod.rs
src/fls/oci/registry.rs
src/fls/options.rs
src/fls/progress.rs
src/fls/stream_utils.rs
src/lib.rs
src/main.rs
tests/from_url.rs

🚧 Files skipped from review as they are similar to previous changes (7)

src/fls/oci/mod.rs
src/fls/compression.rs
tests/from_url.rs
src/fls/oci/registry.rs
src/fls/http.rs
src/fls/magic_bytes.rs
src/fls/progress.rs

🧰 Additional context used

🧬 Code graph analysis (4)

src/fls/from_url.rs (4)

src/fls/http.rs (2)

setup_http_client (6-75)

start_download (77-168)

src/fls/oci/registry.rs (1)

new (25-42)

src/fls/progress.rs (1)

new (84-104)

src/fls/block_writer.rs (3)

new (40-52)

new (93-312)

new (437-504)

src/fls/mod.rs (3)

src/fls/oci/manifest.rs (1)

compression (196-204)

src/fls/from_url.rs (1)

flash_from_url (13-596)

src/fls/oci/from_oci.rs (1)

flash_from_oci (27-452)

src/main.rs (3)

src/fls/oci/registry.rs (1)

image_ref (239-241)

src/fls/oci/from_oci.rs (1)

flash_from_oci (27-452)

src/fls/from_url.rs (1)

flash_from_url (13-596)

src/fls/options.rs (1)

src/fls/oci/registry.rs (1)

new (25-42)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Build (aarch64-unknown-linux-gnu)
GitHub Check: Build (aarch64-unknown-linux-musl)

🔇 Additional comments (13)

src/fls/stream_utils.rs (1)

1-58: LGTM! Well-designed async-to-sync bridge.

The ChannelReader implementation correctly bridges async Tokio channels to synchronous Read trait consumers. The use of blocking_recv() is appropriate here since this is explicitly designed to enable synchronous code (like tar extractors or decompressors) to consume async streaming data.

src/lib.rs (1)

7-7: LGTM! Public API surface expansion is well-aligned.

The re-exports appropriately expose the new OCI functionality alongside the existing URL-based flash operations.

src/fls/mod.rs (1)

3-19: LGTM! Module organization supports the new OCI functionality.

The module structure cleanly separates OCI-related functionality while maintaining backward compatibility with existing HTTP-based flash operations.

src/fls/from_url.rs (3)

10-18: LGTM! Clean options refactoring.

The conversion from BlockFlashOptions to HttpClientOptions follows the new options structure pattern and maintains backward compatibility.

32-44: LGTM! Consistent use of nested options structure.

All references correctly use the options.common.* pattern, which provides better separation between common flash options and HTTP-specific options.

167-203: LGTM! Enhanced retry logic with error-specific delays.

The retry logic now respects per-error suggested delays (e.g., for rate limiting) while falling back to configured defaults. This is a good improvement for handling different types of transient failures.

src/main.rs (2)

59-67: LGTM! Good security practice for password handling.

Using the FLS_REGISTRY_PASSWORD environment variable for the password (line 63) follows security best practices by avoiding password exposure in command history or process listings.

102-155: LGTM! Clean OCI path implementation.

The OCI-specific flow is well-structured with appropriate credential handling, debug output, and option construction. The separation from the HTTP path enhances maintainability.

src/fls/oci/from_oci.rs (2)

1114-1114: Blocking task uses runtime handle to bridge to async - correct pattern.

The blocking task correctly captures the runtime handle (line 1089) and uses block_on to send to the async channel. This is the proper way to bridge synchronous decompression code with async I/O.

1261-1299: Well-structured compression handling with proper fallbacks.

The layered approach to handling both manifest-declared compression (LayerCompression) and content-detected compression (Compression) is well-designed. XZ content is correctly passed through raw to the external decompressor, while gzip is decompressed in-process before tar extraction.

src/fls/oci/manifest.rs (1)

225-287: Test coverage is adequate for initial implementation.

The tests validate basic parsing of image manifests and indices, including platform selection. Consider adding tests for multi-layer images and error cases in future iterations.

src/fls/options.rs (2)

55-62: OciOptions design is well-structured for OCI workflows.

The composition pattern with common: FlashOptions enables code reuse, and the optional auth fields properly support both public and private registries. The file_pattern field provides flexibility for selecting specific disk images from multi-file tar archives.

64-92: HttpClientOptions abstraction enables clean HTTP client setup.

The subset pattern for HttpClientOptions extracts only the relevant fields for HTTP client configuration, and the From trait implementations provide clean conversion from different option types. This avoids coupling HTTP client code to full option structs.

coderabbitai · 2026-01-07T10:14:24Z

src/main.rs

+            // Detect URL scheme to determine handler
+            let is_oci = url.starts_with("oci://");
+
+            if debug {
+                eprintln!("[DEBUG] URL: '{}', is_oci: {}", url, is_oci);
            }
-            println!("  Ignore certificates: {}", insecure_tls);
-            println!("  Buffer size: {} MB", buffer_size);
-            println!("  Write buffer size: {} MB", write_buffer_size);
-            println!("  Max retries: {}", max_retries);
-            println!("  Retry delay: {} seconds", retry_delay);
-            println!("  Debug: {}", debug);
-            println!("  O_DIRECT mode: {}", o_direct);

-            // Parse headers in the format "Header: value"
-            let parsed_headers: Vec<(String, String)> = headers
-                .iter()
-                .filter_map(|h| {
-                    let parts: Vec<&str> = h.splitn(2, ':').collect();
-                    if parts.len() == 2 {
-                        Some((parts[0].trim().to_string(), parts[1].trim().to_string()))
-                    } else {
-                        eprintln!("Warning: Ignoring invalid header format: {}", h);
-                        None
-                    }
-                })
-                .collect();
+            if is_oci {
+                // OCI image - strip scheme prefix
+                let image_ref = url.strip_prefix("oci://").unwrap();


⚠️ Potential issue | 🟡 Minor

Validate that image reference is non-empty.

After stripping the oci:// prefix, the image reference could be empty if the URL is exactly "oci://". This would pass the starts_with check but result in an empty string that might cause issues downstream in flash_from_oci.

🛡️ Proposed validation

if is_oci { // OCI image - strip scheme prefix let image_ref = url.strip_prefix("oci://").unwrap(); + + if image_ref.is_empty() { + eprintln!("Error: OCI image reference cannot be empty"); + println!("Result: FLASH_FAILED"); + std::process::exit(1); + } println!("OCI flash command:");

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

// Detect URL scheme to determine handler

let is_oci = url.starts_with("oci://");

if debug {

eprintln!("[DEBUG] URL: '{}', is_oci: {}", url, is_oci);

}

println!(" Ignore certificates: {}", insecure_tls);

println!(" Buffer size: {} MB", buffer_size);

println!(" Write buffer size: {} MB", write_buffer_size);

println!(" Max retries: {}", max_retries);

println!(" Retry delay: {} seconds", retry_delay);

println!(" Debug: {}", debug);

println!(" O_DIRECT mode: {}", o_direct);

// Parse headers in the format "Header: value"

let parsed_headers: Vec<(String, String)> = headers

.iter()

.filter_map(|h| {

let parts: Vec<&str> = h.splitn(2, ':').collect();

if parts.len() == 2 {

Some((parts[0].trim().to_string(), parts[1].trim().to_string()))

} else {

eprintln!("Warning: Ignoring invalid header format: {}", h);

None

}

})

.collect();

if is_oci {

// OCI image - strip scheme prefix

let image_ref = url.strip_prefix("oci://").unwrap();

// Detect URL scheme to determine handler

let is_oci = url.starts_with("oci://");

if debug {

eprintln!("[DEBUG] URL: '{}', is_oci: {}", url, is_oci);

}

if is_oci {

// OCI image - strip scheme prefix

let image_ref = url.strip_prefix("oci://").unwrap();

if image_ref.is_empty() {

eprintln!("Error: OCI image reference cannot be empty");

println!("Result: FLASH_FAILED");

std::process::exit(1);

}

🤖 Prompt for AI Agents

In @src/main.rs around lines 95 - 104, After detecting is_oci and obtaining image_ref via url.strip_prefix("oci://").unwrap(), validate that image_ref is not empty and return or error early if it is; update the OCI branch (where image_ref is used and passed to flash_from_oci) to check image_ref.is_empty() and produce a clear error/exit instead of calling flash_from_oci with an empty string so downstream code cannot receive an invalid image reference.

coderabbitai

Actionable comments posted: 3

🤖 Fix all issues with AI agents

In @src/fls/from_url.rs:
- Line 593: Replace the previous inline final statistics printing in
src/fls/from_url.rs with a single call to
progress.print_final_stats_with_ratio(); ensure all duplicated inline printing
logic is removed and consolidated to use the method in src/fls/progress.rs so
final stats are printed via progress.print_final_stats_with_ratio().

In @src/fls/progress.rs:
- Around line 275-330: No changes required: the new print_final_stats and
print_final_stats_with_ratio methods correctly consolidate final-stat printing
and handle the compression-ratio division-by-zero via the guard using
stats.mb_received > 0.0, so leave flash.rs as-is.

🧹 Nitpick comments (3)

src/fls/oci/manifest.rs (2)
139-142: Multi-layer handling assumes last layer contains the disk image.

For images with multiple layers, get_single_layer returns the last layer based on the assumption that it "typically contains the actual content" (line 141). This may not always be correct for all OCI image formats.

Consider adding a user-facing warning when multiple layers are detected, or provide a CLI option to select a specific layer by index.
💡 Suggested improvement
             Manifest::Image(ref m) => {
                 if m.layers.is_empty() {
                     Err("Manifest has no layers".to_string())
                 } else if m.layers.len() > 1 {
+                    eprintln!(
+                        "Warning: Image has {} layers. Using the last layer. \
+                        If this is incorrect, please specify a file pattern.",
+                        m.layers.len()
+                    );
                     // For multi-layer images, we take the last layer
                     // which typically contains the actual content
                     Ok(m.layers.last().unwrap())
176-180: Platform selection prefers arm64 over amd64.

The find_linux_manifest method prioritizes linux/arm64 over linux/amd64 (line 178), which may be unexpected for users on x86_64 systems. While the comment notes this is "common for embedded," desktop/server users might expect amd64 to be tried first.

Consider either:

Detecting the host architecture and preferring that platform

Adding a CLI option to specify the desired platform

Documenting this preference clearly in the help text
🔄 Suggested improvement
     /// Find a manifest for linux/amd64 or linux/arm64
+    /// 
+    /// Note: Prefers arm64 first, as this is common for embedded devices.
+    /// Desktop/server users on x86_64 may want to specify platform explicitly.
     pub fn find_linux_manifest(&self) -> Option<&Descriptor> {
-        // Try arm64 first (common for embedded), then amd64
-        self.find_platform("linux", "arm64")
-            .or_else(|| self.find_platform("linux", "amd64"))
+        // Try detecting host architecture
+        #[cfg(target_arch = "aarch64")]
+        {
+            self.find_platform("linux", "arm64")
+                .or_else(|| self.find_platform("linux", "amd64"))
+        }
+        #[cfg(not(target_arch = "aarch64"))]
+        {
+            self.find_platform("linux", "amd64")
+                .or_else(|| self.find_platform("linux", "arm64"))
+        }
     }
src/fls/options.rs (1)

55-62: Add documentation for file_pattern field.

The file_pattern field lacks documentation explaining:

What it's used for (filtering files in OCI layers?)

Expected format/syntax (regex, glob, exact match?)

Whether it's required or optional behavior when None

Consider adding inline documentation or expanding the struct-level doc comment.

Verify: Is Default implementation needed for OciOptions?

BlockFlashOptions provides a Default implementation, but OciOptions does not. This inconsistency may be intentional if credentials are always required, but it could also make the API less ergonomic.

Should OciOptions have a sensible default (with None for optional fields), or is the current design intentional?

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 499b082 and 9a1a0c2.

📒 Files selected for processing (15)

src/fls/compression.rs
src/fls/from_url.rs
src/fls/http.rs
src/fls/magic_bytes.rs
src/fls/mod.rs
src/fls/oci/from_oci.rs
src/fls/oci/manifest.rs
src/fls/oci/mod.rs
src/fls/oci/registry.rs
src/fls/options.rs
src/fls/progress.rs
src/fls/stream_utils.rs
src/lib.rs
src/main.rs
tests/from_url.rs

🚧 Files skipped from review as they are similar to previous changes (4)

src/fls/stream_utils.rs
tests/from_url.rs
src/fls/oci/mod.rs
src/lib.rs

🧰 Additional context used

🧬 Code graph analysis (7)

src/main.rs (3)

src/fls/oci/registry.rs (1)

image_ref (239-241)

src/fls/oci/from_oci.rs (1)

flash_from_oci (27-452)

src/fls/from_url.rs (1)

flash_from_url (13-596)

src/fls/from_url.rs (3)

src/fls/http.rs (2)

setup_http_client (6-75)

start_download (77-168)

src/fls/oci/registry.rs (1)

new (25-42)

src/fls/progress.rs (1)

new (84-104)

src/fls/oci/registry.rs (4)

src/fls/oci/auth.rs (3)

request_token (201-245)

new (20-28)

parse (55-74)

src/fls/http.rs (1)

setup_http_client (6-75)

src/fls/oci/manifest.rs (1)

parse (103-130)

src/fls/oci/reference.rs (4)

parse (55-81)

registry_url (97-104)

api_repository (84-86)

reference_string (89-94)

src/fls/options.rs (1)

src/fls/oci/registry.rs (1)

new (25-42)

src/fls/magic_bytes.rs (1)

src/fls/oci/manifest.rs (1)

compression (196-204)

src/fls/mod.rs (3)

src/fls/oci/manifest.rs (1)

compression (196-204)

src/fls/from_url.rs (1)

flash_from_url (13-596)

src/fls/oci/from_oci.rs (1)

flash_from_oci (27-452)

src/fls/oci/manifest.rs (3)

src/fls/oci/auth.rs (2)

parse (55-74)

new (20-28)

src/fls/oci/reference.rs (1)

parse (55-81)

src/fls/stream_utils.rs (1)

new (21-27)

🔇 Additional comments (12)

src/fls/compression.rs (1)

1-28: LGTM! Clean compression type abstraction.

The new Compression enum provides a clean, unified way to handle compression types across OCI and URL-based operations. The implementation is straightforward with appropriate derives and a clear Display implementation.

src/fls/mod.rs (1)

1-38: LGTM! Well-organized module structure for OCI support.

The module organization cleanly integrates the new OCI functionality with appropriate visibility controls. The expanded re-exports (line 19) properly expose the new HttpClientOptions and OciOptions types alongside the existing BlockFlashOptions, supporting the refactored options architecture.

src/fls/http.rs (1)

1-168: Clean refactoring to HttpClientOptions.

The transition from BlockFlashOptions to HttpClientOptions at lines 2 and 7 properly separates HTTP-specific configuration from block device options. The debug output formatting improvements (lines 47-50, 128-132) enhance readability. All field accesses are updated consistently throughout the function.

src/fls/from_url.rs (4)

10-18: Well-integrated options refactoring.

The transition to HttpClientOptions (lines 10, 17-18) cleanly separates HTTP client configuration from the broader BlockFlashOptions. The Into trait conversion at line 17 properly leverages the From implementation mentioned in the AI summary.

32-44: Consistent use of hierarchical options structure.

The refactoring to access device and writer configuration through options.common.* is applied consistently. This properly reflects the new FlashOptions structure with common settings separated from operation-specific options.

92-104: Progress and buffer options properly accessed.

All progress tracking and buffering configuration correctly uses the options.common.* pattern, maintaining consistency with the refactored options architecture.

167-203: Appropriate separation of retry and header options.

The code correctly accesses options.headers, options.max_retries, and options.retry_delay_secs directly (not through .common), indicating these are operation-specific settings that belong at the BlockFlashOptions level rather than in shared FlashOptions.

src/fls/oci/from_oci.rs (1)

767-786: TeeReader path construction is only used in debug mode.

The TeeReader at line 768 creates a file at a hardcoded path. However, this is only instantiated when debug mode is enabled (line 482), which limits the security risk to debug/development scenarios.

While a past review comment flagged the predictable path as a security concern, the limited scope (debug mode only) makes this acceptable for the current implementation. For production hardening, consider using secure temporary files even in debug mode.

src/fls/options.rs (4)

3-33: LGTM! Clean base options structure.

The FlashOptions struct provides a solid foundation with sensible defaults for common flash operation parameters.

35-53: LGTM! Good composition pattern.

The BlockFlashOptions struct cleanly separates HTTP-specific concerns from common options using composition. Default values for retries are sensible.

64-70: LGTM! Clean separation of HTTP client concerns.

The HttpClientOptions struct appropriately extracts only the fields needed for HTTP client setup, following good separation of concerns.

72-92: LGTM! Excellent use of delegation pattern.

The From trait implementations cleanly delegate through the common field for composed types, avoiding code duplication while maintaining consistency.

src/fls/from_url.rs

src/fls/magic_bytes.rs

src/fls/progress.rs

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com>

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (2)

src/fls/oci/from_oci.rs (2)

27-462: Consider decomposing this large function for maintainability.

The flash_from_oci function spans 435 lines and orchestrates multiple complex workflows (authentication, manifest resolution, pipeline setup, download coordination, progress tracking, and cleanup). While functionally correct, extracting logical subsections into helper functions would improve testability and readability.

For example, consider extracting:

Manifest resolution logic (lines 40-64)

Pipeline setup (lines 156-265)

Download coordination (lines 309-379)

943-1259: Consider extracting helper functions to reduce complexity.

The flash_raw_disk_image_directly function is 315 lines long and handles multiple concerns: progress setup, pipeline configuration, external decompressor management, in-process decompression, and download coordination.

Consider extracting logical subsections such as:

External decompressor pipeline setup (lines 994-1088)

In-process decompression pipeline setup (lines 1089-1146)

Download loop coordination (lines 1157-1213)

This would improve testability and make the code easier to maintain.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 9a1a0c2 and 700674b.

📒 Files selected for processing (1)

src/fls/oci/from_oci.rs

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Build (aarch64-unknown-linux-gnu)
GitHub Check: Build (aarch64-unknown-linux-musl)

🔇 Additional comments (4)

src/fls/oci/from_oci.rs (4)

571-645: LGTM!

The decompressor hint selection logic correctly handles various compression and content type combinations. The pattern matching functions have appropriate guards (skipping hidden files) and documented limitations.

647-941: LGTM! Debug logging properly gated.

All reader implementations correctly gate debug output behind the debug field, addressing previous review concerns. The MagicDetectingReader peek buffer management is correct, and all readers properly implement the Read trait.

1017-1032: LGTM! Proper error handling for fd duplication.

The unsafe libc::dup call now correctly checks for errors (line 1023-1025) and returns a proper error instead of proceeding with an invalid file descriptor. The non-unix platform handling also appropriately returns an error rather than causing a compile-time failure.

1261-1315: LGTM! Correct compression handling before tar extraction.

The function correctly handles both manifest-declared compression (LayerCompression) and content-detected compression, applying appropriate decompression before tar extraction. The special handling for XZ (passing raw data to the main pipeline) aligns with the overall architecture.

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com> Assisted-by: claude-sonnet-4.5

coderabbitai

Actionable comments posted: 2

🤖 Fix all issues with AI agents

In @src/fls/oci/from_oci.rs:
- Around line 832-846: The multiplication (buffer_size_mb * 1024) can overflow;
update the computation of buffer_capacity so it uses a checked or saturating
multiply and then division to avoid overflow (e.g., use usize::saturating_mul or
checked_mul followed by unwrap_or(MAX) and then divide), assign the safe result
to buffer_capacity, and keep the existing max() logic and debug branch
(references: options.common.buffer_size_mb, avg_chunk_size_kb, buffer_capacity,
LayerCompression::None, options.common.debug).

🧹 Nitpick comments (2)

src/fls/oci/from_oci.rs (2)

244-423: Consider extracting common download coordination logic.

The functions coordinate_download_and_processing and coordinate_raw_disk_download share substantial logic for download loop, progress tracking, timeout handling, and cleanup. While the different parameter types make direct unification challenging, consider extracting shared logic into helper functions to reduce duplication and improve maintainability.

Common patterns that could be extracted:

Download loop with timeout and progress updates (lines 277-340 and 622-677)

Progress channel draining (lines 307-318 and 646-655)

Final stats and cleanup (lines 413-420 and 716-720)

Also applies to: 595-723

772-788: Consider making detection buffer size configurable.

The 2MB detection buffer size is hardcoded, which is reasonable for most cases but may be insufficient for certain file formats or excessive for others. Consider adding a configuration option for this value.

This would allow users to tune the detection behavior based on their specific use case and memory constraints.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 700674b and d33c9a8.

📒 Files selected for processing (1)

src/fls/oci/from_oci.rs

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Build (aarch64-unknown-linux-musl)
GitHub Check: Build (aarch64-unknown-linux-gnu)

🔇 Additional comments (1)

src/fls/oci/from_oci.rs (1)

1-1529: Well-architected streaming pipeline implementation.

This is a complex piece of code that handles OCI image streaming, decompression, tar extraction, and block device writing with proper async/sync boundaries, comprehensive error handling, and progress tracking. The overall architecture is sound:

Clean separation between tar archive and raw disk image paths

Proper channel-based coordination between async and blocking tasks

Good error propagation through dedicated error channels

Debug logging properly gated behind configuration flags

Appropriate use of timeouts for network operations

The complexity is justified by the requirements, and the code is well-organized despite its length.

coderabbitai · 2026-01-07T15:25:26Z

src/fls/oci/from_oci.rs

+    let buffer_size_mb = options.common.buffer_size_mb;
+    let avg_chunk_size_kb = 16;
+    let mut buffer_capacity = (buffer_size_mb * 1024) / avg_chunk_size_kb;
+    buffer_capacity = buffer_capacity.max(1000);
+
+    // For compressed layers, use much larger buffering to ensure gzip decoder gets enough data
+    if compression != LayerCompression::None {
+        buffer_capacity = buffer_capacity.max(10000); // Ensure substantial buffering for compression
+        if options.common.debug {
+            eprintln!(
+                "[DEBUG] Using enhanced buffering for compressed layer: {} chunks",
+                buffer_capacity
+            );
+        }
+    }


⚠️ Potential issue | 🟡 Minor

Buffer capacity calculation could overflow.

The buffer capacity calculation at line 834 performs (buffer_size_mb * 1024) / avg_chunk_size_kb which could theoretically overflow if buffer_size_mb is very large (e.g., close to usize::MAX / 1024). While unlikely in practice, consider adding overflow protection.

🔎 Proposed fix

let buffer_size_mb = options.common.buffer_size_mb; let avg_chunk_size_kb = 16; - let mut buffer_capacity = (buffer_size_mb * 1024) / avg_chunk_size_kb; + let mut buffer_capacity = buffer_size_mb + .saturating_mul(1024) + .checked_div(avg_chunk_size_kb) + .unwrap_or(1000); buffer_capacity = buffer_capacity.max(1000);

🤖 Prompt for AI Agents

In @src/fls/oci/from_oci.rs around lines 832 - 846, The multiplication (buffer_size_mb * 1024) can overflow; update the computation of buffer_capacity so it uses a checked or saturating multiply and then division to avoid overflow (e.g., use usize::saturating_mul or checked_mul followed by unwrap_or(MAX) and then divide), assign the safe result to buffer_capacity, and keep the existing max() logic and debug branch (references: options.common.buffer_size_mb, avg_chunk_size_kb, buffer_capacity, LayerCompression::None, options.common.debug).

coderabbitai · 2026-01-07T15:25:27Z

src/fls/oci/from_oci.rs

+    let reader: Box<dyn Read + Send> = if is_gzipped {
+        if debug {
+            eprintln!("[DEBUG] Auto-detected gzip compression from magic bytes");
+            // Save first 1MB of raw gzip data for debugging
+            let tee_reader = TeeReader::new(magic_reader, "/tmp/debug_gzip_data.gz", 1024 * 1024);
+            let gz_decoder = GzDecoder::new(tee_reader);
+            Box::new(DebugGzReader::new(gz_decoder, debug))
+        } else {
+            let gz_decoder = GzDecoder::new(magic_reader);
+            Box::new(gz_decoder)
+        }


⚠️ Potential issue | 🟠 Major

Security: Hardcoded /tmp path still present.

The TeeReader at line 944 still uses the hardcoded path /tmp/debug_gzip_data.gz, which creates a symlink attack vector on multi-user systems. While this path is only used in debug mode, it still poses a security risk.

A past review comment indicated this was addressed, but the hardcoded path remains in the code.

🔒 Recommended fix using tempfile crate

let reader: Box<dyn Read + Send> = if is_gzipped { if debug { eprintln!("[DEBUG] Auto-detected gzip compression from magic bytes"); - // Save first 1MB of raw gzip data for debugging - let tee_reader = TeeReader::new(magic_reader, "/tmp/debug_gzip_data.gz", 1024 * 1024); + // Save first 1MB of raw gzip data for debugging (use secure temp file) + use std::sync::OnceLock; + static TEMP_PATH: OnceLock<String> = OnceLock::new(); + let temp_path = TEMP_PATH.get_or_init(|| { + let temp_dir = std::env::temp_dir(); + temp_dir.join(format!("fls_debug_gzip_{}.gz", std::process::id())) + .to_string_lossy().to_string() + }); + let tee_reader = TeeReader::new(magic_reader, temp_path, 1024 * 1024); let gz_decoder = GzDecoder::new(tee_reader); Box::new(DebugGzReader::new(gz_decoder, debug))

Note: Using tempfile::NamedTempFile would be better but requires careful lifetime management since the TeeReader needs the path to remain valid.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (5)

src/fls/progress.rs (1)

251-306: LGTM! Consider consolidating the two print methods.

Both methods work correctly and the compression ratio division is properly guarded. The functionality is sound.

The two methods share significant code (printing download, decompression, write, and total stats). Consider refactoring to reduce duplication, such as adding a parameter to control whether to print the compression ratio, or extracting common logic to a helper.

♻️ Optional refactoring to reduce code duplication

-    /// Print final statistics to stdout
-    ///
-    /// This consolidates the common stats printing pattern used by both
-    /// URL and OCI flash operations.
-    pub(crate) fn print_final_stats(&self) {
+    /// Print final statistics to stdout
+    ///
+    /// This consolidates the common stats printing pattern used by both
+    /// URL and OCI flash operations.
+    pub(crate) fn print_final_stats(&self) {
+        self.print_final_stats_impl(false);
+    }
+
+    /// Print final statistics including compression ratio (for URL flash)
+    pub(crate) fn print_final_stats_with_ratio(&self) {
+        self.print_final_stats_impl(true);
+    }
+
+    fn print_final_stats_impl(&self, include_ratio: bool) {
         let stats = self.final_stats();
         println!(
             "\nDownload complete: {:.2} MB in {} ({:.2} MB/s)",
             stats.mb_received,
             stats.download_time_formatted(),
             stats.download_rate
         );
         println!(
             "Decompression complete: {:.2} MB in {} ({:.2} MB/s)",
             stats.mb_decompressed,
             stats.decompress_time_formatted(),
             stats.decompress_rate
         );
         println!(
             "Write complete: {:.2} MB in {} ({:.2} MB/s)",
             stats.mb_written,
             stats.write_time_formatted(),
             stats.write_rate
         );
-        println!("Total flash runtime: {}", stats.total_time_formatted());
-    }
-
-    /// Print final statistics including compression ratio (for URL flash)
-    pub(crate) fn print_final_stats_with_ratio(&self) {
-        let stats = self.final_stats();
-        println!(
-            "\nDownload complete: {:.2} MB in {} ({:.2} MB/s)",
-            stats.mb_received,
-            stats.download_time_formatted(),
-            stats.download_rate
-        );
-        println!(
-            "Decompression complete: {:.2} MB in {} ({:.2} MB/s)",
-            stats.mb_decompressed,
-            stats.decompress_time_formatted(),
-            stats.decompress_rate
-        );
-        println!(
-            "Write complete: {:.2} MB in {} ({:.2} MB/s)",
-            stats.mb_written,
-            stats.write_time_formatted(),
-            stats.write_rate
-        );
-        if stats.mb_received > 0.0 {
+        if include_ratio && stats.mb_received > 0.0 {
             println!(
                 "Compression ratio: {:.2}x",
                 stats.mb_decompressed / stats.mb_received
             );
         }
         println!("Total flash runtime: {}", stats.total_time_formatted());
     }

src/fls/oci/from_oci.rs (3)

544-578: Calling block_on from a blocking task on the same runtime.

At line 544, the current runtime handle is captured, and at line 569, rt.block_on() is called from within a spawn_blocking task to send data to an async channel. While this generally works with Tokio's multi-threaded runtime, it's not the recommended pattern and could cause issues in certain runtime configurations (e.g., current-thread runtime or when blocking thread pool is exhausted).

♻️ Consider using `Handle::block_on` or a sync channel

Option 1: Use a synchronous crossbeam channel instead of async mpsc:

// At function signature level, use crossbeam_channel instead
let (data_tx, data_rx) = crossbeam_channel::bounded(16);

// In blocking task:
if data_tx.send(buffer[..n].to_vec()).is_err() {
    return Err("Data channel closed".to_string());
}

Option 2: Keep the current approach but document the runtime assumption:

// Capture the runtime handle before entering the blocking task
let rt = tokio::runtime::Handle::current();

// In the blocking task, add a comment:
// NOTE: Using block_on from blocking context. This is safe with 
// multi-threaded runtime but requires blocking thread pool to have capacity.
if rt.block_on(data_tx.send(buffer[..n].to_vec())).is_err() {
    return Err("Data channel closed".to_string());
}

The crossbeam approach is cleaner for sync→async boundaries.

364-390: Indefinite wait for decompressor process without timeout.

The loop waits for the decompressor process to exit using try_wait() with 100ms sleeps. If the decompressor process hangs or blocks indefinitely, this loop will never complete. Consider adding a timeout to prevent indefinite hangs.

⏱️ Add timeout for decompressor wait

+    let wait_start = std::time::Instant::now();
+    let max_wait = Duration::from_secs(300); // 5 minutes timeout
+
     loop {
         while let Ok(byte_count) = params.decompressed_progress_rx.try_recv() {
             progress.bytes_decompressed += byte_count;
         }
         while let Ok(written_bytes) = params.written_progress_rx.try_recv() {
             progress.bytes_written = written_bytes;
         }
         let _ = progress.update_progress(None, update_interval, false);
 
         match decompressor.try_wait() {
             Ok(Some(status)) => {
                 if !status.success() {
                     return Err(format!(
                         "{} failed with status: {:?}",
                         context.decompressor_name,
                         status.code()
                     )
                     .into());
                 }
                 break;
             }
             Ok(None) => {
+                if wait_start.elapsed() > max_wait {
+                    let _ = decompressor.kill();
+                    return Err(format!(
+                        "{} timed out after {} seconds",
+                        context.decompressor_name,
+                        max_wait.as_secs()
+                    ).into());
+                }
                 tokio::time::sleep(Duration::from_millis(100)).await;
             }
             Err(e) => return Err(e.into()),
         }
     }

944-944: Hardcoded debug file path may not be portable.

The debug TeeReader writes to /tmp/debug_gzip_data.gz, which won't work on Windows and may fail if /tmp doesn't exist or isn't writable. While the code handles errors gracefully (lines 1229-1232), consider using a more portable path.

🔧 Use std::env::temp_dir() for portability

 let reader: Box<dyn Read + Send> = if is_gzipped {
     if debug {
         eprintln!("[DEBUG] Auto-detected gzip compression from magic bytes");
-        // Save first 1MB of raw gzip data for debugging
-        let tee_reader = TeeReader::new(magic_reader, "/tmp/debug_gzip_data.gz", 1024 * 1024);
+        // Save first 1MB of raw gzip data for debugging  
+        let debug_path = std::env::temp_dir().join("fls_debug_gzip_data.gz");
+        let tee_reader = TeeReader::new(
+            magic_reader, 
+            &debug_path.to_string_lossy(), 
+            1024 * 1024
+        );
         let gz_decoder = GzDecoder::new(tee_reader);
         Box::new(DebugGzReader::new(gz_decoder, debug))
     } else {

This uses the system's temporary directory which works across platforms.

src/main.rs (1)

102-105: Use .expect() instead of .unwrap() for better error context.

While the .unwrap() at line 104 is safe (because line 96 guarantees the prefix exists), using .expect() with a descriptive message makes the code more maintainable and provides better error context if the logic changes.
📝 Suggested improvement
-            // OCI image - strip scheme prefix
-            let image_ref = url.strip_prefix("oci://").unwrap();
+            // OCI image - strip scheme prefix (guaranteed by check above)
+            let image_ref = url
+                .strip_prefix("oci://")
+                .expect("BUG: oci:// prefix should exist after starts_with check");

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d33c9a8 and 30ccaf9.

📒 Files selected for processing (5)

src/fls/magic_bytes.rs
src/fls/oci/from_oci.rs
src/fls/oci/reference.rs
src/fls/progress.rs
src/main.rs

🚧 Files skipped from review as they are similar to previous changes (1)

src/fls/magic_bytes.rs

🧰 Additional context used

🧬 Code graph analysis (1)

src/main.rs (3)

src/fls/oci/registry.rs (1)

image_ref (239-241)

src/fls/oci/from_oci.rs (1)

flash_from_oci (726-911)

src/fls/from_url.rs (1)

flash_from_url (13-596)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: Build (x86_64-apple-darwin)
GitHub Check: Build (aarch64-unknown-linux-gnu)
GitHub Check: Build (aarch64-unknown-linux-musl)

🔇 Additional comments (8)

src/fls/progress.rs (1)

223-249: LGTM! Clean refactoring improves readability.

The refactoring to use named variables and a unified rate helper closure makes the code more readable and maintainable. The division-by-zero protection in the rate helper ensures safe calculations.
src/fls/oci/from_oci.rs (2)
757-759: Clear error message for unsupported compression.

The code explicitly rejects Zstd-compressed layers with a clear error message. This is good practice for unsupported features.

452-495: Potential file descriptor leak on error path.

Between duplicating the fd at line 457 and spawning the blocking task at line 473, if an error occurs (e.g., spawn_blocking panics or fails to schedule), the duplicated fd (dup_fd) may leak. The original async decompressor_stdin is dropped at line 470, but the dup_fd is moved into the closure and may not be properly cleaned up if the spawn fails.
🔧 Suggested improvement: Wrap fd in RAII guard

Consider wrapping the duplicated fd in an RAII guard (like OwnedFd from std::os::unix::io) to ensure it's closed even if spawn_blocking fails:
 #[cfg(unix)]
 let stdin_fd = {
     use std::os::unix::io::{AsRawFd, FromRawFd};
     let raw_fd = decompressor_stdin.as_raw_fd();
-    // Duplicate the fd so we can use it in blocking context
     let dup_fd = unsafe { libc::dup(raw_fd) };
     if dup_fd == -1 {
         return Err(std::io::Error::last_os_error().into());
     }
-    // SAFETY: dup_fd is a valid file descriptor (we checked above)
-    unsafe { std::fs::File::from_raw_fd(dup_fd) }
+    // SAFETY: dup_fd is valid (we checked != -1)
+    // Using File ensures the fd is closed on drop even if spawn_blocking fails
+    unsafe { std::fs::File::from_raw_fd(dup_fd) }
 };
The current code is mostly safe because spawn_blocking typically doesn't fail, but explicit RAII is more robust.
Likely an incorrect or invalid review comment.
src/main.rs (1)

172-199: Well-structured header parsing with appropriate validation.

The header parsing logic correctly handles various edge cases (invalid format, empty names, empty values) with appropriate warnings. Allowing empty header values (line 188-190) is correct, as they're valid in HTTP.

src/fls/oci/reference.rs (4)

107-148: Excellent digest validation following OCI specification.

The digest validation is thorough and correctly enforces OCI spec requirements:

Validates algorithm (sha256/sha512)

Enforces exact hex length (64 for sha256, 128 for sha512)

Requires lowercase hex characters per OCI spec

This prevents digest-related vulnerabilities and ensures compliance.

156-189: Well-designed heuristics for parsing registry ports vs. image tags.

The function handles the ambiguous case of distinguishing registry:port/repo from image:tag using reasonable heuristics:

Slash after colon indicates a port

Slash before colon indicates a tag

Short all-digit suffix with hostname-like prefix indicates a port

The logic is well-tested (lines 247-287) and handles edge cases correctly.

192-217: Correct implementation of Docker Hub naming conventions.

The function properly implements Docker Hub defaults:

Bare names → docker.io/library/name

Namespace/repo → docker.io/namespace/repo

Registry detection via dots, colons, or "localhost"

This matches Docker's standard behavior.

219-322: Comprehensive test coverage for image reference parsing.

The test suite covers:

Various reference formats (full, Docker Hub, with ports)

Digest parsing and validation

Default tag behavior

Error cases (invalid digests, wrong algorithm)

Edge cases (numeric tags, registry ports)

This provides good confidence in the parsing logic.

coderabbitai

Actionable comments posted: 3

🤖 Fix all issues with AI agents

In @src/fls/oci/from_oci.rs:
- Around line 516-517: The code currently discards the result of awaiting
stdin_writer_handle (let _ = stdin_writer_handle.await), which hides any errors
from the stdin writer task; change this to explicitly await and handle the
JoinHandle result and the inner task result (e.g., match or use ?), converting
JoinError or the inner Result::Err into a proper Err return so failures (broken
pipe, write errors) propagate instead of being ignored; update the call site
that spawns the stdin writer (stdin_writer_handle) to ensure the task returns a
Result type if it doesn't already and propagate that error up from the enclosing
function.
- Around line 813-825: The XZ-branch unconditionally bypasses tar extraction;
change the condition so it only bypasses when the detected content is NOT a tar
archive. Specifically, in the block that currently checks compression_type ==
Compression::Xz (around flash_raw_disk_image_directly call), add a guard
checking the content type (e.g., content_type != ContentType::TarArchive or
similar symbol used for tar detection from content_detection_buffer) and only
call flash_raw_disk_image_directly when the content is not a tar archive;
otherwise let the code proceed to decompress XZ then run tar extraction to
locate the disk image.
- Around line 950-962: The code incorrectly bypasses tar extraction for
Compression::Xz and passes raw XZ data into tar::Archive::new() via
magic_reader; either remove that bypass so extract_tar_stream_impl is invoked
for Compression::Xz, or explicitly decompress XZ before creating the tar
archive; concretely, update the Compression::Xz branch (where
extract_tar_stream_impl is skipped) to treat XZ like gzip, and in the path that
currently does Box::new(magic_reader) when is_xz is true, wrap magic_reader with
an XZ decoder (e.g., xz2::read::XzDecoder or equivalent) before passing to
tar::Archive::new() (or call extract_tar_stream_impl with a decompressed
reader), and ensure any debug messages reflect that XZ tar extraction is
performed; alternatively, if XZ-supported tar.xz is intentionally unsupported,
make the bypass explicit in the Compression::Xz handling and add a failing
error/log there instead of leaking raw XZ into tar::Archive::new().

🧹 Nitpick comments (2)

src/fls/oci/from_oci.rs (2)
1399-1404: Remove unused progress tracking variables.

Lines 1399-1404 initialize _progress and _update_interval but these variables are never used. Progress tracking is handled within coordinate_raw_disk_download (called at line 1467). Remove these dead code lines to improve clarity.
♻️ Proposed cleanup
     println!("Raw disk image detected - streaming directly to device");
     println!("Compression: {:?}", compression_type);
     println!("Opening device: {}", options.common.device);

-    // Create progress tracking
-    let mut _progress =
-        ProgressTracker::new(options.common.newline_progress, options.common.show_memory);
-    _progress.set_content_length(Some(layer_size));
-    _progress.set_is_compressed(compression_type != Compression::None);
-    let _update_interval = Duration::from_secs_f64(options.common.progress_interval_secs);
-
     // Set up single-purpose block writer with its own progress channel
     let (raw_written_progress_tx, raw_written_progress_rx) = mpsc::unbounded_channel::<u64>();
1-1524: Consider splitting this file for better maintainability.

This file is 1524 lines and handles multiple concerns:

Pipeline coordination

Tar extraction

Decompression strategies

Debug instrumentation (readers/wrappers)

Progress tracking

Consider splitting into separate modules:

from_oci/pipeline.rs - Pipeline setup and coordination

from_oci/tar.rs - Tar extraction logic

from_oci/debug_readers.rs - Debug reader wrappers (MagicDetectingReader, DebugReader, etc.)

This would improve navigability and make each module more focused.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 30ccaf9 and b91e744.

📒 Files selected for processing (5)

src/fls/magic_bytes.rs
src/fls/oci/from_oci.rs
src/fls/oci/reference.rs
src/fls/progress.rs
src/main.rs

🚧 Files skipped from review as they are similar to previous changes (2)

src/fls/progress.rs
src/fls/magic_bytes.rs

🧰 Additional context used

🧬 Code graph analysis (2)

src/main.rs (3)

src/fls/oci/registry.rs (1)

image_ref (239-241)

src/fls/oci/from_oci.rs (1)

flash_from_oci (725-910)

src/fls/from_url.rs (1)

flash_from_url (13-596)

src/fls/oci/from_oci.rs (5)

src/fls/stream_utils.rs (2)

read (31-56)

new (21-27)

src/fls/oci/manifest.rs (2)

compression (196-204)

from (216-222)

src/fls/decompress.rs (2)

spawn_stderr_reader (52-69)

start_decompressor_process (33-50)

src/fls/magic_bytes.rs (1)

detect_content_and_compression (55-119)

src/fls/progress.rs (1)

new (84-104)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Build (aarch64-unknown-linux-musl)
GitHub Check: Build (aarch64-unknown-linux-gnu)

🔇 Additional comments (5)

src/main.rs (1)

174-194: Good improvement to header parsing.

The new header parsing logic using split_once is cleaner than the previous approach. The validation for empty header names and the warning for empty values provide better user feedback for configuration issues.

src/fls/oci/reference.rs (3)

111-148: Well-implemented digest validation.

The digest validation correctly enforces OCI specification requirements:

Supported algorithms (sha256, sha512)

Exact hex character lengths

Lowercase hex digits

The error messages are clear and actionable.

156-189: Robust colon disambiguation logic.

The split_name_and_tag function handles the tricky problem of distinguishing between registry:port/repo and image:tag formats. The heuristics are well-reasoned:

Slash after colon indicates port

All-numeric short suffix with hostname-like prefix indicates port

Otherwise treat as tag

The logic appears sound for common cases, though edge cases like single-label registries might behave unexpectedly.

219-322: Excellent test coverage.

The test suite comprehensively covers:

Various reference formats (registry, Docker Hub, ports, digests)

Edge cases (numeric tags, port disambiguation)

Validation failure scenarios (invalid digests)

This provides strong confidence in the parsing implementation.

src/fls/oci/from_oci.rs (1)

452-467: The file descriptor duplication pattern at lines 452-467 is correctly implemented and safe.

The code properly manages the file descriptor lifecycle: it duplicates the fd while decompressor_stdin is held, checks for errors via dup_fd == -1, takes ownership via from_raw_fd, and only then explicitly drops the original handle. The single use of these unsafe patterns in the codebase and comprehensive error handling confirm there are no leak or premature-closure risks.

src/fls/oci/from_oci.rs

coderabbitai

Actionable comments posted: 8

🤖 Fix all issues with AI agents

In @src/fls/magic_bytes.rs:
- Around line 28-40: The TAR magic check is wrong: remove the invalid
TAR_MAGIC_POSIX constant and update TAR_MAGIC_USTAR to the 6-byte pattern
b"ustar\0"; then change the slice in is_tar_archive from
TAR_MAGIC_OFFSET..TAR_MAGIC_OFFSET+5 to TAR_MAGIC_OFFSET..TAR_MAGIC_OFFSET+6 and
compare the extracted bytes only against TAR_MAGIC_USTAR (remove the "posix"
alternative) so the function validates the correct "ustar\0" magic.

In @src/fls/oci/from_oci.rs:
- Around line 426-530: The external decompressor child process started in
setup_external_decompressor_pipeline (returned inside
ExternalDecompressorPipeline) is never awaited or its exit status checked, so
coordinate_raw_disk_download (which keeps an external_decompressor in
RawDiskDownloadParams) can silently succeed with corrupted output; fix by
awaiting the child process after the writer_handle completes and verify its exit
code (e.g., after writer_handle.await and before returning success) to surface
non-zero exits or termination errors, returning an Err if the decompressor
process exited unsuccessfully; locate and update usages in
setup_external_decompressor_pipeline, ExternalDecompressorPipeline, and
coordinate_raw_disk_download to join the child (decompressor) and propagate any
non-successful exit.
- Around line 1023-1054: get_decompressor_hint currently returns "disk.img" for
ContentType::TarArchive whenever the layer is gzip (LayerCompression::Gzip)
which ignores the extracted file's extension and risks writing compressed bytes
to the device; update get_decompressor_hint to prefer the actual extracted file
extension when file_pattern is provided (if file_pattern ends_with ".xz" return
"disk.img.xz", if ".gz" return "disk.img.gz", otherwise fall back to "disk.img"
when you know the extracted file is uncompressed or layer/content compression
indicates decompression already happened, and use a conservative default
"disk.img.xz" when unknown) so that setup_tar_processing_pipeline /
start_decompressor_process will select the correct decompressor for embedded
*.img.xz inside gzipped tar layers.
- Around line 532-596: The spawned blocking reader task (reader_handle) can
return a decompression/read error which is currently ignored by the async writer
task (the line "let _ = reader_handle.await;"), causing read failures to be
dropped and the writer to close successfully; change the async task inside
setup_inprocess_decompression_pipeline to await reader_handle and handle both
tokio::task::JoinError and the inner Result::<(), String>: if
reader_handle.await returns an Err or an Ok(Err(String)), convert that into an
std::io::Error (e.g., io::Error::new(io::ErrorKind::Other, msg)) and return it
from the async task immediately instead of proceeding to writer.close(); this
ensures reader read/decompression errors from reader_handle are propagated as
Err(u64, std::io::Error) from the spawned handle.
- Around line 453-468: Replace the unstable std::io::Error::other(...) usage
with the stable alternative: construct errors via
std::io::Error::new(std::io::ErrorKind::Other, ...) where std::io::Error::other
is currently used (e.g., the failing-return sites around the decompressor stdin
handling); alternatively, if you prefer to require Rust 1.76+, set rust-version
= "1.76" in Cargo.toml, but the minimal change is to swap calls to
std::io::Error::other into std::io::Error::new(std::io::ErrorKind::Other,
<message-or-error>) so older toolchains build successfully.

In @src/main.rs:
- Around line 59-67: Update the auth logging so it only claims "Using provided
credentials" when both username and password are present: change the condition
that checks username.is_some() to require username.is_some() &&
password.is_some(), and adjust the alternative message when only one (e.g.,
username present but password None) or neither is present to accurately reflect
that full credentials were not provided; refer to the username and password
fields used in the current auth/logging block and update those messages
accordingly.

🧹 Nitpick comments (3)

src/fls/oci/reference.rs (1)
83-86: Fix misleading docstring for api_repository() (it does not include the registry).

Line 83 says “(registry/repository)” but Line 84-86 returns only self.repository (which is correct for /v2/<repo>/... paths).
Proposed fix
-    /// Get the full repository path for API calls (registry/repository)
+    /// Get the repository path for registry API calls (e.g., /v2/<repository>/...)
     pub fn api_repository(&self) -> &str {
         &self.repository
     }
src/main.rs (1)
47-49: Consider redacting sensitive headers in CLI output.

Line 196-201 prints header values verbatim; if a user passes Authorization: Bearer ..., it’ll be echoed to stdout.
Proposed fix (simple redaction)
                 if !parsed_headers.is_empty() {
                     println!("  Custom headers:");
                     for (name, value) in &parsed_headers {
-                        println!("    {}: {}", name, value);
+                        let redacted = matches!(
+                            name.to_ascii_lowercase().as_str(),
+                            "authorization" | "proxy-authorization"
+                        );
+                        println!("    {}: {}", name, if redacted { "<redacted>" } else { value });
                     }
                 }
Also applies to: 174-201
src/fls/oci/from_oci.rs (1)

943-966: Avoid writing to /tmp from library code by default (even in debug).

Line 947-949 (and TeeReader) writes a debug artifact to a fixed path. This is surprising side-effect for a library API and can fail on non-Unix or restricted envs.

Also applies to: 1225-1273

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between b91e744 and 08bf5bb.

📒 Files selected for processing (5)

src/fls/magic_bytes.rs
src/fls/oci/from_oci.rs
src/fls/oci/reference.rs
src/fls/progress.rs
src/main.rs

🧰 Additional context used

🧬 Code graph analysis (3)

src/fls/magic_bytes.rs (2)

src/fls/oci/manifest.rs (1)

compression (196-204)

src/fls/oci/from_oci.rs (4)

read (1164-1207)

read (1248-1272)

read (1298-1351)

read (1365-1388)

src/fls/oci/from_oci.rs (8)

src/fls/stream_utils.rs (2)

read (31-56)

new (21-27)

src/fls/oci/manifest.rs (3)

compression (196-204)

from (216-222)

parse (103-130)

src/fls/decompress.rs (2)

spawn_stderr_reader (52-69)

start_decompressor_process (33-50)

src/fls/magic_bytes.rs (1)

detect_content_and_compression (55-119)

src/fls/progress.rs (1)

new (84-104)

src/fls/oci/registry.rs (2)

new (25-42)

image_ref (239-241)

src/fls/oci/auth.rs (2)

new (20-28)

parse (55-74)

src/fls/oci/reference.rs (1)

parse (55-81)

src/main.rs (2)

src/fls/oci/registry.rs (1)

image_ref (239-241)

src/fls/from_url.rs (1)

flash_from_url (13-596)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Build (aarch64-unknown-linux-gnu)
GitHub Check: Build (aarch64-unknown-linux-musl)

🔇 Additional comments (4)

src/fls/progress.rs (2)

223-249: LGTM! Clean refactoring with good defensive programming.

The final_stats method is well-structured:

Safe duration extraction using unwrap_or_default() provides sensible fallbacks

The inline rate helper ensures consistent division-by-zero protection across all rate calculations

Variable naming (mb_received, mb_decompressed, mb_written) is clear and consistent

250-292: Excellent consolidation of printing logic.

The new print_final_stats methods provide a clean API for final statistics reporting:

Well-documented with clear purpose

Proper visibility (pub(crate)) for internal crate usage

Division-by-zero protection on line 285 ensures safe compression ratio calculation

The pattern of delegating to print_stats with a boolean flag is appropriate for this use case

src/main.rs (1)

62-64: The clap "env" feature is already properly enabled.

The clap dependency in Cargo.toml already includes the "env" feature: clap = { version = "4.4", features = ["derive", "env"] }. No changes are needed.

src/fls/oci/reference.rs (1)

107-148: The digest validation in this function correctly implements OCI Image Spec requirements. The OCI spec mandates that digests use lowercase algorithms and hex characters only—sha256 must match /[a-f0-9]{64}/ and sha512 must match /[a-f0-9]{128}/. The code's restrictions (sha256/sha512 only, lowercase hex required) are not optional strictness but are required by the specification, so no changes are needed.

src/fls/magic_bytes.rs

src/fls/oci/from_oci.rs

coderabbitai · 2026-01-09T12:54:45Z

src/fls/oci/from_oci.rs

+    #[cfg(unix)]
+    let stdin_fd = {
+        use std::os::unix::io::{AsRawFd, FromRawFd};
+        let raw_fd = decompressor_stdin.as_raw_fd();
+        // Duplicate the fd so we can use it in blocking context
+        let dup_fd = unsafe { libc::dup(raw_fd) };
+        if dup_fd == -1 {
+            return Err(std::io::Error::last_os_error().into());
+        }
+        // SAFETY: dup_fd is a valid file descriptor (we checked above)
+        unsafe { std::fs::File::from_raw_fd(dup_fd) }
+    };
+    #[cfg(not(unix))]
+    let stdin_fd: std::fs::File = {
+        return Err("XZ streaming decompression is not supported on non-unix platforms".into());
+    };


⚠️ Potential issue | 🟡 Minor

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash set -euo pipefail # Check tokio version/features and libc dependency echo "=== Checking Cargo.toml for dependencies ===" rg -n --hidden -S 'tokio\s*=\s*\{[^}]*\}' Cargo.toml || true rg -n --hidden -S 'tokio\s*=' Cargo.toml || true rg -n --hidden -S 'libc\s*=' Cargo.toml || true # Locate blocking_send usage echo "=== Checking for blocking_send usage in from_oci.rs ===" rg -n --hidden -S '\.blocking_send\(' src/fls/oci/from_oci.rs || echo "No blocking_send found" # Locate std::io::Error::other usage echo "=== Checking for std::io::Error::other usage ===" rg -n --hidden -S 'std::io::Error::other' src/fls/oci/from_oci.rs || echo "No std::io::Error::other found" # Check the actual context around the flagged lines echo "=== Context around lines 453-495 ===" sed -n '450,500p' src/fls/oci/from_oci.rs || true

Repository: jumpstarter-dev/fls

Length of output: 2756

🏁 Script executed:

#!/bin/bash set -euo pipefail # Check for MSRV in Cargo.toml echo "=== Checking for MSRV specification ===" grep -n "rust-version\|msrv" Cargo.toml || echo "No explicit MSRV found" # Get context around lines 520-521 where std::io::Error::other is used echo "=== Context around std::io::Error::other usage (lines 515-530) ===" sed -n '515,530p' src/fls/oci/from_oci.rs # Check what error handling is used at lines 485-495 echo "=== Context around lines 485-495 (also mentioned in review) ===" sed -n '485,495p' src/fls/oci/from_oci.rs

Repository: jumpstarter-dev/fls

Length of output: 1080

Add MSRV constraint or use stable alternative for std::io::Error::other().

The code uses std::io::Error::other() at lines 520-521, which requires Rust 1.76+. Since the project declares no MSRV, this can break builds for users on older Rust versions. Either set a minimum rust-version = "1.76" in Cargo.toml or replace with the stable alternative std::io::Error::new(std::io::ErrorKind::Other, ...).

The libc::dup() usage (line 460) and tokio::blocking_send() calls are properly supported by the declared dependencies.

🤖 Prompt for AI Agents

In @src/fls/oci/from_oci.rs around lines 453 - 468, Replace the unstable std::io::Error::other(...) usage with the stable alternative: construct errors via std::io::Error::new(std::io::ErrorKind::Other, ...) where std::io::Error::other is currently used (e.g., the failing-return sites around the decompressor stdin handling); alternatively, if you prefer to require Rust 1.76+, set rust-version = "1.76" in Cargo.toml, but the minimal change is to swap calls to std::io::Error::other into std::io::Error::new(std::io::ErrorKind::Other, <message-or-error>) so older toolchains build successfully.

src/fls/oci/from_oci.rs

coderabbitai · 2026-01-09T12:54:46Z

src/fls/oci/reference.rs

+    pub fn parse(input: &str) -> Result<Self, String> {
+        let input = input.trim();
+
+        if input.is_empty() {
+            return Err("Empty image reference".to_string());
+        }
+
+        // Split by @ for digest references
+        let (name_part, reference) = if let Some(at_pos) = input.rfind('@') {
+            let digest = &input[at_pos + 1..];
+            validate_digest(digest)?;
+            (&input[..at_pos], Reference::Digest(digest.to_string()))
+        } else {
+            // Split by : for tag, but be careful about registry port
+            let (name, tag) = split_name_and_tag(input)?;
+            (name, Reference::Tag(tag.unwrap_or("latest").to_string()))
+        };
+
+        // Parse registry and repository from name
+        let (registry, repository) = parse_registry_and_repo(name_part)?;
+
+        Ok(Self {
+            registry,
+            repository,
+            reference,
+        })
+    }


⚠️ Potential issue | 🟠 Major

Reject “tag + digest” mixed references (currently parsed into an invalid repository).

With the current flow (Line 62-71), something like repo:tag@sha256:... will accept the digest and treat repo:tag as the name part, which then flows into parse_registry_and_repo() and yields a bogus repository.

Proposed fix (minimal validation for digest refs)

- let (name_part, reference) = if let Some(at_pos) = input.rfind('@') { + let (name_part, reference) = if let Some(at_pos) = input.rfind('@') { let digest = &input[at_pos + 1..]; validate_digest(digest)?; - (&input[..at_pos], Reference::Digest(digest.to_string())) + let name_part = &input[..at_pos]; + // Disallow tag mixed with digest (e.g. repo:tag@sha256:...) + // but allow registry ports (e.g. registry:5000/repo@sha256:...) + if let Some(last_colon) = name_part.rfind(':') { + if let Some(last_slash) = name_part.rfind('/') { + if last_colon > last_slash { + return Err("Invalid reference: cannot combine tag and digest".to_string()); + } + } else { + return Err("Invalid reference: cannot combine tag and digest".to_string()); + } + } + (name_part, Reference::Digest(digest.to_string())) } else { // Split by : for tag, but be careful about registry port let (name, tag) = split_name_and_tag(input)?; (name, Reference::Tag(tag.unwrap_or("latest").to_string())) };

Also applies to: 62-71

src/main.rs

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com>

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In @src/fls/oci/from_oci.rs:
- Around line 978-980: The hardcoded debug file path "/tmp/debug_gzip_data.gz"
used when constructing TeeReader causes collisions and permission issues; change
the DebugGzReader/TeeReader creation to write to a unique temp file instead
(e.g., use std::env::temp_dir() or tempfile::NamedTempFile and include
PID/timestamp or rely on tempfile to create a unique name), pass that temp file
path to TeeReader, and ensure the temp file is created in a writable temp
directory and removed/closed when DebugGzReader is dropped.

🧹 Nitpick comments (3)

src/fls/magic_bytes.rs (2)

76-97: Fallback behavior silently masks decompression errors.

When gzip or XZ decompression fails during content detection (lines 78-84, 91-97), the code falls back to assuming TarArchive. While debug mode logs the error, this could mask data corruption or invalid compression in production. The assumption may be reasonable for detection purposes, but consider whether a partial decompression failure should be surfaced to the caller in some scenarios.

100-107: Add explicit documentation for the Zstd limitation or synchronize with future support changes.

The code assumes TarArchive for Zstd with the comment that "Zstd layers are rejected by from_oci.rs anyway." This assumption is currently accurate—from_oci.rs explicitly rejects Zstd at multiple points (lines 791–793, 1530–1532, 1555–1557). However, this creates an implicit coupling: if Zstd support is added to from_oci.rs later, the sample decompression logic here will silently continue to assume TarArchive, potentially leading to incorrect content type detection for Zstd-compressed data.

To prevent this, consider either:

Adding a prominent comment linking to the rejection points in from_oci.rs, or

Implementing a placeholder decompress_zstd_sample() function that returns an error referencing the from_oci.rs rejection, ensuring the functions stay synchronized during future refactors.

src/fls/oci/from_oci.rs (1)

790-793: Zstd rejection is hardcoded.

While consistent with the assumptions in magic_bytes.rs, this hardcoded rejection limits future extensibility. Consider using a feature flag or configuration option to enable Zstd support when implemented.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 08bf5bb and fd2ab7d.

📒 Files selected for processing (5)

src/fls/magic_bytes.rs
src/fls/oci/from_oci.rs
src/fls/oci/reference.rs
src/fls/progress.rs
src/main.rs

🧰 Additional context used

🧬 Code graph analysis (1)

src/fls/magic_bytes.rs (1)

src/fls/oci/manifest.rs (1)

compression (196-204)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Build (aarch64-unknown-linux-musl)
GitHub Check: Build (aarch64-unknown-linux-gnu)

🔇 Additional comments (7)

src/fls/progress.rs (2)

224-249: LGTM! Clean refactoring with proper safeguards.

The refactored final_stats method simplifies variable naming and consolidates rate calculations with proper division-by-zero protection.

251-292: LGTM! Well-structured final stats printing.

The new print methods provide a clean API for printing final statistics with optional compression ratio output, consolidating the printing logic effectively.

src/fls/oci/reference.rs (1)

173-206: Complex but well-tested port vs. tag disambiguation logic.

The logic for distinguishing registry ports (e.g., registry:5000/repo) from image tags (e.g., myrepo:12345) is intricate, relying on multiple heuristics including slash position, digit patterns, and hostname detection. The test coverage (lines 236-363) validates the key edge cases, giving confidence in the implementation.

src/main.rs (1)

95-234: LGTM! Clean separation of OCI and HTTP paths.

The URL scheme detection and branching logic clearly separates OCI and HTTP flows, with proper option construction and consistent error handling for both paths.

src/fls/oci/from_oci.rs (3)

453-468: Verify Unix-specific unsafe fd duplication is correct.

The unsafe dup and from_raw_fd operations at lines 458-463 properly check for errors (line 459-461) and have appropriate platform guards. Non-Unix platforms return a clear error (lines 465-468).

246-424: LGTM! Well-orchestrated download coordination.

The download coordination logic properly manages streaming, progress tracking, and cleanup across multiple concurrent tasks (tar extractor, decompressor writer, block writer, error processor). The timeout handling (line 279) and proper channel cleanup (line 344) prevent resource leaks.

760-945: LGTM! Comprehensive OCI flash implementation.

The main flash_from_oci function properly orchestrates authentication, manifest resolution, content detection, and pipeline setup. The separation between tar archives and raw disk images (lines 836-860) provides clear execution paths based on content type.

coderabbitai · 2026-01-09T13:15:00Z

src/fls/oci/from_oci.rs

+            let tee_reader = TeeReader::new(magic_reader, "/tmp/debug_gzip_data.gz", 1024 * 1024);
+            let gz_decoder = GzDecoder::new(tee_reader);
+            Box::new(DebugGzReader::new(gz_decoder, debug))


⚠️ Potential issue | 🟡 Minor

Hardcoded debug path could cause conflicts.

The debug mode saves gzip data to /tmp/debug_gzip_data.gz, which could cause issues:

Concurrent runs would overwrite each other's debug files

Permission errors in restricted environments

No cleanup mechanism

Since this is debug-only, the impact is low, but consider using a unique filename with PID or timestamp for concurrent safety.

💡 Suggested improvement for debug file path

- let tee_reader = TeeReader::new(magic_reader, "/tmp/debug_gzip_data.gz", 1024 * 1024); + let debug_path = format!("/tmp/debug_gzip_data_{}.gz", std::process::id()); + let tee_reader = TeeReader::new(magic_reader, &debug_path, 1024 * 1024);

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

let tee_reader = TeeReader::new(magic_reader, "/tmp/debug_gzip_data.gz", 1024 * 1024);

let gz_decoder = GzDecoder::new(tee_reader);

Box::new(DebugGzReader::new(gz_decoder, debug))

let debug_path = format!("/tmp/debug_gzip_data_{}.gz", std::process::id());

let tee_reader = TeeReader::new(magic_reader, &debug_path, 1024 * 1024);

let gz_decoder = GzDecoder::new(tee_reader);

Box::new(DebugGzReader::new(gz_decoder, debug))

🤖 Prompt for AI Agents

In @src/fls/oci/from_oci.rs around lines 978 - 980, The hardcoded debug file path "/tmp/debug_gzip_data.gz" used when constructing TeeReader causes collisions and permission issues; change the DebugGzReader/TeeReader creation to write to a unique temp file instead (e.g., use std::env::temp_dir() or tempfile::NamedTempFile and include PID/timestamp or rely on tempfile to create a unique name), pass that temp file path to TeeReader, and ensure the temp file is created in a writable temp directory and removed/closed when DebugGzReader is dropped.

mangelajo · 2026-01-12T09:11:59Z

src/fls/magic_bytes.rs

+/// from magic bytes (file signatures).
+use crate::fls::compression::Compression;
+
+/// Magic byte signatures for compression formats


bennyz added 2 commits January 4, 2026 16:16

initial OCI support

d8d678f

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com>

add support for flashing from an OCI image

05ced89

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com> Assisted-by: claude-opus-4.5

bennyz requested a review from mangelajo January 4, 2026 17:47

fix XZ handling and clippy

c1cab54

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com>

bennyz marked this pull request as ready for review January 6, 2026 08:41

coderabbitai bot reviewed Jan 6, 2026

View reviewed changes

src/fls/oci/from_oci.rs Outdated Show resolved Hide resolved

src/fls/oci/from_oci.rs Outdated Show resolved Hide resolved

bennyz force-pushed the oci-image branch from 24f0c19 to c580350 Compare January 6, 2026 10:36

coderabbitai bot reviewed Jan 6, 2026

View reviewed changes

src/fls/oci/from_oci.rs Show resolved Hide resolved

src/fls/oci/from_oci.rs Show resolved Hide resolved

bennyz force-pushed the oci-image branch from c580350 to bd612d2 Compare January 6, 2026 11:04

coderabbitai bot reviewed Jan 6, 2026

View reviewed changes

src/fls/oci/from_oci.rs Show resolved Hide resolved

address review

0e3220e

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com>

bennyz force-pushed the oci-image branch from bd612d2 to 0e3220e Compare January 6, 2026 11:19

coderabbitai bot reviewed Jan 6, 2026

View reviewed changes

bennyz force-pushed the oci-image branch from 043af9a to 7f1bb1e Compare January 7, 2026 09:47

coderabbitai bot reviewed Jan 7, 2026

View reviewed changes

bennyz force-pushed the oci-image branch from 7f1bb1e to 499b082 Compare January 7, 2026 10:09

refactor to reduce cruft

9a1a0c2

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com>

bennyz force-pushed the oci-image branch from 499b082 to 9a1a0c2 Compare January 7, 2026 10:12

coderabbitai bot reviewed Jan 7, 2026

View reviewed changes

src/fls/from_url.rs Show resolved Hide resolved

src/fls/magic_bytes.rs Show resolved Hide resolved

src/fls/progress.rs Show resolved Hide resolved

fix decompression progress bar

700674b

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com>

coderabbitai bot reviewed Jan 7, 2026

View reviewed changes

refactor according to review

d33c9a8

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com> Assisted-by: claude-sonnet-4.5

coderabbitai bot reviewed Jan 7, 2026

View reviewed changes

coderabbitai bot reviewed Jan 9, 2026

View reviewed changes

bennyz force-pushed the oci-image branch from 30ccaf9 to b91e744 Compare January 9, 2026 11:48

coderabbitai bot reviewed Jan 9, 2026

View reviewed changes

src/fls/oci/from_oci.rs Outdated Show resolved Hide resolved

src/fls/oci/from_oci.rs Outdated Show resolved Hide resolved

src/fls/oci/from_oci.rs Show resolved Hide resolved

bennyz force-pushed the oci-image branch from b91e744 to 08bf5bb Compare January 9, 2026 12:48

coderabbitai bot reviewed Jan 9, 2026

View reviewed changes

simplify code

fd2ab7d

Signed-off-by: Benny Zlotnik <bzlotnik@redhat.com>

bennyz force-pushed the oci-image branch from 08bf5bb to fd2ab7d Compare January 9, 2026 13:10

coderabbitai bot reviewed Jan 9, 2026

View reviewed changes

mangelajo approved these changes Jan 12, 2026

View reviewed changes

mangelajo merged commit 6857ee6 into jumpstarter-dev:main Jan 12, 2026
8 checks passed

coderabbitai bot mentioned this pull request Jan 12, 2026

support sparse simg from a-i-b #3

Merged

initial OCI support #2

initial OCI support #2

Uh oh!

Conversation

bennyz commented Jan 4, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jan 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

Uh oh!

bennyz commented Jan 6, 2026

Uh oh!

coderabbitai bot commented Jan 6, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

bennyz commented Jan 4, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 4, 2026 •

edited

Loading