Enhance health check functionality with caching by nahimterrazas · Pull Request #291 · openintentsframework/oif-solver

nahimterrazas · 2026-02-10T21:23:45Z

Summary

Testing Process

Checklist

Add a reference to related issues in the PR description.
Add unit tests if applicable.

Summary by CodeRabbit

Refactor
- Improved storage backend initialization and admin API startup for more reliable configuration loading.
Bug Fixes
- Ensure operator admin configuration is seeded if missing and surface clearer errors during startup.
Tests
- Added tests covering storage initialization and admin config/nonce store behaviors to prevent regressions.

coderabbitai · 2026-02-10T21:23:59Z

📝 Walkthrough

Walkthrough

A shared admin storage backend (Redis) is created once during server startup and reused to construct the operator config store and nonce store; startup now seeds operator config if missing and surfaces clearer IO errors and logs for storage/config/nonce creation failures.

Changes

Cohort / File(s)	Summary
Server storage & helpers `crates/solver-service/src/server.rs`	Added `create_admin_storage_backend`, `create_admin_config_store`, `create_admin_nonce_store`. Start-up flow now creates a single admin storage backend, reuses it for OperatorConfig and Nonce stores, seeds operator config if missing, and adds descriptive IO error mapping and tracing logs.
Tests `crates/solver-service/src/tests/*`	Added tests validating error on invalid Redis URL and that config/nonce stores use the shared admin storage backend; updated existing tests to call new helpers and shared-backend paths.

Sequence Diagram(s)

sequenceDiagram
  participant Starter as StartServer
  participant AdminStore as create_admin_storage_backend
  participant ConfigStore as create_admin_config_store
  participant NonceStore as create_admin_nonce_store
  participant Seeder as seed_operator_config
  participant AdminAPI as AdminApiState

  Starter->>AdminStore: initialize shared admin storage (redis_url)
  AdminStore-->>Starter: admin_storage (or error)
  Starter->>ConfigStore: create config store using admin_storage + solver_id
  ConfigStore-->>Starter: config_store (or error)
  Starter->>NonceStore: create nonce store using admin_storage + solver_id + ttl
  NonceStore-->>Starter: nonce_store (or error)
  Starter->>Seeder: ensure operator config exists via config_store
  Seeder-->>Starter: seeded / already present
  Starter->>AdminAPI: build AdminApiState(token_manager, config_store, nonce_store)
  AdminAPI-->>Starter: admin API ready

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

Adding Redis implementation #261 — Adds Redis backend implementation and registry used by create_admin_storage_backend (direct code dependency).

Suggested reviewers

NicoMolinaOZ
shahnami
zeljkoX

Poem

🐰 I forged one backend neat and tight,
Reused with care from morning to night,
Configs and nonces now sit side-by-side,
A hopping helper for the server's stride. 🥕

🚥 Pre-merge checks | ✅ 1 | ❌ 2

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Title check	⚠️ Warning	The PR title 'Enhance health check functionality with caching' does not match the actual changes, which refactor admin storage helpers and startup flow—not health check caching.	Update the title to accurately reflect the main changes, such as 'Refactor admin storage helpers and startup flow' or 'Consolidate admin storage backend initialization'.
Description check	⚠️ Warning	The PR description only contains template headings with no actual implementation details, making it impossible to understand the purpose, rationale, or testing approach.	Fill in the Summary section with implementation details, explain the rationale for changes, and document the testing process performed. Check the two checklist items if applicable.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch redis-improve-healthcheck

No actionable comments were generated in the recent review. 🎉

🧹 Recent nitpick comments

crates/solver-service/src/server.rs (3)
76-99: Redundant empty solver_id validation.

The check at lines 83–87 duplicates the validation already present in create_config_store (see config_store.rs lines 178–181). It's not harmful, but it means two code paths need to stay in sync.

That said, it does produce a more contextual error message ("Config store error: solver ID cannot be empty" vs the upstream generic one), so keeping it is reasonable if intentional.

215-216: Unnecessary .clone() on redis_url.

redis_url is not used after this line, so the clone is superfluous. You can pass it directly.
Suggested fix
-				let admin_storage = create_admin_storage_backend(redis_url.clone())?;
+				let admin_storage = create_admin_storage_backend(redis_url)?;
244-276: Duplicate error log for nonce store creation failure.

create_admin_nonce_store (line 112) already logs tracing::error!("Failed to initialize admin nonce store: ..."). The Err arm at line 274 logs the same message again. This results in a duplicated error log entry for a single failure, which can be noisy and confusing during incident triage.

Either remove the log inside the helper (to let callers decide) or remove the one here.
Option A: Remove the duplicate at the call site
 				Err(e) => {
-					tracing::error!("Failed to initialize admin nonce store: {}", e);
 					None
 				},

Tip

Issue Planner is now in beta. Read the docs and try it out! Share your feedback on Discord.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

shahnami

I’m a bit confused about the need for this change. Are we caching the health state for storage readiness? If so, what problem are we trying to solve?

My understanding is that a health check should always reflect the current state of the system. Returning a cached response from e.g. two minutes ago means we’re no longer reporting the actual (real-time) status, which somewhat defeats the purpose of having a health check in the first place?

If the goal is to prevent excessive calls (e.g. to avoid spamming an internal dependency), wouldn’t it be better to adjust the check frequency or caller behavior instead of introducing caching at the health endpoint level? Could you clarify the rationale behind this?

nahimterrazas · 2026-02-11T13:21:08Z

I’m a bit confused about the need for this change. Are we caching the health state for storage readiness? If so, what problem are we trying to solve?

My understanding is that a health check should always reflect the current state of the system. Returning a cached response from e.g. two minutes ago means we’re no longer reporting the actual (real-time) status, which somewhat defeats the purpose of having a health check in the first place?

If the goal is to prevent excessive calls (e.g. to avoid spamming an internal dependency), wouldn’t it be better to adjust the check frequency or caller behavior instead of introducing caching at the health endpoint level? Could you clarify the rationale behind this?

So, the problem we are solving is that /health currently triggers storage readiness that can create Redis connection churn,regarding to cache yes, we can remove it, although the default is 5 seconds, definitely not aggresive cache in that case, but I prefer remove it.

codecov · 2026-02-11T13:40:15Z

Codecov Report

❌ Patch coverage is 89.42308% with 11 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
crates/solver-service/src/server.rs	89.4%	11 Missing ⚠️

📢 Thoughts on this report? Let us know!

shahnami

LGTM

Enhance health check functionality with caching

866f0a4

nahimterrazas requested review from NicoMolinaOZ, pepebndc and shahnami as code owners February 10, 2026 21:23

pepebndc approved these changes Feb 11, 2026

View reviewed changes

shahnami reviewed Feb 11, 2026

View reviewed changes

Comment thread crates/solver-service/src/server.rs Outdated

shahnami reviewed Feb 11, 2026

View reviewed changes

Remove health check caching configuration

8f34cfd

shahnami approved these changes Feb 11, 2026

View reviewed changes

nahimterrazas added 2 commits February 11, 2026 14:58

tests

ab0de15

Merge branch 'main' into redis-improve-healthcheck

4b361b8

nahimterrazas merged commit e366e06 into main Feb 13, 2026
8 checks passed

nahimterrazas deleted the redis-improve-healthcheck branch February 13, 2026 16:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance health check functionality with caching#291

Enhance health check functionality with caching#291
nahimterrazas merged 4 commits intomainfrom
redis-improve-healthcheck

nahimterrazas commented Feb 10, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Feb 10, 2026 •

edited

Loading

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

Uh oh!

shahnami left a comment

Uh oh!

nahimterrazas commented Feb 11, 2026

Uh oh!

codecov Bot commented Feb 11, 2026 •

edited

Loading

Uh oh!

shahnami left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

nahimterrazas commented Feb 10, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing Process

Checklist

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

Uh oh!

shahnami left a comment

Choose a reason for hiding this comment

Uh oh!

nahimterrazas commented Feb 11, 2026

Uh oh!

codecov Bot commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

shahnami left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nahimterrazas commented Feb 10, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Feb 10, 2026 •

edited

Loading

codecov Bot commented Feb 11, 2026 •

edited

Loading