feat(shard): Implement shard router by krishvishal · Pull Request #2853 · apache/iggy

krishvishal · 2026-03-03T08:03:52Z

In the shard router each shard gets an inbox (flume channel) and a vector of senders to all other shards (including itself). When a message arrives at any shard, the router:

Decomposes the generic message to extract the operation and namespace
Resolves the target shard - metadata ops go to shard 0, partition ops are looked up in the ShardsTable, unknown ops default to shard 0
Enqueues a ShardFrame into the target shard's channel

The message pump (run_message_pump) drains the inbox and processes frames sequentially, ensuring all mutations on a given shard are serialized through a single async task.

The ShardsTable trait abstracts partition ownership lookup with two implementations:

() : no-op for single-shard setups (simulator for now)
PapayaShardsTable: lock-free concurrent map for multi-shard deployments

codecov · 2026-03-03T08:20:10Z

Codecov Report

❌ Patch coverage is 0% with 173 lines in your changes missing coverage. Please review.
✅ Project coverage is 67.79%. Comparing base (be23a35) to head (02683f0).

Files with missing lines	Patch %	Lines
core/shard/src/router.rs	0.00%	86 Missing ⚠️
core/shard/src/lib.rs	0.00%	44 Missing ⚠️
core/shard/src/shards_table.rs	0.00%	33 Missing ⚠️
core/common/src/types/consensus/header.rs	0.00%	8 Missing ⚠️
core/simulator/src/replica.rs	0.00%	2 Missing ⚠️

❌ Your patch check has failed because the patch coverage (0.00%) is below the target coverage (50.00%). You can increase the patch coverage or adjust the target coverage.

Additional details and impacted files

@@             Coverage Diff              @@
##             master    #2853      +/-   ##
============================================
- Coverage     67.94%   67.79%   -0.15%     
  Complexity      739      739              
============================================
  Files          1049     1051       +2     
  Lines         84385    84542     +157     
  Branches      60963    61131     +168     
============================================
- Hits          57336    57319      -17     
- Misses        24692    24851     +159     
- Partials       2357     2372      +15

Flag	Coverage Δ
csharp	`67.43% <ø> (-0.19%)`	⬇️
go	`6.33% <ø> (ø)`
java	`54.83% <ø> (ø)`
node	`92.26% <ø> (-0.15%)`	⬇️
python	`0.00% <ø> (ø)`
rust	`70.04% <0.00%> (-0.20%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
core/simulator/src/replica.rs	`0.00% <0.00%> (ø)`
core/common/src/types/consensus/header.rs	`23.57% <0.00%> (-0.70%)`	⬇️
core/shard/src/shards_table.rs	`0.00% <0.00%> (ø)`
core/shard/src/lib.rs	`0.00% <0.00%> (ø)`
core/shard/src/router.rs	`0.00% <0.00%> (ø)`

... and 15 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

hubcio · 2026-03-03T08:24:05Z

can you use crossfire instead of flume? its much more battle-tested than flume and seems to have better performance.

krishvishal · 2026-03-03T08:40:20Z

can you use crossfire instead of flume? its much more battle-tested than flume and seems to have better performance.

Okay, I will look into it.

krishvishal · 2026-03-03T09:12:12Z

core/shard/src/router.rs

+impl<B, J, S, M, T, R> IggyShard<B, J, S, M, T, R>
+where
+    B: MessageBus,
+    T: ShardsTable,
+    R: Send,


@hubcio with crossfire. This code will change to

impl<B, J, S, M, T, R> IggyShard<B, J, S, M, T, R> where B: MessageBus, T: ShardsTable, R: Send + 'static, {

crossfire requires additional 'static bound. Which maybe too restrictive. I'm thinking if it would cause problems for us or not. Let me know what do you think of this.

IMHO it's OK. shard (and messages router) should live forever in given thread so static lifetime fits perfectly. @numinnex any thoughts?

I think it's fine, lets move to crossfire

numinnex · 2026-03-03T09:46:38Z

core/shard/src/lib.rs

-        if planes.0.is_applicable(&request) {
-            planes.0.on_request(request).await;
+        let (metadata, (partitions, _)) = self.plane.inner();
+        if metadata.is_applicable(&request) {


Why we do it this way ? that's the purpose of MuxPlane to not have that logic exposed inside of the top-level caller

The MuxPlane's Plane trait dispatch can't work here because the two planes use different consensus types:

Metadata: VsrConsensus<B> (default LocalPipeline)

Partitions: VsrConsensus<B, NamespacedPipeline>

The variadic Plane impl requires a single C for both, so self.plane.on_request(...) doesn't compile without either:

Unifying both planes on NamespacedPipeline which changes metadata's pipeline behavior.

Adding a second Plane impl for IggyPartitions with the wrong pipeline type

Both feel like they are hacky solutions.

numinnex · 2026-03-03T09:48:21Z

core/shard/src/router.rs

+    /// Dispatch a message and return a receiver that resolves when the target
+    /// shard has finished processing it.
+    pub fn dispatch_request(&self, message: Message<GenericHeader>) -> flume::Receiver<R> {
+        let (operation, namespace, generic) = Self::decompose(&message);


Let's get rid of the decompose function, instead do it inline there and match inline

numinnex · 2026-03-03T09:48:56Z

core/shard/src/router.rs

+    /// - Partition operations route to the shard that owns the namespace,
+    ///   looked up via the [`ShardsTable`].
+    /// - Unknown operations fall back to shard 0.
+    fn resolve(&self, operation: Operation, namespace: IggyNamespace) -> u16 {


Inline this aswell

krishvishal added 2 commits March 3, 2026 13:35

feat(shard): implement shard router

9889d70

fix(ci): Cargo.toml

3a41f08

krishvishal force-pushed the shard-router branch from c010536 to 3a41f08 Compare March 3, 2026 08:11

fix(ci): cargo sort

a2ba764

krishvishal commented Mar 3, 2026

View reviewed changes

numinnex requested changes Mar 3, 2026

View reviewed changes

krishvishal added 2 commits March 3, 2026 15:53

refactor(shard): inline decompose and resolve methods.

787b81d

Merge branch 'master' into shard-router

02683f0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(shard): Implement shard router#2853

feat(shard): Implement shard router#2853
krishvishal wants to merge 5 commits intoapache:masterfrom
krishvishal:shard-router

krishvishal commented Mar 3, 2026

Uh oh!

codecov bot commented Mar 3, 2026 •

edited

Loading

Uh oh!

hubcio commented Mar 3, 2026

Uh oh!

krishvishal commented Mar 3, 2026

Uh oh!

krishvishal Mar 3, 2026 •

edited

Loading

Uh oh!

hubcio Mar 3, 2026 •

edited

Loading

Uh oh!

numinnex Mar 3, 2026

Uh oh!

numinnex Mar 3, 2026

Uh oh!

krishvishal Mar 3, 2026 •

edited

Loading

Uh oh!

numinnex Mar 3, 2026

Uh oh!

krishvishal Mar 3, 2026

Uh oh!

numinnex Mar 3, 2026

Uh oh!

krishvishal Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

krishvishal commented Mar 3, 2026

Uh oh!

codecov bot commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

hubcio commented Mar 3, 2026

Uh oh!

krishvishal commented Mar 3, 2026

Uh oh!

krishvishal Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hubcio Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

numinnex Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

numinnex Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

krishvishal Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

numinnex Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

krishvishal Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

numinnex Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

krishvishal Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Mar 3, 2026 •

edited

Loading

krishvishal Mar 3, 2026 •

edited

Loading

hubcio Mar 3, 2026 •

edited

Loading

krishvishal Mar 3, 2026 •

edited

Loading