Fix: Re-queue packet on send failure instead of dropping it by OverkillFPV · Pull Request #2141 · meshcore-dev/MeshCore

OverkillFPV · 2026-03-24T07:35:55Z

When the CAD timeout's due to high traffic the packet is force sent after 4 seconds. And StartSendRaw fails due to the radio being busy the packet is dropped. This change changes it so that instead of dropping the packet entirely, it is requeued with a delay and retried. This is why some of my high traffic nodes with 20+ neighbours have become unreliable.

Copilot

Pull request overview

This PR changes the Dispatcher TX failure behavior so that when startSendRaw() fails (e.g., radio busy after CAD timeout), the outbound packet is re-queued for a delayed retry instead of being released back to the pool (dropped).

Changes:

Re-queue the outbound packet on startSendRaw() failure rather than freeing it.
Add a retry delay before attempting further TX after a send-start failure.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-24T07:38:44Z

src/Dispatcher.cpp

+        _mgr->queueOutbound(outbound, 0, futureMillis(getCADFailRetryDelay()));
        outbound = NULL;
+        next_tx_time = futureMillis(getCADFailRetryDelay());


getCADFailRetryDelay() is called twice here (once for queueOutbound and once for next_tx_time). Since getCADFailRetryDelay() can be non-deterministic (e.g., Mesh::getCADFailRetryDelay() uses RNG), the packet’s scheduled retry time and the global next_tx_time throttle can diverge unpredictably. Compute the retry delay once and reuse it for both scheduling and next_tx_time.

Suggested change

_mgr->queueOutbound(outbound, 0, futureMillis(getCADFailRetryDelay()));

outbound = NULL;

next_tx_time = futureMillis(getCADFailRetryDelay());

int retry_delay = getCADFailRetryDelay();

unsigned long retry_time = futureMillis(retry_delay);

_mgr->queueOutbound(outbound, 0, retry_time);

outbound = NULL;

next_tx_time = retry_time;

Copilot · 2026-03-24T07:38:45Z

src/Dispatcher.cpp

-        releasePacket(outbound);  // return to pool
+
+        // re-queue instead of dropping so the packet gets another chance
+        _mgr->queueOutbound(outbound, 0, futureMillis(getCADFailRetryDelay()));


Re-queuing with a hard-coded priority 0 can change ordering semantics: lower numeric values are treated as higher priority in PacketQueue::get(), so this elevates any packet that happened to fail startSendRaw() to highest priority on retry and may starve other queued traffic. Consider preserving the original outbound priority (e.g., by storing priority on the Packet or extending PacketManager::getNextOutbound to return both packet + priority) when re-queuing.

Copilot · 2026-03-24T07:38:45Z

src/Dispatcher.cpp

@@ -330,9 +330,11 @@ void Dispatcher::checkSend() {
        MESH_DEBUG_PRINTLN("%s Dispatcher::loop(): ERROR: send start failed!", getLogDateTime());


The debug string says Dispatcher::loop() but this code is inside Dispatcher::checkSend(), which makes logs misleading when diagnosing TX failures. Update the message to the correct function name.

Suggested change

MESH_DEBUG_PRINTLN("%s Dispatcher::loop(): ERROR: send start failed!", getLogDateTime());

MESH_DEBUG_PRINTLN("%s Dispatcher::checkSend(): ERROR: send start failed!", getLogDateTime());

…ly a solid idea

OverkillFPV · 2026-03-24T10:56:23Z

I was also wondering about increasing the timeout to >4 seconds as for some nodes with 50 neighbours it's not too unheard of to have 4 seconds of traffic

added in requeue for packets that fail to be sent

278f5f6

Copilot AI review requested due to automatic review settings March 24, 2026 07:35

Copilot started reviewing on behalf of OverkillFPV March 24, 2026 07:36 View session

Copilot AI reviewed Mar 24, 2026

View reviewed changes

Addedff in single delay calc that copilot suggested cause it's actual…

9675823

…ly a solid idea

OverkillFPV force-pushed the fix/requeue-packet-on-cad-timeout branch from 3f9190d to 9675823 Compare March 24, 2026 07:48

Updated timeout to 6 seconds

9e7e001

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Re-queue packet on send failure instead of dropping it#2141

Fix: Re-queue packet on send failure instead of dropping it#2141
OverkillFPV wants to merge 3 commits intomeshcore-dev:devfrom
OverkillFPV:fix/requeue-packet-on-cad-timeout

OverkillFPV commented Mar 24, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

OverkillFPV commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -330,9 +330,11 @@ void Dispatcher::checkSend() {
		MESH_DEBUG_PRINTLN("%s Dispatcher::loop(): ERROR: send start failed!", getLogDateTime());

	MESH_DEBUG_PRINTLN("%s Dispatcher::loop(): ERROR: send start failed!", getLogDateTime());
	MESH_DEBUG_PRINTLN("%s Dispatcher::checkSend(): ERROR: send start failed!", getLogDateTime());

Conversation

OverkillFPV commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

OverkillFPV commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

OverkillFPV commented Mar 24, 2026 •

edited

Loading