[OCTRL-1010] Fix automatic Mesos retry #714

justonedev1 · 2025-05-27T05:38:52Z

I did 3 major changes:

introduce "oneRound" variables in deployment retry in tasks/manager.go so we can control state of affairs after one retry round.
removed 2 checks if len(descriptorsUndeployable) == 0 because it is possible to continue even if you have some "undeployable" tasks. More than that, it is desirable to do so, as right now it was possible that some tasks are run multiple times resulting in "zombie" state forgotten on machine described by warning attempted status update of task not in roster. These tasks would be deployed multiple times and only the most recent deployment would be deleted leaving others behind. So I switched the logic around, we can try to deploy even if some of the tasks were declared as "undeployable".
adding tasks that were successfully deployed immediately to the roster so we can get updates from them.

NOTE: removing two ifs caused some weird looking changes in the diff, but it is just removing one tab depth of code, so the diff is a bit muddy.

knopers8

Thanks, it seems good, please consider my two pedantic comments, but in any case it is good to go as it is.

core/task/manager.go

justonedev1 requested a review from knopers8 as a code owner May 27, 2025 05:38

knopers8 previously approved these changes May 28, 2025

View reviewed changes

core/task/manager.go Outdated Show resolved Hide resolved

core/task/manager.go Outdated Show resolved Hide resolved

justonedev1 dismissed knopers8’s stale review via ff6c7ac May 28, 2025 09:41

knopers8 reviewed May 28, 2025

View reviewed changes

core/task/manager.go Outdated Show resolved Hide resolved

justonedev1 requested a review from knopers8 May 28, 2025 09:45

knopers8 approved these changes May 28, 2025

View reviewed changes

[core] deployment retry properly retries only failed tasks

7b87bfb

justonedev1 force-pushed the OCTRL-1010 branch from adbbfec to 7b87bfb Compare May 28, 2025 09:48

knopers8 merged commit 8ab57da into master May 28, 2025
4 checks passed

knopers8 deleted the OCTRL-1010 branch May 28, 2025 09:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OCTRL-1010] Fix automatic Mesos retry #714

[OCTRL-1010] Fix automatic Mesos retry #714

Uh oh!

justonedev1 commented May 27, 2025 •

edited

Loading

Uh oh!

knopers8 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

[OCTRL-1010] Fix automatic Mesos retry #714

[OCTRL-1010] Fix automatic Mesos retry #714

Uh oh!

Conversation

justonedev1 commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

knopers8 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

justonedev1 commented May 27, 2025 •

edited

Loading