Skip to content

DAOS-18658 rebuild: delay ds_rebuild_regenerate_task 100S#17755

Open
liuxuezhao wants to merge 3 commits intorelease/2.6.4-aurorafrom
lxz/RB_fix_264a
Open

DAOS-18658 rebuild: delay ds_rebuild_regenerate_task 100S#17755
liuxuezhao wants to merge 3 commits intorelease/2.6.4-aurorafrom
lxz/RB_fix_264a

Conversation

@liuxuezhao
Copy link
Contributor

Steps for the author:

  • Commit message follows the guidelines.
  • Appropriate Features or Test-tag pragmas were used.
  • Appropriate Functional Test Stages were run.
  • At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • Gatekeeper requested (daos-gatekeeper added as a reviewer).

Delay trigger rebuild when PS leader step up for at most 100 seconds.

Signed-off-by: Xuezhao Liu <xuezhao.liu@hpe.com>
@liuxuezhao liuxuezhao requested review from a team as code owners March 23, 2026 10:30
@github-actions
Copy link

Ticket title is 'Hold regenerating rebuild tasks until CaRT events stabilize'
Status is 'Open'
https://daosio.atlassian.net/browse/DAOS-18658

1. add some DEBUG logs for rebuild enumerate
2. add WARN log if some engines did not report EC agg epoch progress in
   600S.
3. Fix a typo for sub_anchors->sa_nr compare

Signed-off-by: Xuezhao Liu <xuezhao.liu@hpe.com>
Report scanned obj count (riv_toberb_obj_count) in rebuild_tgt_status_check_ult,
in rebuild query status, use rs_rec_nr to be total scanned object to
simulate progress when no object need to reclaim.

Signed-off-by: Xuezhao Liu <xuezhao.liu@hpe.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

1 participant