test(e2e): rely on Kimi trajectory acceptance#4153
Conversation
|
Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually. Contributors can view more details about this message here. |
E2E Advisor RecommendationRequired E2E: Dispatch hint: Auto-dispatched E2E: Full advisor summaryE2E Recommendation AdvisorBase: Required E2E
Optional E2E
New E2E recommendations
Dispatch hint
|
E2E Scenario Advisor RecommendationRequired scenario E2E: None Full scenario advisor summaryE2E Scenario AdvisorBase: Required scenario E2E
Optional scenario E2E
Relevant changed files
|
PR Review AdvisorFindings: 0 needs attention, 0 worth checking, 0 nice ideas This is an automated advisory review. A human maintainer must make the final merge decision. |
|
No actionable comments were generated in the recent review. 🎉 ℹ️ Recent review info⚙️ Run configurationConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Enterprise Run ID: 📒 Files selected for processing (1)
📝 WalkthroughWalkthroughThe PR updates post-run validation logic in the ChangesAgent execution validation in e2e test
Estimated code review effort🎯 2 (Simple) | ⏱️ ~10 minutes Possibly related PRs
Suggested labels
Suggested reviewers
Poem
🚥 Pre-merge checks | ✅ 4 | ❌ 1❌ Failed checks (1 warning)
✅ Passed checks (4 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches📝 Generate docstrings
🧪 Generate unit tests (beta)
Comment |
…e2e-final-text-accounting
Selective E2E Results — ✅ All requested jobs passedRun: 26365908238
|
Selective E2E Results — ✅ All requested jobs passedRun: 26365944264
|
|
Prepared for review after #4152 merged:
Validation:
All current PR checks are passing. GitHub still shows some cancelled check runs in the rollup from the base-retarget/draft-to-ready churn, but the latest checks reported by |
Summary
The nightly flake sweep had one
kimi-inference-compat-e2efailure where the OpenClaw command exited 0 and the trajectory later proved all split Kimi exec calls completed cleanly, but an earlier command-output text parser had already incremented FAIL. This PR makes the command-output check validate command completion and leaves exact final-answer/tool-result correctness to the existing trajectory acceptance check.Changes
Type of Change
Verification
npx prek run --all-filespassesnpm testpassesmake docsbuilds without warnings (doc changes only)Signed-off-by: Carlos Villela cvillela@nvidia.com
Summary by CodeRabbit