Skip to content

docs(lark-search): improve search workflow from eval run base-b1m-selected2-v2-20260521#1022

Closed
BytedanceSearch wants to merge 1 commit into
mainfrom
eval-search/auto-pr/base-b1m-selected2-v2-20260521-21348
Closed

docs(lark-search): improve search workflow from eval run base-b1m-selected2-v2-20260521#1022
BytedanceSearch wants to merge 1 commit into
mainfrom
eval-search/auto-pr/base-b1m-selected2-v2-20260521-21348

Conversation

@BytedanceSearch
Copy link
Copy Markdown
Collaborator

Eval Summary

  • Run: base-b1m-selected2-v2-20260521
  • Dataset size: 2
  • Scored: 2
  • Total: 2/30 (6.7%)
  • Primary bottleneck: search_strategy

Findings

  • F-002 [medium] tool_capability/drive -> shortcuts/drive: 没有可用 evidence_top_results;drive search shortcut 需要返回更稳定的标题、id/token、时间和摘要字段。 (cases: case_002)
  • F-001 [low] search_strategy/drive -> skills/lark-drive/references/lark-drive-search.md: recall=1;应在 drive 搜索中优先使用 query 的时间/人员/状态过滤,并用非污染 evidence_top_results 验证目标资源。 (cases: case_001)
  • F-003 [low] search_strategy/drive -> skills/lark-drive/references/lark-drive-search.md: recall=0;应在 drive 搜索中优先使用 query 的时间/人员/状态过滤,并用非污染 evidence_top_results 验证目标资源。 (cases: case_002)

Review Notes

This PR was generated from eval-search findings. Reviewers should check that
changes are domain-specific, generic, and not tailored to one case's exact gold
answer.

…ected2-v2-20260521

Driven by /eval-search propose-pr base-b1m-selected2-v2-20260521.

- Primary bottleneck: search_strategy
- Findings: 3

Change-Id: If87977b0315f8a058c537b3eddb8094b08975afc
@coderabbitai
Copy link
Copy Markdown

coderabbitai Bot commented May 21, 2026

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 374b12f3-be1b-49bf-8811-cfbeacbc6dae

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

  • 🔍 Trigger review
✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch eval-search/auto-pr/base-b1m-selected2-v2-20260521-21348

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@github-actions github-actions Bot added domain/ccm PR touches the ccm domain size/L Large or sensitive change across domains or core paths labels May 21, 2026
@codecov
Copy link
Copy Markdown

codecov Bot commented May 21, 2026

Codecov Report

❌ Patch coverage is 79.65116% with 35 lines in your changes missing coverage. Please review.
✅ Project coverage is 67.78%. Comparing base (c02a38f) to head (014ae49).
⚠️ Report is 13 commits behind head on main.

Files with missing lines Patch % Lines
shortcuts/drive/drive_search.go 79.65% 30 Missing and 5 partials ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main    #1022      +/-   ##
==========================================
+ Coverage   67.66%   67.78%   +0.11%     
==========================================
  Files         576      590      +14     
  Lines       54510    55262     +752     
==========================================
+ Hits        36885    37460     +575     
- Misses      14566    14688     +122     
- Partials     3059     3114      +55     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

@BytedanceSearch
Copy link
Copy Markdown
Collaborator Author

Closing this draft because the eval root cause is a broker capability gap around original creator vs owner, not the drive output shape change proposed here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

domain/ccm PR touches the ccm domain size/L Large or sensitive change across domains or core paths

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant