Skip to content

chore(skills): Add skill-creator and update managed agent skills#19713

Open
chargome wants to merge 2 commits intodevelopfrom
feat/update-agent-skills
Open

chore(skills): Add skill-creator and update managed agent skills#19713
chargome wants to merge 2 commits intodevelopfrom
feat/update-agent-skills

Conversation

@chargome
Copy link
Member

@chargome chargome commented Mar 9, 2026

Closes #19760 (added automatically)

chargome and others added 2 commits March 9, 2026 13:14
Add `skill-creator` skill from anthropics/skills for creating and
optimizing agent skills. Update `dotagents` and `skill-scanner` skills
to their latest versions.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@chargome chargome self-assigned this Mar 9, 2026
@github-actions
Copy link
Contributor

github-actions bot commented Mar 9, 2026

size-limit report 📦

Path Size % Change Change
@sentry/browser 25.64 kB - -
@sentry/browser - with treeshaking flags 24.14 kB - -
@sentry/browser (incl. Tracing) 42.44 kB - -
@sentry/browser (incl. Tracing, Profiling) 47.1 kB - -
@sentry/browser (incl. Tracing, Replay) 81.26 kB - -
@sentry/browser (incl. Tracing, Replay) - with treeshaking flags 70.88 kB - -
@sentry/browser (incl. Tracing, Replay with Canvas) 85.95 kB - -
@sentry/browser (incl. Tracing, Replay, Feedback) 98.21 kB - -
@sentry/browser (incl. Feedback) 42.44 kB - -
@sentry/browser (incl. sendFeedback) 30.31 kB - -
@sentry/browser (incl. FeedbackAsync) 35.36 kB - -
@sentry/browser (incl. Metrics) 26.8 kB - -
@sentry/browser (incl. Logs) 26.95 kB - -
@sentry/browser (incl. Metrics & Logs) 27.62 kB - -
@sentry/react 27.39 kB - -
@sentry/react (incl. Tracing) 44.78 kB - -
@sentry/vue 30.09 kB - -
@sentry/vue (incl. Tracing) 44.31 kB - -
@sentry/svelte 25.66 kB - -
CDN Bundle 28.18 kB - -
CDN Bundle (incl. Tracing) 43.27 kB - -
CDN Bundle (incl. Logs, Metrics) 29.02 kB - -
CDN Bundle (incl. Tracing, Logs, Metrics) 44.11 kB - -
CDN Bundle (incl. Replay, Logs, Metrics) 68.1 kB - -
CDN Bundle (incl. Tracing, Replay) 80.15 kB - -
CDN Bundle (incl. Tracing, Replay, Logs, Metrics) 81.01 kB - -
CDN Bundle (incl. Tracing, Replay, Feedback) 85.66 kB - -
CDN Bundle (incl. Tracing, Replay, Feedback, Logs, Metrics) 86.54 kB - -
CDN Bundle - uncompressed 82.38 kB - -
CDN Bundle (incl. Tracing) - uncompressed 128.09 kB - -
CDN Bundle (incl. Logs, Metrics) - uncompressed 85.21 kB - -
CDN Bundle (incl. Tracing, Logs, Metrics) - uncompressed 130.93 kB - -
CDN Bundle (incl. Replay, Logs, Metrics) - uncompressed 208.88 kB - -
CDN Bundle (incl. Tracing, Replay) - uncompressed 244.98 kB - -
CDN Bundle (incl. Tracing, Replay, Logs, Metrics) - uncompressed 247.8 kB - -
CDN Bundle (incl. Tracing, Replay, Feedback) - uncompressed 257.89 kB - -
CDN Bundle (incl. Tracing, Replay, Feedback, Logs, Metrics) - uncompressed 260.7 kB - -
@sentry/nextjs (client) 47.19 kB - -
@sentry/sveltekit (client) 42.9 kB - -
@sentry/node-core 52.27 kB +0.02% +7 B 🔺
@sentry/node 174.77 kB +0.01% +4 B 🔺
@sentry/node - without tracing 97.44 kB +0.02% +11 B 🔺
@sentry/aws-serverless 113.24 kB -0.01% -3 B 🔽

View base workflow run

@chargome chargome marked this pull request as ready for review March 11, 2026 12:37
@chargome chargome requested review from a team, JPeer264, Lms24 and RulaKhaled and removed request for a team March 11, 2026 12:37
accumulated_json = ""
else:
return False

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Bug: The function returns after checking only the first tool use in a response, both in the streaming and fallback paths. This will miss valid skill triggers if they aren't the first tool used.
Severity: HIGH

Suggested Fix

In the non-streaming path, move the return triggered statement outside of the for loop to ensure all content_items are checked. In the streaming path, remove the else: return False block and continue processing events to check for subsequent tool uses within the same response.

Prompt for AI Agent
Review the code at the location below. A potential bug has been identified by an AI
agent.
Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not
valid.

Location: .agents/skills/skill-creator/scripts/run_eval.py#L142

Potential issue: The skill trigger detection logic in `run_eval.py` can lead to false
negatives. In both the streaming path (around line 142) and the non-streaming fallback
path (line 164), the function returns prematurely. The streaming path explicitly returns
`False` if the first tool is not `Skill` or `Read`. The fallback path has an indentation
error, placing `return triggered` inside the `for` loop, causing it to exit after
checking only the first `tool_use` item. This means if an assistant response contains
multiple tool calls, and the relevant `Skill` or `Read` call is not the first one, it
will be missed, incorrectly reporting that the skill was not triggered.

Did we get this right? 👍 / 👎 to inform future reviews.

Copy link

@cursor cursor bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cursor Bugbot has reviewed your changes and found 2 potential issues.

Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.

"analyzer_model": "<model-name>",
"timestamp": datetime.now(timezone.utc).strftime("%Y-%m-%dT%H:%M:%SZ"),
"evals_run": eval_ids,
"runs_per_configuration": 3
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hardcoded runs_per_configuration ignores actual run count

Low Severity

The runs_per_configuration metadata field is hardcoded to 3 instead of being calculated from the actual data in results. This causes both the viewer and the generated benchmark.md to display incorrect information about how many runs were performed per configuration, regardless of the actual number of runs the user executed.

Fix in Cursor Fix in Web


data_json = json.dumps(embedded)

return template.replace("/*__EMBEDDED_DATA__*/", f"const EMBEDDED_DATA = {data_json};")
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Embedded JSON breaks viewer on </script> in output files

Medium Severity

json.dumps does not escape </script> sequences within string values. When generate_html embeds the JSON payload directly inside an HTML <script> tag, any text output file containing </script> will cause the browser's HTML parser to prematurely close the script block, completely breaking the viewer. The standard mitigation is to replace </ with <\/ in the serialized JSON before embedding.

Fix in Cursor Fix in Web

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

chore(skills): Add skill-creator and update managed agent skills

1 participant