chore(skills): Add `skill-creator` and update managed agent skills by chargome · Pull Request #19713 · getsentry/sentry-javascript

chargome · 2026-03-09T12:17:26Z

Adds the official skill-creator skill from anthropics/skills for creating and optimizing agent skills. We should use this going forward with every skill. ref https://claude.com/blog/improving-skill-creator-test-measure-and-refine-agent-skills
Update dotagents and skill-scanner skills to their latest versions.

Closes #19760 (added automatically)

Add `skill-creator` skill from anthropics/skills for creating and optimizing agent skills. Update `dotagents` and `skill-scanner` skills to their latest versions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

github-actions · 2026-03-09T12:26:47Z

size-limit report 📦

Path	Size	% Change	Change
@sentry/browser	25.64 kB	-	-
@sentry/browser - with treeshaking flags	24.14 kB	-	-
@sentry/browser (incl. Tracing)	42.44 kB	-	-
@sentry/browser (incl. Tracing, Profiling)	47.1 kB	-	-
@sentry/browser (incl. Tracing, Replay)	81.26 kB	-	-
@sentry/browser (incl. Tracing, Replay) - with treeshaking flags	70.88 kB	-	-
@sentry/browser (incl. Tracing, Replay with Canvas)	85.95 kB	-	-
@sentry/browser (incl. Tracing, Replay, Feedback)	98.21 kB	-	-
@sentry/browser (incl. Feedback)	42.44 kB	-	-
@sentry/browser (incl. sendFeedback)	30.31 kB	-	-
@sentry/browser (incl. FeedbackAsync)	35.36 kB	-	-
@sentry/browser (incl. Metrics)	26.8 kB	-	-
@sentry/browser (incl. Logs)	26.95 kB	-	-
@sentry/browser (incl. Metrics & Logs)	27.62 kB	-	-
@sentry/react	27.39 kB	-	-
@sentry/react (incl. Tracing)	44.78 kB	-	-
@sentry/vue	30.09 kB	-	-
@sentry/vue (incl. Tracing)	44.31 kB	-	-
@sentry/svelte	25.66 kB	-	-
CDN Bundle	28.18 kB	-	-
CDN Bundle (incl. Tracing)	43.27 kB	-	-
CDN Bundle (incl. Logs, Metrics)	29.02 kB	-	-
CDN Bundle (incl. Tracing, Logs, Metrics)	44.11 kB	-	-
CDN Bundle (incl. Replay, Logs, Metrics)	68.1 kB	-	-
CDN Bundle (incl. Tracing, Replay)	80.15 kB	-	-
CDN Bundle (incl. Tracing, Replay, Logs, Metrics)	81.01 kB	-	-
CDN Bundle (incl. Tracing, Replay, Feedback)	85.66 kB	-	-
CDN Bundle (incl. Tracing, Replay, Feedback, Logs, Metrics)	86.54 kB	-	-
CDN Bundle - uncompressed	82.38 kB	-	-
CDN Bundle (incl. Tracing) - uncompressed	128.09 kB	-	-
CDN Bundle (incl. Logs, Metrics) - uncompressed	85.21 kB	-	-
CDN Bundle (incl. Tracing, Logs, Metrics) - uncompressed	130.93 kB	-	-
CDN Bundle (incl. Replay, Logs, Metrics) - uncompressed	208.88 kB	-	-
CDN Bundle (incl. Tracing, Replay) - uncompressed	244.98 kB	-	-
CDN Bundle (incl. Tracing, Replay, Logs, Metrics) - uncompressed	247.8 kB	-	-
CDN Bundle (incl. Tracing, Replay, Feedback) - uncompressed	257.89 kB	-	-
CDN Bundle (incl. Tracing, Replay, Feedback, Logs, Metrics) - uncompressed	260.7 kB	-	-
@sentry/nextjs (client)	47.19 kB	-	-
@sentry/sveltekit (client)	42.9 kB	-	-
@sentry/node-core	52.27 kB	+0.02%	+7 B 🔺
@sentry/node	174.77 kB	+0.01%	+4 B 🔺
@sentry/node - without tracing	97.44 kB	+0.02%	+11 B 🔺
@sentry/aws-serverless	113.24 kB	-0.01%	-3 B 🔽

View base workflow run

sentry · 2026-03-11T12:39:37Z

.agents/skills/skill-creator/scripts/run_eval.py

+                                    accumulated_json = ""
+                                else:
+                                    return False
+


Bug: The function returns after checking only the first tool use in a response, both in the streaming and fallback paths. This will miss valid skill triggers if they aren't the first tool used.
_{Severity: HIGH}

Suggested Fix

In the non-streaming path, move the return triggered statement outside of the for loop to ensure all content_items are checked. In the streaming path, remove the else: return False block and continue processing events to check for subsequent tool uses within the same response.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: .agents/skills/skill-creator/scripts/run_eval.py#L142 Potential issue: The skill trigger detection logic in `run_eval.py` can lead to false negatives. In both the streaming path (around line 142) and the non-streaming fallback path (line 164), the function returns prematurely. The streaming path explicitly returns `False` if the first tool is not `Skill` or `Read`. The fallback path has an indentation error, placing `return triggered` inside the `for` loop, causing it to exit after checking only the first `tool_use` item. This means if an assistant response contains multiple tool calls, and the relevant `Skill` or `Read` call is not the first one, it will be missed, incorrectly reporting that the skill was not triggered.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

cursor · 2026-03-11T12:42:11Z

.agents/skills/skill-creator/scripts/aggregate_benchmark.py

+            "analyzer_model": "<model-name>",
+            "timestamp": datetime.now(timezone.utc).strftime("%Y-%m-%dT%H:%M:%SZ"),
+            "evals_run": eval_ids,
+            "runs_per_configuration": 3


Hardcoded runs_per_configuration ignores actual run count

Low Severity

The runs_per_configuration metadata field is hardcoded to 3 instead of being calculated from the actual data in results. This causes both the viewer and the generated benchmark.md to display incorrect information about how many runs were performed per configuration, regardless of the actual number of runs the user executed.

cursor · 2026-03-11T12:42:11Z

.agents/skills/skill-creator/eval-viewer/generate_review.py

+
+    data_json = json.dumps(embedded)
+
+    return template.replace("/*__EMBEDDED_DATA__*/", f"const EMBEDDED_DATA = {data_json};")


Embedded JSON breaks viewer on </script> in output files

Medium Severity

json.dumps does not escape </script> sequences within string values. When generate_html embeds the JSON payload directly inside an HTML <script> tag, any text output file containing </script> will cause the browser's HTML parser to prematurely close the script block, completely breaking the viewer. The standard mitigation is to replace </ with <\/ in the serialized JSON before embedding.

chargome and others added 2 commits March 9, 2026 13:14

chore: Add skill-creator and update managed agent skills

5829270

Add `skill-creator` skill from anthropics/skills for creating and optimizing agent skills. Update `dotagents` and `skill-scanner` skills to their latest versions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fmt

15a18d6

chargome self-assigned this Mar 9, 2026

chargome marked this pull request as ready for review March 11, 2026 12:37

github-actions bot mentioned this pull request Mar 11, 2026

chore(skills): Add skill-creator and update managed agent skills #19760

Open

chargome requested review from a team, JPeer264, Lms24 and RulaKhaled and removed request for a team March 11, 2026 12:37

sentry bot reviewed Mar 11, 2026

View reviewed changes

cursor bot reviewed Mar 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore(skills): Add `skill-creator` and update managed agent skills#19713

chore(skills): Add `skill-creator` and update managed agent skills#19713
chargome wants to merge 2 commits intodevelopfrom
feat/update-agent-skills

chargome commented Mar 9, 2026 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Mar 9, 2026

Uh oh!

sentry bot Mar 11, 2026

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Mar 11, 2026

Uh oh!

cursor bot Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant


		data_json = json.dumps(embedded)

		return template.replace("/__EMBEDDED_DATA__/", f"const EMBEDDED_DATA = {data_json};")

Uh oh!

Conversation

chargome commented Mar 9, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 9, 2026

size-limit report 📦

Uh oh!

sentry bot Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Mar 11, 2026

Choose a reason for hiding this comment

Hardcoded runs_per_configuration ignores actual run count

Uh oh!

cursor bot Mar 11, 2026

Choose a reason for hiding this comment

Embedded JSON breaks viewer on </script> in output files

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

chargome commented Mar 9, 2026 •

edited by github-actions bot

Loading

Hardcoded `runs_per_configuration` ignores actual run count

Embedded JSON breaks viewer on `</script>` in output files