[prompt-clustering] Copilot Agent Prompt Clustering Analysis — 2026-05-23 #34200
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by Copilot Agent Prompt Clustering Analysis. A newer discussion is available at Discussion #34410. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Summary
NLP clustering analysis of 1,000 copilot agent PRs created between 2026-05-05 and 2026-05-23. TF-IDF features (unigrams + bigrams) were clustered with K-means; k=9 was selected by silhouette score on a k-sweep from 3–9.
Key Findings
+1067/−952LOC — most of that is regenerated golden fixtures.Cluster Overview
Merge Rate by Cluster
Cluster Complexity (Avg Files Changed)
Cluster Details
Cluster 3: Workflow / MCP / shared agent plumbing (catch-all) — 435 PRs, 79% merged
Top terms:
workflow,fix,shared,workflows,add,mcp,files,agent,run,validationAvg files changed: 30.5
Avg LOC: +425 / −297
Avg commits per PR: 4.4
Avg reviews / comments: 1.5 reviews, 2.6 comments
Sample PRs:
#34124 — fix: skip unlock job when activation was skipped ✅ merged
#34123 — Consolidate workflow
FieldLocationonto consoleErrorPosition✅ merged#34120 — Increase audit workflow repo-memory patch budget to prevent push_repo_memory failures ✅ merged
Cluster 2: Safe-outputs, branch & issue handling — 156 PRs, 89% merged
Top terms:
safe,fix,output,safe output,branch,issue,test,outputs,field,safe outputsAvg files changed: 18.2
Avg LOC: +252 / −108
Avg commits per PR: 5.0
Avg reviews / comments: 1.8 reviews, 3.7 comments
Sample PRs:
#34144 — Refactor workflow helper hotspots from semantic clustering audit ✅ merged
#34117 — Guard OTLP attribute merge against allocation-size overflow ✅ merged
#34066 — safe-outputs: resolve base branch from origin/HEAD and harden full patch base selection ✅ merged
Cluster 0: Docs & CLI surface — 118 PRs, 80% merged
Top terms:
docs,cli,package,pkg,help,add,sites,logger,updated,renameAvg files changed: 13.7
Avg LOC: +189 / −103
Avg commits per PR: 3.6
Avg reviews / comments: 1.5 reviews, 1.5 comments
Sample PRs:
#34122 — Validate logger migration completeness across targeted packages
#34121 — docs(reference): add non-Copilot engine examples to targeted reference pages
#34018 — Clarify
status/listpattern semantics and add CLI help example validation ✅ mergedCluster 5: Experiments, prompts & token telemetry — 93 PRs, 87% merged
Top terms:
experiment,prompt,token,workflow,agent,inline,report,usage,variant,subAvg files changed: 20.1
Avg LOC: +269 / −147
Avg commits per PR: 3.9
Avg reviews / comments: 1.6 reviews, 1.6 comments
Sample PRs:
#34063 — Optimize ab-testing-advisor prompt with inline sub-agents ✅ merged
#34037 — fix: hypothesis always "(not specified)" in daily experiment report ✅ merged
#34036 — Surface OTel token usage from agent-stdio.log when proxy usage logs are missing ✅ merged
Cluster 7: AWF / firewall / gateway version bumps — 65 PRs, 63% merged
Top terms:
awf,bump,gateway,version,golden,firewall,default,awf config,config,mcpAvg files changed: 104.5
Avg LOC: +1067 / −952
Avg commits per PR: 3.8
Avg reviews / comments: 1.5 reviews, 4.9 comments
Sample PRs:
#34129 — Fix Codex smoke workflow by preserving
OPENAI_API_KEYin AWF container env ✅ merged#34114 — Bump gh-aw-firewall to v0.25.52 and sync embedded AWF schema ✅ merged
#34081 — Bump default MCP Gateway image to gh-aw-mcpg v0.3.18 ✅ merged
Cluster 4: Pull-request / reviewer flows — 48 PRs, 83% merged
Top terms:
request,pull request,pull,reviewer,review,workflows,command,request reviewer,create pull,reactionAvg files changed: 21.5
Avg LOC: +317 / −75
Avg commits per PR: 6.1
Avg reviews / comments: 1.8 reviews, 5.3 comments
Sample PRs:
#34139 — fix: exclude merged upstream commits from diffSize in push_to_pull_request_branch incremental mode ✅ merged
#34138 — fix: reject create_pull_request/push_to_pull_request_branch when branch equals base_branch after detection ✅ merged
#34133 — Create REQUEST_CHANGES review for
create_pull_requestthreat-warning mode ✅ mergedCluster 8: Model providers (Copilot/Codex/BYOK) — 38 PRs, 82% merged
Top terms:
model,copilot,multiplier,alias,inventory,provider,model inventory,engine,byok,codexAvg files changed: 50.7
Avg LOC: +203 / −96
Avg commits per PR: 3.4
Avg reviews / comments: 1.4 reviews, 1.7 comments
Sample PRs:
#34149 — Use Copilot BYOK platform default model instead of hard-coded Claude fallback ✅ merged
#34148 — Pin Matt Pocock reviewer to supported Copilot model ✅ merged
#34136 — Add
@app/copilot-swe-agentas a copilot bot alias ✅ mergedCluster 1: WIP: Failing GitHub Actions auto-fix — 27 PRs, 74% merged
Top terms:
failing github,fix failing,actions job,actions,github actions,job,wip fix,failing,wip,githubAvg files changed: 49.7
Avg LOC: +255 / −58
Avg commits per PR: 3.1
Avg reviews / comments: 0.9 reviews, 0.3 comments
Sample PRs:
#34119 — [WIP] Fix failing GitHub Actions job agent
#33673 — [WIP] Fix failing GitHub Actions job js-typecheck
#33672 — [WIP] Fix failing GitHub Actions job Lint Gate
Cluster 6: SPDD spec safeguards & compliance — 20 PRs, 95% merged
Top terms:
spdd,spec,safeguards,hash,sync,compliance,frontmatter hash,daily,added,normsAvg files changed: 28.1
Avg LOC: +496 / −312
Avg commits per PR: 4.0
Avg reviews / comments: 1.4 reviews, 0.8 comments
Sample PRs:
#33591 — Close daily SPDD spec gaps across MCP Scripts, Effective Tokens, Forecast, Frontmatter Hash, and Fuzzy Schedule ✅ merged
#33365 — SPDD: close safeguards/consistency gaps across 5 specs and add targeted compliance vectors ✅ merged
#33096 — docs(spdd): strengthen safeguards, OTel norms, and compliance tests across 5 spec files ✅ merged
Recent PRs by Cluster (latest 50)
@app/copilot-swe-agentas a copilot bot aliascreate_pull_requestthreat-warning modeOPENAI_API_KEYin AWF container enFieldLocationonto consoleErrorPositiongenerate_safe_outputs_tools.cjsfalse positivecopilotas a bot alias to all GitHub Copilot bot identitiesstatus/listpattern semantics and add CLI help example validati{files}placeholders as inline code in JS template renderingadd/add-wizardandupdatemanifest-aware foraw.ymlpackage inruns-onin customjobsschemagithubnext/agenticsto fix broken `add-wizarequest_reviewprotected-files mode forcreate_pull_requestlog.*linter pattern to `pkRecommendations
min_dflowered andmax_dfraised) would surface internal structure — likely "logger migration", "validation refactor", "shared workflow helpers", and "MCP plumbing" as distinct sub-buckets.Methodology
/tmp/gh-aw/prompt-cache/pr-full-data/(full PR JSON with body, reviews, commits, files).min_df=5,max_df=0.6, alpha-only tokens.References:
Beta Was this translation helpful? Give feedback.
All reactions