Sharpen README conversion flow

wimi321 · wimi321 · commit 199f4e25ecc0 · 2026-03-18T22:57:46.000+08:00
diff --git a/README.md b/README.md
@@ -2,20 +2,28 @@
 
 [English](./README.md) | [简体中文](./README.zh-CN.md)
 
+<p align="center">
+  <img src="./assets/hero-banner.svg" alt="Task Bundle hero banner" width="100%" />
+</p>
+
+<p align="center"><strong>Turn AI coding runs into portable, replayable, benchmark-ready task bundles.</strong></p>
+<p align="center">The missing middle layer between raw chat logs and heavyweight benchmark platforms.</p>
+<p align="center">
+  <a href="#quickstart"><strong>Quick Start</strong></a> ·
+  <a href="#real-bundles"><strong>Real Output</strong></a> ·
+  <a href="./docs/bundle-format.md"><strong>Bundle Format</strong></a> ·
+  <a href="./ROADMAP.md"><strong>Roadmap</strong></a> ·
+  <a href="./docs/branding.md"><strong>Brand Assets</strong></a>
+</p>
+
 [![CI](https://github.com/wimi321/task-bundle/actions/workflows/ci.yml/badge.svg)](https://github.com/wimi321/task-bundle/actions/workflows/ci.yml)
 [![GitHub stars](https://img.shields.io/github/stars/wimi321/task-bundle?style=social)](https://github.com/wimi321/task-bundle/stargazers)
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](./LICENSE)
 
-![Task Bundle hero banner](./assets/hero-banner.svg)
-
-Turn AI coding runs into portable, replayable, benchmark-ready task bundles.
-
-The missing middle layer between raw chat logs and heavyweight benchmark platforms.
+Task Bundle is a TypeScript + Node.js CLI for teams building agents, evals, coding benchmarks, and reproducible AI workflows.
 
 Package a task once, inspect it later, compare tools on the same starting point, and generate benchmark-style reports from real artifacts.
 
-Task Bundle is a TypeScript + Node.js CLI for teams building agents, evals, coding benchmarks, and reproducible AI workflows.
-
 Why people star it:
 - turn one AI coding run into a clean, shareable directory instead of a screenshot, transcript, or loose patch
 - compare Codex, Claude Code, Cursor, or internal agents with real metadata, hashes, and outcome fields
@@ -38,6 +46,66 @@ It is intentionally not:
 - a benchmark platform
 - a token-by-token recorder
 
+<a id="quickstart"></a>
+
+## Quick Start
+
+Run the repo against real example bundles in about a minute:
+
+```bash
+npm install
+npm run build
+npm run dev -- compare ./examples/hello-world-bundle ./examples/hello-world-bundle-claude
+```
+
+If you want the shortest possible proof that the project already works, this is it.
+
+<a id="real-bundles"></a>
+
+## See It On Real Bundles
+
+Inspect a bundle:
+
+```text
+$ npm run dev -- inspect ./examples/hello-world-bundle
+Task Bundle
+-----------
+Title: Fix greeting punctuation
+Tool: codex
+Model: gpt-5
+Status: success
+Score: 0.93
+Workspace files: 1
+Events: 3
+```
+
+Compare two tools on the same task:
+
+```text
+$ npm run dev -- compare ./examples/hello-world-bundle ./examples/hello-world-bundle-claude
+Task Bundle Comparison
+----------------------
+Left tool: codex
+Right tool: claude-code
+Left score: 0.93
+Right score: 0.89
+Score delta: 0.04
+Workspace file delta: 0
+Event count delta: -1
+```
+
+Generate a benchmark-style summary from a directory of runs:
+
+```text
+$ npm run dev -- report ./examples --out ./dist/benchmark-report.md
+Bundles: 2
+Average score: 0.91
+
+Ranking
+1. Fix greeting punctuation | codex / gpt-5 | success | score 0.93
+2. Fix greeting punctuation | claude-code / claude-sonnet-4 | success | score 0.89
+```
+
 ## Why It Matters
 
 Most AI coding work disappears into screenshots, transcripts, or one-off patches.
diff --git a/README.zh-CN.md b/README.zh-CN.md
@@ -2,20 +2,28 @@
 
 [English](./README.md) | [简体中文](./README.zh-CN.md)
 
+<p align="center">
+  <img src="./assets/hero-banner.svg" alt="Task Bundle hero banner" width="100%" />
+</p>
+
+<p align="center"><strong>把 AI coding 过程变成可分享、可重跑、可比较、可做 benchmark 的任务包。</strong></p>
+<p align="center">它正好补上“聊天记录太散、benchmark 平台太重”之间缺失的那层基础设施。</p>
+<p align="center">
+  <a href="#quickstart"><strong>快速开始</strong></a> ·
+  <a href="#real-bundles"><strong>真实输出</strong></a> ·
+  <a href="./docs/bundle-format.zh-CN.md"><strong>格式说明</strong></a> ·
+  <a href="./ROADMAP.zh-CN.md"><strong>路线图</strong></a> ·
+  <a href="./docs/branding.zh-CN.md"><strong>品牌素材</strong></a>
+</p>
+
 [![CI](https://github.com/wimi321/task-bundle/actions/workflows/ci.yml/badge.svg)](https://github.com/wimi321/task-bundle/actions/workflows/ci.yml)
 [![GitHub stars](https://img.shields.io/github/stars/wimi321/task-bundle?style=social)](https://github.com/wimi321/task-bundle/stargazers)
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](./LICENSE)
 
-![Task Bundle hero banner](./assets/hero-banner.svg)
-
-把 AI coding 过程，变成可分享、可重跑、可比较、可做 benchmark 的任务包。
-
-它是“原始聊天记录”和“重型 benchmark 平台”之间，那层真正好用的中间层。
+Task Bundle 是一个 TypeScript + Node.js CLI，适合 agent、eval、benchmark、可复现实验这类工作流。
 
 一次打包，之后就能 inspect、compare、validate、report，也能把不同工具放到同一起点上做更公平的对照。
 
-Task Bundle 是一个 TypeScript + Node.js CLI，适合 agent、eval、benchmark、可复现实验这类工作流。
-
 大家愿意给它点 star，通常是因为这些点：
 - 它能把一次 AI coding 任务整理成干净、稳定、可搬运的目录，而不是散落的截图、聊天记录或 patch
 - 它能比较 Codex、Claude Code、Cursor 或内部工具的结果，而且比较依据是真实元数据、哈希和 outcome 字段
@@ -38,6 +46,66 @@ Task Bundle 是一个 TypeScript + Node.js CLI，适合 agent、eval、benchmark
 - benchmark 平台
 - token 级录制器
 
+<a id="quickstart"></a>
+
+## 快速开始
+
+用仓库自带示例，1 分钟就能跑出一个真实对比：
+
+```bash
+npm install
+npm run build
+npm run dev -- compare ./examples/hello-world-bundle ./examples/hello-world-bundle-claude
+```
+
+如果你只想先确认“这项目现在到底能不能用”，这组命令就是最短路径。
+
+<a id="real-bundles"></a>
+
+## 看看真实输出
+
+先 inspect 一个 bundle：
+
+```text
+$ npm run dev -- inspect ./examples/hello-world-bundle
+Task Bundle
+-----------
+Title: Fix greeting punctuation
+Tool: codex
+Model: gpt-5
+Status: success
+Score: 0.93
+Workspace files: 1
+Events: 3
+```
+
+再比较两个工具在同一个任务上的结果：
+
+```text
+$ npm run dev -- compare ./examples/hello-world-bundle ./examples/hello-world-bundle-claude
+Task Bundle Comparison
+----------------------
+Left tool: codex
+Right tool: claude-code
+Left score: 0.93
+Right score: 0.89
+Score delta: 0.04
+Workspace file delta: 0
+Event count delta: -1
+```
+
+最后从一组 bundle 直接生成 benchmark 风格摘要：
+
+```text
+$ npm run dev -- report ./examples --out ./dist/benchmark-report.md
+Bundles: 2
+Average score: 0.91
+
+Ranking
+1. Fix greeting punctuation | codex / gpt-5 | success | score 0.93
+2. Fix greeting punctuation | claude-code / claude-sonnet-4 | success | score 0.89
+```
+
 ## 为什么值得关注
 
 很多 AI coding 结果最后只留下截图、聊天记录或者一个 patch，后续几乎没法稳定比较。
@@ -73,7 +141,7 @@ task-bundle/
 - [docs/bundle-format.md](./docs/bundle-format.md)
 - [docs/design-decisions.md](./docs/design-decisions.md)
 - [docs/replay-contract.md](./docs/replay-contract.md)
-- [docs/branding.md](./docs/branding.md)
+- [docs/branding.zh-CN.md](./docs/branding.zh-CN.md)
 
 ## 五分钟演示
 
diff --git a/docs/branding.zh-CN.md b/docs/branding.zh-CN.md
@@ -0,0 +1,28 @@
+# 品牌素材
+
+Task Bundle 在 `assets/` 目录下提供了一套可直接用于仓库展示的视觉素材，让 README、GitHub 首页和分享卡片能保持统一气质。
+
+## 文件说明
+
+- `assets/hero-banner.svg`
+  中英文 README 顶部使用的主视觉横幅，可继续编辑。
+- `assets/social-preview.svg`
+  GitHub 社交预览图的可编辑源文件。
+- `assets/social-preview.png`
+  已导出的上传版本，适合直接放到 GitHub 仓库设置里。
+
+## 推荐设置
+
+1. 打开仓库设置页。
+2. 进入 `General` -> `Social preview`。
+3. 上传 `assets/social-preview.png`。
+
+## 重新导出 PNG
+
+在 macOS 上可以直接运行：
+
+```bash
+sips -s format png ./assets/social-preview.svg --out ./assets/social-preview.png
+```
+
+仓库里保留 SVG，是为了让这套素材更容易继续修改、做版本对比，也更适合长期维护。
diff --git a/package.json b/package.json
@@ -36,7 +36,9 @@
     "templates",
     "LICENSE",
     "README.md",
-    "README.zh-CN.md"
+    "README.zh-CN.md",
+    "ROADMAP.md",
+    "ROADMAP.zh-CN.md"
   ],
   "scripts": {
     "build": "tsc -p tsconfig.json",