fix: 修复产品化缺口和文档问题

SummerOneTwo · claude · SummerOneTwo · commit cbc0e6a7dd15 · 2026-04-09T18:43:36.000+08:00
- 修复 README 中目录名错误 (autocode-mcp → AutoCode)
- 添加 CI 打包产物 smoke test，验证 wheel 安装后 console script 正常工作
- 增强安全边界说明，明确 file_read/file_save 的访问控制行为
- 添加测试分层文档 (tests/README.md)，明确 L1-L4 测试职责

Co-Authored-By: Claude Opus 4.6 &lt;noreply@anthropic.com&gt;
diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
@@ -66,3 +66,35 @@ jobs:
           enable-cache: true
       - run: uv sync --all-extras
       - run: uv run pytest tests/ -v -m "integration"
+
+  # 打包产物 smoke test - 验证 wheel 安装后 console script 正常工作
+  test-packaging:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+      - uses: actions/setup-python@v5
+        with:
+          python-version: "3.10"
+      - uses: astral-sh/setup-uv@v4
+        with:
+          enable-cache: true
+      - name: Build wheel
+        run: |
+          uv sync --all-extras
+          uv build
+      - name: Install from wheel
+        run: |
+          # 创建临时虚拟环境进行安装测试
+          python -m venv /tmp/test-venv
+          source /tmp/test-venv/bin/activate
+          pip install dist/autocode_mcp-*.whl
+      - name: Run packaging smoke tests
+        run: |
+          source /tmp/test-venv/bin/activate
+          pytest tests/test_packaging_smoke.py -v -m "packaging"
+      - name: Verify console script
+        run: |
+          source /tmp/test-venv/bin/activate
+          autocode-mcp --help || true  # --help 可能返回非 0，但命令应该存在
+          # 验证 MCP 握手
+          echo '{"jsonrpc":"2.0","id":1,"method":"initialize","params":{"protocolVersion":"2024-11-05","capabilities":{},"clientInfo":{"name":"test","version":"1.0"}}}' | timeout 5 autocode-mcp | head -1
diff --git a/README.md b/README.md
@@ -40,7 +40,7 @@ uv tool install autocode-mcp
 
 ```bash
 git clone https://github.com/SummerOneTwo/AutoCode.git
-cd autocode-mcp
+cd AutoCode
 uv sync
 ```
 
@@ -454,12 +454,39 @@ problem_pack_polygon(
 
 ⚠️ **Important: This tool is designed for local trusted environments only**
 
-- **File Operations**: `file_read` and `file_save` can read/write arbitrary paths (use `problem_dir` parameter to limit scope)
-- **Code Execution**: Compiles and executes AI-generated C++ code with only time/memory limits, no sandbox isolation
-- **Use Cases**: Local development, competitive programming problem creation, AI-assisted coding in trusted environments
-- **Not Suitable For**: Multi-tenant environments, untrusted code execution, production-grade code execution platforms
+#### File Operations
 
-For stronger isolation, run inside a container or virtual machine.
+- **With `problem_dir` parameter**: `file_read` and `file_save` restrict access to paths within the specified directory
+- **Without `problem_dir` parameter**: These tools can read/write **any arbitrary path** on the filesystem
+- **Recommendation**: Always specify `problem_dir` when calling file operations to limit scope
+
+#### Code Execution
+
+- Compiles and executes AI-generated C++ code with only time/memory limits
+- No sandbox isolation (uses `prlimit` on Linux for memory limits only)
+- **Risk**: Malformed or malicious code could potentially affect the system
+
+#### Use Cases
+
+✅ **Suitable For**:
+- Local development machines
+- Competitive programming problem creation
+- AI-assisted coding in trusted environments
+- Personal workstations with regular backups
+
+❌ **Not Suitable For**:
+- Multi-tenant environments
+- Untrusted code execution
+- Production-grade code execution platforms
+- Shared servers without isolation
+
+#### Mitigation Strategies
+
+For stronger isolation, consider:
+- Running inside a Docker container
+- Using a virtual machine
+- Restricting filesystem permissions at the OS level
+- Running as a non-privileged user
 
 ### Generation Strategies
 
@@ -498,7 +525,7 @@ problems/your-problem/
 
 ```bash
 git clone https://github.com/SummerOneTwo/AutoCode.git
-cd autocode-mcp
+cd AutoCode
 uv sync
 ```
 
diff --git a/README_CN.md b/README_CN.md
@@ -40,7 +40,7 @@ uv tool install autocode-mcp
 
 ```bash
 git clone https://github.com/SummerOneTwo/AutoCode.git
-cd autocode-mcp
+cd AutoCode
 uv sync
 ```
 
@@ -454,12 +454,39 @@ problem_pack_polygon(
 
 ⚠️ **重要提示：本工具仅适用于本地可信环境**
 
-- **文件操作**：`file_read` 和 `file_save` 可读写任意路径（需显式指定 `problem_dir` 参数限制范围）
-- **代码执行**：编译并执行 AI 生成的 C++ 代码，仅提供时间/内存限制，无沙箱隔离
-- **适用场景**：本地开发、竞赛编程出题、可信环境下的 AI 辅助编程
-- **不适用场景**：多租户环境、不可信代码执行、生产级代码运行平台
+#### 文件操作
 
-如需更强的安全隔离，建议在容器或虚拟机中运行。
+- **指定 `problem_dir` 参数时**：`file_read` 和 `file_save` 限制在指定目录内访问
+- **不指定 `problem_dir` 参数时**：这些工具可以读写**任意路径**的文件
+- **建议**：调用文件操作时始终指定 `problem_dir` 以限制访问范围
+
+#### 代码执行
+
+- 编译并执行 AI 生成的 C++ 代码，仅提供时间/内存限制
+- 无沙箱隔离（Linux 上仅通过 `prlimit` 限制内存）
+- **风险**：畸形或恶意代码可能影响系统
+
+#### 适用场景
+
+✅ **适用于**：
+- 本地开发机器
+- 竞赛编程出题
+- 可信环境下的 AI 辅助编程
+- 有定期备份的个人工作站
+
+❌ **不适用于**：
+- 多租户环境
+- 不可信代码执行
+- 生产级代码运行平台
+- 无隔离的共享服务器
+
+#### 安全加固建议
+
+如需更强的安全隔离，建议：
+- 在 Docker 容器中运行
+- 使用虚拟机
+- 在操作系统层面限制文件系统权限
+- 以非特权用户身份运行
 
 ### 生成策略
 
@@ -498,7 +525,7 @@ problems/your-problem/
 
 ```bash
 git clone https://github.com/SummerOneTwo/AutoCode.git
-cd autocode-mcp
+cd AutoCode
 uv sync
 ```
 
diff --git a/pyproject.toml b/pyproject.toml
@@ -35,7 +35,10 @@ artifacts = ["src/autocode_mcp/templates"]
 [tool.pytest.ini_options]
 asyncio_mode = "auto"
 testpaths = ["tests"]
-markers = ["integration: marks tests as integration tests (deselect with '-m \"not integration\"')"]
+markers = [
+    "integration: marks tests as integration tests (deselect with '-m \"not integration\"')",
+    "packaging: marks tests as packaging smoke tests (run after uv build)",
+]
 
 [tool.ruff]
 line-length = 100
diff --git a/tests/README.md b/tests/README.md
@@ -0,0 +1,113 @@
+# 测试分层说明
+
+本项目采用分层测试策略，确保从单元到端到端的全面覆盖。
+
+## 测试层级
+
+```
+┌─────────────────────────────────────────────────────────────┐
+│                    L4: 打包产物测试                          │
+│  test_packaging_smoke.py                                    │
+│  验证 wheel 安装后 console script 正常工作                   │
+│  运行时机：uv build 后，在独立虚拟环境中                      │
+├─────────────────────────────────────────────────────────────┤
+│                    L3: 端到端 MCP 测试                       │
+│  test_e2e_mcp.py                                            │
+│  通过 stdio 启动真实 MCP Server 进程，验证协议兼容性          │
+│  运行时机：CI 常规测试（源码环境）                            │
+├─────────────────────────────────────────────────────────────┤
+│                    L2: 集成测试                              │
+│  test_server.py, test_compiler.py, test_*.py               │
+│  测试模块间交互、工具链集成                                   │
+│  运行时机：CI 常规测试                                       │
+├─────────────────────────────────────────────────────────────┤
+│                    L1: 单元测试                              │
+│  test_prompts.py, test_resources.py, test_cache.py         │
+│  测试独立函数和类的行为                                       │
+│  运行时机：CI 常规测试                                       │
+└─────────────────────────────────────────────────────────────┘
+```
+
+## 测试文件职责
+
+### L1: 单元测试
+
+| 文件 | 职责 |
+|------|------|
+| `test_prompts.py` | 测试 prompt 模板生成 |
+| `test_resources.py` | 测试资源访问 |
+| `test_cache.py` | 测试编译缓存 |
+| `test_mixins.py` | 测试工具 mixin 行为 |
+| `test_resource_limit.py` | 测试资源限制工具 |
+| `test_win_job.py` | 测试 Windows Job Object |
+
+### L2: 集成测试
+
+| 文件 | 职责 |
+|------|------|
+| `test_server.py` | 测试 MCP Server 工具注册和调用 |
+| `test_compiler.py` | 测试 C++ 编译器集成 |
+| `test_packaging.py` | 测试打包配置、模板访问、MCP 类型 |
+
+### L3: 端到端 MCP 测试
+
+| 文件 | 职责 |
+|------|------|
+| `test_e2e_mcp.py` | 真实 MCP 协议握手和工具调用 |
+
+### L4: 打包产物测试
+
+| 文件 | 职责 |
+|------|------|
+| `test_packaging_smoke.py` | 验证 wheel 安装后 console script |
+
+## CI 测试流程
+
+```yaml
+# 1. 单元测试 + 集成测试（多 Python 版本）
+test-unit:
+  - uv run pytest tests/ -v -m "not integration"
+
+# 2. 集成测试（标记为 integration）
+test-integration:
+  - uv run pytest tests/ -v -m "integration"
+
+# 3. 打包产物测试（uv build 后）
+test-packaging:
+  - uv build
+  - pip install dist/*.whl
+  - pytest tests/test_packaging_smoke.py -v -m "packaging"
+```
+
+## 测试标记
+
+| 标记 | 用途 | 示例 |
+|------|------|------|
+| `@pytest.mark.integration` | 集成测试 | 需要 g++ 或外部依赖 |
+| `@pytest.mark.packaging` | 打包测试 | 需要从 wheel 安装 |
+
+## 运行测试
+
+```bash
+# 运行所有单元测试和集成测试
+uv run pytest tests/ -v
+
+# 只运行单元测试
+uv run pytest tests/ -v -m "not integration"
+
+# 只运行集成测试
+uv run pytest tests/ -v -m "integration"
+
+# 运行端到端 MCP 测试
+uv run pytest tests/test_e2e_mcp.py -v
+
+# 运行打包产物测试（需要先安装 wheel）
+pytest tests/test_packaging_smoke.py -v -m "packaging"
+```
+
+## 测试覆盖目标
+
+- **L1 单元测试**: 覆盖核心逻辑，快速反馈
+- **L2 集成测试**: 覆盖模块交互，验证工具链
+- **L3 端到端测试**: 覆盖 MCP 协议兼容性
+- **L4 打包测试**: 覆盖发布产物可用性
diff --git a/tests/test_packaging_smoke.py b/tests/test_packaging_smoke.py