Skip to content

Latest commit

 

History

History
30 lines (25 loc) · 1.99 KB

File metadata and controls

30 lines (25 loc) · 1.99 KB

🦾 PyOB Evolution Roadmap (v0.2.0)

1. TDD Mode (Test-Driven Development)

  • What it does: Forces PyOB to write a failing test before writing any feature code, ensuring functional correctness instead of just "it didn't crash."
  • Implementation:
    • Create a new prompt template UT.md (Unit Test).
    • In Phase 3, the AI must generate a tests/test_feature.py file first.
    • Update run_pipeline to execute pytest. Success is only declared if the new test passes.

2. GitHub "Librarian" Integration

  • What it does: Automatically manages the project's Git lifecycle, creating branches for every feature and opening Pull Requests for you to review.
  • Implementation:
    • Use subprocess to execute git checkout -b, git commit, and git push.
    • Trigger this in execute_targeted_iteration immediately after a successful FINAL VERIFICATION.
    • Optionally use the GitHub CLI (gh pr create) to automate the PR submission.

3. Agent Debate (Consensus Verification)

  • What it does: Uses a second, different AI model (e.g., Qwen 30B) to "sanity check" the code generated by the first model (e.g., Gemini 1.5) before it is ever applied.
  • Implementation:
    • In get_valid_edit, once a patch is generated, send the diff to the other provider.
    • Use a prompt: "Review this diff for logical flaws or security holes. Respond 'VALID' or provide a critique."
    • If the critique is negative, the system auto-regenerates using the critique as feedback.

4. Cyberpunk Hardware Extensions

  • What it does: Expands the target application's (System Monitor) capabilities to include GPU tracking, network packet monitoring, and audio visualization.
  • Implementation:
    • Install and integrate GPUtil (GPU metrics), scapy (Network), and pyaudio (Sound).
    • Update the SystemMetricsEngine to provide these data streams and create new DetailMode Enums in gui.py to visualize the "pulsing" hardware data.