Evaluate and Improve Example Workflows

## Problem

Currently, we have no systematic way to:
- Measure how well these workflows perform their intended tasks
- Test prompt improvements before deploying them
- Compare different prompt variations or model configurations
- Validate that changes don't regress quality
- Provide quality benchmarks for the community

## Solution

Use the Gemini CLI evaluation framework to systematically test and improve the effectiveness of prompts and configurations used in our example workflows. This will enable data-driven optimization of our provided workflows and give the community tools to evaluate their own Gemini CLI automations.

## Dependencies 

- Gemini CLI evaluation framework: https://github.com/google-gemini/gemini-cli/issues/6757
- Prompt Reusability: #76 

## References

- https://google.github.io/adk-docs/evaluate/


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluate and Improve Example Workflows #219

Problem

Solution

Dependencies

References

Sub-issues

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Evaluate and Improve Example Workflows #219

Description

Problem

Solution

Dependencies

References

Sub-issues

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions