Tool Robust Exploration

Studying using mock tool calls to improve prompt robustness.

Layout

src/tool_robust_poc/
- tasks: Datasets we use
- conditions: The prompt conditions
- atttack_opt: The automated redteam attack generation
- runners, reporting.
scripts/ — table generators and final-run launch scripts.
data/ — task input items.
results/ — paper-input result archives (Git LFS).

Setup

uv sync

Note though there is a dependency on fllmingo (a currently internal LLM wrapper package). I need to clean that and figure out about exporting here. If you wanted to run from scratch with this existing code, agents could probably migrate it over to normal APIs fairly easily (the parts used are a thin wrapper and all the parameters passed through to the API apparent). This code is mostly intended as a reference for the writeup.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
results		results
scripts		scripts
src/tool_robust_poc		src/tool_robust_poc
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tool Robust Exploration

Layout

Setup

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Tool Robust Exploration

Layout

Setup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages