Using Watch Mode
Automatically re-run evaluations when golden set files change
Watch mode re-runs evaluation whenever a golden set file changes. Useful for iterative development — tweak a test case, save the file, and see results immediately.
Start watching
regtrace watchOutput:
=== regtrace watch ===
i Watching 1 golden set file(s)...
i Press Ctrl+C to stop
Watching: /project/golden-sets/qa.yamlHow it works
- Watches all enabled golden set files from the config
- Debounces changes (500ms) to avoid re-running on partial writes
- Each detected change triggers a full
regtrace run
Workflow
i Change detected: golden-sets/qa.yaml
✓ Evaluating "my-qa-set" (3 test cases)
...
Run Complete
1 suite(s) evaluatedStopping
Press Ctrl+C to stop:
i Watch stoppedUse case: prompt iteration
A typical workflow:
- Run
regtrace watch - Edit a test case's
actual_outputto test a new prompt - Save — watch mode runs evaluation
- Check results
- Iterate
This tight feedback loop is faster than manually running regtrace run after
every change.