← Back to browse
skill

AI agent regression testing with EvalView

From hidai25/eval-view ★ 105

Summary

Detect regressions in AI agent behaviour by comparing current outputs and tool calls against golden baselines. It identifies changes in outputs, tool usage, and significant score drops.