← Back to browse
skill
AI agent regression testing with EvalView
From hidai25/eval-view ★ 105
Summary
Detect regressions in AI agent behaviour by comparing current outputs and tool calls against golden baselines. It identifies changes in outputs, tool usage, and significant score drops.