ai mitts
Browse Skills Tools GitHub

Tag: evalview

Type: All Skills Tools
Tags: mcp automation llm cli code-review unity typescript genomics game-development bioinformatics debugging documentation cloudbase ai-agents agentic-workflow devops llm-agent workflow-automation encode code-analysis
skill ★ 105

AI agent regression testing with EvalView

Detect regressions in AI agent behaviour by comparing current outputs and tool calls against golden baselines. It identifies changes in outputs, tool usage, and significant score drops.

hidai25/eval-view regression-testing ai-agents evalview llm-evaluation
Page 1
ai mitts

Agentic skills & tools, vector-searched.

Browse

All Skills Tools

About

Auto-discovered from public GitHub. Summarised by Ollama. Searched with nomic-embed-text.

© 2026 aimitts.