← Back to browse
skill
Eval-Driven Development Framework for AI Agents
From tan-yong-sheng/ai-vision-mcp ★ 50
Summary
This skill provides a formal framework for implementing Eval-Driven Development (EDD) within AI coding sessions. It enables developers to define capability and regression tests, track agent reliability using metrics like pass@k, and generate comprehensive evaluation reports.