← Back to browse
skill

Eval-Driven Development Framework for AI Agents

From tan-yong-sheng/ai-vision-mcp ★ 50

Summary

This skill provides a formal framework for implementing Eval-Driven Development (EDD) within AI coding sessions. It enables developers to define capability and regression tests, track agent reliability using metrics like pass@k, and generate comprehensive evaluation reports.