skill
★ 50
Eval-Driven Development Framework for AI Agents
This skill provides a formal framework for implementing Eval-Driven Development (EDD) within AI coding sessions. It enables developers to define capability and regression tests, track agent reliability using metrics like pass@k, and generat…
tan-yong-sheng/ai-vision-mcp
evaluation-framework
edd
ai-testing
regression-testing