ai mitts
Browse Skills Tools GitHub

Tag: ai-testing

Type: All Skills Tools
Tags: mcp automation llm cli code-review unity typescript genomics game-development bioinformatics debugging documentation cloudbase ai-agents agentic-workflow devops llm-agent workflow-automation encode code-analysis
skill ★ 50

Eval-Driven Development Framework for AI Agents

This skill provides a formal framework for implementing Eval-Driven Development (EDD) within AI coding sessions. It enables developers to define capability and regression tests, track agent reliability using metrics like pass@k, and generat…

tan-yong-sheng/ai-vision-mcp evaluation-framework edd ai-testing regression-testing
tool

Comprehensive AI Data and Model Quality Evaluator

Dingo provides a comprehensive framework for evaluating data and AI outputs using both deterministic rule-based checks and advanced LLM-based metrics. It supports complex workflows, including RAG evaluation and autonomous fact-checking, via…

DataEval/dingo data-quality llm-evaluation rule-based rag-metrics
Page 1
ai mitts

Agentic skills & tools, vector-searched.

Browse

All Skills Tools

About

Auto-discovered from public GitHub. Summarised by Ollama. Searched with nomic-embed-text.

© 2026 aimitts.