Tag: automated-evaluation
skill
★ 21
Parallel Codebase Audit Orchestrator
Orchestrates parallel agent execution to perform codebase evaluations, health checks, and documentation audits. It organises the end-to-end scoping process and generates structured audit plans for pipeline integration.
skill
Automated Experiment Comparison and Reporting
Automates the comparison of baseline and new implementation results by calculating performance metrics and generating structured JSON reports. It handles the systematic updating of experiment logs, summaries, and comparison datasets.