Tag: automated-evaluation

All Skills Tools

Parallel Codebase Audit Orchestrator

Orchestrates parallel agent execution to perform codebase evaluations, health checks, and documentation audits. It organises the end-to-end scoping process and generates structured audit plans for pipeline integration.

HatmanStack/RAGStack-Lambda codebase-audit agentic-workflow automated-evaluation technical-debt

skill

Automated Experiment Comparison and Reporting

Automates the comparison of baseline and new implementation results by calculating performance metrics and generating structured JSON reports. It handles the systematic updating of experiment logs, summaries, and comparison datasets.

Upsonic/gpt-computer-assistant experiment-tracking automated-evaluation metric-comparison data-analysis