← Back to browse
skill
Automated Experiment Benchmarking Skill
Summary
Defines comparison metrics and extracts baseline values from notebook outputs to record them in a structured JSON log for downstream evaluation.