← Back to browse
skill

Automated Experiment Benchmarking Skill

From Upsonic/gpt-computer-assistant

Summary

Defines comparison metrics and extracts baseline values from notebook outputs to record them in a structured JSON log for downstream evaluation.