Tag: machine-learning
Automated experiment benchmarking and metric extraction
Defines comparison metrics and extracts baseline values from notebook outputs to record them in a structured JSON log for downstream evaluation.
Research Method Implementation Skill
Automates the generation of Jupyter notebooks for implementing new research methods, ensuring parity with baseline experiments through consistent data loading and metric computation. It manages dependency updates and logs implementation det…
Analyze Current Baseline Implementation
Extracts model architectures, preprocessing steps, and hyperparameters from existing Jupyter notebooks to create a structured experiment log. It also identifies dependencies and generates a requirements file from the notebook's import state…
Automated Experiment Benchmarking Skill
Defines comparison metrics and extracts baseline values from notebook outputs to record them in a structured JSON log for downstream evaluation.
Analyse Current Experiment Baseline
Analyses existing Jupyter notebooks to extract model architectures, preprocessing steps, and hyperparameters, recording the findings in a structured experiment log.