Tag: metrics
Structured Skill for Metric Benchmarking and Logging
This skill defines and extracts comprehensive comparison metrics and baseline values from existing analysis notebooks. It appends a structured Phase 3 benchmark entry to the experiment's log.json, ensuring all necessary metrics are recorded…
Evaluate research results and compare implementations
This skill automates the final evaluation phase of research experiments. It compares metrics between a baseline and a new implementation, determining a formal verdict and generating comprehensive, machine-readable reports for tracking and c…
evaluate retrieval quality metrics and performance
This utility measures core retrieval metrics (Hit@5, MRR, nDCG@10) against a corpus of golden queries. It compares current performance against a saved baseline and provides actionable interpretation and tuning recommendations for diagnosing…
Generates Octocode usage statistics dashboard
This tool generates a local HTML dashboard visualizing Octocode usage statistics, including estimated tokens saved, cache hits, and total tool calls. It processes the stats.json file to provide a comprehensive report on session performance …
Query Netdata Cloud via REST API
This skill enables querying Netdata Cloud via its REST API to retrieve time-series metrics, logs, network flows, and topology data. It provides the necessary procedures for interacting with spaces, rooms, and nodes using an API token.
Query Netdata Agents via Direct HTTP API
This skill provides methods for querying Netdata agents directly via their HTTP API, enabling access to metrics, logs, and functions. It manages the lifecycle of per-agent bearer tokens, allowing power users to bypass cloud round-trips for …
Netdata Collector Development Best Practices Guide
This guide outlines the mandatory mental models and best practices for authoring production-grade data collectors for the Netdata Agent. It covers critical topics such as dashboard shaping via NIDL, maintaining cardinality discipline, and e…
datadog-powered production issue debugging skill
This skill facilitates the root-cause analysis of production failures by integrating Datadog's telemetry (APM, logs, metrics) with the Langfuse codebase. It processes incident reports or issue IDs to deliver a structured analysis, including…
Financial Ratio Calculation and Analysis
This skill calculates and interprets a wide range of financial ratios and metrics from provided financial statement data, enabling investment analysis and performance evaluation. It supports various input formats and provides industry bench…