Skills
Deduplicate and report bugs to Linear
This skill manages the process of triaging measured bug or regression evidence by searching Linear for existing issues. It requires explicit human approval before either commenting on related tickets or creating new, evidence-only bug repor…
Detect Production Regressions in Langfuse
Proactively identifies production regressions by comparing recent Datadog error logs, spans, and API latency against baseline benchmarks across multiple environments. It generates a structured findings table for human review before optional…
Browser Review for Frontend UI Changes
This skill provides a structured workflow for reviewing user-visible frontend changes, ensuring layout, styling, and navigation regressions are caught. It utilizes the Playwright MCP server to facilitate final signoff for UI-affecting work.
datadog-powered production issue debugging skill
This skill facilitates the root-cause analysis of production failures by integrating Datadog's telemetry (APM, logs, metrics) with the Langfuse codebase. It processes incident reports or issue IDs to deliver a structured analysis, including…
Langfuse Datadog Production Query Recipes
Provides predefined Datadog query shapes for investigating Langfuse production telemetry across multiple environments. It facilitates research into tenant activity, API usage, queue behaviour, and system metrics.
Comprehensive Code Review Workflow for Changes
This skill guides the comprehensive review of code changes (PRs, diffs) focusing on correctness, behavioral regressions, security risks, and performance. It enforces structured output, requiring findings to be listed by severity with precis…
Langfuse Backend Development Guidelines
Comprehensive development guidelines for the Langfuse monorepo, covering tRPC routers, BullMQ processors, and database access patterns using Prisma and ClickHouse. It provides standards for architecture, middleware, and testing across web, …
ClickHouse Schema, Query, and Data Best Practices Review
This skill provides comprehensive, rule-based guidance for optimizing ClickHouse database interactions. It enforces best practices across schema design, query writing, and data ingestion, ensuring all recommendations are cited against speci…
Generate user-focused changelog entries
This skill guides the drafting of user-facing changelog entries for completed features. It requires analyzing a feature branch diff and existing documentation patterns to produce structured, user-centric content.
Analyze Langfuse Cloud Infrastructure Costs
This skill provides evidence-backed cost analysis of Langfuse Cloud infrastructure by querying Metabase cost marts. It identifies total spend, provider splits (e.g., AWS vs ClickHouse), and top cost drivers across various services and usage…
Agent Setup and Maintenance Workflow
A workflow for managing shared agent configurations, skills, and discovery surfaces within the Langfuse repository. It provides guidelines for maintaining canonical files, updating sync scripts, and ensuring robust agent setup during instal…
Managing LLM Model Pricing Configurations
This skill guides developers through updating model pricing configurations across multiple providers, including OpenAI, Anthropic, and Gemini. It covers editing pricing JSON, shared LLM types, and complex regex match patterns to ensure accu…
NocturnusAI Knowledge Base Management
Manage a logical knowledge base by asserting facts, defining Horn clause rules, and performing inference-based queries. The interface supports bulk operations, pattern-based retraction, and schema discovery via an MCP-compatible interface.
GitHub Issue and Pull Request Triage
Automates the triage of GitHub issues and pull requests by refreshing local snapshots, evaluating mergeability, and updating a status ledger. It enforces testing standards and provides a prioritised workflow for managing repository tasks.
AWS Infrastructure Security Analysis
This skill provides workflows for analysing AWS security posture, identifying attack paths, and implementing remediations using Cyntrisec MCP tools. It enables automated security assessments, compliance auditing, and IAM permission optimisa…
Natural Language Smart TV Control
This skill enables natural-language control of a smart TV by mapping user intents to stv CLI commands. It supports playback, volume adjustments, app switching, and URL casting in both English and Korean.
Automated Product Demo Generation with Auto-Zoom
Automates the generation of polished product demos by orchestrating a narrative-driven recording process with intelligent auto-zoom. The skill handles product analysis, interaction choreography, and iterative refinement to produce high-qual…
Wireshark Packet Capture Analysis Workflow
This skill provides a structured workflow for analysing packet captures and live network traffic for security triage, incident response, or troubleshooting. It enables developers to transform raw pcap data into evidence-backed findings usin…
Wireshark Traffic Analysis Skill
Provides a disciplined methodology for analysing packet captures and live network traffic using Wireshark MCP tools. It enables structured investigation for security triage, incident response, and protocol troubleshooting through evidence-b…
Wireshark Network Traffic Analysis
A structured methodology for performing deep packet inspection and network forensics using Wireshark MCP tools. It facilitates systematic triage, security hunting, and protocol troubleshooting through evidence-backed analysis of pcap/pcapng…
Visual GUI Automation and Interaction
Provides agents with visual interaction capabilities to manipulate GUIs through mouse movements, keyboard inputs, and window management. It is compatible with local machines, Docker containers, and cloud-based virtual machines.
PromptSpeak Governance Hold Triage
Facilitates the review and processing of pending governance holds flagged by PromptSpeak. It provides a workflow to approve, reject, or skip risky operations that have been paused for human intervention.
PromptSpeak Governance Configuration
Configures PromptSpeak governance profiles by assessing project risk, mapping to predefined profiles, and applying configuration settings via specific tools.
Agent Behavioural Drift Analysis
Analyse behavioural drift patterns across agents to identify anomalies, trends, and dimension shifts. It provides a structured workflow for reviewing agent health and recommending governance actions such as recalibration or halting.