Tag: qa
Automated Code Review and Quality Assurance
This skill provides comprehensive code review assistance, automating style checks and identifying potential bugs, security vulnerabilities, and performance issues. It delivers structured feedback categorized by severity (Critical, Important…
Design QA for Llamactl CLI Changes
This skill facilitates design-QA for llamactl changes, allowing human reviewers to assess the rendered output, naming conventions, and error messaging against a live backend. It generates a matrix of carefully chosen test cases and produces…
Reproducible Redteaming for Prompt Security QA
This tool facilitates reproducible redteaming evaluations of prompts, allowing developers to run, inspect, and triage security scan results. It supports stable evaluation of existing test artifacts or full regeneration, enabling focused rer…
Authoring and Running Promptfoo Evaluation Suites
This skill guides developers through authoring comprehensive promptfoo evaluation suites for robust regression testing and quality assurance. It covers defining prompts, structuring test cases, implementing various assertions, and validatin…
Creating and managing promptfoo evaluation suites
This skill guides the creation and maintenance of comprehensive promptfoo evaluation suites, enabling rigorous QA for non-redteam coverage, regression testing, and new matrix development. It details structuring configs, writing prompts, sel…
Guidelines for LLM Testing and Development
Provides comprehensive guidelines for developing robust tests for LLM-related functionality. It advises developers on fixture usage, particularly regarding model and provider selection, ensuring tests remain flexible.
Comprehensive qa engineering test suite writing
This skill emulates a QA engineer, specializing in writing comprehensive pytest suites to validate system functionality. It ensures coverage across happy paths, identified edge cases, and error modes while managing regression detection.
Automated Diff-Driven QA Validation
This skill automates code validation by analysing git diffs to determine the appropriate testing strategy for frontend, backend, or mixed changes. It executes targeted browser automation, API requests, or repository-native tests to report p…