← Back to Catalog

skill-eval

Automated CLI for evaluating AI agent skills

Version: 1.0.0
Binaries: 1
Platforms: 2
License: GPL-3.0
Homepage: Link ↗

Install:

brew install matt-riley/tools/skill-eval

README


title: Home description: skill-evaluator is a friendly CLI tool that automates eval-driven iteration for AI skills, helping you run, grade, and benchmark agent outputs.

✨ skill-eval

Welcome to skill-eval! This friendly little CLI tool helps you automate the eval-driven iteration loop for your AI skills. It's inspired by the workflow from agentskills.io.

Skill Evaluator in action!

With skill-eval, you can easily define test cases, run your agent with and without a skill, have an LLM grade the results, and see how everything benchmarks. Let's make your skills amazing! 🚀