home/categories/llm-ai/muratcankoylan-agent-skills-for-context-engineering-skills-evaluation-skill-md

llm-aidata-ai

evaluation

Name: evaluation
Author: muratcankoylan

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge, multi-dimensional evaluation, agent testing, or quality gates for agent pipelines.

View Source llm-ai

maintainer

muratcankoylan

Updated 3/18/2026

Stars

14945

Forks

1173

quick start

Installation and usage

Installation

$ install --globalskills.sh

Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use evaluation