home/categories/llm-ai/tkhongsap-llama-index-rag-pipeline-claude-skills-evaluating-rag-skill-md

llm-aidata-ai

evaluating-rag

Name: evaluating-rag
Author: tkhongsap

Evaluate RAG systems with hit rate, MRR, faithfulness metrics and compare retrieval strategies. Use when testing retrieval quality, generating evaluation datasets, comparing embeddings or retrievers, A/B testing, or measuring production RAG performance.

View Source llm-ai

maintainer

tkhongsap

Updated 10/30/2025

Stars

Forks

quick start

Installation and usage

Installation

$ install --globalskills.sh

Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use evaluating-rag