Search Skills

Search for skills or navigate to categories

SkillforthatSkillforthat
AI & Machine Learning
L

llm-evaluation

Implement comprehensive evaluation strategies for LLM applications.

Category

AI & Machine Learning

Author

wshobson

Updated

Jan 2026

Tags

4

Install Command

claude skill add wshobson/agents

Description

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or establishing evaluation frameworks.

Tags

EvaluationLLMMetricsBenchmarking

Information

Developerwshobson
CategoryAI & Machine Learning
CreatedJan 15, 2026
UpdatedJan 15, 2026

You Might Also Like