genai-4-beginners
Ctrl
k
Copy
Chapter 9 - Evaluation & Testing
Prompt Testing
LLM Benchmarks (HELM, MMLU, TruthfulQA)
Guardrails for Output Control
Evaluating Relevance, Coherence, Safety
Previous
Content Generation (Text, Code, Images)
Next
Prompt Testing