genai-4-beginners
Ctrlk
  • Contents
  • Chapter 1 - Foundations of GenAI
  • Chapter 2 - Models & Architectures
  • Chapter 3 - Model Providers (Expand your existing) Mistral
  • Chapter 4 - Ecosystem Tools & Frameworks
  • Chapter 5 - Memory & Agents
  • Chapter 6 - Infrastructures & Storage
  • Chapter 7 - Ethics & Limitations
  • Chapter 8 - Use Cases & Applications
  • Chapter 9 - Evaluation & Testing
    • Prompt Testing
    • LLM Benchmarks (HELM, MMLU, TruthfulQA)
    • Guardrails for Output Control
    • Evaluating Relevance, Coherence, Safety
  • Chapter 10 - Trends & Future
  • Interview Questions
Powered by GitBook
On this page

Chapter 9 - Evaluation & Testing

Prompt TestingLLM Benchmarks (HELM, MMLU, TruthfulQA)Guardrails for Output ControlEvaluating Relevance, Coherence, Safety
PreviousContent Generation (Text, Code, Images)NextPrompt Testing