genai-4-beginners
search
⌘Ctrlk
genai-4-beginners
  • Contents
  • Chapter 1 - Foundations of GenAI
  • Chapter 2 - Models & Architectures
  • Chapter 3 - Model Providers (Expand your existing) Mistral
  • Chapter 4 - Ecosystem Tools & Frameworks
  • Chapter 5 - Memory & Agents
  • Chapter 6 - Infrastructures & Storage
  • Chapter 7 - Ethics & Limitations
  • Chapter 8 - Use Cases & Applications
  • Chapter 9 - Evaluation & Testing
    • Prompt Testing
    • LLM Benchmarks (HELM, MMLU, TruthfulQA)
    • Guardrails for Output Control
    • Evaluating Relevance, Coherence, Safety
  • Chapter 10 - Trends & Future
  • Interview Questions
gitbookPowered by GitBook
block-quoteOn this pagechevron-down

Chapter 9 - Evaluation & Testing

Prompt Testingchevron-rightLLM Benchmarks (HELM, MMLU, TruthfulQA)chevron-rightGuardrails for Output Controlchevron-rightEvaluating Relevance, Coherence, Safetychevron-right
PreviousContent Generation (Text, Code, Images)chevron-leftNextPrompt Testingchevron-right