13. Repeat evaluation

Repeat evaluation

You can add iterations to the experiment.

This is useful when you can repeat the evaluation multiple times:

  • For larger evaluation sets

  • For chains that can generate variable responses

  • Assessments that can generate variable scores (e.g. llm-as-judge )

Reference

Copy

# installation
# !pip install -qU langsmith langchain-teddynotew

Copy

# Configuration file for managing API KEY as environment variable
from dotenv import load_dotenv

# Load API KEY information
load_dotenv()

Copy

Copy

Copy

Define functions for RAG performance testing

Copy

Copy

Repeat evaluation for RAG using GPT model

Copy

Repeat evaluation for RAG using Ollama model

Copy

Last updated