Chapter 3

Chapter 3: Instruction Fine-Tuning

  • Why instruction-tune for assistant behavior.

  • Prepare a JSONL dataset of instruction/query/response.

  • Use transformers.Trainer or trlx for fine‑tuning.

  • Optionally use LoRA/PEFT to efficiently adapt a model.

Last updated