Chapter 3
Chapter 3: Instruction Fine-Tuning
Why instruction-tune for assistant behavior.
Prepare a JSONL dataset of instruction/query/response.
Use
transformers.Trainerortrlxfor fine‑tuning.Optionally use LoRA/PEFT to efficiently adapt a model.
PreviousHands-on: Querying the model and analyzing responsesNextWhy instruction-tune for assistant behavior
Last updated