Ctrlk

Chapter 3

Chapter 3: Instruction Fine-Tuning

Why instruction-tune for assistant behavior.
Prepare a JSONL dataset of instruction/query/response.
Use transformers.Trainer or trlx for fine‑tuning.
Optionally use LoRA/PEFT to efficiently adapt a model.

PreviousHands-on: Querying the model and analyzing responses NextWhy instruction-tune for assistant behavior

Last updated 5 months ago