Subject
When prompting runs out of road and you need to actually change the model: full fine-tuning versus LoRA and QLoRA, how to build a dataset that teaches instead of confuses, and how distillation shrinks a big model into a fast one you can afford to run.
Fine-tuning is the option people reach for first and need last. It feels like the serious move, the one that actually changes the model instead of fiddling with words. But changing the model is expensive in ways that do not show up until later: you have to build a dataset, you have to retrain when the base model updates, and you have to keep an eval suite alive to know the fine-tune did not quietly regress. The cheaper options solve most problems, so the real skill is knowing when they run out.
Think of it as a ladder. Each rung is cheaper and faster to undo than the one above it.
You cannot skip rungs honestly. If you have not measured the gap on an eval set, you do not know which rung you are on.
Fine-tuning earns its cost in three situations:
The biggest mistake is using it to add knowledge. Fine-tuning on a pile of facts does not reliably make the model know those facts; it makes the model fluent at producing text that looks like those facts, which often means hallucinating in the same register with more confidence. Knowledge that changes belongs in a retrievable source, not in frozen weights you would have to retrain to update.
Before you commit, price in the parts nobody mentions on the sales slide:
If a clearer prompt plus a few examples plus retrieval closes the gap, ship that. Fine-tune when, and only when, you have measured a behavioral gap those cannot close and your volume justifies the ongoing cost.
A short editorial reading list. Pick whichever fits how you like to learn.
Comments 0
No comments yet. Be the first to share your thoughts.