LoRA
Low-Rank Adaptation — a cheap way to fine-tune large AI models by training a small set of extra weights instead of the whole model.
In plain English
LoRA (Low-Rank Adaptation) is a fine-tuning technique that lets you customise a large model without retraining all its billions of parameters. Instead, you train a small set of additional weights ("adapters") that modify the model's behaviour for your task.
Why LoRA matters:
- Cost — full fine-tuning a 70B model costs thousands; LoRA can cost dollars
- Storage — adapters are MBs, not GBs; you can have hundreds for one base model
- Speed — train in hours instead of days
- Composability — combine multiple LoRAs (one for tone, one for domain knowledge)
Where you'll see it:
- Image generation — Stable Diffusion communities share thousands of LoRAs for specific styles and subjects
- Open-weight LLMs — most fine-tunes of Llama, Mistral, etc. use LoRA
- Enterprise customisation — internal models tuned on company data
LoRA is the dominant approach to lightweight fine-tuning today.