Training

LoRA

Low-Rank Adaptation — a cheap way to fine-tune large AI models by training a small set of extra weights instead of the whole model.

01 ——

In plain English

LoRA (Low-Rank Adaptation) is a fine-tuning technique that lets you customise a large model without retraining all its billions of parameters. Instead, you train a small set of additional weights ("adapters") that modify the model's behaviour for your task.

Why LoRA matters:

Cost — full fine-tuning a 70B model costs thousands; LoRA can cost dollars
Storage — adapters are MBs, not GBs; you can have hundreds for one base model
Speed — train in hours instead of days
Composability — combine multiple LoRAs (one for tone, one for domain knowledge)

Where you'll see it:

Image generation — Stable Diffusion communities share thousands of LoRAs for specific styles and subjects
Open-weight LLMs — most fine-tunes of Llama, Mistral, etc. use LoRA
Enterprise customisation — internal models tuned on company data

LoRA is the dominant approach to lightweight fine-tuning today.

02 ——