‌
‌

Core concepts

Mixture of Experts

A model architecture that has many "expert" subnetworks but activates only a few per token — getting big-model quality at small-model inference cost.

01 ——

In plain English

Mixture of Experts (MoE) is a neural network design where the model has many specialised sub-networks ("experts"), but a small routing layer picks only a few of them to run on any given token. The result: you get the parameter count of a giant model with the inference cost of a much smaller one.

Why MoE wins on inference economics:

A "1 trillion parameter" MoE might only activate 50B parameters per token
Training cost is high (you train the whole model), but serving cost is what matters at scale
Higher quality per compute dollar than a dense model of the same active size

Notable MoE models:

Mixtral (Mistral) — popularised open-source MoE
DeepSeek-V3 / R1 — large MoE that competes with frontier closed models
Qwen-MoE (Alibaba)
Grok (xAI), reportedly MoE
GPT-4 is widely believed to be MoE (never confirmed by OpenAI)

Trade-offs:

More complex to train and fine-tune
Memory-heavy at inference even though FLOPs are lower
Routing imbalance can leave some experts under-trained

02 ——

Related terms

The neural network architecture introduced in 2017 that powers nearly every modern LLM, image generator, and AI breakthrough.

Small Language Model

A compact language model — typically 1B to 15B parameters — designed to run cheaply, fast, or on-device while still being useful for focused tasks.

Foundation Model

A large, general-purpose AI model trained on broad data that can be adapted (via prompting or fine-tuning) to many downstream tasks.

The process of running a trained AI model to generate a response — as opposed to training the model.

Shrinking an AI model by storing its weights in lower-precision numbers — making it smaller, faster, and cheaper with minimal quality loss.

Open-weight Model

An AI model whose trained weights are publicly released, so anyone can download, run, or fine-tune it themselves.

Back to glossaryLast reviewed June 2026

Vol. 4 · Issue 21 · Last reviewed 2026-06-27

Sign up for our newsletter

Receive weekly updates so you can stay up-to-date with the world of AI

Sign up for our newsletter

Receive weekly updates so you can stay up-to-date with the world of AI

AI Tools Directory

The AI tools directory for discovering, exploring, and comparing the most innovative AI tools in the industry

Explore

All AI tools

Top 100 AI tools

Best AI tools

Curated collections

AI tool alternatives

AI categories

Pricing

AI glossary

Compare AI tools

Blog

Methodology

Editorial team

AI graveyard

Research

MCP server

Latest collections

Policy

Terms & conditions

Privacy policy

FAQ

Refund policy

Affiliate disclosure