Core concepts

GPT

Generative Pre-trained Transformer — the architecture behind OpenAI's models, and now used as shorthand for any LLM-powered chatbot.

01 ——

In plain English

GPT stands for Generative Pre-trained Transformer. It's both the technical name for the architecture OpenAI uses (GPT-2, GPT-3, GPT-4, GPT-5) and a generic term most people use to refer to LLM-based chatbots in general.

Breaking down the name:

  • Generative — produces new text rather than classifying
  • Pre-trained — trained on massive text data before being released
  • Transformer — the underlying neural network architecture

Why GPT became famous: ChatGPT (built on GPT-3.5 and later GPT-4) was the first AI product to reach 100 million users in two months. It made LLMs mainstream and reset what consumers expect from software.

Other LLM families (Claude, Gemini, Llama) use the same transformer architecture but aren't technically "GPTs" — though many people use the word loosely for any chatbot.

02 ——

Related terms

Back to glossaryLast reviewed May 2026
Vol. 4 · Issue 19 · Last reviewed 2026-05-30

Sign up for our newsletter

Receive weekly updates so you can stay up-to-date with the world of AI