Core concepts

GPT

Generative Pre-trained Transformer — the architecture behind OpenAI's models, and now used as shorthand for any LLM-powered chatbot.

01 ——

In plain English

GPT stands for Generative Pre-trained Transformer. It's both the technical name for the architecture OpenAI uses (GPT-2, GPT-3, GPT-4, GPT-5) and a generic term most people use to refer to LLM-based chatbots in general.

Breaking down the name:

Generative — produces new text rather than classifying
Pre-trained — trained on massive text data before being released
Transformer — the underlying neural network architecture

Why GPT became famous: ChatGPT (built on GPT-3.5 and later GPT-4) was the first AI product to reach 100 million users in two months. It made LLMs mainstream and reset what consumers expect from software.

Other LLM families (Claude, Gemini, Llama) use the same transformer architecture but aren't technically "GPTs" — though many people use the word loosely for any chatbot.

02 ——