GPT
Generative Pre-trained Transformer — the architecture behind OpenAI's models, and now used as shorthand for any LLM-powered chatbot.
In plain English
GPT stands for Generative Pre-trained Transformer. It's both the technical name for the architecture OpenAI uses (GPT-2, GPT-3, GPT-4, GPT-5) and a generic term most people use to refer to LLM-based chatbots in general.
Breaking down the name:
- Generative — produces new text rather than classifying
- Pre-trained — trained on massive text data before being released
- Transformer — the underlying neural network architecture
Why GPT became famous: ChatGPT (built on GPT-3.5 and later GPT-4) was the first AI product to reach 100 million users in two months. It made LLMs mainstream and reset what consumers expect from software.
Other LLM families (Claude, Gemini, Llama) use the same transformer architecture but aren't technically "GPTs" — though many people use the word loosely for any chatbot.