‌
‌

Safety

Data Poisoning

An attack that corrupts a model’s training data to make it behave incorrectly — either degrading performance or installing hidden backdoors.

01 ——

In plain English

Data poisoning is an attack on the AI supply chain: an attacker injects bad data into a training set so the resulting model behaves in a way they want. Because frontier models scrape much of the public web, the threat is concrete and growing.

Two main types:

Availability attacks — degrade the model's general performance (subtle quality drops across many tasks)
Backdoor attacks — make the model behave normally except when triggered by a specific phrase, image, or pattern — at which point it produces malicious output

Real-world surface area:

Web-scraped pretraining data (anyone with a website can try)
Open datasets (Common Crawl, LAION, Stack Overflow dumps)
User-uploaded fine-tuning data
RAG corpora (a poisoned doc in a vector DB can hijack answers)

Defences: Data filtering, provenance tracking, training-data deduplication, and post-hoc red teaming. No defence is foolproof; this is an active research area.

02 ——

Related terms

Adversarial Attack

Deliberately crafted inputs that trick an AI model into producing wrong or harmful outputs — a key category of AI security threat.

Prompt Injection

A security attack where malicious instructions hidden in user input or external content trick an AI model into ignoring its real instructions.

Deliberately trying to make an AI model misbehave — find jailbreaks, exploits, and failure modes — before adversaries do.

The dataset an AI model learns from — its quality, diversity, and biases directly shape what the model can do and how well it does it.

Retrieval-Augmented Generation — a technique that gives an AI model access to external documents before it answers, so it can cite real, up-to-date sources.

Rules and filters that constrain what an AI model can output — used to block harmful, off-topic, or non-compliant responses.

Back to glossaryLast reviewed June 2026

Vol. 4 · Issue 21 · Last reviewed 2026-06-27

Sign up for our newsletter

Receive weekly updates so you can stay up-to-date with the world of AI

Sign up for our newsletter

Receive weekly updates so you can stay up-to-date with the world of AI

AI Tools Directory

The AI tools directory for discovering, exploring, and comparing the most innovative AI tools in the industry

Explore

All AI tools

Top 100 AI tools

Best AI tools

Curated collections

AI tool alternatives

AI categories

Pricing

AI glossary

Compare AI tools

Blog

Methodology

Editorial team

AI graveyard

Research

MCP server

Latest collections

Policy

Terms & conditions

Privacy policy

FAQ

Refund policy

Affiliate disclosure