Modalities

Voice Cloning

AI that learns to mimic a specific person's voice from a short sample, then generates new speech in that voice from any text.

01 ——

In plain English

Voice cloning is AI technology that captures a specific person's voice — often from just a few seconds or minutes of sample audio — and lets you generate new spoken content in that voice from any text.

Legitimate uses:

  • Audiobook narration — let an author "read" their own book
  • Localisation — keep the original actor's voice across languages
  • Accessibility — preserve the voice of someone losing their ability to speak
  • Content creation — podcasters and creators producing scaled audio

Misuse risks:

  • Fraud — voice-cloned scam calls impersonating CEOs or family
  • Deepfake media — fake political audio, defamation
  • Identity theft — bypassing voice-based authentication

Major providers: ElevenLabs leads on quality. Resemble, PlayHT, and Descript also offer voice cloning. All require some form of consent and watermarking — though enforcement varies.

02 ——

Related terms

Back to glossaryLast reviewed May 2026
Vol. 4 · Issue 19 · Last reviewed 2026-05-30

Sign up for our newsletter

Receive weekly updates so you can stay up-to-date with the world of AI