Skip to main content

AI can now duplicate anyone's voice based on just one minute of training

ai lyrebird duplicate anyones voice 60965918 l
Ian Allenden/123RF
Do you remember the cool Mission Impossible tech that lets Tom Cruise’s character Ethan Hunt mimic the voice of other characters using some nifty speech synthesis technology?

Well, a Montreal-based startup called Lyrebird (named after the sound-imitating bird) just invented it for real.

“We are developing new speech synthesis technologies which, among other features, allow us to copy the voice of someone with very little data,” Alexandre de Brebisson, one of the PhD students who developed the deep-learning tech behind the project. “Our experiments show that one minute of audio already contains a lot of the DNA of a human voice. We are able to learn a new voice with as little data because our model is able to capture similarities between the new voice and all the voices it already knows. Our models understand the underlying variables that make [one] voice different from another.”

Since the tech was shown off this week, de Brebisson said his team have received dozens of different suggested use-cases by email, some containing applications they’d thought of, and others containing ones that they hadn’t.

Some companies, for example, are interested in letting their users choose to have audio books read in the voice of either famous people or family members. The same is true of medical companies, which could allow people with voice disabilities to train their synthetic voices to sound like themselves, if recorded samples of their speaking voices exist. Another interesting idea is for video game companies to offer the ability for in-game characters to speak with the voice of the human player.

There are plenty more exciting opportunities, which have led to 10,000 people already signing up to be informed of the forthcoming beta version. “We will then add features over time, such as letting companies design a unique voice tailored for their needs, and control the emotion of the [voice] generation,” de Brebisson continued.

While it doesn’t sound perfect yet, it’s not hard to imagine how this might sound in just a few years. Combined with technology such as software for making convincing edits to the moving lips of a person who is speaking, “fake news” circa 2025 should certainly be a whole lot of fun.

Right?

Editors' Recommendations

Luke Dormehl
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
This AI cloned my voice using just three minutes of audio
acapela group voice cloning ad

There's a scene in Mission Impossible 3 that you might recall. In it, our hero Ethan Hunt (Tom Cruise) tackles the movie's villain, holds him at gunpoint, and forces him to read a bizarre series of sentences aloud.

"The pleasure of Busby's company is what I most enjoy," he reluctantly reads. "He put a tack on Miss Yancy's chair, and she called him a horrible boy. At the end of the month, he was flinging two kittens across the width of the room ..."

Read more
Digital Trends’ Top Tech of CES 2023 Awards
Best of CES 2023 Awards Our Top Tech from the Show Feature

Let there be no doubt: CES isn’t just alive in 2023; it’s thriving. Take one glance at the taxi gridlock outside the Las Vegas Convention Center and it’s evident that two quiet COVID years didn’t kill the world’s desire for an overcrowded in-person tech extravaganza -- they just built up a ravenous demand.

From VR to AI, eVTOLs and QD-OLED, the acronyms were flying and fresh technologies populated every corner of the show floor, and even the parking lot. So naturally, we poked, prodded, and tried on everything we could. They weren’t all revolutionary. But they didn’t have to be. We’ve watched enough waves of “game-changing” technologies that never quite arrive to know that sometimes it’s the little tweaks that really count.

Read more
Digital Trends’ Tech For Change CES 2023 Awards
Digital Trends CES 2023 Tech For Change Award Winners Feature

CES is more than just a neon-drenched show-and-tell session for the world’s biggest tech manufacturers. More and more, it’s also a place where companies showcase innovations that could truly make the world a better place — and at CES 2023, this type of tech was on full display. We saw everything from accessibility-minded PS5 controllers to pedal-powered smart desks. But of all the amazing innovations on display this year, these three impressed us the most:

Samsung's Relumino Mode
Across the globe, roughly 300 million people suffer from moderate to severe vision loss, and generally speaking, most TVs don’t take that into account. So in an effort to make television more accessible and enjoyable for those millions of people suffering from impaired vision, Samsung is adding a new picture mode to many of its new TVs.
[CES 2023] Relumino Mode: Innovation for every need | Samsung
Relumino Mode, as it’s called, works by adding a bunch of different visual filters to the picture simultaneously. Outlines of people and objects on screen are highlighted, the contrast and brightness of the overall picture are cranked up, and extra sharpness is applied to everything. The resulting video would likely look strange to people with normal vision, but for folks with low vision, it should look clearer and closer to "normal" than it otherwise would.
Excitingly, since Relumino Mode is ultimately just a clever software trick, this technology could theoretically be pushed out via a software update and installed on millions of existing Samsung TVs -- not just new and recently purchased ones.

Read more