Skip to main content

Adobe working on technology that's basically Photoshop for audio recordings

adobe reuters team up stock photography video media content archive adobehq
Image used with permission by copyright holder
Adobe is one of the most well-known and influential companies when it comes to using computers to process audio and visual information. It’s no surprise, then, to learn that the company is forging new grounds when it comes to processing recorded data and turning it into something else.

One specific project that Adobe is working on with Princeton University is called Project VoCo, which received a sneak preview at Adobe’s MAX 2016 conference. Adobe developer Zeyu Jin showed off the technology in a brief presentation, describing the process as essentially Photoshop for audio, as The Verge reports.

According to the demonstration, Project VoCo enables a user to add words to a speech recording that are not actually included in the original. The software uses an algorithm that requires roughly 20 minutes of speech recording for a given individual, which analyzes the sound and then replicates the individual’s voice. Then, Project VoCo can add new, unspoken words into recordings.

projectvoco5
Adobe
Adobe

Adobe has a rather innocent idea of what the technology might be used for, as it indicated in an official statement: “When recording voiceovers, dialogue, and narration, people would often like to change or insert a word or a few words due to either a mistake they made or simply because they would like to change part of the narrative. We have developed a technology called Project VoCo in which you can simply type in the word or words that you would like to change or insert into the voiceover. The algorithm does the rest and makes it sound like the original speaker said those words.”

Just as with Photoshopped images, however, it’s easy to imagine much more nefarious uses for such a technology. In fact, if the algorithm is good enough and can create a synthetic voice that’s identical to the actual speaker’s, then the very notion of using sound recordings as evidence could go out the window. Like many technologies, then, Project VoCo has both its light and its dark sides.

There’s no information on if or when Project VoCo technology will make it into a commercially available product. Certainly, it’s not particularly surprising that someone like Adobe is working on the ability to create artificial recordings, and it would be more surprising than not if we don’t see it hitting the market at some point in the future.

Mark Coppock
Mark has been a geek since MS-DOS gave way to Windows and the PalmPilot was a thing. He’s translated his love for…
A dangerous new jailbreak for AI chatbots was just discovered
the side of a Microsoft building

Microsoft has released more details about a troubling new generative AI jailbreak technique it has discovered, called "Skeleton Key." Using this prompt injection method, malicious users can effectively bypass a chatbot's safety guardrails, the security features that keeps ChatGPT from going full Taye.

Skeleton Key is an example of a prompt injection or prompt engineering attack. It's a multi-turn strategy designed to essentially convince an AI model to ignore its ingrained safety guardrails, "[causing] the system to violate its operators’ policies, make decisions unduly influenced by a user, or execute malicious instructions," Mark Russinovich, CTO of Microsoft Azure, wrote in the announcement.

Read more