OpenAI’s new voice synthesizer can copy your voice from simply 15 seconds of audio

OpenAI has been quickly creating its ChatGPT generative AI chatbot and Sora AI video creator during the last 12 months, and it is now acquired a brand new artificial intelligence software to point out off: Voice Technology, which may create artificial voices from simply 15 seconds of audio.

In a blog post (by way of The Verge), OpenAI says it has been working “a small-scale preview” of Voice Engine, which has been in improvement since late 2022. It is really already being utilized in the Read Aloud feature within the ChatGPT app, which (because the title suggests) reads out solutions to you.

As soon as you’ve got educated the voice from a 15-second pattern, you may then get it to learn out any textual content you want, in an “emotive and practical” means. OpenAI says it could possibly be used for instructional functions, for translating podcasts into new languages, for reaching distant communities, and for supporting people who find themselves non-verbal.

This is not one thing everybody can use proper now, however you may go and listen to the samples created by Voice Engine. The clips OpenAI has revealed sound fairly spectacular, although there’s a slight robotic and stilted edge to them.

Security first

Voice Engine is already utilized in ChatGPT’s Learn Aloud function (Picture credit score: OpenAI)

Worries about misuse are the primary motive Voice Engine is just in a restricted preview for now: OpenAI says it needs to do extra analysis into the way it can defend instruments like this from getting used to unfold misinformation and replica voices with out consent.

“We hope to begin a dialogue on the accountable deployment of artificial voices, and the way society can adapt to those new capabilities,” says OpenAI. “Primarily based on these conversations and the outcomes of those small scale checks, we’ll make a extra knowledgeable choice about whether or not and deploy this expertise at scale.”

With main elections due in each the US and UK this 12 months, and generative AI instruments getting extra superior on a regular basis, it is a concern throughout each kind of AI content material – audio, textual content, and video – and it is getting more and more troublesome to know what to belief.

As OpenAI itself factors out, this has the potential to trigger issues with voice authentication measures, and scams the place you may not know who you are speaking to over the cellphone, or who’s left you a voicemail. These aren’t straightforward points to unravel – however we’ll have to search out methods to cope with them.

OpenAI’s new voice synthesizer can copy your voice from simply 15 seconds of audio

16 Finest Gaming Headsets (2024): Wired, Wi-fi, for Change, PC, Xbox, PS5, and PS4

Podcast #766 – AMD FSR 3.1, Radeon 7900 GRE Reminiscence OC Unlock, Microsoft DirectSR, LIVA Z5 PLUS, Apple CPU Vulnerability + MORE

admin

Podcast #766 - AMD FSR 3.1, Radeon 7900 GRE Reminiscence OC Unlock, Microsoft DirectSR, LIVA Z5 PLUS, Apple CPU Vulnerability + MORE

Discussion about this post