OpenAI unveils AI voice cloning tech that only needs a 15-second sample to work
OpenAI said another use case is for people who cannot speak or have difficulty speaking to give them a voice, which does not sound like a robot. As an employee, it could mean you’ll do fewer routine tasks and spend more time thinking, making decisions, or even managing these AI tools directly. But you’ll still need to learn to work with AI, knowing what it’s good at, how to manage it when it makes mistakes, and how to communicate with AI tools that are always listening. While tools like this save time and offer easier control, the issues of privacy, consent, surveillance, and etiquette in meetings (especially in hybrid or remote work environments) can be substantive. This isn’t a backend utility, either; rather, tools like Otter are becoming active participants in meetings, giving us a glimpse at a future where AI agents might take on even more collaborative roles.
Technology News
One of them used call spoofing so a loved one showed up on the victim’s phone as the person calling. Another one used an AI voice clone to try and extort ransom money from a mother to release her daughter – that wasn’t kidnapped. Further, the Senate Human Rights Subcommittee, chaired by Sen. Jon Ossoff, D-Ga, held a hearing on June 13 to hear from witnesses impacted by AI. “I will never be able to shake that voice and the desperate cry for help from my mind,” DeStefano said. It’s clear that greater clarity is needed to protect the intellectual property rights of individual voices.
Meet the world’s best programmer: A Polish man who beat ChatGPT
Recent machine-learning advances have made it possible for people’s voices to be imitated with only a few short seconds of a voice sample as training data. To mitigate these risks, it is essential to use voice cloning technology responsibly. Always obtain clear and explicit consent from individuals whose voices are being cloned.
“We recognise that generating speech that resembles people’s voices has serious risks, which are especially top of mind in an election year,” the San Francisco-based company said in a statement. OpenAI has made its artificial intelligence (AI) even more humanly eerie with a text-to-voice tool that generates natural speech from a 15-second clip of someone’s voice to sound like the original speaker. Resemble published multiple voice clone samples showcasing the prowess of the new technology. The company uses voice assistants to help seniors choose the right medicare plan. Seniors are often overwhelmed by their choices and struggle with impatient agents. But Fair Square has built a generative AI voice platform built on GPT-4 that can guide seniors through the process, often without long waits.
- The ALS patient’s voice was cloned from recordings that were made of his voice before the disease took away his ability to speak.
- The move, Resemble says, marks a significant development and will make voice cloning technology more accessible, empowering more users to create custom voices for their applications.
- Neural networks (code that processes patterns of information) use deep learning models to ingest and process huge datasets of human speech.
- But it’s also ripe with potential for abuse, as it could easily be used to commit fraud, spread misinformation and generate fake audio evidence.
- Resemble published multiple voice clone samples showcasing the prowess of the new technology.
In the publicly reported cases of AI voice-cloning scams, victims have recounted how the voice sounded “just like” the person being cloned. By using Chatterbox thoughtfully and adhering to best practices, you can unlock its full potential while making sure that its use aligns with ethical and legal standards. This balance between innovation and responsibility is key to maximizing the benefits of AI voice technology while safeguarding against its potential risks.
- There was no waiting, and my issue, which had festered for more than a week, was solved in about 15 minutes.
- Available today, Rapid Voice Cloning can duplicate voices from relatively short datasets and produce an output in just about a minute.
- The challenge lies in ensuring commercial providers implement safeguards to prevent misuse.
- They are easier and cheaper to create compared to deepfake videos, and there are fewer contextual clues to detect with the naked eye,” said Tandon.
I spoke with an AI version of myself, thanks to Hume’s free tool – how to try it
So it shouldn’t come as a surprise that there are at least some heroes out there fighting the good fight with AI voice cloning technology. Whether through their application of these kinds of softwares directly, or the conditions under which they had to use it, I have a lot of respect for people using technology for good. Of course, I know you’re all smart enough to understand that it’s not voice cloning technology itself that’s inherently bad, but the companies and individuals utilising them for evil that are to blame. In all its forms, artificial intelligence has pierced popular culture with a razor sharp bitterness of late.
A startup in Bengaluru (formerly Bangalore) in southern India is offering a no-code conversational AI platform that lets other businesses deploy real-time, multilingual voice agents for customer support and services. Ring AI has expanded beyond India to the Middle East and Latin America, supporting languages like Arabic and Spanish. A recent UK government study showed that civil servants who used Microsoft’s Copilot AI for administrative tasks saved an average of 26 minutes a day, which works out to about two weeks of gained time per year. Voice AI plays a key role in this shift from the pre-AI workplace to the current AI workplace, becoming useful in transcription, summarization, and virtual meeting assistants. Previously, you’d have to have a human assistant sit in on your Zoom or Microsoft Teams meetings to take notes, or delegate someone who may or may not be good at live note-taking to manage these.