1. ElevenLabs has launched a redesigned speech synthesis page with natural-sounding synthetic voices and voice cloning capabilities.
2. OpenAI’s Voice Engine, delayed due to safety concerns, may challenge ElevenLabs’ platform in the future.
3. ElevenLabs offers fast and simple voice cloning options, including Instant Voice Cloning and Professional Cloning, with potential privacy risks and restrictions in place.
ElevenLabs has revamped its speech synthesis page to make it easier for users to create AI voices and use them for text-to-speech. The platform offers natural sounding synthetic voices and voice clones, with a new design that simplifies the creation process by starting with just a text box and adding controls as users interact with the tool. The platform also offers instant voice cloning and professional cloning options, with the latter requiring equipment verification and up to six hours to receive the final clone. Users can get a remarkably accurate clone of their voice using about three minutes of sample audio for instant cloning, which is available in about 20 minutes.
Voice cloning can be used for various purposes, including creating radio dramas with one actor and improving audio quality. The technology has the potential to bring long-dead performers back to life, raising concerns from organizations like SAG-AFTRA about its implications. While AI-generated speech can sound very realistic, there are restrictions in place within the ElevenLabs system, such as detecting AI-generated clips and preventing the creation of clones of elected officials or candidates.
OpenAI has delayed the launch of its Voice Engine due to safety concerns, opting to first discuss the responsible deployment of synthetic voices and their impact on society. With advancements in text-to-speech technology, there are concerns about the risk of identity fraud and the need for protections against misuse of voice cloning technology. Despite the progress in open-source text-to-speech projects, ElevenLabs continues to innovate in the field with its user-friendly interface and advanced voice cloning capabilities.