💥 Have you tried the vocal capabilities of OpenAI's ChatGPT? If not, do it immediately because it's mind-blowing! 🤯 The comprehension capabilities are amazing, but the thing that impresses me the most is how fast it replies and the quality of the voice - it seems real!
👨💻 I have to admit, I've never been a big fan of voice assistants. I've never had such devices at home (despite my passion for home automation and IoT), and I don't use them regularly on my smartphone.
However, I do have to say that the combination of the potential of generative AI and the impressive quality of TTS/STT systems is starting to fascinate me. 🔥 In fact, I think that combined with SLMs (Small Language Models) that can run on edge devices and LAMs (Large Action Models) or agents, they could become a revolution for many sectors.
🚀 Companies that make devices using these technologies effectively are already becoming a reality (as I've already mentioned in this post https://lnkd.in/d7DDfX3s) and step-by-step, we're getting closer and closer to having something like Tony Stark's Jarvis in our homes! 🏡
📢 In the last period, I've also become interested in other technologies, at the same time impressive and frightening, which make it very easy to do something as complex as voice cloning! The novelty lies not so much in the subject matter, which has already been on the market for some time, but in the quality of the result and the ease with which we can access and use this type of functionality. The result obtained is practically indistinguishable from the original voice! ElevenLabs is probably the benchmark for this type of solution.
🧐 Personally, I'm still conflicted about this type of application of AI. I can understand and appreciate its value, but I also see so many associated risks (copyright, deepfakes, and seeing professions such as voice actors disappear in the future).
What do you guys think?
#AI #GenerativeAI #VoiceAI #ChatGPT #OpenAI #TTS #STT #VoiceCloning #ElevenLabs #HomeAutomation #IoT #VirtualAssistants #Jarvis #AIPotential #AIRisks
We’re excited to launch the ElevenLabs Dubbing API — enabling any developer to add audio or video translation to their product while preserving the unique characteristics of the original speaker’s voices.
To get started, create an account and grab an API key, then follow our Python tutorial https://lnkd.in/eJ2gV_gT
Or visit our API reference to learn how to integrate it with any major language: https://lnkd.in/eC7b38KM
We’ve open-sourced our demo app “ElevenVideos” to give you a full end-to-end example: https://lnkd.in/eErkdZ-3