OpenAI has opened speech AI from ChatGPT to third-party developers – we are waiting for a wave of talking applications

OpenAI has introduced new features to simplify the process of creating applications based on artificial intelligence. Developers can now use an online tool to create voice-based AI solutions using a single set of instructions.

Image source: OpenAI

OpenAI gets most of its revenue from businesses that use the company’s neural networks to create their own AI applications. Expanding the ability to create such products makes sense as the AI ​​battle escalates with companies like Google introducing algorithms into their products that can process different types of information, including text, images and video.

The process of creating voice assistants requires developers to go through at least three stages: converting audio into text, processing the request and generating a text response to it, and converting the received response into audio. As part of the rollout of new capabilities for creating voice AI applications, OpenAI introduced a tool for fine-tuning large language models after completing the training phase. This approach will improve the quality of responses that algorithms created by developers generate in response to queries in text format and using images. The fine-tuning phase can be accompanied by feedback from people who evaluate how well the algorithm produces answers.

OpenAI believes that using images to fine-tune models will give developers greater opportunities to improve AI algorithms’ understanding of what is shown in images. Applications created in this way can act, for example, as an advanced search for visual elements. In addition to this, OpenAI introduced a tool that will allow smaller AI models to learn from larger models, as well as “Fast Caching”, which will significantly reduce development costs by reusing text fragments previously processed by the algorithm. All presented innovations are already being tested with a limited number of OpenAI clients.

admin

Share
Published by
admin

Recent Posts

OnePlus smartphones are again banned from sale in Germany

OnePlus and its parent company Oppo have been embroiled in a protracted patent dispute with…

6 mins ago

Microsoft has released Office 2024 for PC and Mac, which works without a subscription

Microsoft has released a new version of the Office suite for customers who don't want…

40 mins ago

Lian Li presented a compact but roomy case Lancool 207

Lian Li has introduced the Lancool 207 ventilated Mid-Tower case. The dimensions of the new…

60 mins ago

Adobe releases Photoshop and Premiere Elements 2025 with advanced AI features and $90 price

Photoshop Elements and Premiere Elements are lightweight versions of Adobe's most popular image and video…

1 hour ago

Nvidia has released an open-source multimodal AI model, and it’s as good as GPT-4

Nvidia introduced a new family of large multimodal language models, NVLM 1.0, including the NVLM-D-72B,…

2 hours ago