OpenAI Slashes the Cost of Using Its AI With a “Mini” Model

OpenAI today announced a cut-price “mini” model that it says will allow more companies and programs to tap into its artificial intelligence. The new model, called GPT-4o mini and available starting today, is 60 percent cheaper than OpenAI’s most inexpensive existing model while offering higher performance, the company says.

OpenAI characterizes the move as part of an effort to make AI “as broadly accessible as possible,” but it also reflects growing competition among AI cloud providers as well as rising interest in small and free open source AI models. Meta, the social media giant, is expected to debut the largest version of its very capable free offering, Llama 3, next week.

“The whole point of OpenAI is to build and distribute AI safely and make it broadly accessible,” Olivier Godement, a product manager at OpenAI responsible for the new model tells WIRED. “Making intelligence available at a lower cost is one of the most efficient ways for us to do that.”

Godement says the company developed a cheaper offering by improving the model architecture and refining the training data and the training regimen. GPT-4o mini outperforms other “small” models on the market in several common benchmarks, OpenAI says.

OpenAI has gained a significant foothold in the cloud AI market thanks to interest in tapping into the remarkable capabilities of its chatbot, ChatGPT, which debuted in late 2022. The company lets outsiders access the large language model that powers ChatGPT, called GPT-4o, for a fee. It also offers a less powerful model called GPT-3.5 Turbo for about a tenth of the cost of GPT-4o.

The interest in language models triggered by ChatGPT’s wild success has prompted competitors to develop similar offerings. Google, a pioneer in AI, has made a major push to build and commercialize a large language model and chatbot under the brand name Gemini. Startups such as Anthropic, Cohere, and AI21 have raised millions to develop and market their own large language models to business customers and developers.

Source : Wired