Google has released a new AI model designed to deliver high performance with a focus on efficiency. It’s called Gemini 2.5 Flash and will soon be available as part of Google Cloud’s Vertex AI platform for deploying and managing artificial intelligence (AI) models.
Image source: Google
The company notes that Gemini 2.5 Flash offers “dynamic and controlled” computing, allowing developers to adjust request processing time based on their complexity.
«You can customize the speed, accuracy, and cost balance to your specific needs. This flexibility is key to optimizing Flash performance in high-load, cost-sensitive applications,” the company wrote in its official blog.
With the rising cost of using flagship AI models, the Gemini 2.5 Flash could prove extremely useful. Cheaper, more powerful models like the 2.5 Flash are an attractive alternative to the expensive flagship options, but at the cost of some precision.
Gemini 2.5 Flash is a “reasoning” model, similar to OpenAI’s o3-mini and DeepSeek’s R1. This means it takes a little longer to answer queries for fact-checking. Google says 2.5 Flash is ideal for working with large amounts of data and real-time use, particularly for tasks like customer service and document analysis.
«This working model is optimized specifically for low latency and low costs. It is an ideal engine for responsive virtual assistants and real-time summarization tools, where efficiency at scale is key,” the company describes the new AI model.
Google has not released a security or technical report for Gemini 2.5 Flash, making it difficult to determine its strengths and weaknesses. The company has previously said it does not publish reports for models it considers experimental.
Google also announced that it plans to integrate Gemini models such as 2.5 Flash into on-premises environments starting in the third quarter. They will be available on Google Distributed Cloud (GDC), Google’s on-premises solution for customers with strict data management requirements. The company added that it is working with Nvidia to install Gemini on GDC-compatible Nvidia Blackwell systems that customers can purchase through Google or their own channels.
Asia has become the second region in the world to achieve 50% IPv6 compatibility, according…
Asia has become the second region in the world to achieve 50% IPv6 compatibility, according…
Video games set in the iconic cyberpunk universe of Blade Runner can be counted on…
Video games set in the iconic cyberpunk universe of Blade Runner can be counted on…
Eye medicine is poised to reach a new level. A number of degenerative retinal diseases,…
The Elder Scrolls IV: Oblivion remaster from Bethesda Game Studios and Virtuos Studios, which came…