
Google’s first hybrid reasoning model gives developers control over cost, speed, and performance.
- Google has released Gemini 2.5 Flash, a new lightweight AI model with improved reasoning.
- It’s the first Gemini model to offer adjustable thinking settings to balance cost, speed, and quality.
- It’s available now in preview via the Gemini API, AI Studio, and Vertex AI. It’s also available in the Gemini app.
Just when you thought you’d got your head around all the Gemini models, Google goes and adds another one to the list. The company has announced that Gemini 2.5 Flash is now available in preview via the Gemini API, with access through both Google AI Studio and Vertex AI.
According to Google, the new model builds on Gemini 2.0 Flash, retaining its speed and low cost while making a “major upgrade” to reasoning capabilities. It’s described as the company’s first fully hybrid reasoning model, allowing developers to toggle thinking on or off and set a thinking budget, effectively controlling how smart the model gets on a per-query basis.

Welcome, from AppFlicks Chief AuteurX! I created AppFlicks out of my love for all-things entertainment, stints as a music manager, producer and as an advisor to film & game studios, telcos, lux brands, retail, hospitality & theme parks. Thank you for taking part in our streaming community! NOTE: Activity from my account, may also be initiated by AppFlicks Admin.