We designed Gemini 2.5 as a family of hybrid reasoning models that deliver incredible performance while being at the highest level Pareto frontier costs and speed. Today we are taking the next step with our 2.5 Pro and Flash models by releasing them as stable and generally available. Introducing the 2.5 Flash-Lite Preview – our most fuel-efficient and fastest 2.5 model ever.
Widespread release of versions 2.5 Flash and 2.5 Pro
Thanks to all your feedback, today we are releasing stable versions of Flash 2.5 and Pro, so you can create production applications with confidence. Developers like Spline and Rooms and organizations such as Snap and SmartBear have been using the latest versions in production for several weeks now.
Introducing Gemini 2.5 Flash-Lite
We're also previewing the new Gemini 2.5 Flash-Lite, our most fuel-efficient and fastest 2.5 model ever. You can start creating a preview now. We look forward to hearing from you.
Version 2.5 Flash-Lite provides overall higher quality than version 2.0 Flash-Lite in coding, math, science, reasoning, and multimodal benchmarks. It excels at high-volume and latency-sensitive tasks such as translation and classification, with lower latency than 2.0 Flash-Lite and 2.0 Flash versions for a wide sample of hints. It includes the same features that make Gemini 2.5 useful, including the ability to enable thinking across budgets, connecting to tools like Google Search and code execution, multimodal input, and a context length of 1 million tokens.
More details about our 2.5 family of models can be found in the latest issue Gemini Technical Report.


















