• News
  • Google I/O 2025: Gemini 2.5 Pro gets improved reasoning, audio features and multilingual support

Google I/O 2025: Gemini 2.5 Pro gets improved reasoning, audio features and multilingual support

Google unveiled significant upgrades to its Gemini 2.5 model series at I/O 2025, enhancing reasoning capabilities and introducing native audio output. The updated Gemini 2.5 Pro excels in coding and learning tasks, boasting a 1 million-token context window. A new 'Deep Think' mode is being tested for complex problem-solving, while Gemini 2.5 Flash achieves greater efficiency with fewer tokens.
Google I/O 2025: Gemini 2.5 Pro gets improved reasoning, audio features and multilingual support
At Google I/O 2025, the company announced new updates to its Gemini 2.5 model series adding more powerful reasoning, native audio output, security upgrades, and improved tools for developers. “In March, we announced Gemini 2.5 Pro, our most intelligent model yet…Today, We’re bringing new capabilities to 2.5 Pro and 2.5 Flash,” Google said, announcing the new updates. The upgraded Gemini 2.5 Pro model now tops performance charts, including WebDev Arena for coding and LMArena for human preference testing. It also features a 1 million-token context window, which allows it to handle longer inputs and video understanding tasks.Google said that thanks to LearnLM — a version of Gemini developed with educational experts — the model now leads in learning-related tasks as well.“Educators and experts preferred Gemini 2.5 Pro over other models across a diverse range of scenarios,” the company said.

Native audio, emotional dialogue and multilingual support

Google also introduced native audio output for a more natural AI experience. Gemini can now speak with different tones, accents, and styles — such as a dramatic voice when telling a story. It can also:
  • Detect user emotions and respond accordingly (Affective Dialogue)
  • Ignore background noise (Proactive Audio)
  • Handle more complex voice tasks (Thinking in the Live API)
The text-to-speech tool now supports multiple speakers and over 24 languages, and it can switch between languages mid-conversation. These features will be available later today through the Gemini API.

New ‘Deep Think’ for complex tasks

Google said that it is testing an enhanced reasoning mode called Deep Think, which helps Gemini consider multiple answers before responding. It's aimed at tough challenges like advanced math and programming.“We’re starting to test an enhanced reasoning mode called Deep Think,” the company said.“We’re taking extra time to conduct more frontier safety evaluations and get further input from safety experts.”Deep Think is already leading benchmarks like the 2025 USAMO (math), LiveCodeBench (coding), and MMMU (multimodal reasoning).

Gemini 2.5 Flash gets faster and more efficient

Gemini 2.5 Flash, the lightweight version of the model, now uses 20–30% fewer tokens while improving performance across reasoning, code, and multimodal tasks, the company announced. It is now available in the Gemini app, Google AI Studio, and Vertex AI.A general release of the updated model is expected in early June, with 2.5 Pro following soon after.
author
About the Author
TOI Tech Desk

The TOI Tech Desk is a dedicated team of journalists committed to delivering the latest and most relevant news from the world of technology to readers of The Times of India. TOI Tech Desk’s news coverage spans a wide spectrum across gadget launches, gadget reviews, trends, in-depth analysis, exclusive reports and breaking stories that impact technology and the digital universe. Be it how-tos or the latest happenings in AI, cybersecurity, personal gadgets, platforms like WhatsApp, Instagram, Facebook and more; TOI Tech Desk brings the news with accuracy and authenticity.

End of Article
Follow Us On Social Media
OSZAR »