loader image

Ajla Karajko

Google adds real-time audio translation to any headphones

Google has unveiled new Gemini translation enhancements, including a beta feature that enables real-time speech translation delivered directly to any connected headphones. Previously limited to Pixel Buds, this capability is now available to all Android users and supports more than 70 languages, while preserving the speaker’s tone, rhythm, and speaking style.

The updated Gemini 2.5 Flash Native Audio model improves guided conversations, instruction-following, and real-time information usage. With broader world knowledge, the system better interprets slang and culturally specific expressions, resulting in more natural and accurate translations. At the same time, Google has expanded its Duolingo-style language practice mode to 20 additional countries, adding learning streak tracking and pronunciation feedback.

With this move, Google brings the science-fiction vision of a universal translator closer to reality. When any pair of headphones can function as an instant translation tool — and the technology extends to platforms like YouTube and social media — language barriers in the AI era may become nearly invisible.


In Brief: Tech World Highlights

  • MIT enhanced the power of biohybrid robots by combining lab-grown muscles with hydrogel “artificial tendons” and integrating the muscle–tendon module into the fingers of a robotic gripper.
  • Uber and Avride launched a commercial robotaxi service in Dallas, where some Uber rides now arrive in Avride-branded vehicles with safety drivers operating in a limited zone.
  • Beeple’s latest installation, Regular Animals, drew attention at Art Basel Miami Beach with a surreal display of robotic dogs modeled after powerful figures in the tech industry.
  • Pudu Robotics launched the Pudu D5 Series, a new “industrial” line of quadruped robots designed for complex industrial and outdoor environments.
  • Zurich-based startup Flexion Robotics raised $50 million in a Series A round to build a general-purpose autonomous software stack for humanoid robots.


AI Trending Tools:

  • Vidi2 – ByteDance’s AI video editor with spatio-temporal grounding.
  • Runway Gen-4.5 – Runway’s newly released, highest-rated video model.
  • DeepSeek V3.2 – The latest powerful open-source release from DeepSeek.

Podijeli objavu:

Preporučeni blogovi