Kerry Wan/ZDNET reports that Google has had a busy year so far, with the rebranding of its AI chatbot from Bard to Gemini and the release of several new AI models. At the Google I/O developer conference, the company made several announcements regarding AI and its integration into various apps and services.
One major announcement was the introduction of Gemini 1.5 Flash, the fastest Gemini model available in the API. This model offers a more cost-efficient alternative to Gemini 1.5 Pro while still being highly capable. Gemini 1.5 Pro has also been upgraded to provide better-quality responses in areas such as translation, reasoning, and coding.
Google also announced the expansion of Gemini Nano to include images in addition to text, as well as the upcoming launch of Gemma 2 in June. The company introduced PaliGemma, its first vision-language model, as part of the Gemma family of models.
In addition to the updates to the Gemini family of models, Google unveiled enhancements to Google Search, including AI-generated overviews and AI-organized search results. The company also introduced Veo, its most advanced text-to-video generator, and Imagen 3, its next-generation text-to-image generator.
Google is expanding its SynthID technology to include text and video modalities, and introducing Ask Photos, an AI solution in Google Photos that allows users to find specific images using conversational prompts.
Finally, Google is upgrading its Gemini Advanced subscription tier with unique experiences, including access to Gemini 1.5 Pro and the new Gemini Live mobile experience, which allows users to have conversations with Gemini using natural-sounding voices.