Google just kicked off its I/O Developer’s Conference
Google just kicked off its I/O Developer’s Conference, announcing a wide array of updates across its AI ecosystem — including enhancements across its flagship Gemini model family and a new video generation model to rival OpenAI’s Sora. Gemini model updates: - New updates to 1.5 Pro include a massive 2M context window extension and enhanced performance in code, logic, and image understanding. - Gemini 1.5 Pro can also utilize the long context to analyze a range media types, including documents, videos, audio, and codebases. - Google announced Gemini 1.5 Flash, a new model optimized for speed and efficiency with a context window of 1M tokens. - Gemma 2, the next generation of Google’s open-source models, is launching in the coming weeks, along with a new vision-language model called PaliGemma. - Gemini Advanced subscribers can soon create customized personas called ‘Gems’ from a simple text description, similar to ChatGPT GPTs. Video and image model upgrades: - Google revealed a new video model called Veo, capable of generating over 60-second, 1080p resolution videos from text, image, and video prompts. - The new Imagen 3 text-to-image model was also unveiled with better detail, text generation, and natural language understanding than its predecessor. - VideoFX text-to-video tool, featuring storyboard scene-by-scene creation and the ability to add music to generations. - VideoFX is launching in a ‘private preview’ in the U.S. for select creators, while ImageFX (with Imagen 3) is available to try via a waitlist