Google DeepMind at ICML 2024

New approaches to generative artificial intelligence and multimodality

Generative AI technologies and multimodal capabilities expand the creative possibilities of digital media.

We will present VideoPoetwhich uses LLM to generate state-of-the-art video and audio from multimodal inputs, including images, text, audio, and other video.

And share Gin (generative interactive environments) that can generate a range of playable environments for training AI agents based on text prompts, images, photos or sketches.

Finally, we present MagicLensa novel image retrieval system that uses text instructions to find images with richer relationships beyond visual similarity.

Supporting the AI ​​community

We are proud to sponsor ICML and support the diverse AI and machine learning community by supporting initiatives led by Disability in AI,Queer in AI,LatinX in artificial intelligence ANDWomen in machine learning.

If you're attending the conference, visit the Google DeepMind and Google Research booths to meet our teams, watch live presentations, and learn more about our research.

LEAVE A REPLY

Please enter your comment!
Please enter your name here