Hands-on with Gemini: Interacting with multimodal AI

Gemini is our natively multimodal AI model capable of reasoning across text, images, audio, video and code. This video highlights some of our favorite interactions with Gemini. Learn more and try the model: Explore our prompting approaches here: For the purposes of this demo, latency has been reduced and Gemini outputs have been shortened for brevity. Subscribe to our Channel: Tweet with us on Twitter: Follow us on Instagram: Join us on Facebook: 0:00 Intro 0:19 Multimodal Dialogue 1:32 Multilinguality 2:04 Game Creation 2:31 Visual Puzzles 3:17 Making Connections 3:39 Image & Text Generation 4:06 Logic & Spatial Reasoning 4:55 Translating Visuals 5:27 Cultural Understanding

1 view

543

157