Search a title or topic

Over 20 million podcasts, powered by 

Player FM logo
Artwork

Content provided by Google and Google AI. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Google and Google AI or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.
Player FM - Podcast App
Go offline with the Player FM app!

Behind the scenes of Google's state-of-the-art "nano-banana" image model

30:32
 
Share
 

Manage episode 502640984 series 3624003
Content provided by Google and Google AI. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Google and Google AI or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.

Join host Logan Kilpatrick in discussion with some of the minds behind Google's new state-of-the-art image model, Gemini 2.5 Flash. Product and research leads from the Gemini team break down the technology behind its key capabilities, including interleaved generation for complex edits and new approaches to achieving character consistency and pixel-perfect control. With Nicole Brichtova, Kaushik Shivakumar, Mostafa Dehghani and Robert Riachi.
Watch on YouTube:
Chapters:
0:37 - New model introduction
1:21 -Demo - Image Editing
3:44 - Text rendering capabilities
4:44 Beyond human preference evals
6:44 - Text rendering as a proxy for quality
8:38 - Positive transfer between modalities
11:25 - Demo - Multi-turn, context aware image generation
13:54 - Pixel-perfect editing and character consistency
15:51 - Interleaved image generation
17:59 - Specialized vs. native models
19:52 - Understanding nuanced prompts
20:59 - User feedback shaping model development
22:37 - Improvements in character consistency
24:17 - More natural looking images from team collaboration
26:41 - What’s next for image generation models

  continue reading

14 episodes

Artwork
iconShare
 
Manage episode 502640984 series 3624003
Content provided by Google and Google AI. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Google and Google AI or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.

Join host Logan Kilpatrick in discussion with some of the minds behind Google's new state-of-the-art image model, Gemini 2.5 Flash. Product and research leads from the Gemini team break down the technology behind its key capabilities, including interleaved generation for complex edits and new approaches to achieving character consistency and pixel-perfect control. With Nicole Brichtova, Kaushik Shivakumar, Mostafa Dehghani and Robert Riachi.
Watch on YouTube:
Chapters:
0:37 - New model introduction
1:21 -Demo - Image Editing
3:44 - Text rendering capabilities
4:44 Beyond human preference evals
6:44 - Text rendering as a proxy for quality
8:38 - Positive transfer between modalities
11:25 - Demo - Multi-turn, context aware image generation
13:54 - Pixel-perfect editing and character consistency
15:51 - Interleaved image generation
17:59 - Specialized vs. native models
19:52 - Understanding nuanced prompts
20:59 - User feedback shaping model development
22:37 - Improvements in character consistency
24:17 - More natural looking images from team collaboration
26:41 - What’s next for image generation models

  continue reading

14 episodes

ทุกตอน

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Copyright 2025 | Privacy Policy | Terms of Service | | Copyright
Listen to this show while you explore
Play