Building Conversational Agents — Thor Schaeff and Philipp Schmid, Google DeepMind

By ai.engineer

Categories: AI, Tools

Summary

Google DeepMind's developers showcase how to build conversational agents with Gemini API at no cost—the free tier covers all hands-on demos in this workshop. Learn practical setup, API key management, and real-world deployment patterns from engineers shipping production AI features.

Key Takeaways

Access Gemini API free tier with just a Gmail account—no credit card required. Visit ai.dev or aistudio.com to create API keys in minutes for hands-on development.
Treat API keys as secrets like passwords. Developers frequently leak keys to GitHub through committed code—use environment variables (.bashrc, .zshrc) or inline injection instead.
Google AI Studio provides a zero-code interface to test Gemini models before building API integrations. Ideal for rapid prototyping before committing engineering resources.
Gemini supports WebSocket connections for real-time conversational interactions—enabling live voice and multi-language agent capabilities demonstrated via phone software.
Multi-language support is native to Gemini. Workshop attendees from 10+ countries tested models across Spanish, Romanian, Czech, Farsi, and Hindi in real-time.

Topics

Gemini API Setup
Conversational Agents
WebSocket Real-time Agents
API Key Security
Free Tier Development

Transcript Excerpt

Hello everyone. >> Hello. >> Perfect. >> It's just that Philip and I were both Germans and we thought it was funny. Maybe we we can do it in German. Actually, looks like there's a there's a German crew there, which is nice. Uh, no, no worries. We'll we'll we'll do it in English. We'll do it in a couple different languages. Maybe we'll find out. Do we have other languages in the room? >> Other nationalities? Yeah. What do we have? >> Shout it out. >> So, Spanish. >> Any Icelandic? >> No. >> D. Ok...