Accelerating AI on Edge — Chintan Parikh and Weiyi Wang, Google DeepMind
Summary
As models get smaller and more capable, more AI workloads can move onto the device itself. In this talk, Chintan Parikh from Google DeepMind walks through what that looks like in practice, from Gemma
Related topics
Transcript Excerpt
Afternoon everyone. We'll get started. Uh my name is uh Chintan Purik uh product manager for uh light RT which is part of Google AI edge. And uh I also have my uh my colleague here ve also he'll be joining us also for the uh Q&A part of this session. Uh quick show of hands. How many of you are working on deploying on edge or are keen on learning more about like what it's going to be the big benefits. Okay. And some of you already deploying. So um I'm going to go through the slides. I'll try to you know also leave it open to understand if you guys have any use cases or any sort of things you're working on that you know you'd like to discuss. Uh so we can kind of keep it open in that sense. Great. So uh here's going to be a quick uh set of agenda items and I'm going to go through some of the…
More from ai.engineer
- The agent-ready web: Simplify user actions with WebMCP — Tara Agyemang, Google
- Why Eval++ Is the Next Great Compute Primitive — Sunil Pai & Matt Carey, Cloudflare
- How to Keep Shipping When You Walk Away from Your Desk — Zack Proser, WorkOS
- Why More Context Makes Your Agent Dumber and What to Do About It — Nupur Sharma, Qodo
- RAG is dead, right?? — Kuba Rogut, Turbopuffer