Accelerating AI on Edge — Chintan Parikh and Weiyi Wang, Google DeepMind
Summary
As models get smaller and more capable, more AI workloads can move onto the device itself. In this talk, Chintan Parikh from Google DeepMind walks through what that looks like in practice, from Gemma
Transcript Excerpt
Afternoon everyone. We'll get started. Uh my name is uh Chintan Purik uh product manager for uh light RT which is part of Google AI edge. And uh I also have my uh my colleague here ve also he'll be joining us also for the uh Q&A part of this session. Uh quick show of hands. How many of you are working on deploying on edge or are keen on learning more about like what it's going to be the big benefits. Okay. And some of you already deploying. So um I'm going to go through the slides. I'll try to you know also leave it open to understand if you guys have any use cases or any sort of things you're working on that you know you'd like to discuss. Uh so we can kind of keep it open in that sense. Great. So uh here's going to be a quick uh set of agenda items and I'm going to go through some of the…