Back To Schedule
Monday, May 20 • 18:36 - 18:41
Lightning Talk: Managing Drivers in a Kubernetes Cluster - Renaud Gaubert, NVIDIA

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
As a cluster operator, managing drivers (Mellanox networking, NVIDIA compute and graphics drivers, ...) at scale today is a real issue, from installation to upgrade every step you take brings you further away from Kubernetes.

Drivers are frequently needed for enabling users (e.g: run AI workloads) or reducing cost (RDMA over converged ethernet), yet there are no clear consensus or tools that allows you to solve the issues encountered by requiring drivers on your machines.

During this Lightning talk we’ll take a look at the different strategies you can use in Kubernetes to manage drivers (containers vs base image) and the available update strategies that will help you minimize disruption and maximize cost.

Finally we will take a look at the challenges and solutions that VM based runtimes introduce.

avatar for Renaud Gaubert

Renaud Gaubert

Software Engineer, Nvidia
Renaud Gaubert has been working since 2017 at NVIDIA on making GPU applications easier to deploy and manage in data centers. He focuses on supporting GPU-accelerated machine learning frameworks in container orchestration systems such as Kubernetes, Docker swarm, and Nomad. He is an... Read More →

Monday May 20, 2019 18:36 - 18:41 CEST
Hall 8.0 A1