Training
19 sessions
08:31 Project Lightning Talk: How To Run Kubernetes Pods On My Slurm-Based HPC Center Elicium 2 · Diego Ciangottini → 12:33 Project Lightning Talk: Next-Gen AI Orchestration With Volcano On Kubernetes Elicium 2 · Zhonghu Xu → 08:42 Sponsored Keynote: From Complexity to Clarity: Engineering an Invisible Kubernetes ▶ Hall 12 · Jesse Butler → 08:56 Keynote: From Inference to Agents: Where Open Source AI Is Headed ▶ Hall 12 · Jonathan Bryce, Brian Stevens, Mark Collier, Lin Sun → 14:15 Instrumenting Kueue Scheduling for ML Training Hall 8 | Room F · Amy Chen, Gabriel Saba → 14:15 Kubeflow in Cloud Native AI: Orchestrating the Next Wave of Agentic AI and LLMOps E103-105 · Johnu George, Valentina Rodriguez Sosa, Antonin Stefanutti, Alexander Perlman, Michael Zazula → 14:15 To Swap or Not To Swap: Memory Management Design Patterns for AI Workloads in Kubernetes 1.34+ Amtrium 1+2 · Nic Vermande → 15:15 Slinky Expanded: Slurm, Kubernetes, and DRA Amtrium 1+2 · Praveen Krishna, Marlow Warnicke → 16:00 Volcano: Orchestrating the Full AI Lifecycle – From Training To Inference and Agents E106-108 · Chen Zicong, Hajnal Máté → 17:15 Sponsored Demo: Beyond Training: Volcano for Inference & Agents Hall 1-5 | Tram Zone | Demo Theater → 12:15 🪧 Poster Session: Efficient Inference for Training Hurricane Data and Predicting Future Movement Hall 1-5 | Gouda Zone | Poster Pavilion · Avery Yang → 13:45 Sponsored Demo: The Path to AI Nirvana Is Through Unified Identity Hall 1-5 | Tram Zone | Demo Theater → 14:00 Operationalizing AI Workloads on Kubernetes With OpenKruise E103-105 · Zhang Zhen, Vec Sun → 16:34 ⚡ Lightning Talk: The $100K GPU Mystery: Why Your AI Training Dies at 99% Auditorium · Michael Ifeanyi → 12:30 Project Demo: Kubeflow SDK: Expanding the Python Experience Across Kubeflow Hall 1-5 | Gouda Zone | Project Pavilion → 13:30 BoF: Infrastructure Optimization for GPUs / Inference / Training / Networking G106 → 13:30 Optimizing Error Recovery for Cost-Efficient Distributed AI Model Training with Kubernetes Elicium 2 · Radostin Stoyanov, Andrey Velichkevich, Viktória Spišáková → 14:15 BoF: AI Observability G106 → 14:15 Collisions in the Dark: Illuminating the 95% of Kubeflow You Can't See Hall 7 | Room A · Amine Lahouel, Laura Llinares →