ML

27 sessions

10:15 Least-Privilege for AI: Authorizing Agents and MCP Tools with Agentgateway and Kyverno Hall 8 | Room F · Luc Chmielowski, Nina Polshakova → 10:15 When an Agent Acts on Your Behalf, Who Holds the Keys? Amtrium 1+2 · Mariusz Sabath, Maia Iyer → 11:00 Breaking the Monolith: Decomposing and Governing Giant LLM Jobs Across Clusters Elicium 2 · Kevin Wang → 11:00 Locking Down Ray Serve: How to Secure Ur ML Models? Hall 8 | Room F · Kateryna Hrytsaienko → 11:00 Your Models Are Vulnerable: How KitOps Turns KServe Into a Zero-Trust Inference Platform Amtrium 1+2 · Brad Micklea, Gavrish Prabhu → 13:30 Driving Adoption and Automation With MCP in Production at Liftoff Amtrium 1+2 · Tommy Nguyen → 13:30 Make GenAI Production-Ready With Kubernetes Patterns Hall 8 | Room F · Roland Huss, Bilgin Ibryam → 14:15 Instrumenting Kueue Scheduling for ML Training Hall 8 | Room F · Amy Chen, Gabriel Saba → 14:15 To Swap or Not To Swap: Memory Management Design Patterns for AI Workloads in Kubernetes 1.34+ Amtrium 1+2 · Nic Vermande → 15:15 Lessons Learned Orchestrating Multi-Tenant GPUs on OpenShift AI with NVIDIA KAI (G/H200) Hall 8 | Room F · Luca Berton → 15:15 Slinky Expanded: Slurm, Kubernetes, and DRA Amtrium 1+2 · Praveen Krishna, Marlow Warnicke → 16:00 Securing the AI/ML Lifecycle With MLSecOps: Open Source Best Practices Amtrium 1+2 · Bahaulddin Shammary, Andrey Shorov → 10:00 GPUs on Kubernetes: What Actually Happens When You Request Nvidia.com/gpu: 1 Hall 8 | Room G · Gulcan Topcu, Daniele Polencic → 10:00 Sandbox Operator: Enabling Session-Aware, Efficient MCP Tool Execution in Kubernetes Auditorium · Mingshan Zhao, Zhen Zhang → 10:45 Fusing FinOps, Forecasting, and Kubernetes at Scale Hall 8 | Room G · Ankur Singh, Satyam Bhardwaj → 10:45 Route, Serve, Adapt, Repeat: Adaptive Routing for AI Inference Workloads in Kubernetes Auditorium · Nir Rozenbaum, Kellen Swain → 13:15 REST in Peace: AI Needs to Be Async - Meet Asya🎭 Auditorium · Artem Yushkovskiy → 14:00 Enterprise Challenges with MCP Adoption Hall 7 | Room C · Christian Posta → 14:00 Peeking Into the GPU Black Box: Continuous Profiling on Kubernetes With eBPF Auditorium · Zahari Dichev → 15:00 Optimizing LLM Inference for the Rest of Us F002-005 · Abdel Sghiouar → 10:00 Achieving Resilient Multi-Cluster AI Inference on Kubernetes With Karmada and KubeRay Auditorium · Wei-Cheng Lai, Han-Ju Chen → 10:45 AI Agents & Platform Engineering: Efficiency Boost or New Source of Trouble? Auditorium · Hasith Kalpage, Vincent Caldeira, Sara Qasmi, Idit Levine, Carlos Santana → 10:45 CANCELED: Building a Scalable and Cost-Effective MLOps Platform at PepsiCo Using CNCF Tools Hall 8 | Room D · Chaitanya G → 12:45 Cloud Native at the Far(m) Edge: Running Kubernetes and AI on Tractors Auditorium · Mauro Morales, Jordan Karapanagiotis → 13:30 Optimizing Error Recovery for Cost-Efficient Distributed AI Model Training with Kubernetes Elicium 2 · Radostin Stoyanov, Andrey Velichkevich, Viktória Spišáková → 13:30 Virtualizing Large Scale GPU Cluster for Sovereign AI: Petasus AI Cloud Journey with Kubernetes Auditorium · Jian Li → 14:15 From Logs to Decisions: Autonomous AI Agents for Real-Time Kubernetes Threat Response Elicium 1 · Willem Berroubache →

← Back to schedule