Video

Podcast - Emerging Kubernetes tools for AI and optimizing GPU workloads

Discover how Kubernetes is evolving to support AI/ML workloads in this interview with John Platt, CTO at StormForge (now part of CloudBolt) on KubeFM's podcast series.

This episode will cover:
 

  • Notable new Kubernetes tools, such as in-place pod resizing for smoother workload rightsizing, EKS Auto Mode to streamline node management, and NOS for simplifying GPU virtualization.
  • The growing adoption of Kubernetes as the preferred platform for AI/ML workloads, delivering substantial cost savings (up to 90%) over managed services like OpenAI, along with enhanced scalability and flexibility.
  • Key obstacles to running AI on Kubernetes, including managing GPU drivers, ensuring CUDA version compatibility, and accommodating very large models that may exceed tens of gigabytes.

 

Latest Resources

Seeing is Believing

Start getting resizing recommendations minutes from now.

Watch An Install

Free trial includes full version on 1 cluster for 30 days!

We use cookies to provide you with a better website experience and to analyze the site traffic. Please read our "privacy policy" for more information.