Artwork

Sisällön tarjoaa Kubernetes Bytes, Ryan Wallner, and Bhavin Shah. Kubernetes Bytes, Ryan Wallner, and Bhavin Shah tai sen podcast-alustan kumppani lataa ja toimittaa kaiken podcast-sisällön, mukaan lukien jaksot, grafiikat ja podcast-kuvaukset. Jos uskot jonkun käyttävän tekijänoikeudella suojattua teostasi ilman lupaasi, voit seurata tässä https://fi.player.fm/legal kuvattua prosessia.
Player FM - Podcast-sovellus
Siirry offline-tilaan Player FM avulla!

Training Machine Learning (ML) models on Kubernetes

55:29
 
Jaa
 

Fetch error

Hmmm there seems to be a problem fetching this series right now. Last successful fetch was on March 03, 2025 17:11 (9M ago)

What now? This series will be checked again in the next day. If you believe it should be working, please verify the publisher's feed link below is valid and includes actual episode links. You can contact support to request the feed be immediately fetched.

Manage episode 421319868 series 3332465
Sisällön tarjoaa Kubernetes Bytes, Ryan Wallner, and Bhavin Shah. Kubernetes Bytes, Ryan Wallner, and Bhavin Shah tai sen podcast-alustan kumppani lataa ja toimittaa kaiken podcast-sisällön, mukaan lukien jaksot, grafiikat ja podcast-kuvaukset. Jos uskot jonkun käyttävän tekijänoikeudella suojattua teostasi ilman lupaasi, voit seurata tässä https://fi.player.fm/legal kuvattua prosessia.

In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Bernie Wu, VP Strategic Partnerships and AI/CXL/Kubernetes Initiatives at Memverge. They discuss about how Kubernetes is the most popular platform to run AI model training and model inferencing jobs. The discussion dives into model training, talking about different phases of a DAG, and then talk about how Memverge can help users with efficient and cost-effective model checkpoints. The discussion goes into topics like saving costs by using spot instances, hot restart of training jobs, reclaiming unused GPU resources, etc.

Check out our website at https://kubernetesbytes.com/

Episode Sponsor: Nethopper

Cloud Native News:

  • https://www.aquasec.com/blog/linguistic-lumberjack-understanding-cve-2024-4323-in-fluent-bit/
  • https://kubernetes.io/blog/2024/05/20/completing-cloud-provider-migration/
  • https://thenewstack.io/introducing-aks-automatic-managed-kubernetes-for-developers/
  • https://www.harness.io/blog/harness-to-acquire-split

Show Links:

  • https://www.linkedin.com/in/berniewu/
  • https://criu.org/Main_Page
  • https://memverge.com/
  • https://youtu.be/tY8YOMRuqWI?si=yB3hHqLUpYPZ-KWN
  • https://youtu.be/ND4seSKpJHI?si=shh0iuA9qC-dO6eb

Timestamps:


  continue reading

88 jaksoa

Artwork
iconJaa
 

Fetch error

Hmmm there seems to be a problem fetching this series right now. Last successful fetch was on March 03, 2025 17:11 (9M ago)

What now? This series will be checked again in the next day. If you believe it should be working, please verify the publisher's feed link below is valid and includes actual episode links. You can contact support to request the feed be immediately fetched.

Manage episode 421319868 series 3332465
Sisällön tarjoaa Kubernetes Bytes, Ryan Wallner, and Bhavin Shah. Kubernetes Bytes, Ryan Wallner, and Bhavin Shah tai sen podcast-alustan kumppani lataa ja toimittaa kaiken podcast-sisällön, mukaan lukien jaksot, grafiikat ja podcast-kuvaukset. Jos uskot jonkun käyttävän tekijänoikeudella suojattua teostasi ilman lupaasi, voit seurata tässä https://fi.player.fm/legal kuvattua prosessia.

In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Bernie Wu, VP Strategic Partnerships and AI/CXL/Kubernetes Initiatives at Memverge. They discuss about how Kubernetes is the most popular platform to run AI model training and model inferencing jobs. The discussion dives into model training, talking about different phases of a DAG, and then talk about how Memverge can help users with efficient and cost-effective model checkpoints. The discussion goes into topics like saving costs by using spot instances, hot restart of training jobs, reclaiming unused GPU resources, etc.

Check out our website at https://kubernetesbytes.com/

Episode Sponsor: Nethopper

Cloud Native News:

  • https://www.aquasec.com/blog/linguistic-lumberjack-understanding-cve-2024-4323-in-fluent-bit/
  • https://kubernetes.io/blog/2024/05/20/completing-cloud-provider-migration/
  • https://thenewstack.io/introducing-aks-automatic-managed-kubernetes-for-developers/
  • https://www.harness.io/blog/harness-to-acquire-split

Show Links:

  • https://www.linkedin.com/in/berniewu/
  • https://criu.org/Main_Page
  • https://memverge.com/
  • https://youtu.be/tY8YOMRuqWI?si=yB3hHqLUpYPZ-KWN
  • https://youtu.be/ND4seSKpJHI?si=shh0iuA9qC-dO6eb

Timestamps:


  continue reading

88 jaksoa

Kaikki jaksot

×
 
Loading …

Tervetuloa Player FM:n!

Player FM skannaa verkkoa löytääkseen korkealaatuisia podcasteja, joista voit nauttia juuri nyt. Se on paras podcast-sovellus ja toimii Androidilla, iPhonela, ja verkossa. Rekisteröidy sykronoidaksesi tilaukset laitteiden välillä.

 

Pikakäyttöopas

Tekijänoikeudet 2025 | Tietosuojakäytäntö | Käyttöehdot | | Tekijänoikeus
Kuuntele tämä ohjelma tutkiessasi
Toista