Discover this podcast and so much more

Podcasts are free to enjoy without a subscription. We also offer ebooks, audiobooks, and so much more from just $11.99/month.

Deploy and fine-tune LLM models on Kubernetes using KAITO

UNLIMITED

Deploy and fine-tune LLM models on Kubernetes using KAITO

FromKubernetes Bytes


UNLIMITED

Deploy and fine-tune LLM models on Kubernetes using KAITO

FromKubernetes Bytes

ratings:
Length:
44 minutes
Released:
Aug 7, 2024
Format:
Podcast episode

Description

In this episode of the Kubernetes Bytes podcast, Bhavin sits down with  Sachi Desai, Product Manager and Paul Yu, Sr. Cloud Advocate at Microsoft to talk about the open source KAITO project. KAITO is the Kubernetes AI Toolchain Operator that enables AKS users to deploy open source LLM models on their Kubernetes clusters. They discuss how KAITO helps with running AI-enabled applications alongside the LLM models, how it helps users bring their own LLM models and run them as containers, and how KAITO helps them fine-tune open source LLMs on their Kubernetes clusters.  Check out our website at https://kubernetesbytes.com/  Cloud Native News:   https://azure.github.io/AKS/2024/07/30/azure-container-storage-ga https://github.blog/news-insights/product-news/introducing-github-models/  Show links:  Azure/kaito: Kubernetes AI Toolchain Operator - https://github.com/Azure/kaito/tree/main https://www.youtube.com/watch?v=3cGmHDjR_3I&list=PLc3Ep462vVYtgN4rP1ThTJd2UlsBc2sou&index=2 https://aka.ms/cloudnative/learnlive/intelligent-apps-on-aks/episode-2 Jumpstart AI Workflows With Kubernetes AI Toolchain Operator - The New Stack - https://thenewstack.io/jumpstart-ai-workflows-with-kubernetes-ai-toolchain-operator https://paulyu.dev/article/soaring-with-kaito/ Concepts - Fine-tuning language models for AI and machine learning workflows - Azure Kubernetes Service | Microsoft Learn - https://learn.microsoft.com/en-us/azure/aks/concepts-fine-tune-language-models   Keep up to date on the most recent announcements by following some of the KAITO engineers on LinkedIn:  Fei Guo - https://www.linkedin.com/in/fei-guo-a48319a/ Ishaan Sehgal - https://www.linkedin.com/in/ishaan-sehgal/ Timestamps:  00:02:15 Cloud Native News  00:05:34 Interview with Sachi and Paul  00:42:08 Key takeaways
Released:
Aug 7, 2024
Format:
Podcast episode

Titles in the series (87)

Kubernetes Bytes is a podcast bringing you the latest from the world of cloud native data management. Hosts Ryan Wallner and Bhavin Shah come to you from Boston, Massachusetts with experienced backgrounds in cloud-native tech. They'll be sharing their thoughts on recent cloud native news and talking to industry experts about their experiences and challenges managing the wealth of data in today's cloud-native ecosystem.