Machine Learning on Kubernetes: A practical handbook for building and using a complete open source machine learning platform on Kubernetes

Ebook586 pages4 hours

Machine Learning on Kubernetes: A practical handbook for building and using a complete open source machine learning platform on Kubernetes

Name: Machine Learning on Kubernetes: A practical handbook for building and using a complete open source machine learning platform on Kubernetes
Author: Faisal Masood
ISBN: 9781803231655

By Faisal Masood and Ross Brigoli

Rating: 0 out of 5 stars

()

Read preview

About this ebook

MLOps is an emerging field that aims to bring repeatability, automation, and standardization of the software engineering domain to data science and machine learning engineering. By implementing MLOps with Kubernetes, data scientists, IT professionals, and data engineers can collaborate and build machine learning solutions that deliver business value for their organization.
You'll begin by understanding the different components of a machine learning project. Then, you'll design and build a practical end-to-end machine learning project using open source software. As you progress, you'll understand the basics of MLOps and the value it can bring to machine learning projects. You will also gain experience in building, configuring, and using an open source, containerized machine learning platform. In later chapters, you will prepare data, build and deploy machine learning models, and automate workflow tasks using the same platform. Finally, the exercises in this book will help you get hands-on experience in Kubernetes and open source tools, such as JupyterHub, MLflow, and Airflow.
By the end of this book, you'll have learned how to effectively build, train, and deploy a machine learning model using the machine learning platform you built.

Skip carousel

Computers

LanguageEnglish

PublisherPackt Publishing

Release dateJun 24, 2022

ISBN9781803231655

Author

Faisal Masood

Related authors

Skip carousel

Related to Machine Learning on Kubernetes

Related ebooks

Skip carousel

Accelerating DevSecOps on AWS: Create secure CI/CD pipelines using Chaos and AIOps
Ebook
Accelerating DevSecOps on AWS: Create secure CI/CD pipelines using Chaos and AIOps
byNikit Swaraj
Rating: 0 out of 5 stars
0 ratings
The Kubernetes Operator Framework Book: Overcome complex Kubernetes cluster management challenges with automation toolkits
Ebook
The Kubernetes Operator Framework Book: Overcome complex Kubernetes cluster management challenges with automation toolkits
byMichael Dame
Rating: 0 out of 5 stars
0 ratings
Big Data on Kubernetes: A practical guide to building efficient and scalable data solutions
Ebook
Big Data on Kubernetes: A practical guide to building efficient and scalable data solutions
byNeylson Crepalde
Rating: 0 out of 5 stars
0 ratings
Architecting Cloud-Native Serverless Solutions: Design, build, and operate serverless solutions on cloud and open source platforms
Ebook
Architecting Cloud-Native Serverless Solutions: Design, build, and operate serverless solutions on cloud and open source platforms
bySafeer Cm
Rating: 0 out of 5 stars
0 ratings
Hybrid Cloud Management with Red Hat CloudForms
Ebook
Hybrid Cloud Management with Red Hat CloudForms
bySangram Rath
Rating: 0 out of 5 stars
0 ratings
Azure for Developers.: Implement rich Azure PaaS ecosystems using containers, serverless services, and storage solutions
Ebook
Azure for Developers.: Implement rich Azure PaaS ecosystems using containers, serverless services, and storage solutions
byKamil Mrzygłód
Rating: 0 out of 5 stars
0 ratings
The Azure Cloud Native Architecture Mapbook: Explore Microsoft Cloud's infrastructure, application, data, and security architecture
Ebook
The Azure Cloud Native Architecture Mapbook: Explore Microsoft Cloud's infrastructure, application, data, and security architecture
byStéphane Eyskens
Rating: 0 out of 5 stars
0 ratings
Azure Stack Hub Demystified: Building hybrid cloud, IaaS, and PaaS solutions
Ebook
Azure Stack Hub Demystified: Building hybrid cloud, IaaS, and PaaS solutions
byRichard Young
Rating: 0 out of 5 stars
0 ratings
Kubernetes in Production Best Practices: Build and manage highly available production-ready Kubernetes clusters
Ebook
Kubernetes in Production Best Practices: Build and manage highly available production-ready Kubernetes clusters
byAly Saleh
Rating: 0 out of 5 stars
0 ratings
IoT Edge Computing with MicroK8s: A hands-on approach to building, deploying, and distributing production-ready Kubernetes on IoT and Edge platforms
Ebook
IoT Edge Computing with MicroK8s: A hands-on approach to building, deploying, and distributing production-ready Kubernetes on IoT and Edge platforms
byKarthikeyan Shanmugam
Rating: 0 out of 5 stars
0 ratings
Azure Containers Explained: Leverage Azure container technologies for effective application migration and deployment
Ebook
Azure Containers Explained: Leverage Azure container technologies for effective application migration and deployment
byWesley Haakman
Rating: 0 out of 5 stars
0 ratings
Hands-On Microservices with Kubernetes: Build, deploy, and manage scalable microservices on Kubernetes
Ebook
Hands-On Microservices with Kubernetes: Build, deploy, and manage scalable microservices on Kubernetes
byGigi Sayfan
Rating: 5 out of 5 stars
5/5
A Developer's Guide to .NET in Azure: Build quick, scalable cloud-native applications and microservices with .NET 6.0 and Azure
Ebook
A Developer's Guide to .NET in Azure: Build quick, scalable cloud-native applications and microservices with .NET 6.0 and Azure
byAnuraj Parameswaran
Rating: 0 out of 5 stars
0 ratings
Mastering AWS CloudFormation: Build resilient and production-ready infrastructure in Amazon Web Services with CloudFormation
Ebook
Mastering AWS CloudFormation: Build resilient and production-ready infrastructure in Amazon Web Services with CloudFormation
byKaren Tovmasyan
Rating: 0 out of 5 stars
0 ratings
MLOps with Red Hat OpenShift: A cloud-native approach to machine learning operations
Ebook
MLOps with Red Hat OpenShift: A cloud-native approach to machine learning operations
byRoss Brigoli
Rating: 0 out of 5 stars
0 ratings
Mastering Azure Machine Learning.: Execute large-scale end-to-end machine learning with Azure
Ebook
Mastering Azure Machine Learning.: Execute large-scale end-to-end machine learning with Azure
byKörner Christoph
Rating: 0 out of 5 stars
0 ratings
Kubernetes – An Enterprise Guide: Effectively containerize applications, integrate enterprise systems, and scale applications in your enterprise
Ebook
Kubernetes – An Enterprise Guide: Effectively containerize applications, integrate enterprise systems, and scale applications in your enterprise
byMarc Boorshtein
Rating: 0 out of 5 stars
0 ratings
Cloud Native with Kubernetes: Deploy, configure, and run modern cloud native applications on Kubernetes
Ebook
Cloud Native with Kubernetes: Deploy, configure, and run modern cloud native applications on Kubernetes
byAlexander Raul
Rating: 0 out of 5 stars
0 ratings
Hands-On Azure for Developers: Implement rich Azure PaaS ecosystems using containers, serverless services, and storage solutions
Ebook
Hands-On Azure for Developers: Implement rich Azure PaaS ecosystems using containers, serverless services, and storage solutions
byKamil Mrzygłód
Rating: 0 out of 5 stars
0 ratings
The Kubernetes Bible: The definitive guide to deploying and managing Kubernetes across major cloud platforms
Ebook
The Kubernetes Bible: The definitive guide to deploying and managing Kubernetes across major cloud platforms
byNassim Kebbani
Rating: 4 out of 5 stars
4/5
Windows Azure programming patterns for Start-ups
Ebook
Windows Azure programming patterns for Start-ups
byBecker Riccardo
Rating: 0 out of 5 stars
0 ratings
Learning AWS
Ebook
Learning AWS
byAmit Shah
Rating: 4 out of 5 stars
4/5
Machine Learning Engineering on AWS: Build, scale, and secure machine learning systems and MLOps pipelines in production
Ebook
Machine Learning Engineering on AWS: Build, scale, and secure machine learning systems and MLOps pipelines in production
byJoshua Arvin Lat
Rating: 0 out of 5 stars
0 ratings
Mastering Azure Kubernetes Service (AKS): Rapidly Build and Scale Your Containerized Applications with Microsoft Azure Kubernetes Service (English Edition)
Ebook
Mastering Azure Kubernetes Service (AKS): Rapidly Build and Scale Your Containerized Applications with Microsoft Azure Kubernetes Service (English Edition)
byAbhishek Mishra
Rating: 0 out of 5 stars
0 ratings
Learning Docker
Ebook
Learning Docker
byVinod Singh
Rating: 5 out of 5 stars
5/5
The Machine Learning Solutions Architect Handbook: Create machine learning platforms to run solutions in an enterprise setting
Ebook
The Machine Learning Solutions Architect Handbook: Create machine learning platforms to run solutions in an enterprise setting
byDavid Ping
Rating: 0 out of 5 stars
0 ratings
Rust Web Programming: A hands-on guide to developing fast and secure web apps with the Rust programming language
Ebook
Rust Web Programming: A hands-on guide to developing fast and secure web apps with the Rust programming language
byMaxwell Flitton
Rating: 0 out of 5 stars
0 ratings
Bootstrapping Service Mesh Implementations with Istio: Build reliable, scalable, and secure microservices on Kubernetes with Service Mesh
Ebook
Bootstrapping Service Mesh Implementations with Istio: Build reliable, scalable, and secure microservices on Kubernetes with Service Mesh
byAnand Rai
Rating: 0 out of 5 stars
0 ratings
50 Kubernetes Concepts Every DevOps Engineer Should Know: Your go-to guide for making production-level decisions on how and why to implement Kubernetes
Ebook
50 Kubernetes Concepts Every DevOps Engineer Should Know: Your go-to guide for making production-level decisions on how and why to implement Kubernetes
byMichael Levan
Rating: 0 out of 5 stars
0 ratings
Kubernetes on AWS: Deploy and manage production-ready Kubernetes clusters on AWS
Ebook
Kubernetes on AWS: Deploy and manage production-ready Kubernetes clusters on AWS
byEd Robinson
Rating: 0 out of 5 stars
0 ratings

Computers For You

Skip carousel

Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Ebook
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
byMargot Lee Shetterly
Rating: 4 out of 5 stars
4/5
The Innovators: How a Group of Hackers, Geniuses, and Geeks Created the Digital Revolution
Ebook
The Innovators: How a Group of Hackers, Geniuses, and Geeks Created the Digital Revolution
byWalter Isaacson
Rating: 4 out of 5 stars
4/5
The Invisible Rainbow: A History of Electricity and Life
Ebook
The Invisible Rainbow: A History of Electricity and Life
byArthur Firstenberg
Rating: 5 out of 5 stars
5/5
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 4 out of 5 stars
4/5
Elon Musk
Ebook
Elon Musk
byWalter Isaacson
Rating: 4 out of 5 stars
4/5
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics
Ebook
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics
byGary Smith
Rating: 4 out of 5 stars
4/5
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
Ebook
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
byTJ Books
Rating: 4 out of 5 stars
4/5
Deep Search: How to Explore the Internet More Effectively
Ebook
Deep Search: How to Explore the Internet More Effectively
byAlan Pearce
Rating: 5 out of 5 stars
5/5
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad
Ebook
Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad
byT.C. Boyle
Rating: 5 out of 5 stars
5/5
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
Ebook
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
bySeth Stephens-Davidowitz
Rating: 4 out of 5 stars
4/5
Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time!
Ebook
Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time!
byJohannes Wild
Rating: 0 out of 5 stars
0 ratings
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
Ebook
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
byAlex Parkinson
Rating: 4 out of 5 stars
4/5
Slenderman: Online Obsession, Mental Illness, and the Violent Crime of Two Midwestern Girls
Ebook
Slenderman: Online Obsession, Mental Illness, and the Violent Crime of Two Midwestern Girls
byKathleen Hale
Rating: 4 out of 5 stars
4/5
Computer Science I Essentials
Ebook
Computer Science I Essentials
byRandall Raus
Rating: 5 out of 5 stars
5/5
The Hacker Crackdown: Law and Disorder on the Electronic Frontier
Ebook
The Hacker Crackdown: Law and Disorder on the Electronic Frontier
byBruce Sterling
Rating: 4 out of 5 stars
4/5
Uncanny Valley: A Memoir
Ebook
Uncanny Valley: A Memoir
byAnna Wiener
Rating: 4 out of 5 stars
4/5
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
Ebook
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61
Ebook
CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61
byQuentin Docter
Rating: 0 out of 5 stars
0 ratings
How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)
Ebook
How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)
byDavid Kadavy
Rating: 5 out of 5 stars
5/5
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
Ebook
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
bySteven Cooper
Rating: 4 out of 5 stars
4/5
CompTia Security 701: Fundamentals of Security
Ebook
CompTia Security 701: Fundamentals of Security
byAS Snipes
Rating: 0 out of 5 stars
0 ratings
People Skills for Analytical Thinkers
Ebook
People Skills for Analytical Thinkers
byGilbert Eijkelenboom
Rating: 5 out of 5 stars
5/5
CompTIA Security+ Get Certified Get Ahead: SY0-701 Study Guide
Ebook
CompTIA Security+ Get Certified Get Ahead: SY0-701 Study Guide
byJoe Shelley
Rating: 5 out of 5 stars
5/5
Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition
Ebook
Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition
byAndrew Hodges
Rating: 4 out of 5 stars
4/5
The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling
Ebook
The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling
byRalph Kimball
Rating: 0 out of 5 stars
0 ratings
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5
101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters
Ebook
101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters
byTriumph Books
Rating: 4 out of 5 stars
4/5
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
Ebook
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
byNigel Tillery
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

Confidential Computing
UNLIMITED
Confidential Computing
byThe Cloudcast
0 ratings
0% found this document useful
New Tools for Cloud Native Developers
UNLIMITED
New Tools for Cloud Native Developers
byThe Cloudcast
0 ratings
0% found this document useful
Open Standards Make MLOps Easier and Silos Harder // Cody Peterson // #234
UNLIMITED
Open Standards Make MLOps Easier and Silos Harder // Cody Peterson // #234
byMLOps.community
0 ratings
0% found this document useful
Machine in Production = Data Engineering + ML + Software Engineering // Satish Chandra Gupta // MLOps Coffee Sessions #16
UNLIMITED
Machine in Production = Data Engineering + ML + Software Engineering // Satish Chandra Gupta // MLOps Coffee Sessions #16
byMLOps.community
0 ratings
0% found this document useful
MLOps for GenAI Applications // Harcharan Kabbay // #256
UNLIMITED
MLOps for GenAI Applications // Harcharan Kabbay // #256
byMLOps.community
0 ratings
0% found this document useful
Understanding Machine Learning Features and Platforms
UNLIMITED
Understanding Machine Learning Features and Platforms
byThe Cloudcast
0 ratings
0% found this document useful
SQL Commenter with Nimesh Bhagat and Morgan McLean: First time co-host joins this week to talk about database observability and the cool tools that make it possible. Morgan McLean and Nimesh Bhagat describe database observability, which uses metrics, logs, and other tools to help users understand the...
UNLIMITED
SQL Commenter with Nimesh Bhagat and Morgan McLean: First time co-host joins this week to talk about database observability and the cool tools that make it possible. Morgan McLean and Nimesh Bhagat describe database observability, which uses metrics, logs, and other tools to help users understand the...
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Building Distributed Cognition into Your Business with Sam Ramji: Here on “Screaming” we like to shine the light on peoples’ best work, but with folks like Sam Ramji, Chief Strategy Officer at DataStax, the question is where to start? From early days at Microsoft, to throwing his weight into leading DevOps management at
UNLIMITED
Building Distributed Cognition into Your Business with Sam Ramji: Here on “Screaming” we like to shine the light on peoples’ best work, but with folks like Sam Ramji, Chief Strategy Officer at DataStax, the question is where to start? From early days at Microsoft, to throwing his weight into leading DevOps management at
byScreaming in the Cloud
0 ratings
0% found this document useful
Building Large AI Models
UNLIMITED
Building Large AI Models
byThe Cloudcast
0 ratings
0% found this document useful
A Decade of Kubernetes Contribution: This episode is the first in our four-part Kubernetes 10 Years Anniversary special! The focus of this episode is on Kubernetes maintainers who have been involved with the project since its early days, and who are still active today. Featuring guests:...
UNLIMITED
A Decade of Kubernetes Contribution: This episode is the first in our four-part Kubernetes 10 Years Anniversary special! The focus of this episode is on Kubernetes maintainers who have been involved with the project since its early days, and who are still active today. Featuring guests:...
byKubernetes Podcast from Google
0 ratings
0% found this document useful
Building An Internal Database As A Service Platform At Cloudflare: Data persistence is one of the most challenging aspects of computer systems. In the era of the cloud most developers rely on hosted services to manage their databases, but what if you are a cloud service? In this episode Vignesh Ravichandran explains how his team at Cloudflare provides PostgreSQL as a service to their developers for low latency and high uptime services at global scale. This is an interesting and insightful look at pragmatic engineering for reliability and scale.
UNLIMITED
Building An Internal Database As A Service Platform At Cloudflare: Data persistence is one of the most challenging aspects of computer systems. In the era of the cloud most developers rely on hosted services to manage their databases, but what if you are a cloud service? In this episode Vignesh Ravichandran explains how his team at Cloudflare provides PostgreSQL as a service to their developers for low latency and high uptime services at global scale. This is an interesting and insightful look at pragmatic engineering for reliability and scale.
byData Engineering Podcast
0 ratings
0% found this document useful
Cloud Dataflow with Frances Perry: Cloud Dataflow and its OSS counterpart Apache Beam are amazing tools for Big Data so we asked Frances Perry, the Tech Lead and PMC for those projects, to join us and tell us more about it.
UNLIMITED
Cloud Dataflow with Frances Perry: Cloud Dataflow and its OSS counterpart Apache Beam are amazing tools for Big Data so we asked Frances Perry, the Tech Lead and PMC for those projects, to join us and tell us more about it.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Developer Tools for Kubernetes
UNLIMITED
Developer Tools for Kubernetes
byThe Cloudcast
0 ratings
0% found this document useful
Cloud SQL Insights with Nimesh Bhagat: This week on the podcast, Mark Mirchandani and Gabi Ferrara talk with Nimesh Bhagat about Cloud SQL Insights.
UNLIMITED
Cloud SQL Insights with Nimesh Bhagat: This week on the podcast, Mark Mirchandani and Gabi Ferrara talk with Nimesh Bhagat about Cloud SQL Insights.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
How Data Platforms Affect ML & AI // Jake Watson // #207
UNLIMITED
How Data Platforms Affect ML & AI // Jake Watson // #207
byMLOps.community
0 ratings
0% found this document useful
Episode 464 - Azure Deployment Environments: Cale and Russell talk to the Microsoft Program Manager for DevBox and Azure Deployment Environments, Sagar Chandra Reddy Lankala, about how Azure Deployment Environments can enable rapid deployment of on-demand dev/test environments while providing governance, security and cost management - plus some more updates from Microsoft Build 2023! Media File: https://azpodcast.blob.core.windows.net/episodes/Episode464.mp3 Sagar's links: GA blog - https://aka.ms/ade-ga-blog Sign up for Terraform support - https://aka.ms/ade-terraform-signup LinkedIn profile - https://www.linkedin.com/in/sagarchandrareddy Other updates mentioned in this episode: Public preview: Introducing NGads V620-series VMs optimized for cloud gaming | Azure updates | Microsoft Azure Generally available: Azure Data Explorer Kusto Emulator on Linux | Azure updates | Microsoft Azure Explore the latest features for Datadog—An Azure Native ISV Service Microsoft Cost Management updates
UNLIMITED
Episode 464 - Azure Deployment Environments: Cale and Russell talk to the Microsoft Program Manager for DevBox and Azure Deployment Environments, Sagar Chandra Reddy Lankala, about how Azure Deployment Environments can enable rapid deployment of on-demand dev/test environments while providing governance, security and cost management - plus some more updates from Microsoft Build 2023! Media File: https://azpodcast.blob.core.windows.net/episodes/Episode464.mp3 Sagar's links: GA blog - https://aka.ms/ade-ga-blog Sign up for Terraform support - https://aka.ms/ade-terraform-signup LinkedIn profile - https://www.linkedin.com/in/sagarchandrareddy Other updates mentioned in this episode: Public preview: Introducing NGads V620-series VMs optimized for cloud gaming | Azure updates | Microsoft Azure Generally available: Azure Data Explorer Kusto Emulator on Linux | Azure updates | Microsoft Azure Explore the latest features for Datadog—An Azure Native ISV Service Microsoft Cost Management updates
byThe Azure Podcast
0 ratings
0% found this document useful
Rainforest QA with Russell Smith: Russell Smith, cofounder and CTO of Rainforest QA, joins the podcast to explain how they power their analytics platform with BigQuery, streaming thousands of rows per second.
UNLIMITED
Rainforest QA with Russell Smith: Russell Smith, cofounder and CTO of Rainforest QA, joins the podcast to explain how they power their analytics platform with BigQuery, streaming thousands of rows per second.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
gRPC at CoreOS with Brandon Philips: Brandon Philips, CTO of CoreOS, tells your cohosts Mark and Francesc why they chose gRPC for the newest version of etcd and how this improved its performance and development flow.
UNLIMITED
gRPC at CoreOS with Brandon Philips: Brandon Philips, CTO of CoreOS, tells your cohosts Mark and Francesc why they chose gRPC for the newest version of etcd and how this improved its performance and development flow.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Building Scalable Data-Streaming Cloud Services
UNLIMITED
Building Scalable Data-Streaming Cloud Services
byThe Cloudcast
0 ratings
0% found this document useful
Azul and the Current State of the Java Ecosystem with Scott Sellers: Corey is joined by Scott Sellers, CEO & Co-Founder of Azul, to discuss the current state of the Java ecosystem and how Java is changing to adapt to a cloud-native world. Scott describes how he transitioned from hardware to the world of Java software, Java
UNLIMITED
Azul and the Current State of the Java Ecosystem with Scott Sellers: Corey is joined by Scott Sellers, CEO & Co-Founder of Azul, to discuss the current state of the Java ecosystem and how Java is changing to adapt to a cloud-native world. Scott describes how he transitioned from hardware to the world of Java software, Java
byScreaming in the Cloud
0 ratings
0% found this document useful
Scalable Databases on Kubernetes
UNLIMITED
Scalable Databases on Kubernetes
byThe Cloudcast
0 ratings
0% found this document useful
Fastly with Tyler McMullen: Tyler McMullen of Fastly is with us today, telling our hosts Mark Mirchandani and Brian Dorsey all about the company, CDNs, and more.
UNLIMITED
Fastly with Tyler McMullen: Tyler McMullen of Fastly is with us today, telling our hosts Mark Mirchandani and Brian Dorsey all about the company, CDNs, and more.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
A VC's Perspective on AI and Security
UNLIMITED
A VC's Perspective on AI and Security
byThe Cloudcast
0 ratings
0% found this document useful
Understanding Time-Series Database Patterns
UNLIMITED
Understanding Time-Series Database Patterns
byThe Cloudcast
0 ratings
0% found this document useful
Eliminate The Overhead In Your Data Integration With The Open Source dlt Library: Cloud data warehouses and the introduction of the ELT paradigm has led to the creation of multiple options for flexible data integration, with a roughly equal distribution of commercial and open source options. The challenge is that most of those options are complex to operate and exist in their own silo. The dlt project was created to eliminate overhead and bring data integration into your full control as a library component of your overall data system. In this episode Adrian Brudaru explains how it works, the benefits that it provides over other data integration solutions, and how you can start building pipelines today.
UNLIMITED
Eliminate The Overhead In Your Data Integration With The Open Source dlt Library: Cloud data warehouses and the introduction of the ELT paradigm has led to the creation of multiple options for flexible data integration, with a roughly equal distribution of commercial and open source options. The challenge is that most of those options are complex to operate and exist in their own silo. The dlt project was created to eliminate overhead and bring data integration into your full control as a library component of your overall data system. In this episode Adrian Brudaru explains how it works, the benefits that it provides over other data integration solutions, and how you can start building pipelines today.
byData Engineering Podcast
0 ratings
0% found this document useful
Episode 5: The Last Mainframe with a Kickstart and a Double Clutch: How are companies evolving in a world where Cloud is on the rise? Where Cloud providers are bought out and absorbed into other companies? Today, we’re talking to Nell Shamrell-Harrington about Cloud infrastructure. She is a senior software engineer at Che
UNLIMITED
Episode 5: The Last Mainframe with a Kickstart and a Double Clutch: How are companies evolving in a world where Cloud is on the rise? Where Cloud providers are bought out and absorbed into other companies? Today, we’re talking to Nell Shamrell-Harrington about Cloud infrastructure. She is a senior software engineer at Che
byScreaming in the Cloud
0 ratings
0% found this document useful
Episode 495 - How APIM powers Backbase's Grand Central: Hearing how our ISVs are using Azure in innovative ways is always eye-opening. So, we are fortunate to have Saquib Rashid, a Technical Strategist in the Azure partner organization, give us insights into how Backbase used APIM and the Azure Service Operator to integrate with their customers and backend systems.   Media file: https://azpodcast.blob.core.windows.net/episodes/Episode495.mp3  YouTube: https://youtu.be/qFwOoSNpDME Resources:   Backbase Grand Central Microsoft Customer Story-Backbase seeks to usher in a new era of engagement banking with its Grand Central platform and Azure API Management APIM links: Azure API Management - v2 tiers | Microsoft Learn ASO links: Azure/azure-service-operator: Azure Service Operator allows you to create Azure resources using kubectl (github.com) Azure Service Operator v2   Other updates: Public Preview - Azure Compute Fleet | Azure updates | Microsoft Azure http
UNLIMITED
Episode 495 - How APIM powers Backbase's Grand Central: Hearing how our ISVs are using Azure in innovative ways is always eye-opening. So, we are fortunate to have Saquib Rashid, a Technical Strategist in the Azure partner organization, give us insights into how Backbase used APIM and the Azure Service Operator to integrate with their customers and backend systems.   Media file: https://azpodcast.blob.core.windows.net/episodes/Episode495.mp3  YouTube: https://youtu.be/qFwOoSNpDME Resources:   Backbase Grand Central Microsoft Customer Story-Backbase seeks to usher in a new era of engagement banking with its Grand Central platform and Azure API Management APIM links: Azure API Management - v2 tiers | Microsoft Learn ASO links: Azure/azure-service-operator: Azure Service Operator allows you to create Azure resources using kubectl (github.com) Azure Service Operator v2   Other updates: Public Preview - Azure Compute Fleet | Azure updates | Microsoft Azure http
byThe Azure Podcast
0 ratings
0% found this document useful
The Evolution of MongoDB
UNLIMITED
The Evolution of MongoDB
byThe Cloudcast
0 ratings
0% found this document useful
OpenTelemetry for Databases: Empowering DevOps through sqlcommenter with Nimesh Bhagat: Optimizing or debugging database calls has to become as easy as optimizing your application code based on logs, metrics or traces your observability platform provides to developers. It has to be doable by the development and DevOps teams who are...
UNLIMITED
OpenTelemetry for Databases: Empowering DevOps through sqlcommenter with Nimesh Bhagat: Optimizing or debugging database calls has to become as easy as optimizing your application code based on logs, metrics or traces your observability platform provides to developers. It has to be doable by the development and DevOps teams who are...
byPurePerformance
0 ratings
0% found this document useful
2023 Look Ahead to Platform Engineering
UNLIMITED
2023 Look Ahead to Platform Engineering
byThe Cloudcast
0 ratings
0% found this document useful

Skip carousel

Supercomputer On A Platter
Business Today
UNLIMITED
Supercomputer On A Platter
Apr 1, 2022
CHENNAI-HEADQUARTERED automobile major TVS Motor Company uses high-performance computing (HPC) for running R&D simulations and testing the aero-dynamics of two-wheelers, which allows it to make the vehicles stable at speed and more efficient, cool en
7 min read
It’s Great When You’re K8s
Linux Format
UNLIMITED
It’s Great When You’re K8s
Oct 18, 2022
8 min read
Opinion
Linux Format
UNLIMITED
Opinion
Jul 23, 2024
Italo Vignoli is one of the founders of LibreOffice and the Document Foundation. “LibreOffice 24.8 will be announced in the second half of August, and the developers are working hard to optimise the new features that will be included. It will be the
3 min read
Opinion
Linux Format
UNLIMITED
Opinion
Aug 20, 2024
Italo Vignoli is one of the founders of LibreOffice and the Document Foundation. “Think about the personal and confidential information in your office suite documents; it’s essential your office suite respects user privacy. LibreOffice does not ask y
3 min read
Newsdesk
Linux Format
UNLIMITED
Newsdesk
Mar 5, 2024
11 min read
Real World Computing
PC Pro Magazine
UNLIMITED
Real World Computing
May 11, 2023
Migrating to Azure isn’t necessarily the toughest part of a successful cloud migration, explains our guest columnist Many organisations succeed at deploying resources in or migrating to Microsoft Azure. But many of those same organisations fail to en
6 min read
Newsdesk
Linux Format
UNLIMITED
Newsdesk
Dec 13, 2022
OPEN SOURCE FUNDING GitHub, Fastly and Mozilla are all looking for new projects to back, giving a boost to open source development. Small open source projects might be created solely by enthusiasts but most make use of outside developers, often paid
9 min read
Edge and Cloud Computing Can They Coexist Peacefully?
Techfastly
UNLIMITED
Edge and Cloud Computing Can They Coexist Peacefully?
Jun 1, 2022
6 min read
Automotive Grade Linux
Linux Format
UNLIMITED
Automotive Grade Linux
Nov 16, 2021
9 min read
PyScript – Bring Python Coding To The Web
APC
UNLIMITED
PyScript – Bring Python Coding To The Web
Aug 8, 2022
4 min read
In The Clouds
Linux Format
UNLIMITED
In The Clouds
May 28, 2024
On 6th June, it will be the tenth anniversary of the launch of Kubernetes, the container orchestration tool developed by Google. It was developed to make it easier to deploy software in a repeatable and predictable manner using container images, then
1 min read
Cloud Computing For All
Fast Company
UNLIMITED
Cloud Computing For All
May 2, 2023
2 min read
Build A Static Analysis Development Pipeline
Linux Format
UNLIMITED
Build A Static Analysis Development Pipeline
Jul 27, 2021
9 min read
Five Technology Tips For Dark Factories Installation
Techfastly
UNLIMITED
Five Technology Tips For Dark Factories Installation
Jun 1, 2021
6 min read
An easy-to-Understand Overview of Popular extended BPF Tools: BCC, Falco, and More
Techfastly
UNLIMITED
An easy-to-Understand Overview of Popular extended BPF Tools: BCC, Falco, and More
Apr 1, 2022
7 min read
How Netflix’s OTT Architecture Functions?
Techfastly
UNLIMITED
How Netflix’s OTT Architecture Functions?
May 1, 2022
With so many OTT platforms in the market today, Netflix has managed to capture a majority of the audience on a global scale. Netflix has become the go-to source of so much entertainment for consumers in less than 20 years. It can even be said that Ne
4 min read
Liz Rice Chief Open Source Officer at Isovalent
Techfastly
UNLIMITED
Liz Rice Chief Open Source Officer at Isovalent
Apr 1, 2022
5 min read
Data Model For Embedded Machine Learning
The Shed
UNLIMITED
Data Model For Embedded Machine Learning
Feb 13, 2023
4 min read
Data Model For Embedded Machine Learning
The Shed
UNLIMITED
Data Model For Embedded Machine Learning
Feb 13, 2023
4 min read
FLASK Web Frameworks
Linux Format
UNLIMITED
FLASK Web Frameworks
Jun 4, 2019
The main focus of Python has always been to get you cracking on with your coding – the language was never made for web programming. However, this has just made it more interesting to extend the language for the web, or to create an interface to web-b
9 min read
Three Low-code Options
PC Pro Magazine
UNLIMITED
Three Low-code Options
Nov 12, 2020
Counting Intel, Vodafone and VW among its customers, OutSystems helps businesses create cloudbased, on-premises and hybrid applications for mobile and web. Its development environment is predominantly drag-and-drop, with views for processes, data and
3 min read
The Idea Factories
3D World
UNLIMITED
The Idea Factories
Jul 15, 2020
6 min read
On Cloud Nine
Business Today
UNLIMITED
On Cloud Nine
Jul 8, 2022
8 min read
Copilot Will Learn Your OneDrive Files Without Crushing Your PC
Tech Advisor
UNLIMITED
Copilot Will Learn Your OneDrive Files Without Crushing Your PC
Feb 28, 2024
2 min read
News In Brief
PC Pro Magazine
UNLIMITED
News In Brief
Jul 9, 2020
Qualcomm unveiled its Snapdragon 690 chipset, a low-end system that includes support for 5G through its onboard X51 modem. The new addition to the company’s 6-series chips also promises other top-drawer features, such as support for 120Hz displays an
2 min read
Build And Compile For Embedded Systems
Linux Format
UNLIMITED
Build And Compile For Embedded Systems
Nov 17, 2020
Mats Tage Axelsson and a whole pile of compiling time. Mats Tage Axelsson has spent decades figuring out reasons to use Linux computers while sacrificing real social interactions. Embedded systems are small, specialised units with a key attribute
4 min read
Nourishment For The Soul
Linux Format
UNLIMITED
Nourishment For The Soul
Sep 20, 2022
9 min read
Windows 365 launches Microsoft’s Cloud PC era
PCWorld
UNLIMITED
Windows 365 launches Microsoft’s Cloud PC era
Aug 2, 2021
3 min read
News
APC
UNLIMITED
News
Feb 20, 2023
Be careful who you buy from. Some Chinese crypto miners are doing everything they can to offload their heavily used mining GPUs now that they’ve nothing left to mine. They are going to interesting lengths to sell their inventory, like repainting them
4 min read
Newsdesk
Linux Format
UNLIMITED
Newsdesk
Nov 14, 2023
8 min read

Related categories

Skip carousel

Reviews for Machine Learning on Kubernetes

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Machine Learning on Kubernetes - Faisal Masood

cover.png

BIRMINGHAM—MUMBAI

Machine Learning on Kubernetes

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author(s), nor Packt Publishing or its dealers and distributors, will be held liable for any damages caused or alleged to have been caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

Publishing Product Manager: Dhruv Jagdish Kataria

Senior Editor: David Sugarman

Content Development Editor: Priyanka Soam

Technical Editor: Devanshi Ayare

Copy Editor: Safis Editing

Project Coordinator: Farheen Fathima

Proofreader: Safis Editing

Indexer: Manju Arasan

Production Designer: Nilesh Mohite

Marketing Coordinators: Shifa Ansari, Abeer Riyaz Dawe

First published: June 2022

Production reference: 1190522

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham

B3 2PB, UK.

ISBN 978-1-80324-180-7

www.packt.com

To my daughter, Yleana Zorelle – hopefully, this book will help you understand what Papa does for a living.

Ross Brigoli

To my wife, Bushra Arif – without your support, none of this would have become a reality

Faisal Masood

Contributors

About the authors

Faisal Masood is a principal architect at Red Hat. He has been helping teams to design and build data science and application platforms using OpenShift, Red Hat's enterprise Kubernetes offering. Faisal has over 20 years of experience in building software and has been building microservices since the pre-Kubernetes era.

Ross Brigoli is an associate principal architect at Red Hat. He has been designing and building software in various industries for over 18 years. He has designed and built data platforms and workflow automation platforms. Before Red Hat, Ross led a data engineering team as an architect in the financial services industry. He currently designs and builds microservices architectures and machine learning solutions on OpenShift.

About the reviewers

Audrey Reznik is a senior principal software engineer in the Red Hat Cloud Services – OpenShift Data Science team focusing on managed services, AI/ML workloads, and next-generation platforms. She has been working in the IT Industry for over 20 years in full stack development relating to data science roles. As a former technical advisor and data scientist, Audrey has been instrumental in educating data scientists and developers about what the OpenShift platform is and how to use OpenShift containers (images) to organize, develop, train, and deploy intelligent applications using MLOps. She is passionate about data science and, in particular, the current opportunities with machine learning and open source technologies.

Cory Latschkowski has made a number of major stops in various IT fields over the past two decades, including high-performance computing (HPC), cybersecurity, data science, and container platform design. Much of his experience was acquired within large organizations, including one Fortune 100 company. His last name is pronounced Latch - cow - ski. His passions are pretty moderate, but he will admit to a love of automation, Kubernetes, RTFM, and bacon. To learn more about his personal bank security questions, ping him on GitHub.

Shahebaz Sayed is a highly skilled certified cloud computing engineer with exceptional development ability and extensive knowledge of scripting and data serialization languages. Shahebaz has expertise in all three major clouds – AWS, Azure, and GCP. He also has extensive experience with technologies such as Kubernetes, Terraform, Docker, and others from the DevOps domain. Shahebaz is also certified with global certifications, including AWS Certified DevOps Engineer Professional, AWS Solution Architect Associate, Azure DevOps Expert, Azure Developer Associate, and Kubernetes CKA. He has also worked with Packt as a technical reviewer on multiple projects, including AWS Automation Cookbook, Kubernetes on AWS, and Kubernetes for Serverless Applications.

Table of Contents

Preface

Part 1: The Challenges of Adopting ML and Understanding MLOps (What and Why)

Chapter 1: Challenges in Machine Learning

Understanding ML

Delivering ML value

Choosing the right approach

The importance of data

Facing the challenges of adopting ML

Focusing on the big picture

Breaking down silos

Fail-fast culture

An overview of the ML platform

Summary

Further reading

Chapter 2: Understanding MLOps

Comparing ML to traditional programming

Exploring the benefits of DevOps

Understanding MLOps

DevOps

ML project life cycle

Fast feedback loop

Collaborating over the project life cycle

The role of OSS in ML projects

Running ML projects on Kubernetes

Summary

Further reading

Chapter 3: Exploring Kubernetes

Technical requirements

Exploring Kubernetes major components

Control plane

Worker nodes

Kubernetes objects required to run an application

Becoming cloud-agnostic through Kubernetes

Understanding Operators

Setting up your local Kubernetes environment

Installing kubectl

Installing minikube

Installing OLM

Provisioning a VM on GCP

Summary

Part 2: The Building Blocks of an MLOps Platform and How to Build One on Kubernetes

Chapter 4: The Anatomy of a Machine Learning Platform

Technical requirements

Defining a self-service platform

Exploring the data engineering components

Data engineer workflow

Exploring the model development components

Understanding the data scientist workflow

Security, monitoring, and automation

Introducing ODH

Installing the ODH operator on Kubernetes

Enabling the ingress controller on the Kubernetes cluster

Installing Keycloak on Kubernetes

Summary

Further reading

Chapter 5: Data Engineering

Technical requirements

Configuring Keycloak for authentication

Importing the Keycloak configuration for the ODH components

Creating a Keycloak user

Configuring ODH components

Installing ODH

Understanding and using JupyterHub

Validating the JupyterHub installation

Running your first Jupyter notebook

Understanding the basics of Apache Spark

Understanding Apache Spark job execution

Understanding how ODH provisions Apache Spark cluster on-demand

Creating a Spark cluster

Understanding how JupyterHub creates a Spark cluster

Writing and running a Spark application from Jupyter Notebook

Summary

Chapter 6: Machine Learning Engineering

Technical requirements

Understanding ML engineering

Using a custom notebook image

Building a custom notebook container image

Introducing MLflow

Understanding MLflow components

Validating the MLflow installation

Using MLFlow as an experiment tracking system

Adding custom data to the experiment run

Using MLFlow as a model registry system

Summary

Chapter 7: Model Deployment and Automation

Technical requirements

Understanding model inferencing with Seldon Core

Wrapping the model using Python

Containerizing the model

Deploying the model using the Seldon controller

Packaging, running, and monitoring a model using Seldon Core

Introducing Apache Airflow

Understanding DAG

Exploring Airflow features

Understanding Airflow components

Validating the Airflow installation

Configuring the Airflow DAG repository

Configuring Airflow runtime images

Automating ML model deployments in Airflow

Creating the pipeline by using the pipeline editor

Summary

Part 3: How to Use the MLOps Platform and Build a Full End-to-End Project Using the New Platform

Chapter 8: Building a Complete ML Project Using the Platform

Reviewing the complete picture of the ML platform

Understanding the business problem

Data collection, processing, and cleaning

Understanding data sources, location, and the format

Understanding data processing and cleaning

Performing exploratory data analysis

Understanding sample data

Understanding feature engineering

Data augmentation

Building and evaluating the ML model

Selecting evaluation criteria

Building the model

Deploying the model

Reproducibility

Summary

Chapter 9: Building Your Data Pipeline

Technical requirements

Automated provisioning of a Spark cluster for development

Writing a Spark data pipeline

Preparing the environment

Understanding data

Designing and building the pipeline

Using the Spark UI to monitor your data pipeline

Building and executing a data pipeline using Airflow

Understanding the data pipeline DAG

Building and running the DAG

Summary

Chapter 10: Building, Deploying, and Monitoring Your Model

Technical requirements

Visualizing and exploring data using JupyterHub

Building and tuning your model using JupyterHub

Tracking model experiments and versioning using MLflow

Tracking model experiments

Versioning models

Deploying the model as a service

Calling your model

Monitoring your model

Understanding monitoring components

Configuring Grafana and a dashboard

Summary

Chapter 11: Machine Learning on Kubernetes

Identifying ML platform use cases

Considering AutoML

Commercial platforms

ODH

Operationalizing ML

Setting the business expectations

Dealing with dirty real-world data

Dealing with incorrect results

Maintaining continuous delivery

Managing security

Adhering to compliance policies

Applying governance

Running on Kubernetes

Avoiding vendor lock-ins

Considering other Kubernetes platforms

Roadmap

Summary

Further reading

Other Books You May Enjoy

Preface

Machine Learning (ML) is the new black. Organizations are investing in adopting and uplifting their ML capabilities to build new products and improve customer experience. The focus of this book is on assisting organizations and teams to get business value out of ML initiatives. By implementing MLOps with Kubernetes, data scientists, IT operations professionals, and data engineers will be able to collaborate and build ML solutions that create tangible outcomes for their business. This book enables teams to take a practical approach to work together to bring the software engineering discipline to the ML project life cycle.

You'll begin by understanding why MLOps is important and discover the different components of an ML project. Later in the book, you'll design and build a practical end-to-end MLOps project that'll use the most popular OSS components. As you progress, you'll get to grips with the basics of MLOps and the value it can bring to your ML projects, as well as gaining experience in building, configuring, and using an open source, containerized ML platform on Kubernetes. Finally, you'll learn how to prepare data, build and deploy models quickly, and automate tasks for an efficient ML pipeline using a common platform. The exercises in this book will help you get hands-on with using Kubernetes and integrating it with OSS, such as JupyterHub, MLflow, and Airflow.

By the end of this book, you'll have learned how to effectively build, train, and deploy an ML model using the ML platform you built.

Who this book is for

This book is for data scientists, data engineers, IT platform owners, AI product owners, and data architects who want to use open source components to compose an ML platform. Although this book starts with the basics, a good understanding of Python and Kubernetes, along with knowledge of the basic concepts of data science and data engineering, will help you grasp the topics covered in this book much better.

What this book covers

Chapter 1, Challenges in Machine Learning, discusses the challenges organizations face in adopting ML and why a good number of ML initiatives may not deliver the expected outcomes. The chapter further discusses the top few reasons why organizations face these challenges.

Chapter 2, Understanding MLOps, continues building on the identified set of problems from Chapter 1, Challenges in Machine Learning, and discusses how we can tackle the challenges in adopting ML. The chapter will provide the definition of MLOps and how it helps organizations to get value out of their ML initiatives. The chapter also provides a blueprint on how companies can adopt MLOps in their ML projects.

Chapter 3, Exploring Kubernetes, first describes why we have chosen Kubernetes as the basis for MLOps in this book. The chapter further defines the core concept of Kubernetes and assists you in creating an environment where the code can be tested. The world is changing fast and part of this high-velocity disruption is the availability of the cloud and cloud-based solutions. This chapter provides an overview of how the Kubernetes-based platform can give you the flexibility to run your solution anywhere.

Chapter 4, The Anatomy of a Machine Learning Platform, takes a 1,000-foot view of what an ML platform looks like. You already know what problems MLOps solves. This chapter defines the components of an MLOps platform in a technology-agnostic way. You will build a solid foundation on the core components of an MLOps platform.

Chapter 5, Data Engineering, covers an important part of any ML project that is often missed. A good number of ML tutorials/books start with a clean dataset, maybe a CSV file to build your model against. The real world is different. Data comes in many shapes and sizes and it is important that you have a well-defined strategy to harvest, process, and prepare data at scale. This chapter will define the role data engineering plays in a successful ML project. It will discuss OSS tools that can provide the basis for data engineering. The chapter will then talk about how you can install these toolsets on the Kubernetes platform.

Chapter 6, Machine Learning Engineering, will move the discussion to the model building tuning and deployment activities of an ML development life cycle. The chapter will discuss providing a self-service solution to data scientists so they can work more efficiently and collaborate with data engineering teams and fellow data scientists using the same platform. It will also discuss OSS tools that can provide the basis for model development. The chapter will then talk about how you can install these toolsets on the Kubernetes platform.

Chapter 7, Model Deployment and Automation, covers the deployment phase of the ML project life cycle. The model you build knows the data you provided to it. In the real world, however, the data changes. This chapter discusses the tools and techniques to monitor your model performance. This performance data could be used to decide whether the model needs retraining on a new dataset or whether it's time to build a new model for the given problem.

Chapter 8, Building a Complete ML Project Using the Platform, will define a typical ML project and how each component of the platform is utilized in every step of the project life cycle. The chapter will define the outcomes and requirements of the project and focus on how the MLOps platform facilitates the project life cycle.

Chapter 9, Building Your Data Pipeline, will show how a Spark cluster can be used to ingest and process data. The chapter will show how the platform enables the data engineer to read the raw data from any storage, process it, and write it back to another storage. The main focus is to demonstrate how a Spark cluster can be created on-demand and how workloads could be isolated in a shared environment.

Chapter 10, Building, Deploying, and Monitoring Your Model, will show how the JuyterHub server can be used to build, train, and tune models on the platform. The chapter will show how the platform enables the data scientist to perform the modeling activities in a self-serving fashion. This chapter will also introduce MLflow as the model experiment tracking and model registry component. Now you have a working model, how do you want to share this model for the other teams to consume? This chapter will show how the Seldon Core component allows non-programmers to expose their models as REST APIs. You will see how the deployed APIs automatically scale out using the Kubernetes capabilities.

Chapter 11, Machine Learning on Kubernetes, will take you through some of the key ideas to bring forth with you to further your knowledge on the subject. This chapter will cover identifying use cases for the ML platform, operationalizing ML, and running on Kubernetes.

To get the most out of this book

You will need a basic working knowledge of Kubernetes and Python to get the most out of this book's technical exercises. The platform uses multiple software components to cover the full ML development life cycle. You will need the recommended hardware to run all the components with ease.

Running the platform requires a good amount of compute resources. If you do not have the required number of CPU cores and memory on your desktop or laptop computer, we recommend running a virtual machine on Google Cloud or any other cloud platform.

If you are using the digital version of this book, we advise you to type the code yourself or access the code from the book's GitHub repository (a link is available in the next section). Doing so will help you avoid any potential errors related to the copying and pasting of code.

A good follow-up after you finish with this book is to create a proof of concept within your team or organization using the platform. Assess the benefits and learn how you can further optimize your organization's data science and ML project life cycle.

Download the example code files

You can download the example code files for this book from GitHub at https://github.com/PacktPublishing/Machine-Learning-on-Kubernetes. If there's an update to the code, it will be updated in the GitHub repository.

We also have other code bundles from our rich catalog of books and videos available at https://github.com/PacktPublishing/. Check them out!

Download the color images

We also provide a PDF file that has color images of the screenshots/diagrams used in this book. You can download it here: https://static.packt-cdn.com/downloads/9781803241807_ColorImages.pdf.

Conventions used

There are a number of text conventions used throughout this book.

Code in text: Indicates code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles. Here is an example: "Notice that you will need to adjust the following command and change the quay.io/ml-on-k8s/ part before executing the command."

A block of code is set as follows:

docker tag scikit-notebook:v1.1.0 quay.io/ml-on-k8s/scikit-notebook:v1.1.0

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:

gcloud compute project-info add-metadata --metadata enable-oslogin=FALSE

Bold: Indicates a new term, an important word, or words that you see onscreen. For instance, words in menus or dialog boxes appear in bold. Here is an example: The installer will present the following License Agreement screen. Click I Agree.

Tips or Important Notes

Appear like this.

Get in touch

Feedback from our readers is always welcome.

General feedback: If you have questions about any aspect of this book, mention the book title in the subject of your message and email us at [email protected].

Errata: Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you have found a mistake in this book, we would be grateful if you would report this to us. Please visit www.packtpub.com/support/errata, selecting your book, clicking on the Errata Submission Form link, and entering the details.

Piracy: If you come across any illegal copies of our works in any form on the Internet, we would be grateful if you would provide us with the location address or website name. Please contact us at [email protected] with a link to the material.

If you are interested in becoming an author: If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, please visit authors.packtpub.com.

Reviews

Please leave a review. Once you have read and used this book, why not leave a review on the site that you purchased it from? Potential readers can then see and use your unbiased opinion to make purchase decisions, we at Packt can understand what you think about our products, and our authors can see your feedback on their book. Thank you!

For more information about Packt, please visit packt.com.

Share Your Thoughts

Once you've read Machine Learning on Kubernetes, we'd love to hear your thoughts! Please click here to go straight to the Amazon review page for this book and share your feedback.

Your review is important to us and the tech community and will help us make sure we're delivering excellent quality content.

Part 1: The Challenges of Adopting ML and Understanding MLOps (What and Why)

In this section, we will define what MLOps is and why it is critical to the success of your AI journey. You will go through the challenges organizations may encounter in their AI journey and how MLOps can assist in overcoming those challenges.

The last chapter of this section will provide a refresher on Kubernetes and the role it plays in bringing MLOps to the OSS community. This is by no means a guide to Kubernetes, and you should consult other sources for a guide on Kubernetes.

This section comprises the following chapters:

Chapter 1, Challenges in Machine Learning

Chapter 2, Understanding MLOps

Chapter 3, Exploring Kubernetes

Chapter 1: Challenges in Machine Learning

Many people believe that artificial intelligence (AI) is all about the idea of a humanoid robot or an intelligent computer program that takes over humanity. The shocking news is that we are not even close to this. A better term for such incredible machines is human-like intelligence or artificial general intelligence (AGI).

So, what is AI? A more straightforward answer would be a system that uses a combination of data and algorithms to make predictions. AI practitioners call it machine learning or ML. A particular subset of ML algorithms, called deep learning (DL), refers to an ML algorithm that uses a series of steps, or layers, of computation (Goodfellow, Bengio, and Courville, 2017). This technique employs deep neural networks (DNNs) with multiple layers of artificial neurons that mimic the architecture of the human brain. Though it sounds complicated enough, it does not always mean that all DL systems will have a better performance compared to other AI algorithms or even a traditional programming approach.

ML is not always about DL. Sometimes, a basic statistical model may be a better fit for a problem you are trying to solve than a complex DNN. One of the challenges of implementing ML is about selecting the right approach. Moreover, delivering an ML project comes with other challenges, not only on the business and technology side but also in people and processes. These challenges are the primary reasons why most ML initiatives fail to deliver their expected value.

In this chapter, we will revisit a basic understanding of ML and understand the challenges in delivering ML projects that can lead to a project not delivering its promised value.

The following topics will be covered:

Understanding ML

Delivering ML value

Choosing the right approach

Facing the challenges of adopting ML

An overview of the ML platform

Understanding ML

In traditional computer programming, a human programmer must write a clear set of instructions in order for a computer program to perform an operation or provide an answer to a question. In ML, however, a human (usually an ML engineer or data scientist) uses data and an

Enjoying the preview?

Page 1 of 1

Machine Learning on Kubernetes: A practical handbook for building and using a complete open source machine learning platform on Kubernetes

About this ebook

Faisal Masood

Related authors

Related to Machine Learning on Kubernetes

Related ebooks

Accelerating DevSecOps on AWS: Create secure CI/CD pipelines using Chaos and AIOps

The Kubernetes Operator Framework Book: Overcome complex Kubernetes cluster management challenges with automation toolkits

Big Data on Kubernetes: A practical guide to building efficient and scalable data solutions

Architecting Cloud-Native Serverless Solutions: Design, build, and operate serverless solutions on cloud and open source platforms

Hybrid Cloud Management with Red Hat CloudForms

Azure for Developers.: Implement rich Azure PaaS ecosystems using containers, serverless services, and storage solutions

The Azure Cloud Native Architecture Mapbook: Explore Microsoft Cloud's infrastructure, application, data, and security architecture

Azure Stack Hub Demystified: Building hybrid cloud, IaaS, and PaaS solutions

Kubernetes in Production Best Practices: Build and manage highly available production-ready Kubernetes clusters

IoT Edge Computing with MicroK8s: A hands-on approach to building, deploying, and distributing production-ready Kubernetes on IoT and Edge platforms

Azure Containers Explained: Leverage Azure container technologies for effective application migration and deployment

Hands-On Microservices with Kubernetes: Build, deploy, and manage scalable microservices on Kubernetes

A Developer's Guide to .NET in Azure: Build quick, scalable cloud-native applications and microservices with .NET 6.0 and Azure

Mastering AWS CloudFormation: Build resilient and production-ready infrastructure in Amazon Web Services with CloudFormation

MLOps with Red Hat OpenShift: A cloud-native approach to machine learning operations

Mastering Azure Machine Learning.: Execute large-scale end-to-end machine learning with Azure

Kubernetes – An Enterprise Guide: Effectively containerize applications, integrate enterprise systems, and scale applications in your enterprise

Cloud Native with Kubernetes: Deploy, configure, and run modern cloud native applications on Kubernetes

Hands-On Azure for Developers: Implement rich Azure PaaS ecosystems using containers, serverless services, and storage solutions

The Kubernetes Bible: The definitive guide to deploying and managing Kubernetes across major cloud platforms

Windows Azure programming patterns for Start-ups

Learning AWS

Machine Learning Engineering on AWS: Build, scale, and secure machine learning systems and MLOps pipelines in production

Mastering Azure Kubernetes Service (AKS): Rapidly Build and Scale Your Containerized Applications with Microsoft Azure Kubernetes Service (English Edition)

Learning Docker

The Machine Learning Solutions Architect Handbook: Create machine learning platforms to run solutions in an enterprise setting

Rust Web Programming: A hands-on guide to developing fast and secure web apps with the Rust programming language

Bootstrapping Service Mesh Implementations with Istio: Build reliable, scalable, and secure microservices on Kubernetes with Service Mesh

50 Kubernetes Concepts Every DevOps Engineer Should Know: Your go-to guide for making production-level decisions on how and why to implement Kubernetes

Kubernetes on AWS: Deploy and manage production-ready Kubernetes clusters on AWS

Computers For You

Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race

The Innovators: How a Group of Hackers, Geniuses, and Geeks Created the Digital Revolution

The Invisible Rainbow: A History of Electricity and Life

Mastering ChatGPT: 21 Prompts Templates for Effortless Writing

Elon Musk

Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics

The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology

Deep Search: How to Explore the Internet More Effectively

SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL

Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad

Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are

Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time!

Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees

How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally

Slenderman: Online Obsession, Mental Illness, and the Violent Crime of Two Midwestern Girls

Computer Science I Essentials

The Hacker Crackdown: Law and Disorder on the Electronic Frontier

Uncanny Valley: A Memoir

ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind

CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61

How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)

Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work

CompTia Security 701: Fundamentals of Security

People Skills for Analytical Thinkers

CompTIA Security+ Get Certified Get Ahead: SY0-701 Study Guide

Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition

The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling

Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates

101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters

Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence

Related podcast episodes

Related articles

Related categories

Reviews for Machine Learning on Kubernetes

What did you think?

Book preview

Machine Learning on Kubernetes - Faisal Masood

101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters