Python Machine Learning By Example: The easiest way to get into machine learning

Ebook409 pages2 hours

Python Machine Learning By Example: The easiest way to get into machine learning

Name: Python Machine Learning By Example: The easiest way to get into machine learning
Brand: Packt Publishing
Rating: 5.0 (1 reviews)

By Yuxi (Hayden) Liu

Rating: 5 out of 5 stars

5/5

()

Read preview

About this ebook

Data science and machine learning are some of the top buzzwords in the technical world today. A resurging interest in machine learning is due to the same factors that have made data mining and Bayesian analysis more popular than ever. This book is your entry point to machine learning.
This book starts with an introduction to machine learning and the Python language and shows you how to complete the setup. Moving ahead, you will learn all the important concepts such as, exploratory data analysis, data preprocessing, feature extraction, data visualization and clustering, classification, regression and model performance evaluation. With the help of various projects included, you will find it intriguing to acquire the mechanics of several important machine learning algorithms – they are no more obscure as they thought. Also, you will be guided step by step to build your own models from scratch. Toward the end, you will gather a broad picture of the machine learning ecosystem and best practices of applying machine learning techniques.
Through this book, you will learn to tackle data-driven problems and implement your solutions with the powerful yet simple language, Python. Interesting and easy-to-follow examples, to name some, news topic classification, spam email detection, online ad click-through prediction, stock prices forecast, will keep you glued till you reach your goal.

Skip carousel

LanguageEnglish

PublisherPackt Publishing

Release dateMay 31, 2017

ISBN9781783553129

Author

Yuxi (Hayden) Liu

Related to Python Machine Learning By Example

Related ebooks

Skip carousel

Machine Learning Algorithms: Popular algorithms for data science and machine learning
Ebook
Machine Learning Algorithms: Popular algorithms for data science and machine learning
byGiuseppe Bonaccorso
Rating: 0 out of 5 stars
0 ratings
Mastering Predictive Analytics with scikit-learn and TensorFlow: Implement machine learning techniques to build advanced predictive models using Python
Ebook
Mastering Predictive Analytics with scikit-learn and TensorFlow: Implement machine learning techniques to build advanced predictive models using Python
byAlan Fontaine
Rating: 0 out of 5 stars
0 ratings
Deep Learning By Example: A hands-on guide to implementing advanced machine learning algorithms and neural networks
Ebook
Deep Learning By Example: A hands-on guide to implementing advanced machine learning algorithms and neural networks
byAhmed Menshawy
Rating: 0 out of 5 stars
0 ratings
Go Machine Learning Projects: Eight projects demonstrating end-to-end machine learning and predictive analytics applications in Go
Ebook
Go Machine Learning Projects: Eight projects demonstrating end-to-end machine learning and predictive analytics applications in Go
byXuanyi Chew
Rating: 0 out of 5 stars
0 ratings
Hands-On Data Science and Python Machine Learning: Perform data mining and machine learning efficiently using Python and Spark
Ebook
Hands-On Data Science and Python Machine Learning: Perform data mining and machine learning efficiently using Python and Spark
byFrank Kane
Rating: 0 out of 5 stars
0 ratings
Hands-On Neural Network Programming with C#: Add powerful neural network capabilities to your C# enterprise applications
Ebook
Hands-On Neural Network Programming with C#: Add powerful neural network capabilities to your C# enterprise applications
byMatt R. Cole
Rating: 0 out of 5 stars
0 ratings
Mastering Machine Learning Algorithms: Expert techniques to implement popular machine learning algorithms and fine-tune your models
Ebook
Mastering Machine Learning Algorithms: Expert techniques to implement popular machine learning algorithms and fine-tune your models
byGiuseppe Bonaccorso c/o Quandoo
Rating: 0 out of 5 stars
0 ratings
Keras Reinforcement Learning Projects: 9 projects exploring popular reinforcement learning techniques to build self-learning agents
Ebook
Keras Reinforcement Learning Projects: 9 projects exploring popular reinforcement learning techniques to build self-learning agents
byGiuseppe Ciaburro
Rating: 0 out of 5 stars
0 ratings
Statistics for Machine Learning
Ebook
Statistics for Machine Learning
byPratap Dangeti
Rating: 3 out of 5 stars
3/5
Hands-On Predictive Analytics with Python: Master the complete predictive analytics process, from problem definition to model deployment
Ebook
Hands-On Predictive Analytics with Python: Master the complete predictive analytics process, from problem definition to model deployment
byAlvaro Fuentes
Rating: 0 out of 5 stars
0 ratings
R Machine Learning Projects: Implement supervised, unsupervised, and reinforcement learning techniques using R 3.5
Ebook
R Machine Learning Projects: Implement supervised, unsupervised, and reinforcement learning techniques using R 3.5
byDr. Sunil Kumar Chinnamgari
Rating: 0 out of 5 stars
0 ratings
R Deep Learning Essentials.: A step-by-step guide to building deep learning models using TensorFlow, Keras, and MXNet
Ebook
R Deep Learning Essentials.: A step-by-step guide to building deep learning models using TensorFlow, Keras, and MXNet
byMark Hodnett
Rating: 0 out of 5 stars
0 ratings
Deep Learning with PyTorch: A practical approach to building neural network models using PyTorch
Ebook
Deep Learning with PyTorch: A practical approach to building neural network models using PyTorch
byVishnu Subramanian
Rating: 0 out of 5 stars
0 ratings
Deep Reinforcement Learning Hands-On: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more
Ebook
Deep Reinforcement Learning Hands-On: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more
byMaxim Lapan
Rating: 0 out of 5 stars
0 ratings
Practical Convolutional Neural Networks: Implement advanced deep learning models using Python
Ebook
Practical Convolutional Neural Networks: Implement advanced deep learning models using Python
byMohit Sewak
Rating: 0 out of 5 stars
0 ratings
Practical Computer Vision: Extract insightful information from images using TensorFlow, Keras, and OpenCV
Ebook
Practical Computer Vision: Extract insightful information from images using TensorFlow, Keras, and OpenCV
byAbhinav Dadhich
Rating: 0 out of 5 stars
0 ratings
Hands-On Genetic Algorithms with Python: Applying genetic algorithms to solve real-world deep learning and artificial intelligence problems
Ebook
Hands-On Genetic Algorithms with Python: Applying genetic algorithms to solve real-world deep learning and artificial intelligence problems
byEyal Wirsansky
Rating: 0 out of 5 stars
0 ratings
Hands-on Machine Learning with JavaScript: Solve complex computational web problems using machine learning
Ebook
Hands-on Machine Learning with JavaScript: Solve complex computational web problems using machine learning
byBurak Kanber
Rating: 0 out of 5 stars
0 ratings
Ensemble Machine Learning: A beginner's guide that combines powerful machine learning algorithms to build optimized models
Ebook
Ensemble Machine Learning: A beginner's guide that combines powerful machine learning algorithms to build optimized models
byAnkit Dixit
Rating: 0 out of 5 stars
0 ratings
Machine Learning With Go: Leverage Go's powerful packages to build smart machine learning and predictive applications, 2nd Edition
Ebook
Machine Learning With Go: Leverage Go's powerful packages to build smart machine learning and predictive applications, 2nd Edition
byDaniel Whitenack
Rating: 0 out of 5 stars
0 ratings
Python Machine Learning Blueprints: Put your machine learning concepts to the test by developing real-world smart projects, 2nd Edition
Ebook
Python Machine Learning Blueprints: Put your machine learning concepts to the test by developing real-world smart projects, 2nd Edition
byAlexander Combs
Rating: 0 out of 5 stars
0 ratings
Apache Mahout Essentials
Ebook
Apache Mahout Essentials
byJayani Withanawasam
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence By Example: Develop machine intelligence from scratch using real artificial intelligence use cases
Ebook
Artificial Intelligence By Example: Develop machine intelligence from scratch using real artificial intelligence use cases
byDenis Rothman
Rating: 0 out of 5 stars
0 ratings
Machine Learning with scikit-learn Quick Start Guide: Classification, regression, and clustering techniques in Python
Ebook
Machine Learning with scikit-learn Quick Start Guide: Classification, regression, and clustering techniques in Python
byKevin Jolly
Rating: 0 out of 5 stars
0 ratings
Learning Data Mining with Python
Ebook
Learning Data Mining with Python
byRobert Layton
Rating: 0 out of 5 stars
0 ratings
Machine Learning with Scala Quick Start Guide: Leverage popular machine learning algorithms and techniques and implement them in Scala
Ebook
Machine Learning with Scala Quick Start Guide: Leverage popular machine learning algorithms and techniques and implement them in Scala
byMd. Rezaul Karim
Rating: 0 out of 5 stars
0 ratings
Mastering Machine Learning with scikit-learn - Second Edition
Ebook
Mastering Machine Learning with scikit-learn - Second Edition
byGavin Hackeling
Rating: 0 out of 5 stars
0 ratings
Mastering Numerical Computing with NumPy: Master scientific computing and perform complex operations with ease
Ebook
Mastering Numerical Computing with NumPy: Master scientific computing and perform complex operations with ease
byUmit Mert Cakmak
Rating: 0 out of 5 stars
0 ratings
Practical Machine Learning with Python: Real-World Applications
Ebook
Practical Machine Learning with Python: Real-World Applications
byGloria Cheruto
Rating: 0 out of 5 stars
0 ratings
Learning Data Mining with Python: Use Python to manipulate data and build predictive models
Ebook
Learning Data Mining with Python: Use Python to manipulate data and build predictive models
byRobert Layton
Rating: 0 out of 5 stars
0 ratings

Programming For You

Skip carousel

Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
Ebook
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
byJames Tudor
Rating: 5 out of 5 stars
5/5
Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time!
Ebook
Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time!
byJohannes Wild
Rating: 0 out of 5 stars
0 ratings
JavaScript All-in-One For Dummies
Ebook
JavaScript All-in-One For Dummies
byChris Minnick
Rating: 5 out of 5 stars
5/5
SQL All-in-One For Dummies
Ebook
SQL All-in-One For Dummies
byAllen G. Taylor
Rating: 3 out of 5 stars
3/5
Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.
Ebook
Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.
byGwendolyn Faraday
Rating: 5 out of 5 stars
5/5
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
Ebook
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
byNigel Tillery
Rating: 0 out of 5 stars
0 ratings
Coding All-in-One For Dummies
Ebook
Coding All-in-One For Dummies
byNikhil Abraham
Rating: 4 out of 5 stars
4/5
Python Programming for Beginners: A Comprehensive Crash Course With Practical Exercises to Quickly Learn Coding and Programming for Data Analysis and Machine Learning
Ebook
Python Programming for Beginners: A Comprehensive Crash Course With Practical Exercises to Quickly Learn Coding and Programming for Data Analysis and Machine Learning
byAnthony Adams
Rating: 4 out of 5 stars
4/5
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
HTML & CSS: Learn the Fundaments in 7 Days
Ebook
HTML & CSS: Learn the Fundaments in 7 Days
byMichael Knapp
Rating: 4 out of 5 stars
4/5
Python Programming : How to Code Python Fast In Just 24 Hours With 7 Simple Steps
Ebook
Python Programming : How to Code Python Fast In Just 24 Hours With 7 Simple Steps
byJason Scotts
Rating: 4 out of 5 stars
4/5
C# Programming from Zero to Proficiency (Beginner): C# from Zero to Proficiency, #2
Ebook
C# Programming from Zero to Proficiency (Beginner): C# from Zero to Proficiency, #2
byPatrick Felicia
Rating: 0 out of 5 stars
0 ratings
Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1
Ebook
Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1
byKevin Clark
Rating: 5 out of 5 stars
5/5
The Advanced Roblox Coding Book: An Unofficial Guide, Updated Edition: Learn How to Script Games, Code Objects and Settings, and Create Your Own World!
Ebook
The Advanced Roblox Coding Book: An Unofficial Guide, Updated Edition: Learn How to Script Games, Code Objects and Settings, and Create Your Own World!
byHeath Haskins
Rating: 5 out of 5 stars
5/5
Python: For Beginners A Crash Course Guide To Learn Python in 1 Week
Ebook
Python: For Beginners A Crash Course Guide To Learn Python in 1 Week
byTimothy C. Needham
Rating: 4 out of 5 stars
4/5
Linux: Learn in 24 Hours
Ebook
Linux: Learn in 24 Hours
byAlex Nordeen
Rating: 5 out of 5 stars
5/5
CODING FOR ABSOLUTE BEGINNERS: How to Keep Your Data Safe from Hackers by Mastering the Basic Functions of Python, Java, and C++ (2022 Guide for Newbies)
Ebook
CODING FOR ABSOLUTE BEGINNERS: How to Keep Your Data Safe from Hackers by Mastering the Basic Functions of Python, Java, and C++ (2022 Guide for Newbies)
byEric Vargas
Rating: 0 out of 5 stars
0 ratings
PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project
Ebook
PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project
byMark Chan
Rating: 5 out of 5 stars
5/5
Python QuickStart Guide: The Simplified Beginner's Guide to Python Programming Using Hands-On Projects and Real-World Applications
Ebook
Python QuickStart Guide: The Simplified Beginner's Guide to Python Programming Using Hands-On Projects and Real-World Applications
byRobert Oliver
Rating: 0 out of 5 stars
0 ratings
C Programming For Beginners: The Simple Guide to Learning C Programming Language Fast!
Ebook
C Programming For Beginners: The Simple Guide to Learning C Programming Language Fast!
byTim Warren
Rating: 5 out of 5 stars
5/5
Beginning Programming with C++ For Dummies
Ebook
Beginning Programming with C++ For Dummies
byStephen R. Davis
Rating: 4 out of 5 stars
4/5
The Python Workshop: Learn to code in Python and kickstart your career in software development or data science
Ebook
The Python Workshop: Learn to code in Python and kickstart your career in software development or data science
byAndrew Bird
Rating: 5 out of 5 stars
5/5
Coding with JavaScript For Dummies
Ebook
Coding with JavaScript For Dummies
byChris Minnick
Rating: 0 out of 5 stars
0 ratings
HTML in 30 Pages
Ebook
HTML in 30 Pages
byU.Q. Magnusson
Rating: 5 out of 5 stars
5/5
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
So You Want to Start a Podcast: Finding Your Voice, Telling Your Story, and Building a Community That Will Listen
Ebook
So You Want to Start a Podcast: Finding Your Voice, Telling Your Story, and Building a Community That Will Listen
byKristen Meinzer
Rating: 3 out of 5 stars
3/5
The JavaScript Workshop: Learn to develop interactive web applications with clean and maintainable JavaScript code
Ebook
The JavaScript Workshop: Learn to develop interactive web applications with clean and maintainable JavaScript code
byJoseph Labrecque
Rating: 5 out of 5 stars
5/5
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
Ebook
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
byi Code Academy
Rating: 5 out of 5 stars
5/5

Related podcast episodes

Skip carousel

Reduce Friction In Your Business Analytics Through Entity Centric Data Modeling: For business analytics the way that you model the data in your warehouse has a lasting impact on what types of questions can be answered quickly and easily. The major strategies in use today were created decades ago when the software and hardware for warehouse databases were far more constrained. In this episode Maxime Beauchemin of Airflow and Superset fame shares his vision for the entity-centric data model and how you can incorporate it into your own warehouse design.
UNLIMITED
Reduce Friction In Your Business Analytics Through Entity Centric Data Modeling: For business analytics the way that you model the data in your warehouse has a lasting impact on what types of questions can be answered quickly and easily. The major strategies in use today were created decades ago when the software and hardware for warehouse databases were far more constrained. In this episode Maxime Beauchemin of Airflow and Superset fame shares his vision for the entity-centric data model and how you can incorporate it into your own warehouse design.
byData Engineering Podcast
0 ratings
0% found this document useful
Understanding Machine Learning Features and Platforms
UNLIMITED
Understanding Machine Learning Features and Platforms
byThe Cloudcast
0 ratings
0% found this document useful
Continuous Application Profiling
UNLIMITED
Continuous Application Profiling
byThe Cloudcast
0 ratings
0% found this document useful
Building A Data Mesh Platform At PayPal: There has been a lot of discussion about the practical application of data mesh and how to implement it in an organization. Jean-Georges Perrin was tasked with designing a new data platform implementation at PayPal and wound up building a data mesh. In this episode he shares that journey and the combination of technical and organizational challenges that he encountered in the process.
UNLIMITED
Building A Data Mesh Platform At PayPal: There has been a lot of discussion about the practical application of data mesh and how to implement it in an organization. Jean-Georges Perrin was tasked with designing a new data platform implementation at PayPal and wound up building a data mesh. In this episode he shares that journey and the combination of technical and organizational challenges that he encountered in the process.
byData Engineering Podcast
0 ratings
0% found this document useful
AI and ML Networking: bridging the gap between performance and economy
UNLIMITED
AI and ML Networking: bridging the gap between performance and economy
byTechnology Now
0 ratings
0% found this document useful
69: Testing Front End Code: Summary Oren Rubin (@Shexman) goes through why it’s important to not only test the back-end code of our applications but also to test our Front End code, the integration points, and the full user experience. Oren also goes through...
UNLIMITED
69: Testing Front End Code: Summary Oren Rubin (@Shexman) goes through why it’s important to not only test the back-end code of our applications but also to test our Front End code, the integration points, and the full user experience. Oren also goes through...
byThe Web Platform Podcast
0 ratings
0% found this document useful
051: Strategy evaluation techniques, flaws and solutions with Dave Walton: Today we’re covering a topic which can really be a concern for traders of all levels, from beginner to pro, and that is the topic of strategy evaluation. Have you ever found that real-life performance does not match expected results? Or perhaps you...
UNLIMITED
051: Strategy evaluation techniques, flaws and solutions with Dave Walton: Today we’re covering a topic which can really be a concern for traders of all levels, from beginner to pro, and that is the topic of strategy evaluation. Have you ever found that real-life performance does not match expected results? Or perhaps you...
byBetter System Trader
0 ratings
0% found this document useful
Build Your Second Brain One Piece At A Time: Generative AI promises to accelerate the productivity of human collaborators. Currently the primary way of working with these tools is through a conversational prompt, which is often cumbersome and unwieldy. In order to simplify the integration of AI capabilities into developer workflows Tsavo Knott helped create Pieces, a powerful collection of tools that complements the tools that developers already use. In this episode he explains the data collection and preparation process, the collection of model types and sizes that work together to power the experience, and how to incorporate it into your workflow to act as a second brain.
UNLIMITED
Build Your Second Brain One Piece At A Time: Generative AI promises to accelerate the productivity of human collaborators. Currently the primary way of working with these tools is through a conversational prompt, which is often cumbersome and unwieldy. In order to simplify the integration of AI capabilities into developer workflows Tsavo Knott helped create Pieces, a powerful collection of tools that complements the tools that developers already use. In this episode he explains the data collection and preparation process, the collection of model types and sizes that work together to power the experience, and how to incorporate it into your workflow to act as a second brain.
byData Engineering Podcast
0 ratings
0% found this document useful
Safely Test Your Applications And Analytics With Production Quality Data Using Tonic AI: The most interesting and challenging bugs always happen in production, but recreating them is a constant challenge due to differences in the data that you are working with. Building your own scripts to replicate data from production is time consuming and error-prone. Tonic is a platform designed to solve the problem of having reliable, production-like data available for developing and testing your software, analytics, and machine learning projects. In this episode Adam Kamor explores the factors that make this such a complex problem to solve, the approach that he and his team have taken to turn it into a reliable product, and how you can start using it to replace your own collection of scripts.
UNLIMITED
Safely Test Your Applications And Analytics With Production Quality Data Using Tonic AI: The most interesting and challenging bugs always happen in production, but recreating them is a constant challenge due to differences in the data that you are working with. Building your own scripts to replicate data from production is time consuming and error-prone. Tonic is a platform designed to solve the problem of having reliable, production-like data available for developing and testing your software, analytics, and machine learning projects. In this episode Adam Kamor explores the factors that make this such a complex problem to solve, the approach that he and his team have taken to turn it into a reliable product, and how you can start using it to replace your own collection of scripts.
byData Engineering Podcast
0 ratings
0% found this document useful
Harnessing Generative AI For Creating Educational Content With Illumidesk: Generative AI has unlocked a massive opportunity for content creation. There is also an unfulfilled need for experts to be able to share their knowledge and build communities. Illumidesk was built to take advantage of this intersection. In this episode Greg Werner explains how they are using generative AI as an assistive tool for creating educational material, as well as building a data driven experience for learners.
UNLIMITED
Harnessing Generative AI For Creating Educational Content With Illumidesk: Generative AI has unlocked a massive opportunity for content creation. There is also an unfulfilled need for experts to be able to share their knowledge and build communities. Illumidesk was built to take advantage of this intersection. In this episode Greg Werner explains how they are using generative AI as an assistive tool for creating educational material, as well as building a data driven experience for learners.
byData Engineering Podcast
0 ratings
0% found this document useful
15: Lifecycle: A Martech Saga part 4: Picking the right MQL model: You need a good MQL model so that marketing leads make it to sales and get followed up. There are a lot of ways to define MQLs and pass them over. It’s very common to have a lead scoring model, and it’s the best way to get to build a scalable, highly auto
UNLIMITED
15: Lifecycle: A Martech Saga part 4: Picking the right MQL model: You need a good MQL model so that marketing leads make it to sales and get followed up. There are a lot of ways to define MQLs and pass them over. It’s very common to have a lead scoring model, and it’s the best way to get to build a scalable, highly auto
byHumans of Martech
0 ratings
0% found this document useful
How Column-Aware Development Tooling Yields Better Data Models: Architectural decisions are all based on certain constraints and a desire to optimize for different outcomes. In data systems one of the core architectural exercises is data modeling, which can have significant impacts on what is and is not possible for downstream use cases. By incorporating column-level lineage in the data modeling process it encourages a more robust and well-informed design. In this episode Satish Jayanthi explores the benefits of incorporating column-aware tooling in the data modeling process.
UNLIMITED
How Column-Aware Development Tooling Yields Better Data Models: Architectural decisions are all based on certain constraints and a desire to optimize for different outcomes. In data systems one of the core architectural exercises is data modeling, which can have significant impacts on what is and is not possible for downstream use cases. By incorporating column-level lineage in the data modeling process it encourages a more robust and well-informed design. In this episode Satish Jayanthi explores the benefits of incorporating column-aware tooling in the data modeling process.
byData Engineering Podcast
0 ratings
0% found this document useful
Increase Your Odds Of Success For Analytics And AI Through More Effective Knowledge Management With AlignAI: Making effective use of data requires proper context around the information that is being used. As the size and complexity of your organization increases the difficulty of ensuring that everyone has the necessary knowledge about how to get their work done scales exponentially. Wikis and intranets are a common way to attempt to solve this problem, but they are frequently ineffective. Rehgan Avon co-founded AlignAI to help address this challenge through a more purposeful platform designed to collect and distribute the knowledge of how and why data is used in a business. In this episode she shares the strategic and tactical elements of how to make more effective use of the technical and organizational resources that are available to you for getting work done with data.
UNLIMITED
Increase Your Odds Of Success For Analytics And AI Through More Effective Knowledge Management With AlignAI: Making effective use of data requires proper context around the information that is being used. As the size and complexity of your organization increases the difficulty of ensuring that everyone has the necessary knowledge about how to get their work done scales exponentially. Wikis and intranets are a common way to attempt to solve this problem, but they are frequently ineffective. Rehgan Avon co-founded AlignAI to help address this challenge through a more purposeful platform designed to collect and distribute the knowledge of how and why data is used in a business. In this episode she shares the strategic and tactical elements of how to make more effective use of the technical and organizational resources that are available to you for getting work done with data.
byData Engineering Podcast
0 ratings
0% found this document useful
Let Your Business Intelligence Platform Build The Models Automatically With Omni Analytics: Business intelligence has gone through many generational shifts, but each generation has largely maintained the same workflow. Data analysts create reports that are used by the business to understand and direct the business, but the process is very labor and time intensive. The team at Omni have taken a new approach by automatically building models based on the queries that are executed. In this episode Chris Merrick shares how they manage integration and automation around the modeling layer and how it improves the organizational experience of business intelligence.
UNLIMITED
Let Your Business Intelligence Platform Build The Models Automatically With Omni Analytics: Business intelligence has gone through many generational shifts, but each generation has largely maintained the same workflow. Data analysts create reports that are used by the business to understand and direct the business, but the process is very labor and time intensive. The team at Omni have taken a new approach by automatically building models based on the queries that are executed. In this episode Chris Merrick shares how they manage integration and automation around the modeling layer and how it improves the organizational experience of business intelligence.
byData Engineering Podcast
0 ratings
0% found this document useful
DevOps and Incident Response Evolution
UNLIMITED
DevOps and Incident Response Evolution
byThe Cloudcast
0 ratings
0% found this document useful
Automating Analytics Teams
UNLIMITED
Automating Analytics Teams
byThe Cloudcast
0 ratings
0% found this document useful
MLOps Coffee Sessions #10 Analyzing the Article “Continuous Delivery and Automation Pipelines in Machine Learning" // Part 2
UNLIMITED
MLOps Coffee Sessions #10 Analyzing the Article “Continuous Delivery and Automation Pipelines in Machine Learning" // Part 2
byMLOps.community
0 ratings
0% found this document useful
How to choose a digital slide scanner w/ Doug Stapleton, Hamamatsu
UNLIMITED
How to choose a digital slide scanner w/ Doug Stapleton, Hamamatsu
byDigital Pathology Podcast
0 ratings
0% found this document useful
Building Vector Search Applications
UNLIMITED
Building Vector Search Applications
byThe Cloudcast
0 ratings
0% found this document useful
Quantifying The Return On Investment For Your Data Team: As businesses increasingly invest in technology and talent focused on data engineering and analytics, they want to know whether they are benefiting. So how do you calculate the return on investment for data? In this episode Barr Moses and Anna Filippova explore that question and provide useful exercises to start answering that in your company.
UNLIMITED
Quantifying The Return On Investment For Your Data Team: As businesses increasingly invest in technology and talent focused on data engineering and analytics, they want to know whether they are benefiting. So how do you calculate the return on investment for data? In this episode Barr Moses and Anna Filippova explore that question and provide useful exercises to start answering that in your company.
byData Engineering Podcast
0 ratings
0% found this document useful
Composable Data Analytics
UNLIMITED
Composable Data Analytics
byThe Cloudcast
0 ratings
0% found this document useful
MLOps for GenAI Applications // Harcharan Kabbay // #256
UNLIMITED
MLOps for GenAI Applications // Harcharan Kabbay // #256
byMLOps.community
0 ratings
0% found this document useful
Use Consistent And Up To Date Customer Profiles To Power Your Business With Segment Unify
UNLIMITED
Use Consistent And Up To Date Customer Profiles To Power Your Business With Segment Unify
byData Engineering Podcast
0 ratings
0% found this document useful
Episode 283: Implementing Design Systems with Dan Mall: How do you make your design system an internal rooted practice? Our guest today is Dan Mall, entrepreneur and author of Design That Scales. You’ll learn how small practices can evolve into a design system, why adoption should tie in to your company’s mission statement, how to declare your strategic plan when establishing a design system, and more.
UNLIMITED
Episode 283: Implementing Design Systems with Dan Mall: How do you make your design system an internal rooted practice? Our guest today is Dan Mall, entrepreneur and author of Design That Scales. You’ll learn how small practices can evolve into a design system, why adoption should tie in to your company’s mission statement, how to declare your strategic plan when establishing a design system, and more.
byUI Breakfast: UI/UX Design and Product Strategy
0 ratings
0% found this document useful
Making Email Better With AI At Shortwave: Generative AI has rapidly transformed everything in the technology sector. When Andrew Lee started work on Shortwave he was focused on making email more productive. When AI started gaining adoption he realized that he had even more potential for a transformative experience. In this episode he shares the technical challenges that he and his team have overcome in integrating AI into their product, as well as the benefits and features that it provides to their customers.
UNLIMITED
Making Email Better With AI At Shortwave: Generative AI has rapidly transformed everything in the technology sector. When Andrew Lee started work on Shortwave he was focused on making email more productive. When AI started gaining adoption he realized that he had even more potential for a transformative experience. In this episode he shares the technical challenges that he and his team have overcome in integrating AI into their product, as well as the benefits and features that it provides to their customers.
byData Engineering Podcast
0 ratings
0% found this document useful
2024 Look Ahead - Using AI to Enable Personal Productivity
UNLIMITED
2024 Look Ahead - Using AI to Enable Personal Productivity
byThe Cloudcast
0 ratings
0% found this document useful
Understanding Time-Series Database Patterns
UNLIMITED
Understanding Time-Series Database Patterns
byThe Cloudcast
0 ratings
0% found this document useful
Value-Stream Mapping
UNLIMITED
Value-Stream Mapping
byThe Cloudcast
0 ratings
0% found this document useful
Reflecting On The Past 6 Years Of Data Engineering: This podcast started almost exactly six years ago, and the technology landscape was much different than it is now. In that time there have been a number of generational shifts in how data engineering is done. In this episode I reflect on some of the major themes and take a brief look forward at some of the upcoming changes.
UNLIMITED
Reflecting On The Past 6 Years Of Data Engineering: This podcast started almost exactly six years ago, and the technology landscape was much different than it is now. In that time there have been a number of generational shifts in how data engineering is done. In this episode I reflect on some of the major themes and take a brief look forward at some of the upcoming changes.
byData Engineering Podcast
0 ratings
0% found this document useful
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel: Data processing technologies have dramatically improved in their sophistication and raw throughput. Unfortunately, the volumes of data that are being generated continue to double, requiring further advancements in the platform capabilities to keep up. As the sophistication increases, so does the complexity, leading to challenges for user experience. Jignesh Patel has been researching these areas for several years in his work as a professor at Carnegie Mellon University. In this episode he illuminates the landscape of problems that we are faced with and how his research is aimed at helping to solve these problems.
UNLIMITED
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel: Data processing technologies have dramatically improved in their sophistication and raw throughput. Unfortunately, the volumes of data that are being generated continue to double, requiring further advancements in the platform capabilities to keep up. As the sophistication increases, so does the complexity, leading to challenges for user experience. Jignesh Patel has been researching these areas for several years in his work as a professor at Carnegie Mellon University. In this episode he illuminates the landscape of problems that we are faced with and how his research is aimed at helping to solve these problems.
byData Engineering Podcast
0 ratings
0% found this document useful

Skip carousel

Powering Costing With Artificial Intelligence: The Case Of Vodafone Procurement
The European Business Review
UNLIMITED
Powering Costing With Artificial Intelligence: The Case Of Vodafone Procurement
May 25, 2021
8 min read
Getting The edge
The European Business Review
UNLIMITED
Getting The edge
Feb 25, 2021
7 min read
COMPETITIVE ADVANTAGE THROUGH SOFTWARE: Contrasting Enterprises & Startups
The European Business Review
UNLIMITED
COMPETITIVE ADVANTAGE THROUGH SOFTWARE: Contrasting Enterprises & Startups
Feb 4, 2019
6 min read
Mining Actionable Information with Smart Capture
The European Business Review
UNLIMITED
Mining Actionable Information with Smart Capture
May 22, 2018
4 min read
Integrated Workplace Management Systems
Facility Management
UNLIMITED
Integrated Workplace Management Systems
Dec 23, 2018
Property and facilities management are data-rich operating worlds. This is becoming even more complex as the Internet of Things (IoT) provides the capability to imbed sensors and diagnostic tools to monitor the use and performance of everything in re
4 min read
2 The Use of Python in AI and ML
Techfastly
UNLIMITED
2 The Use of Python in AI and ML
Nov 30, 2020
3 min read
The Deep Learning Revolution For Artificial Intelligence
Facility Management
UNLIMITED
The Deep Learning Revolution For Artificial Intelligence
Mar 28, 2019
3 min read
Scikit-Learn: The Ultimate Python Library
APC
UNLIMITED
Scikit-Learn: The Ultimate Python Library
Jul 15, 2019
4 min read
An Expert Speaks Up on What You Should Know About Programming Languages
Entrepreneur
UNLIMITED
An Expert Speaks Up on What You Should Know About Programming Languages
Oct 1, 2015
1 min read
Generative AI: What Leaders Need To Know
Rotman Management
UNLIMITED
Generative AI: What Leaders Need To Know
Jan 1, 2024
12 min read
The Current Frontier In Undustrial Manufacturing: BRINGING SOFTWARE SYSTEMS TO MARKET
The European Business Review
UNLIMITED
The Current Frontier In Undustrial Manufacturing: BRINGING SOFTWARE SYSTEMS TO MARKET
Jan 31, 2020
6 min read
Web App Security
Linux Format
UNLIMITED
Web App Security
Jun 29, 2021
8 min read
In Conversation with Surbhi Rathore
Techfastly
UNLIMITED
In Conversation with Surbhi Rathore
Oct 1, 2021
4 min read
STRATEGIC EXCELLENCE: Steps to Maximise ROI in GEN AI Implementations
The European Business Review
UNLIMITED
STRATEGIC EXCELLENCE: Steps to Maximise ROI in GEN AI Implementations
May 28, 2024
4 min read
Machine Learning – With Zero Programming
APC
UNLIMITED
Machine Learning – With Zero Programming
Aug 12, 2019
6 min read
Cognitive Enterprise
Techfastly
UNLIMITED
Cognitive Enterprise
Dec 1, 2021
6 min read
Leadership Forum: Investing in Disruption
Rotman Management
UNLIMITED
Leadership Forum: Investing in Disruption
Jan 1, 2019
10 min read
Interview// From Kyiv, With Love
Essential Apple User Magazine
UNLIMITED
Interview// From Kyiv, With Love
Nov 21, 2019
3 min read
The Race To Exascale Supercomputers
Maximum PC
UNLIMITED
The Race To Exascale Supercomputers
Jun 21, 2022
9 min read
ARTIFICIAL INTELLIGENCE (AI) IN SUPPLY CHAIN PLANNING THE Future is Here & Now
The European Business Review
UNLIMITED
ARTIFICIAL INTELLIGENCE (AI) IN SUPPLY CHAIN PLANNING THE Future is Here & Now
Dec 3, 2019
7 min read
Understanding Your Technological Risks
NZBusiness and Management
UNLIMITED
Understanding Your Technological Risks
Jul 21, 2021
2 min read
Arnab PANDEY
Techfastly
UNLIMITED
Arnab PANDEY
Apr 1, 2021
11 min read
How Can AI Help Your Business?
PC Pro Magazine
UNLIMITED
How Can AI Help Your Business?
Jun 8, 2023
7 min read
Inside APC
APC
UNLIMITED
Inside APC
Oct 31, 2022
2 min read
Network-monitoring software 2024
PC Pro Magazine
UNLIMITED
Network-monitoring software 2024
Feb 8, 2024
4 min read
Thriving As An Ecosystem Partner
The European Business Review
UNLIMITED
Thriving As An Ecosystem Partner
Sep 30, 2022
Researching ecosystems that span industries from e-commerce and publishing to semiconductors and healthcare over the past decade, we found companies that have been successful for years by contributing to an ecosystem. Sometimes, by contributing as pa
10 min read
Quantum Leap
Marketing
UNLIMITED
Quantum Leap
Jul 11, 2019
6 min read
Inside APC
APC
UNLIMITED
Inside APC
Mar 20, 2023
APC is Australia’s oldest consumer technology magazine – having been consistently in print for over forty years, since our first issue way back in May 1980 – and we take that heritage and responsibility very seriously. While our focus is obviously on
2 min read
Inside APC
APC
UNLIMITED
Inside APC
Apr 20, 2023
APC is Australia’s oldest consumer technology magazine – having been consistently in print for over forty years, since our first issue way back in May 1980 – and we take that heritage and responsibility very seriously. While our focus is obviously on
2 min read
Inside APC
APC
UNLIMITED
Inside APC
May 22, 2023
2 min read

Related categories

Skip carousel

Reviews for Python Machine Learning By Example

Rating: 5 out of 5 stars

5/5

1 rating0 reviews

Book preview

Python Machine Learning By Example - Yuxi (Hayden) Liu

Python Machine Learning By Example

Title Page

Python Machine Learning By Example

Easy-to-follow examples that get you up and running with machine learning

Yuxi (Hayden) Liu

BIRMINGHAM - MUMBAI

Copyright

Credits

About the Author

Yuxi (Hayden) Liu is currently a data scientist working on messaging app optimization at a multinational online media corporation in Toronto, Canada. He is focusing on social graph mining, social personalization, user demographics and interests prediction, spam detection, and recommendation systems. He has worked for a few years as a data scientist at several programmatic advertising companies, where he applied his machine learning expertise in ad optimization, click-through rate and conversion rate prediction, and click fraud detection. Yuxi earned his degree from the University of Toronto, and published five IEEE transactions and conference papers during his master's research. He finds it enjoyable to crawl data from websites and derive valuable insights. He is also an investment enthusiast.

About the Reviewer

Alberto Boschetti is a data scientist with strong expertise in signal processing and statistics. He holds a PhD in telecommunication engineering and currently lives and works in London. In his work projects, he faces challenges daily, spanning across natural language processing (NLP), machine learning, and distributed processing. He is very passionate about his job and always tries to be updated on the latest developments of data science technologies, attending meetups, conferences, and other events. He is the author of Python Data Science Essentials, Regression Analysis with Python, and Large Scale Machine Learning with Python, all published by Packt.

I would like to thank my family, my friends, and my colleagues. Also, a big thanks to the open source community.

www.PacktPub.com

For support files and downloads related to your book, please visit www.PacktPub.com.

Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at [email protected] for more details.

At www.PacktPub.com, you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.

https://www.packtpub.com/mapt

Get the most in-demand software skills with Mapt. Mapt gives you full access to all Packt books and video courses, as well as industry-leading tools to help you plan your personal development and advance your career.

Why subscribe?

Fully searchable across every book published by Packt

Copy and paste, print, and bookmark content

On demand and accessible via a web browser

Customer Feedback

Thanks for purchasing this Packt book. At Packt, quality is at the heart of our editorial process. To help us improve, please leave us an honest review on this book's Amazon page at https://www.amazon.com/dp/1783553111.

If you'd like to join our team of regular reviewers, you can e-mail us at [email protected]. We award our regular reviewers with free eBooks and videos in exchange for their valuable feedback. Help us be relentless in improving our products!

Credits

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Errata

Piracy

Questions

Getting Started with Python and Machine Learning

What is machine learning and why do we need it?

A very high level overview of machine learning

A brief history of the development of machine learning algorithms

Generalizing with data

Overfitting, underfitting and the bias-variance tradeoff

Avoid overfitting with cross-validation

Avoid overfitting with regularization

Avoid overfitting with feature selection and dimensionality reduction

Preprocessing, exploration, and feature engineering

Missing values

Label encoding

One-hot-encoding

Scaling

Polynomial features

Power transformations

Binning

Combining models

Bagging

Boosting

Stacking

Blending

Voting and averaging

Installing software and setting up

Troubleshooting and asking for help

Summary

Exploring the 20 Newsgroups Dataset with Text Analysis Algorithms

What is NLP?

Touring powerful NLP libraries in Python

The newsgroups data

Getting the data

Thinking about features

Visualization

Data preprocessing

Clustering

Topic modeling

Summary

Spam Email Detection with Naive Bayes

Getting started with classification

Types of classification

Applications of text classification

Exploring naive Bayes

Bayes' theorem by examples

The mechanics of naive Bayes

The naive Bayes implementations

Classifier performance evaluation

Model tuning and cross-validation

Summary

News Topic Classification with Support Vector Machine

Recap and inverse document frequency

Support vector machine

The mechanics of SVM

Scenario 1 - identifying the separating hyperplane

Scenario 2 - determining the optimal hyperplane

Scenario 3 - handling outliers

The implementations of SVM

Scenario 4 - dealing with more than two classes

The kernels of SVM

Choosing between the linear and RBF kernel

News topic classification with support vector machine

More examples - fetal state classification on cardiotocography with SVM

Summary

Click-Through Prediction with Tree-Based Algorithms

Brief overview of advertising click-through prediction

Getting started with two types of data, numerical and categorical

Decision tree classifier

The construction of a decision tree

The metrics to measure a split

The implementations of decision tree

Click-through prediction with decision tree

Random forest - feature bagging of decision tree

Summary

Click-Through Prediction with Logistic Regression

One-hot encoding - converting categorical features to numerical

Logistic regression classifier

Getting started with the logistic function

The mechanics of logistic regression

Training a logistic regression model via gradient descent

Click-through prediction with logistic regression by gradient descent

Training a logistic regression model via stochastic gradient descent

Training a logistic regression model with regularization

Training on large-scale datasets with online learning

Handling multiclass classification

Feature selection via random forest

Summary

Stock Price Prediction with Regression Algorithms

Brief overview of the stock market and stock price

What is regression?

Predicting stock price with regression algorithms

Feature engineering

Data acquisition and feature generation

Linear regression

Decision tree regression

Support vector regression

Regression performance evaluation

Stock price prediction with regression algorithms

Summary

Best Practices

Machine learning workflow

Best practices in the data preparation stage

Best practice 1 - completely understand the project goal

Best practice 2 - collect all fields that are relevant

Best practice 3 - maintain consistency of field values

Best practice 4 - deal with missing data

Best practices in the training sets generation stage

Best practice 5 - determine categorical features with numerical values

Best practice 6 - decide on whether or not to encode categorical features

Best practice 7 - decide on whether or not to select features and if so, how

Best practice 8 - decide on whether or not to reduce dimensionality and if so how

Best practice 9 - decide on whether or not to scale features

Best practice 10 - perform feature engineering with domain expertise

Best practice 11 - perform feature engineering without domain expertise

Best practice 12 - document how each feature is generated

Best practices in the model training, evaluation, and selection stage

Best practice 13 - choose the right algorithm(s) to start with

Naive Bayes

Logistic regression

SVM

Random forest (or decision tree)

Neural networks

Best practice 14 - reduce overfitting

Best practice 15 - diagnose overfitting and underfitting

Best practices in the deployment and monitoring stage

Best practice 16 - save, load, and reuse models

Best practice 17 - monitor model performance

Best practice 18 - update models regularly

Summary

Python Machine Learning By Example

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

First published: May 2017

Production reference: 1290517

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham

B3 2PB, UK.

ISBN 978-1-78355-311-2

www.packtpub.com

Preface

What this book covers

Chapter 1, Getting Started with Python and Machine Learning, is the starting point for someone who is looking forward to enter the field of ML with Python. You will get familiar with the basics of Python and ML in this chapter and set up the software on your machine.

Chapter 2, Exploring the 20 Newsgroups Dataset with Text Analysis Algorithms, explains important concepts such as getting the data, its features, and pre-processing. It also covers the dimension reduction technique, principal component analysis, and the k-nearest neighbors algorithm.

Chapter 3, Spam Email Detection with Naive Bayes, covers classification, naive Bayes, and its in-depth implementation, classification performance evaluation, model selection and tuning, and cross-validation. Examples such as spam e-mail detection are demonstrated.

Chapter 4, News Topic Classification with Support Vector Machine, covers multiclass classification, Support Vector Machine, and how it is applied in topic classification. Other important concepts, such as kernel machine, overfitting, and regularization, are discussed as well.

Chapter 5, Click-Through Prediction with Tree-Based Algorithms, explains decision trees and random forests in depth over the course of solving an advertising click-through rate problem.

Chapter 6, Click-Through Prediction with Logistic Regression, explains in depth the logistic regression classifier. Also, concepts such as categorical variable encoding, L1 and L2 regularization, feature selection, online learning, and stochastic gradient descent are detailed.

Chapter 7, Stock Price Prediction with Regression Algorithms, analyzes predicting stock market prices using Yahoo/Google Finance data and maybe additional data. Also, it covers the challenges in finance and brief explanations of related concepts.

Chapter 8, Best Practices, aims to foolproof your learning and get you ready for production.

After covering multiple projects in this book, the readers will have gathered a broad picture of the ML ecosystem using Python.

What you need for this book

The following are required for you to utilize this book:

scikit-learn 0.18.0

Numpy 1.1

Matplotlib 1.5.1

NLTK 3.2.2

pandas 0.19.2

GraphViz

Quandl Python API

You can use a 64-bit architecture, 2GHz CPU, and 8GB RAM to perform all the steps in this book. You will require at least 8GB of hard disk space.

Who this book is for

This book is for anyone interested in entering data science with machine learning. Basic familiarity with Python is assumed.

Conventions

In this book, you will find a number of text styles that distinguish between different kinds of information. Here are some examples of these styles and an explanation of their meaning.

Code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles are shown as follows: The target_names key gives the newsgroups names.

Any command-line input or output is written as follows:

ls -1 enron1/ham/*.txt | wc -l

3672

ls -1 enron1/spam/*.txt | wc -l

1500

New terms and important words are shown in bold. Words that you see on the screen, for example, in menus or dialog boxes, appear in the text like this: Heterogeneity Activity Recognition Data Set.

Warnings or important notes appear in a box like this.

Tips and tricks appear like this.

Reader feedback

Feedback from our readers is always welcome. Let us know what you think about this book-what you liked or disliked. Reader feedback is important for us as it helps us develop titles that you will really get the most out of.

To send us general feedback, simply e-mail [email protected], and mention the book's title in the subject of your message.

If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, see our author guide at www.packtpub.com/authors.

Customer support

Now that you are the proud owner of a Packt book, we have a number of things to help you to get the most from your purchase.

Downloading the example code

You can download the example code files for this book from your account at http://www.packtpub.com. If you purchased this book elsewhere, you can visit http://www.packtpub.com/support and register to have the files e-mailed directly to you.

You can download the code files by following these steps:

Hover the mouse pointer on the SUPPORT tab at the top.

Click on Code Downloads & Errata.

Enter the name of the book in the Search box.

Select the book for which you're looking to download the code files.

Choose from the drop-down menu where you purchased this book from.

Click on Code Download.

Once the file is downloaded, please make sure that you unzip or extract the folder using the latest version of:

WinRAR / 7-Zip for Windows

Zipeg / iZip / UnRarX for Mac

7-Zip / PeaZip for Linux

The code bundle for the book is also hosted on GitHub at https://github.com/PacktPublishing/Python-Machine-Learning-By-Example. We also have other code bundles from our rich catalog of books and videos available at https://github.com/PacktPublishing/. Check them out!

Errata

Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you find a mistake in one of our books-maybe a mistake in the text or the code-we would be grateful if you could report this to us. By doing so, you can save other readers from frustration and help us improve subsequent versions of this book. If you find any errata, please report them by visiting http://www.packtpub.com/submit-errata, selecting your book, clicking on the Errata Submission Form link, and entering the details of your errata. Once your errata are verified, your submission will be accepted and the errata will be uploaded to our website or added to any list of existing errata under the Errata section of that title.

To view the previously submitted errata, go to https://www.packtpub.com/books/content/support and enter the name of the book in the search field. The required information will appear under the Errata section.

Piracy

Piracy of copyrighted material on the Internet is an ongoing problem across all media. At Packt, we take the protection of our copyright and licenses very seriously. If you come across any illegal copies of our works in any form on the Internet, please provide us with the location address or website name immediately so that we can pursue a remedy.

Please contact us at [email protected] with a link to the suspected pirated material.

We appreciate your help in protecting our authors and our ability to bring you valuable content.

Questions

If you have a problem with any aspect of this book, you can contact us at [email protected], and we will do our best to address the problem.

Getting Started with Python and Machine Learning

We kick off our Python and machine learning journey with the basic, yet important concepts of machine learning. We will start with what machine learning is about, why we need it, and its evolution over the last few decades. We will then discuss typical machine learning tasks and explore several essential techniques of working with data and working with models. It is a great starting point of the subject and we will learn it in a fun way. Trust me. At the end, we will also set up the software and tools needed in this book.

We will get into details for the topics mentioned:

What is machine learning and why do we need it?

A very high level overview of machine learning

Generalizing with data

Overfitting and the bias variance trade off

Cross validation

Regularization

Dimensions and features

Preprocessing, exploration, and feature engineering

Missing Values

Label encoding

One hot encoding

Scaling

Polynomial features

Power transformations

Binning

Combining models

Bagging

Boosting

Stacking

Blending

Voting and averaging

Installing software and setting up

Troubleshooting and asking for help

What is machine learning and why do we need it?

Machine learning is a term coined around 1960 composed of two words—machine corresponding to a computer, robot, or other device, and learning an activity, or event patterns, which humans are good at.

So why do we need machine learning, why do we want a machine to learn as a human? There are many problems involving huge datasets, or complex calculations for instance, where it makes sense to let computers do all the work. In general, of course, computers and robots don't get tired, don't have to sleep, and may be cheaper. There is also an emerging school of thought called active learning or human-in-the-loop, which advocates combining the efforts of machine learners and humans. The idea is that there are routine boring tasks more suitable for computers, and creative tasks more suitable for humans. According to this philosophy, machines are able to learn, by following rules (or algorithms) designed by humans and to do repetitive and logic tasks desired by a human.

Machine learning does not involve the traditional type of programming that uses business rules. A popular myth says that the majority of the code in the world has to do with simple rules possibly programmed in Cobol, which covers the bulk of all the possible scenarios of client interactions. So why can't we just hire many software programmers and continue programming new rules?

One reason is that defining, maintaining, and updating rules becomes more and more expensive over time. The number of possible patterns for an activity or event could be enormous and therefore exhausting all enumeration is not practically feasible. It gets even more challenging to do so when

Enjoying the preview?

Page 1 of 1

Python Machine Learning By Example: The easiest way to get into machine learning

About this ebook

Yuxi (Hayden) Liu

Read more from Yuxi (Hayden) Liu

Python Machine Learning By Example: Unlock machine learning best practices with real-world use cases

Python Machine Learning By Example

Machine Learning with PyTorch and Scikit-Learn: Develop machine learning and deep learning models with Python

R Deep Learning Projects: Master the techniques to design and develop neural network models in R

PyTorch 1.x Reinforcement Learning Cookbook: Over 60 recipes to design, develop, and deploy self-learning AI models using Python

Related authors

Related to Python Machine Learning By Example

Related ebooks

Machine Learning Algorithms: Popular algorithms for data science and machine learning

Mastering Predictive Analytics with scikit-learn and TensorFlow: Implement machine learning techniques to build advanced predictive models using Python

Deep Learning By Example: A hands-on guide to implementing advanced machine learning algorithms and neural networks

Go Machine Learning Projects: Eight projects demonstrating end-to-end machine learning and predictive analytics applications in Go

Hands-On Data Science and Python Machine Learning: Perform data mining and machine learning efficiently using Python and Spark

Hands-On Neural Network Programming with C#: Add powerful neural network capabilities to your C# enterprise applications

Mastering Machine Learning Algorithms: Expert techniques to implement popular machine learning algorithms and fine-tune your models

Keras Reinforcement Learning Projects: 9 projects exploring popular reinforcement learning techniques to build self-learning agents

Statistics for Machine Learning

Hands-On Predictive Analytics with Python: Master the complete predictive analytics process, from problem definition to model deployment

R Machine Learning Projects: Implement supervised, unsupervised, and reinforcement learning techniques using R 3.5

R Deep Learning Essentials.: A step-by-step guide to building deep learning models using TensorFlow, Keras, and MXNet

Deep Learning with PyTorch: A practical approach to building neural network models using PyTorch

Deep Reinforcement Learning Hands-On: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more

Practical Convolutional Neural Networks: Implement advanced deep learning models using Python

Practical Computer Vision: Extract insightful information from images using TensorFlow, Keras, and OpenCV

Hands-On Genetic Algorithms with Python: Applying genetic algorithms to solve real-world deep learning and artificial intelligence problems

Hands-on Machine Learning with JavaScript: Solve complex computational web problems using machine learning

Ensemble Machine Learning: A beginner's guide that combines powerful machine learning algorithms to build optimized models

Machine Learning With Go: Leverage Go's powerful packages to build smart machine learning and predictive applications, 2nd Edition

Python Machine Learning Blueprints: Put your machine learning concepts to the test by developing real-world smart projects, 2nd Edition

Apache Mahout Essentials

Artificial Intelligence By Example: Develop machine intelligence from scratch using real artificial intelligence use cases

Machine Learning with scikit-learn Quick Start Guide: Classification, regression, and clustering techniques in Python

Learning Data Mining with Python

Machine Learning with Scala Quick Start Guide: Leverage popular machine learning algorithms and techniques and implement them in Scala

Mastering Machine Learning with scikit-learn - Second Edition

Mastering Numerical Computing with NumPy: Master scientific computing and perform complex operations with ease

Practical Machine Learning with Python: Real-World Applications

Learning Data Mining with Python: Use Python to manipulate data and build predictive models

Programming For You

Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)

Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time!

JavaScript All-in-One For Dummies

SQL All-in-One For Dummies

Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.

Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence

Coding All-in-One For Dummies

Python Programming for Beginners: A Comprehensive Crash Course With Practical Exercises to Quickly Learn Coding and Programming for Data Analysis and Machine Learning

SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL

Grokking Algorithms: An illustrated guide for programmers and other curious people

Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees

HTML & CSS: Learn the Fundaments in 7 Days

Python Programming : How to Code Python Fast In Just 24 Hours With 7 Simple Steps

C# Programming from Zero to Proficiency (Beginner): C# from Zero to Proficiency, #2

Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1

The Advanced Roblox Coding Book: An Unofficial Guide, Updated Edition: Learn How to Script Games, Code Objects and Settings, and Create Your Own World!

Python: For Beginners A Crash Course Guide To Learn Python in 1 Week

Linux: Learn in 24 Hours

CODING FOR ABSOLUTE BEGINNERS: How to Keep Your Data Safe from Hackers by Mastering the Basic Functions of Python, Java, and C++ (2022 Guide for Newbies)

PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project

Python QuickStart Guide: The Simplified Beginner's Guide to Python Programming Using Hands-On Projects and Real-World Applications

C Programming For Beginners: The Simple Guide to Learning C Programming Language Fast!

Beginning Programming with C++ For Dummies

The Python Workshop: Learn to code in Python and kickstart your career in software development or data science

Coding with JavaScript For Dummies

HTML in 30 Pages

Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)

So You Want to Start a Podcast: Finding Your Voice, Telling Your Story, and Building a Community That Will Listen

The JavaScript Workshop: Learn to develop interactive web applications with clean and maintainable JavaScript code

SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days

Related podcast episodes

Related articles

Related categories

Reviews for Python Machine Learning By Example

What did you think?

Book preview

Python Machine Learning By Example - Yuxi (Hayden) Liu