Ist 407 Presentation

Uploaded by

The document summarizes an analysis of factors that affect health insurance costs and compares machine learning algorithms for predicting costs. It explores how age, sex, BMI, family size, region, and smoking affect costs using a dataset of over 1,300 insurance records. It tests decision trees, support vector machines, KNN, naive Bayes, neural networks, and XGBoost for prediction and finds that KNN performs best with an accuracy of 94.69%. Smoking is identified as the strongest predictor of costs.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Ist 407 Presentation

Uploaded by

api-529383903

0% found this document useful (0 votes)

151 views12 pages

Original Description:

Original Title

ist 407 presentation

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

151 views12 pages

Ist 407 Presentation

Uploaded by

api-529383903

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 12

Search inside document

Insurance

Connor Hanan, Emma Lehr, Natalie Ruppel

Research Questions

1. What factors constitute the

beneficiary getting charged the most?
2. Which machine learning algorithm is
best at predicting the charges?
About Our Dataset
● We are using the insurance.csv dataset found on kaggle:
https://www.kaggle.com/mirichoi0218/insurance
● There are 1,338 records
● There are 7 attributes: Age, Sex, BMI, Number of Children, Region, Smoking (yes or
no), and charges (how much was the person charged for insurance)
● Our defined target variable is “charges”
For added complexity:
We included an additional dataset that gave health attributes about specific states.
We linked this to our dataset by assigning the states to the already existing regions. We
made the addition because we wanted to see if there was more evidence for confirming a
relationship between health insurance costs and region beyond the given attributes in our
main dataset.
Experiments
Decision
Tree/Pruned
Decision Tree
What is it? A decision tree is a specific
type of flow chart used to visualize the
decision-making process by mapping
out different courses of action, as well
as their potential outcomes.
Support Vector Machines
What is it? SVMs work by trying to Kernels Used:
divide up all the data points using
● Linear
the kernel trick. This draws a line
● Radial
(called a “hyperplane”), trying to
● Polynomial
maximize the distance between the
different classes of points as
possible.
KNN
What is it? KNN works by finding the distances between a query
and all the examples in the data, selecting the specified number
examples (K) closest to the query, then votes for the most frequent
label (in the case of classification) or averages the labels (in the
case of regression).
Naive Bayes
What is it? It is a classification technique based on Bayes' Theorem with an
assumption of independence among predictors. In simple terms, a Naive Bayes
classifier assumes that the presence of a particular feature in a class is unrelated to
the presence of any other feature.
Multiclass Artificial Neural Network

What is it? In multi-class classiﬁcation, the

neural network has the same number of
output nodes as the number of classes.
Each output node belongs to some class
and outputs a score for that class.
Multi-Class Classiﬁcation (3 classes)
Scores from the last layer are passed
through a softmax layer.
Multiclass XGBoost
What is it? XGBoost is a decision-tree-based ensemble Machine Learning
algorithm that uses an extreme gradient boosting framework. The
extreme version is the exact same as the original, with the extreme one
being focused on speed and performance.
Comparison of the Results
Conclusion:
Being a smoker had the highest information gain in determining the health
insurance cost.

The machine learning algorithm best at predicting charges is the KNN model. The
machine algorithm the worst at predicting charges is Naive Bayes.

KNN had the highest classiﬁcation accuracy of 94.69%

Coincent - Data Science With Python Assignment
Document23 pages
Coincent - Data Science With Python Assignment
Sai Nikhil Nellore
100% (2)
Machine Learning QNA
Document1 page
Machine Learning QNA
pratikmovie999
No ratings yet
Machine Learning Midterm
Document18 pages
Machine Learning Midterm
serialkillerseeyou
No ratings yet
Unit 5
Document8 pages
Unit 5
arinkamble1711
No ratings yet
1.0 Modeling: 1.1 Classification
Document5 pages
1.0 Modeling: 1.1 Classification
Banujan Kuhaneswaran
No ratings yet
U02Lecture08 Statistical Machine Learning
Document41 pages
U02Lecture08 Statistical Machine Learning
tunio.bscsf21
No ratings yet
Decision Tree Classifiers With Ga Based Feature Selection
Document10 pages
Decision Tree Classifiers With Ga Based Feature Selection
TJPRC Publications
No ratings yet
Machine Learning 1707965934
Document15 pages
Machine Learning 1707965934
robson110770
No ratings yet
U21amg05 Aif and ML Unit 04 Notes
Document42 pages
U21amg05 Aif and ML Unit 04 Notes
22cs103
No ratings yet
Decision Tree
Document16 pages
Decision Tree
aecsaranyadurai
No ratings yet
INT 354 CA1 Mokshagna
Document8 pages
INT 354 CA1 Mokshagna
Praveen Kumar Ummidi
No ratings yet
(IJCST-V9I3P23) :aditi Linge, Bhavya Malviya, Digvijay Raut, Payal Ekre
Document3 pages
(IJCST-V9I3P23) :aditi Linge, Bhavya Malviya, Digvijay Raut, Payal Ekre
EighthSenseGroup
No ratings yet
Ijcsea 2
Document13 pages
Ijcsea 2
Billy Bryan
No ratings yet
Interview AI Algo
Document3 pages
Interview AI Algo
ripal.ranpara
No ratings yet
Asign-3 DWDM
Document27 pages
Asign-3 DWDM
Rohilla Jatin
No ratings yet
Decision Tree
Document18 pages
Decision Tree
Mo Shah
No ratings yet
Breast Cancer Classification
Document16 pages
Breast Cancer Classification
Tester
100% (2)
DWBI4
Document10 pages
DWBI4
Dhanraj Deore
No ratings yet
ML (Interview)
Document20 pages
ML (Interview)
ratnadepp
No ratings yet
Decision Tree and Related Techniques For Classification in Scalation
Document12 pages
Decision Tree and Related Techniques For Classification in Scalation
Zazkyeya
No ratings yet
Machine Learning Algorithms For Breast Cancer Prediction
Document8 pages
Machine Learning Algorithms For Breast Cancer Prediction
Vartika Anand
No ratings yet
DM Lecture 06
Document32 pages
DM Lecture 06
Sameer Ahmad
No ratings yet
Machine Learning (AR)
Document5 pages
Machine Learning (AR)
MIRAJ MIAH
No ratings yet
Hyper-Heuristic Decision Tree Induction: Alan Vella, David Corne Chris Murphy
Document6 pages
Hyper-Heuristic Decision Tree Induction: Alan Vella, David Corne Chris Murphy
Sigfrid Sigfridson
No ratings yet
Unit 4 - Machine Learning - WWW - Rgpvnotes.in PDF
Document27 pages
Unit 4 - Machine Learning - WWW - Rgpvnotes.in PDF
Youth Maker
No ratings yet
Classification
Document7 pages
Classification
divyanshu.chouhan786
No ratings yet
Recommendation Systems
Document27 pages
Recommendation Systems
Rexline S J
No ratings yet
Lecture 13 - Unsupervised Learning, PCA ICA
Document50 pages
Lecture 13 - Unsupervised Learning, PCA ICA
kateryna.koval
No ratings yet
MLunit 2 Mynotes
Document15 pages
MLunit 2 Mynotes
Vali Bhasha
No ratings yet
Heart Disease Prediction
Document16 pages
Heart Disease Prediction
Ritika Mandliya
No ratings yet
Data Science Intervieew Questions
Document16 pages
Data Science Intervieew Questions
Satyam Anand
100% (1)
Adbms Assignment 5: Q.1) Comparison of All Classification Algorithms Logistic Regression
Document4 pages
Adbms Assignment 5: Q.1) Comparison of All Classification Algorithms Logistic Regression
Shivam Israni
No ratings yet
Unit 1
Document15 pages
Unit 1
Lavanya Venkata
No ratings yet
Lecture 12 - Unsupervised Learning - Shoould Be Marged
Document31 pages
Lecture 12 - Unsupervised Learning - Shoould Be Marged
kateryna.koval
No ratings yet
DS - UNIT - III - QB & Ans
Document25 pages
DS - UNIT - III - QB & Ans
sarangrao2304
No ratings yet
Top 10 Machine Learning Algorithms With Their Use
Document12 pages
Top 10 Machine Learning Algorithms With Their Use
irma komariah
No ratings yet
Decision Tree Algorithm, Explained-1-22
Document22 pages
Decision Tree Algorithm, Explained-1-22
shyla
No ratings yet
Interview Questions For DS & DA (ML)
Document66 pages
Interview Questions For DS & DA (ML)
pratikmovie999
100% (1)
Data Science Technical Interview Questions
Document24 pages
Data Science Technical Interview Questions
pablo.villegas.mills
No ratings yet
Assignment 2
Document111 pages
Assignment 2
BWENGYE JUSTUS
No ratings yet
DW Ans
Document11 pages
DW Ans
IT11 BHAGYA LAKSHMI V
No ratings yet
I Am Sharing 'Interview' With You
Document65 pages
I Am Sharing 'Interview' With You
Branch Reed
100% (3)
Decision Tree
Document57 pages
Decision Tree
Prabhjit Singh
100% (1)
UNIT 2 - Notes
Document31 pages
UNIT 2 - Notes
126Monish B
No ratings yet
Models For Machine Learning: M. Tim Jones
Document10 pages
Models For Machine Learning: M. Tim Jones
Shanti Guru
No ratings yet
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
Document28 pages
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
aril dan
No ratings yet
Big Data Analytics Algorithm, Tools in Systematic Review
Document7 pages
Big Data Analytics Algorithm, Tools in Systematic Review
tharani devi
No ratings yet
Chapter - 1: 1.1 Overview
Document50 pages
Chapter - 1: 1.1 Overview
karthik0484
No ratings yet
Project Report
Document13 pages
Project Report
Sanjay Kumar
No ratings yet
Ist 407 Final Paper
Document6 pages
Ist 407 Final Paper
api-529383903
No ratings yet
Machine Learning Techniques Assignment-7: Name:Ishaan Kapoor Rollno:1/15/Fet/Bcs/1/055
Document5 pages
Machine Learning Techniques Assignment-7: Name:Ishaan Kapoor Rollno:1/15/Fet/Bcs/1/055
bharti goyal
No ratings yet
ASCAI - Adaptive Sampling For Acquiring Compact AI
Document8 pages
ASCAI - Adaptive Sampling For Acquiring Compact AI
notsure.g6rp1
No ratings yet
Machine Learning Section4 Ebook v03
Document20 pages
Machine Learning Section4 Ebook v03
camgova
No ratings yet
Assignment 1
Document2 pages
Assignment 1
A054 Shubham funday
No ratings yet
Unit 4 AI LASK
Document7 pages
Unit 4 AI LASK
TEJASHREE KUMAR
No ratings yet
Chapter 2 Statistics Review 2023
Document21 pages
Chapter 2 Statistics Review 2023
Minh Khánh
No ratings yet
Wa0000.
Document26 pages
Wa0000.
Lakkarsu Poojitha
No ratings yet
Data Science Interview Questions
Document68 pages
Data Science Interview Questions
Ava White
100% (1)
Survey Paper On Classification
Document6 pages
Survey Paper On Classification
Pratik
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
CTRLX Automation Brochure
Document60 pages
CTRLX Automation Brochure
Ninja do Sofá
No ratings yet
Advanced Programming Language Concepts
Document29 pages
Advanced Programming Language Concepts
dropjar
No ratings yet
Scheduling Algorithm
Document27 pages
Scheduling Algorithm
Saranya A
No ratings yet
®epartment of Bucatton
Document2 pages
®epartment of Bucatton
Lea Parcia
No ratings yet
Project PPT Present1
Document24 pages
Project PPT Present1
khoda tako
No ratings yet
Slide Arduino IDEASasd
Document49 pages
Slide Arduino IDEASasd
Samz Adrian
No ratings yet
Estimate GSQT112 30-07-2020 PDF
Document1 page
Estimate GSQT112 30-07-2020 PDF
Samarendra Jena
No ratings yet
Section 1: Correct
Document10 pages
Section 1: Correct
misbahul
No ratings yet
Room Units Controllers Remote I/O Gateways
Document3 pages
Room Units Controllers Remote I/O Gateways
Minh Tu
No ratings yet
Artificial Intelligence Help Twitter To Verify Information
Document3 pages
Artificial Intelligence Help Twitter To Verify Information
ilyes bouallagui
No ratings yet
Website Authoring
Document24 pages
Website Authoring
Thein Zaw Min
No ratings yet
Study On Cloud Computing
Document6 pages
Study On Cloud Computing
Manas Bhatia
No ratings yet
Developer Training For Apache Spark and Hadoop: Hands-On Exercises
Document113 pages
Developer Training For Apache Spark and Hadoop: Hands-On Exercises
Aiswarya Nimmagadda
No ratings yet
5 Data Types - Lists
Document37 pages
5 Data Types - Lists
Rakshitsingh127021
No ratings yet
User Interface Design: Prashamsa Mishra
Document64 pages
User Interface Design: Prashamsa Mishra
Sagar Hooda
No ratings yet
W3Schools Quiz Results1
Document8 pages
W3Schools Quiz Results1
GAUTAMI UPPADA
No ratings yet
SLA-Asset Cost Account
Document12 pages
SLA-Asset Cost Account
Mahmoud Kamal
No ratings yet
Vdocuments - MX - Anonimo El Libro Hacker 55993a3834ba2
Document163 pages
Vdocuments - MX - Anonimo El Libro Hacker 55993a3834ba2
David Benavent Ferrer
No ratings yet
Redescobrindo o C++ Com Problemas NP-completos, Lambdas, Monads, IA e Paralelismo
Document54 pages
Redescobrindo o C++ Com Problemas NP-completos, Lambdas, Monads, IA e Paralelismo
Fábio da Silva Santana
No ratings yet
1.problem Description
Document185 pages
1.problem Description
Najma Begum S A
No ratings yet
Iso 24517-1-2008
Document34 pages
Iso 24517-1-2008
irdynamics.2020
No ratings yet
Somachine Software 4.3: Release Notes
Document30 pages
Somachine Software 4.3: Release Notes
Cesar Augusto Navarro Salas
No ratings yet
TD Osb Sinc
Document163 pages
TD Osb Sinc
Gabriel Abelha Dos Santos
No ratings yet
Guidelines - 1
Document6 pages
Guidelines - 1
Александра Ивановска
No ratings yet
SSC Service Utility For Epson Stylus Printers.: Russian Version
Document4 pages
SSC Service Utility For Epson Stylus Printers.: Russian Version
Master Cartuchos
No ratings yet
Question Bank Bca 605: Visual Basics Programming Unit 1
Document2 pages
Question Bank Bca 605: Visual Basics Programming Unit 1
Rutuja
No ratings yet
The Quantum Technology Monitor: Facts and Figures
Document45 pages
The Quantum Technology Monitor: Facts and Figures
w;pjo2h
No ratings yet
Unlock An Iphone in Activation Lock Mode: What Information Do I Need Before Calling Apple?
Document3 pages
Unlock An Iphone in Activation Lock Mode: What Information Do I Need Before Calling Apple?
lzad6136
No ratings yet
Classification of Peripheral Devices
Document2 pages
Classification of Peripheral Devices
Ashleyn Mary Sanders
No ratings yet
Ar86ctrl v1.41 Manual
Document22 pages
Ar86ctrl v1.41 Manual
Achmad Arizki Kosasih
No ratings yet