0% found this document useful (0 votes)

380 views

Midterm Solution

This document is the exam for CS534 - Midterm Spring 2012. It contains instructions for the exam, maximum scores for each question, and 4 questions. Question 1 has short answer questions about kernel functions, Bayes' theorem, support vectors, and overfitting. Question 2 asks about linear decision boundaries for different classifiers. Question 3 is about linear regression decomposition. Question 4 covers Naive Bayes classification on binary data with 1 and later 2 features.

Uploaded by

brsbyrm

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

380 views

Midterm Solution

Uploaded by

brsbyrm

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

CS534 Midterm Spring 2012

Solutions

Name (Please print):

1. You have 50 minutes to finish the exam.

2. There are 6 pages in this exam (including cover page).
3. If you use the back of the page please indicate on the front of the page so I wont miss it.

4. This exam is open book, open notes, but no cell phone and no computer.

Max score
1 14
2 8
3 9
4 19
Total 50

1
1. Short Questions.
a. (2pts) Using a kernel function is equivalent to mapping data into a higher dimensional space then
taking the linear dot product in that space. Please explain what is the advantage of using the kernel
function compared to doing the explicit mapping. Solution: The main advantage is computational
complexity. It also allows us to map to an infinite dimensional space when using RBF kernel, which is
not possible via explicit mapping.

b. (2pts) Prove that P (A|B)P (B) = P (B|A)P (A).

Solution: P (A|B)P (B) = P (A, B) = P (B|A)P (A)

c. (4 pts) Consider applying a soft margin SVM to the 1-dimensional data shown below.

What will be the support vectors for c = 0 and c = respectively?

C = 0: All points will be support vectors. In this case, there is no penalty for positive s, thus the
learned boundary will place all points inside the two lines of wT x + b = 1 and wT x + b = 1, making
every point a support vector.

C = : X3 and X4 . This is equivalent to hard margin SVM because any positive will cause infinitely
large penalty, thus strictly avoided.

d. (2pts) True or false. Provide a brief explanation. Consider applying Naive Bayes to a classifica-
tion problem. Suppose the modeling assumptions made by Naive bayes are true, and we have infinite
training data, the learned Naive Bayes classifier will have zero training error.
Flase. When modeling assumption is correct, a generative model like Naive Bayes can learn optimal
decision boundary with infinite training data. However, it may not achieve zero training error if the
two classes overlap.

2
e. (4pts) What impact will the following operation have on overfitting, increase, decrease or no impact?

Increase k for the k-nearest neighbor classifier: Decrease

Increase c for soft margin support vector machines: Increase

Increase the amount of training data for Logistic regression: Decrease

Remove non-support vector instances in the training set for SVM: no impact

2 (2 pts each) Linear decision boundaries. Consider the following binary classification problem as shown
in the figure below. In the figure, we provide two possible linear decision boundaries. Please specify for
each of the following four algorithms, will it produce boundary #1, #2 or possibly both? Please provide a
one-sentence explanation to your answer.

Linear discriminant analysis: #1 - The two classes appear to be Gaussian with the same covariance
structure. Under such conditions, LDA learns the boundary that separates the two distributions optimally.

Logistic regression: Both are possible depending on the initial weight and the learning rate of gradi-
ent ascent.

Support vector machine: #1, as it achieves the maximum margin.

Perceptron (stochastic gradient descent): Both are possible depending on the initial weight and
the learning rate of gradient descent, and the order it receives training data.

3
3 Linear regression. Consider linear regression using polynomial basis functions of order M . The expected
loss can be decomposed into three parts, the bias, the variance and the noise.
a. (3pts) Please provide the expression for these three components.
See linear regression slide 36.

b. (3 pts) If we increase the order of the polynomial basis function (e.g., from quadratic to cubit, or higher
order polynomials), will it increase, decrease or have no impact on each of the three components? Briefly
explain.
Noise: no impact because it is inherent to the data, and has nothing to do with the learning algorithm.
Bias: decrease because the increased model flexibility with higher order M allows better fit of the data.
Variance: increase because the increased flexibility makes it easier to fit the particularity of the training
data D, leading to larger variance.

c. (3 pts) If we increase the training set size, will it increase, decrease or have no impact on each of the
three components? Briefly explain.
Noise: no impact because the term D does not appear in the noise expression at all.
Bias: we dont expect bias to change. As we increase the training data set size, the output model does
not change much in expectation. In particular, if D grows to infinitely large, the output model will
converge to its expectation. The bias will remain the same.
Variance: decrease because larger training set size makes it less likely to overfit to the particularity of
D, as we grow D to infinitely large, the variance will decrease to zero.

4
4 Naive Bayes. Consider a binary classification problem with variable X1 {0, 1} and label Y {0, 1}.
The true generative distribution P (X1 , Y ) = P (Y )P (X1 |Y ) is shown in Table 1 and Table 2.

X1 = 0 X1 = 1
Y =0 Y =1 Y =0 0.7 0.3
0.8 0.2 Y =1 0.3 0.7

Table 1: P (Y ) Table 2: P (X1 |Y )

a. (4 pts): Now suppose we have trained a Naive Bayes classifier, using infinite training data generated
according to Table 1 and Table 2. Please fill in Table 3. In particular, please fill in the probabilities in
the first two columns, and fill in the prediction of Y in the last column of the table. For the probabilities,
please write down the actual values (and the calculation process if you prefer, e.g., 0.8 0.7 = 0.56).

P (X1 , Y = 0) P (X1 , Y = 1) Y (X1 )

X1 = 0 0.8 0.7 = 0.56 0.2 0.3 = 0.06 0

X1 = 1 0.8 0.3 = 0.24 0.2 0.7 = 0.14 0

Table 3: Predictions of the trained Naive Bayes

b (3 pts) What is the expected error rate of this classifier on training examples generated according to
Table 1 and Table 2? In other words, what is P (Y 6= Y (X1 ))?
(Hint: P (Y 6= Y (X1 )) = P (Y 6= Y (X1 ), X1 = 1) + P (Y 6= Y (X1 ), X1 = 0))
P (Y 6= Y (X1 ) = P (Y = 1, X1 = 0) + P (Y = 1, X1 = 1) = 0.06 + 0.14 = 0.2

5
c. Now we add a feature to this data X2 such that X2 is an exact duplicate of X1 . Suppose we have
trained Naive Bayes classifier using infinite training data that are generated by following Tables 1-2,
and then add the additional duplicate feature X2 . Please fill in the following tables.
i. (2pts) Fill in the probabilities for P (X2 |Y ) in Table 4.
ii. (4 pts) Fill in the probabilities for Table 5 and write down the predictions of Y for different X1
and X2 value combinations.

X2 = 0 X2 = 1
Y =0 0.7 0.3
Y =1 0.3 0.7

Table 4: Probability estimation for P (X2 |Y ).

P (X1 , X2 , Y = 0) P (X1 , X2 , Y = 1) Y (X1 , X2 )

X1 = 0, X2 = 0 0.8 0.7 0.7 = 0.392 0.2 0.3 0.3 = 0.018 0

X1 = 1, X2 = 0 0.8 0.7 0.3 = 0.168 0.2 0.7 0.3 = 0.042 0

X1 = 0, X2 = 1 0.8 0.7 0.3 = 0.168 0.2 0.3 0.7 = 0.042 0

X1 = 1, X2 = 1 0.8 0.3 0.3 = 0.072 0.2 0.7 0.7 = 0.098 1

Table 5: Predictions of the trained Naive Bayes.

d. (3 pts) What is the expected error rate of this Naive Bayes classifier on this data?
P (Y 6= Y (X1 , X2 ))
= P (Y 6= Y (X1 , X2 ), X1 = 0) + P (Y 6= Y (X1 , X2 ), X1 = 1)
= P (Y = 1, X1 = 0) + P (Y = 0, X1 = 1)
= 0.06 + 0.24 = 0.3

e. (3 pts) Compare the error rate in d to the error rate in b. What is the reason for the difference?
The error rate is increased. This is because X1 and X2 are not conditional independent given Y ,
violating the the Naive Bayes assumption.

2nd Exam Question Paper 2
No ratings yet
2nd Exam Question Paper 2
16 pages
Cs229 Midterm Aut2015
No ratings yet
Cs229 Midterm Aut2015
21 pages
ECS7020P Sample Paper Solutions
No ratings yet
ECS7020P Sample Paper Solutions
6 pages
University of Pennsylvania CIS 520: Machine Learning Midterm, 2016
No ratings yet
University of Pennsylvania CIS 520: Machine Learning Midterm, 2016
18 pages
Questions and Solutions On Bayes Theorem
No ratings yet
Questions and Solutions On Bayes Theorem
10 pages
cs675 SS2022 Midterm Solution PDF
No ratings yet
cs675 SS2022 Midterm Solution PDF
10 pages
Midterm Practice Questions
No ratings yet
Midterm Practice Questions
14 pages
Machine 2021 Jul-Dec
No ratings yet
Machine 2021 Jul-Dec
46 pages
midterm2008f_sol
No ratings yet
midterm2008f_sol
12 pages
10-701 Midterm Exam, Fall 2007
No ratings yet
10-701 Midterm Exam, Fall 2007
25 pages
Assignment 10 solution
No ratings yet
Assignment 10 solution
8 pages
CS725 2020 Midsem
No ratings yet
CS725 2020 Midsem
3 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Solution of Final Exam: 10-701/15-781 Machine Learning: Fall 2004 Dec. 12th 2004
No ratings yet
Solution of Final Exam: 10-701/15-781 Machine Learning: Fall 2004 Dec. 12th 2004
27 pages
MLFA Spring 2024
No ratings yet
MLFA Spring 2024
11 pages
07au Midterm
No ratings yet
07au Midterm
17 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Lista Fabio Cozman
No ratings yet
Lista Fabio Cozman
6 pages
Midterm 2006
No ratings yet
Midterm 2006
11 pages
10-701/15-781 Machine Learning - Midterm Exam, Fall 2010: Aarti Singh Carnegie Mellon University
No ratings yet
10-701/15-781 Machine Learning - Midterm Exam, Fall 2010: Aarti Singh Carnegie Mellon University
16 pages
ML Midterm Question Pool
No ratings yet
ML Midterm Question Pool
7 pages
Exam 2011
No ratings yet
Exam 2011
22 pages
Machine Learning PYQ 2021
No ratings yet
Machine Learning PYQ 2021
4 pages
Concordia University Machine Learning Assaignment with solutions
No ratings yet
Concordia University Machine Learning Assaignment with solutions
8 pages
Exam 21
No ratings yet
Exam 21
17 pages
Midterm 2010 F
No ratings yet
Midterm 2010 F
15 pages
Machine Learning PYQ 2023
No ratings yet
Machine Learning PYQ 2023
8 pages
Midterm 2002
No ratings yet
Midterm 2002
10 pages
ML Question CMU
No ratings yet
ML Question CMU
12 pages
Final: CS 189 Spring 2013 Introduction To Machine Learning
No ratings yet
Final: CS 189 Spring 2013 Introduction To Machine Learning
9 pages
CS725 2020 Quiz1
No ratings yet
CS725 2020 Quiz1
3 pages
CMPUT 466/551 - Assignment 1: Paradox?
No ratings yet
CMPUT 466/551 - Assignment 1: Paradox?
6 pages
Machine Learning Foundations and Applications Assignment 1 Due Date: 10 October, 2021
No ratings yet
Machine Learning Foundations and Applications Assignment 1 Due Date: 10 October, 2021
3 pages
ML Midsem 2018 Solutions
No ratings yet
ML Midsem 2018 Solutions
7 pages
MLvsMAP Merged
No ratings yet
MLvsMAP Merged
208 pages
Midterm Solutions For Machine Learning
No ratings yet
Midterm Solutions For Machine Learning
13 pages
Mid Term Test
No ratings yet
Mid Term Test
6 pages
ml-20230316-1
No ratings yet
ml-20230316-1
9 pages
10f 601 Midterm
No ratings yet
10f 601 Midterm
17 pages
Question Bank on Probabilities
No ratings yet
Question Bank on Probabilities
4 pages
Final Exam Epfl 2020 Machine Leaning
No ratings yet
Final Exam Epfl 2020 Machine Leaning
16 pages
Machine 2020 Jul-Dec
No ratings yet
Machine 2020 Jul-Dec
45 pages
10-601 Machine Learning Midterm Exam Fall 2011: Tom Mitchell, Aarti Singh Carnegie Mellon University
No ratings yet
10-601 Machine Learning Midterm Exam Fall 2011: Tom Mitchell, Aarti Singh Carnegie Mellon University
16 pages
Midterm Solutions Machine
100% (1)
Midterm Solutions Machine
17 pages
Midterm Solutions PDF
No ratings yet
Midterm Solutions PDF
17 pages
Midterm Sample
No ratings yet
Midterm Sample
16 pages
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
No ratings yet
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
6 pages
Rec5_Solns
No ratings yet
Rec5_Solns
14 pages
Machine Learning PYQ 2022 Ans
No ratings yet
Machine Learning PYQ 2022 Ans
17 pages
ML Practice 1
No ratings yet
ML Practice 1
106 pages
MLT UNIT-2 notes
No ratings yet
MLT UNIT-2 notes
16 pages
Midterm Sp16 Solutions
100% (1)
Midterm Sp16 Solutions
17 pages
Midterm (1)
No ratings yet
Midterm (1)
4 pages
ML A1 PDF
100% (1)
ML A1 PDF
3 pages
MSBD5001_WrittenAssignment2_2024F
No ratings yet
MSBD5001_WrittenAssignment2_2024F
5 pages
Introduction To Machine Learning IIT KGP Week 2
100% (1)
Introduction To Machine Learning IIT KGP Week 2
14 pages
Midterm Sol
No ratings yet
Midterm Sol
16 pages
finals19
No ratings yet
finals19
16 pages
1st Exam Question Paper 2
No ratings yet
1st Exam Question Paper 2
16 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Computer Aided Classification of Mammographic Tissue Using Shapelets and Support Vector Machines
No ratings yet
Computer Aided Classification of Mammographic Tissue Using Shapelets and Support Vector Machines
12 pages
Hw5 Solution
No ratings yet
Hw5 Solution
4 pages
Hybrid Teams Flexible Collaboration Between Humans Robots and Virtual Agents
No ratings yet
Hybrid Teams Flexible Collaboration Between Humans Robots and Virtual Agents
18 pages
Particle Filters in Robotics
No ratings yet
Particle Filters in Robotics
9 pages
K-Means Cluster Analysis For Image Segmentation: S. M. Aqil Burney Humera Tariq
No ratings yet
K-Means Cluster Analysis For Image Segmentation: S. M. Aqil Burney Humera Tariq
8 pages
GE 461 Introduction To Data Science: Spring 2021
No ratings yet
GE 461 Introduction To Data Science: Spring 2021
39 pages
A Triangle Area Based Nearest Neighbors Approach To Intrusion Detection
No ratings yet
A Triangle Area Based Nearest Neighbors Approach To Intrusion Detection
8 pages
Shazia12,+journal+manager,+24.+1574+sheduled+ Compressed
No ratings yet
Shazia12,+journal+manager,+24.+1574+sheduled+ Compressed
14 pages
Deep Learning For Computer Vision With MATLAB - MATLAB & Simulink
No ratings yet
Deep Learning For Computer Vision With MATLAB - MATLAB & Simulink
5 pages
16248-Article Text PDF-92440-2-10-20210209
No ratings yet
16248-Article Text PDF-92440-2-10-20210209
9 pages
Overview of EmoThreat: Emotions and Threat Detection in Urdu at FIRE 2022-T4-1
No ratings yet
Overview of EmoThreat: Emotions and Threat Detection in Urdu at FIRE 2022-T4-1
11 pages
Cse3502-Information Security Management: Phishing Detection Using Data Mining Techniques
No ratings yet
Cse3502-Information Security Management: Phishing Detection Using Data Mining Techniques
25 pages
Sample Resume of Waqar Baig
No ratings yet
Sample Resume of Waqar Baig
3 pages
IPM Template V5
No ratings yet
IPM Template V5
26 pages
Customer Churn by Chen2014
No ratings yet
Customer Churn by Chen2014
20 pages
Fuzzy Logic and Hybrid Based Approaches For The Risk of Heart
No ratings yet
Fuzzy Logic and Hybrid Based Approaches For The Risk of Heart
17 pages
Python-NumPy-and-Machine-Learning-A-Comprehensive-Guide (1)
No ratings yet
Python-NumPy-and-Machine-Learning-A-Comprehensive-Guide (1)
10 pages
Capri
No ratings yet
Capri
100 pages
5 - Fraud Detection in Insurance Claim Using Machine Learning
No ratings yet
5 - Fraud Detection in Insurance Claim Using Machine Learning
69 pages
Final Report Template
No ratings yet
Final Report Template
28 pages
A Survey On Transfer Learning: Sinno Jialin Pan and Qiang Yang, Fellow, IEEE
No ratings yet
A Survey On Transfer Learning: Sinno Jialin Pan and Qiang Yang, Fellow, IEEE
15 pages
Vaccination Scheduling
No ratings yet
Vaccination Scheduling
65 pages
23 31 Network Intrusion Detection Using Wireshark and Machine Learning (1)
No ratings yet
23 31 Network Intrusion Detection Using Wireshark and Machine Learning (1)
9 pages
Feature Extraction Method Based On Filter Banks and Riemannian Tangent Space in Motor-Imagery BCI2022
No ratings yet
Feature Extraction Method Based On Filter Banks and Riemannian Tangent Space in Motor-Imagery BCI2022
11 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
26 pages
105 Machine Learning Paper
No ratings yet
105 Machine Learning Paper
6 pages
Instant download Pervasive Computing and Social Networking Proceedings of ICPCSN 2022 G Ranganathan Robert Bestak Xavier Fernando Eds pdf all chapter
100% (1)
Instant download Pervasive Computing and Social Networking Proceedings of ICPCSN 2022 G Ranganathan Robert Bestak Xavier Fernando Eds pdf all chapter
55 pages
Neurocomputing: Yukun Bao, Tao Xiong, Zhongyi Hu
No ratings yet
Neurocomputing: Yukun Bao, Tao Xiong, Zhongyi Hu
12 pages
Get Bio Inspired Computing Theories and Applications 13th International Conference BIC TA 2018 Beijing China November 2 4 2018 Proceedings Part I Jianyong Qiao PDF Ebook With Full Chapters Now
100% (3)
Get Bio Inspired Computing Theories and Applications 13th International Conference BIC TA 2018 Beijing China November 2 4 2018 Proceedings Part I Jianyong Qiao PDF Ebook With Full Chapters Now
52 pages
SVM Questions
No ratings yet
SVM Questions
7 pages
Quantum Enhanced Support Vector Machine With Instantaneous Quantum Polynomial Encoding For Improved Cyclone Classification
No ratings yet
Quantum Enhanced Support Vector Machine With Instantaneous Quantum Polynomial Encoding For Improved Cyclone Classification
5 pages
Normal and Peaberry Coffee Beans Classif
No ratings yet
Normal and Peaberry Coffee Beans Classif
8 pages
Multi-Modal Hate Speech Detection Using Machine Learning
No ratings yet
Multi-Modal Hate Speech Detection Using Machine Learning
4 pages
IoT Based Smart Water Management Systems A Systematic Review
No ratings yet
IoT Based Smart Water Management Systems A Systematic Review
8 pages
IJRPR13344
No ratings yet
IJRPR13344
6 pages