ML Int

Download as pdf or txt
Download as pdf or txt
You are on page 1of 3

Subject: ML - Machine Learning

Topic or Unit of Study: first 2 units

Grade/Level: Internal Theory Exam

Questions:

01. What is P(X |Y)?


a. prior probability without observing data X
b. likelihood function*
c. probability of features independent of Y
d. conditional probability

02. To classify a new instance, we retrieve from memory the most similar training instance
(i.e., the training instance with the smallest Euclidean distance from the instance to be
classified), and simply assign that training instance’s class. Which classifier is it?
a. Bayesian classifier
b. Support Vector Machine
c. nearest-neighbor classifier*
d. k-cluster classifier

03. Suppose you have trained a logistic regression classifier and it outputs a new example
x with a prediction ho(x) = 0.2. This means
a. Our estimate for P(y=0 | x)
b. Our estimate for P(y=1 | x)*
c. Our estimate for P(y=1 | x)
d. Our estimate for P(y=0 | x)

04. Google's self-driving vehicle is an example of which of the following characteristics of


ML?
a. The ability to take efficiency to the next level when merged with IoT
b. The ability to perform automated data visualization*
c. Customer engagement like never before
d. Accurate data analysis.

05. Which of the following classifications would best suit the student performance
classification systems
a. Cluster analysis
b. If-.then-analysis*
c. Market-basket analysis
d. Regression analysis
06. Which of the following is a widely used and effective machine learning algorithm
based on the idea of bagging?
a. Regression
b. Random Forest*
c. Classification
d. Decision Tree

07. regression comes under ________ model and subgroup discovery come under
_____ learning.
a. Descriptive, supervised
b. Predictive, supervised*
c. Predictive, unsupervised
d. Descriptive, unsupervised

08. A computer program is said to learn from experience E with respect to some task T
and some performance measure P if its performance on T, as measured by P, improves
with experience E. Suppose we feed a learning algorithm a lot of historical weather
data, and have it learn to predict the weather. In this setting, what is P?
a. The weather prediction task.
b. The process of the algorithm examining a large amount of historical weather
data.
c. The probability of it correctly predicting a future date’s weather.*
d. The weather prediction task over the experience

09. What is the formula for Euclidean distance between two points?
a. d=√((x_2-x_1)²+(y_2-y_1)²) *
b.
c.

10. What is the disadvantage of decision trees?


a. Factor analysis
b. Decision trees are robust to outliers
c. Decision trees are prone to be overfit*
d. All of the above

11. When performing regression or classification, which of the following is the correct way
to preprocess the data?
a. Normalize the data -> PCA -> training*
b. PCA -> normalize PCA output -> training
c. Normalize the data -> PCA -> normalize PCA output -> training
d. All of the above

12. a way to ensemble multiple classification or regression


a. stacking
b. boosting.*
c. blending
d. bagging

13. Which of these is a reasonable definition of machine learning?


a. Machine learning is the science of programming computers.
b. Machine learning is the field of allowing robots to act intelligently.
c. Machine learning is the field of study that gives computers the ability to
learn without being explicitly programmed.*
d. Machine learning learns from labeled data.

14. which of the following is not a chatbot to address the financial queries
a. Finn AI
b. Paypal
c. Alexa*
d. Digital Genius

15. Suppose you are working on weather prediction, and your weather station makes one
of three predictions for each day's weather: Sunny, Cloudy or Rainy. You'd like to use a
learning algorithm to predict tomorrow's weather. Would you treat this as a
classification or a regression problem?
a. classification*

You might also like